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HIS volume is the seventh in a series of 

publications* designed to assist test 

users in education, industry, psychiatry, 
and psychology to locate and to evalute tests and 
books on testing. A detailed account of the initia- 
tion and growth of this service to test users has 
been presented in the Introduction of The Nine- 
teen Forty Mental Measurements Yearbook. 
Consequently, the earlier publications in this 
series will-be described only briefly here. 


EARLIER PUBLICATIONS 


The first two publications in this series were 
noncritical bibliographies of tests and were pub- 
lished in 1935 and 1936, respectively. The third 
publication, published in 1937, was expanded to 


‚ 1 Buros, Oscar К. Educational, Psychological, and Personal- 
ity Tests of 1933 and 1934. Rutgers University Bulletin, Vol. тт, 
No. 11; Studies in Education, No. 7. New Brunswick, N.J.: 
School of Education, Rutgers University, May 1935. Pp. 44. 
Paper. Out of print. * $ ^ 
Buros, Oscar K. Educational, Psychological, and Personality 
Tests of 1933, 1934, and 1935. Rutgers University Bulletin, Vol. 
13, No. 1; Studies in Education, No. o. New Brunswick, N.J.: 
School of Education, Rutgers University, July 1936. Pp. 83. 
ue $0.50. * For reviews, see 40:B856, 38:B325, and 36: 
46. x 
Buros, Oscar К. Educational, Psychological, and Personality 
Tests of 1936: Including a Bibliography and Book Review Digest 
of Measurement Books and Monographs of 1933-36. Rutgers 
niversity Bulletin, Vol. 14, No. 2A; Studies in Education, No. 
11, New Brunswick, N.J.: School of Education, Rutgers Uni- 
versity. August 1937. Pp. 141. Paper. $0.60. * For reviews, see 
40:B857 and 38:B326. n 3 : У 
Buros, Oscar E Editor. The Nineteen Thirty Eight 
Mental ‘Measurements Yearbook 2 the School of Education, 
Rutgers, University. Foreword by Clarence Elmar Partch. New 
Brunswick, N.J.: Rutgers University Press, 1938. Pp. xv, 415. 
Out of print. * For reviews, see 40:В858., 
Buros, Oscar Krisen, Editor. The Nineteen Forty Mental 
Measurements Yearbook. Highland Park, N.J.: Gryphon Press, 
1941, Pp. xxv, 674. Out of print. * For reviews, see B7o and 


3:788. 

M Oscar Krisen, Editor. The Third Mental Measurements 
Yearbook, New Brunswick, N.J.: Rutgers University Press, 
1949. Pp. xv, 1047. $12.50. * For reviews, see B71. 


include a bibliography of books on testing and 
closely related fields along with critical excerpts 
from published book reviews. 

With the introduction of critical test reviews 
in 1938, the series title Educational, Psychologi- 
cal, and Personality Tests was replaced by the 
more appropriate title The Mental Measure- 
ments Yearbook. The Nineteen Thirty Eight 
Mental Measurements Y earbook was expanded 
to include 331 original reviews of tests by 133 
educators, psychologists, subject matter special- . 
ists, teachers, and test technicians. Since one of 
the two book-review-excerpt sections, “Research 
and Statistical Methodology Books," was of 
value to research workers not interested in test- 
ing, it was also published as a separate mono- 
graph.? ч 

The fifth publication, The Nineteen Forty 
Mental Measurements Y earbook, was published 
in 1941. The number of test reviewers was nearly 
doubled and the number of original test reviews 
and excerpts greatly increased. In order to pro- 
vide more space for test reviews, the section on 
research and statistical methodology books was 
published as a separate volume as one of a new: 
series ê of reference works. 

The Third Mental Measurements Yearbook, 
published in January 1949, covered the period 
1940 through 1947. This first postwar volume 


2 Buros, Oscar Krisen, Editor. Research and Statistical Meth- 
odology Books and Reviews, 1933-1938. Highland Park, N.J.: 
GE Press, 1938. Pp. vii, 100. $1.50. * 

8 Buros, Oscar Krisen, Editor. The Second Yearbook of Re- 
search and Statistical Methodology Books and Reviews. High- 
land Park, N.J.: Gryphon Press, 1941. Pp. xx, 383. $5.00. *- 

Buros, Oscar Krisen, Editor. "Statistical Methodology Reviews, 
1941—1950. New York: John Wiley & Sons, Inc., 1951. Pp. xi, 
457. $7.00. (London: Chapman & Hall, Ltd. 565.) * 
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included 663 tests, 713 test reviews by 320 re- 
viewers, 66 test-review excerpts, 3,368 test ref- 
erences, 549 books, and 785 book-review ex- 
cerpts from 135 journals. 


THE FOURTH YEARBOOK 


This volume covers the period 1948 through 
1951. Plans to publish a yearbook within two or 
three years after the publication of The Third 
Yearbook had to be abandoned because of inade- 
quate financial support. If the income derived 
from sales of The Fourth Yearbook is sufficient 
to finance the preparation of future volumes, the 
Editor hopes to reduce е intervalbetween suc- 
cessive yearbooks to three years. Asin the past, 
each yearbook will consist entirely of new ma- 
terial and will, therefore, supplement rather than 
supplant earlier yearbooks in the series. 


TESTS AND’ REVIEWS 


The section “Tests and Reviews” lists 793 
tests, 596 original test reviews by 308 reviewers, 
53 excerpts from test reviews in 15 journals, 
and 4,417 references on the construction, valid- 
ity, use, and limitations of specific tests. Fifty- 
four per cent of the 793 tests have been reviewed 
by one or more reviewers ; 18.5 per cent by two 
or more reviewers ; and 2.3 per cent by three or 
more reviewers. Some of these tests have been 

,reviewed in earlier yearbooks; many will be 
again reviewed in future yearbooks. It is planned 
that most tests will eventually receive at least 
three reviews. 

The yearbook attempts to list all commercially 
available tests—educational, psychological, and 
vocational—published as separates in English- 
speaking countries in the four-year period 1948— 
1951. The commercially available tests also 
include older tests selected for review and 
tests published. during the nineteen-year period 
(1933-1951 ) covered by this series of yearbooks 
and bibliographies but not previously listed. 

For the first time in this yearbook series, many 
tests which are available only as a part of highly 
restricted testing programs have been listed and 
reviewed. The following organizations agreed 
to make their confidential tests available to Men- 
tal Measurements Vearbook reviewers: Asso- 
ciation of American Medical Colleges, College 
Entrance Examination Board, Educational Teste 
ing Service, Life Insurance Agency Manage- 
ment Association, National’ League of Nursing 
‚ Education, and Psychological Corporation. Un- 
fortunately, because of space limitations, most of 


these restricted tests and testing programs have 
been listed without reviews in this volume. 

The objectives of the section “Tests ‘and Re- 
views” are the same as those reported for the 
corresponding section in The 1940 Yearbook: 


1) To make readily ayailable comprehensive and up- 
to-date bibliographies of recent tests published in all 
English-speaking countries. 

2) To make readily available hundreds of frankly 
critical test reviews, written by persons of outstanding 
ability representing various viewpoints, which will assist 
test users to make more discriminating selections of the 
standard tests which will best meet their needs. 

3) To make readily available comprehensive and ac- 
curate bibliographies of references on the construction, 
validation, use, and limitations of specific tests. 

4) To impel authors and publishers to place fewer 
but better tests on the market and to provide test users 
with detailed and accurate information on the construc- 

. tion, validation, uses, and limitations of their tests at 
* the time that they are first placed on the market. — 

5) To suggest to test users better methods of arriving 
at their own appraisals of both standard and non- 
standard tests in light of their particular values and 
needs. 

6) To stimulate cooperating reviewers—and others 
to a less extent—to reconsider and think through more 
carefully their beliefs and values relevant to testing. 

7) To inculcate upon test users a keener awareness of 
both the values and dangers which may accompany the 
use of standard tests. 

8) To impress test users with the desirability of sus- 
pecting all standard tests—even though prepared by 
well-known authorities—unaccompanied by detailed 
data on their construction, validation, use, and limita- 
tions. 


BOOKS AND REVIEWS 


The section “Books and Reviews" lists 429 
books on measurements and closely related field: 
and 758 excerpts from book reviews in 121 jour- 
nals, An attempt has been made to list all meas: 
‘urements books published in English-speaking 
countries in the four-year period 1948-1951. In 
addition, a few older books are listed when ac- 
companied by review excerpts not previously 
published in this series. The range of Mots is the 
same as that in The Third Yearbook with one 
exception. Books on statistical methods in edu- 
cation and psychology published in the eleven- 
year period 1941—1951 are included but without 
accompanying reviews. Instead, cross references 


are given to reviews in Statistical Methodology > 


Reviews, 1041—1050. уы д... 
The objectives of the section “Books and Re- 
, views" are the same as those reported for the 
` corresponding section in The 1940 Yearbook: 


1) To make readily available comprehensive and up- 
to-date bibliographies of recent books published in all 
English-speaking countries in measurements and closely 
associated fields. 

2) To make readily available evaluative excerpts 
from hundreds of book reviews appearing in a great 
variety of journals in this country and abroad in order 
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to assist test users to make more discriminating selec- 

tions of books for study and purchase. . 

_ 3) To stimulate readers to develop more critical at- 

titudes toward what they read. — 

4) To make readily available important and pro- 
vocative statments which, though appearing in book 
reviews, have considerable value-entirely apart from а 
consideration of the book under review. - 

5) To point out books which are not being reviewed 
but which probably merit review. 

6) To improve the quality of book reviews by stimu- 


lating review editors to make greater efforts to choose . 


competent reviewers who will contribute frankly criti- 
cal reviews. ^ 

7) To improve the quality of book reviews by stimu- 
lating reviewers "to take their responsibilities more 
seriously" by refusing to review books which they can- 
not appraise competently and honestly. 


SUGGESTIONS FOR TEST REVIEWERS 


The following instructions were sent to each 
test reviewer: 


I) Test reviews should be written with the following 
major objectives in mind : 

а) To provide test users with carefully prepared 
appraisals of tests for their guidance in selecting and 
using tests. 

b) To stimulate progress toward higher profes- 
sional standards in the construction and use of tests 
by commending good work, by censuring poor work, 
and by suggesting improvements. 

c) To impel test authors and publishers to present 
more detailed information on the construction, valid- 
ity, uses, and misuses of their tests. 

2) Concise reviews of 300 to 700 words are wanted. 
The average length of the reviews written by one per- 
son may not exceed 800 words unless arrangements are 
made otherwise with the Editor. 

3) Reviews should be frankly critical, with both 
strengths and weaknesses pointed out in a judicious 
manner. Descriptive comments should be kept to the 
minimum necessary to support the critical portions of 
the review. Criticism should be as specific as possible; 
implied criticisms meaningful only to testing specialists 
should be avoided. Reviews should be written primarily 
for the rank and file of test users. An indication of the 
relative importance and value of a test with respect to 
competing tests should be presented whenever possible. 
Ifa reviewer considers a competing test better than the 
one being reviewed, the competing test should be specif- 
ically named. 

4) If a test manual gives insufficient, contradictory, 
or ambiguous information regarding the construction, 
validity, and use of a test, reviewers are urged to write 
directly to authors and publishers for further informa- 
tion. Any comments which a reviewer may make based 
upon unptiblished information should clearly indicate 
the source of the information. Test authors and pub- 
lishers should be held responsible for presenting ade- 
quate data in test manuals—failure to do so should be 

j дый. 


pointed out. ¢ 


5) If for any reason a reviewer thinks he is not in 


a position to write a frankly critical review in a scholarly 
and unbiased manner, he should request the Editor to 


substitute other tests for review.. CM 
6) Each test will be reviewed by two or more per- 


sons in order to secure better representation of various 
viewpoints. Noncritical content which excessively over- 
laps similar material presented by another reviewer will 
be deleted by the Editor. Reviews will be carefully 
edited, but no important changes will be made without 
the consent of the reviewer. Galley proofs (unaccom- 


panied by copy) will be submitted to reviewers for 
checking. 

7) The Editor reserves the right to reject any review 
which does not meet the minimum standards of the year- 
book series. 

8) Each review should conclude with a paragraph 
presenting a concise summary of the reviewer’s overall 
evaluation of the test. 

9) The use of sideheads and centered heads is op- 
tional with reviewers. For an example of the use of 
headings, see Herbert S. Conrad’s review of test 20 in 
The Third Mental Measurements Yearbook. 

10) Reviewers will be furnished with bibliographic 
entries which will precede their reviews. Information 
presented in the entry should not be repeated in reviews 
unless such repetition appears desirable. 

11) Reyiewers may not invite others to collaborate 
with them in writing a review unless permission is 
secured from the Editor. i 

12) A separate review should be prepared for each 
test. Each review should begin on a new sheet. The test 
and forms reviewed should be clearly indicated. Your 


= name, title, position, and address should precede each 


* 


‘review, e.g.: Oscar К. Buros, Director, Institute of 
Mental Measurements, Rutgers University, New Bruns- 
wick, New Jersey. The review should begin immediately 
after the address. | 

13) Type your review in triplicate; send two copies 
to The Mental Measurements Yearbook office. 

* 14) Each reviewer will receive a complimentary copy 
of The Fourth Mental Measurements Yearbook and the 
Index to Tests. The latter will be a bibliography of tests 
in print and an index to the contents of the first four 
Mental Measurements Yearbooks. 

15) The due date for your reviews is „Айа 
few cases arrangements may be made to submit reviews 


. at a later date—especially where additional time is 


needed for trying out the tests. 


HOW TO USE THIS YEARBOOK 


The following suggestions should be read care- 
fully by all persons wishing to get the maximum 
value out of The Fourth Mental Measurements 
Yearbook. 

1) Table of Contents. The Table of Contents 
should be consulted first to get an overall picture 
of the volume’s contents and the classification 
plan used. The Table of Contents lists all the 
headings and the main subheadings under which 
tests are classified in the section “Tests and Re- 
views.” The subdivisions under the subheading 
“Specific Vocations” are not listed in the Con- 
tents. ( " Wr 

2) Classified Index of Tests. After examining 
the Table of Contents, the reader may find it 
profitable to turn to the Classified Index of Tests 
at the end of the volume. The Classified Index, 


.an expanded table of contents of the section 


“Tests and Reviews," presents a complete list of 
all tests and reviewers represented in this vol- 
A 
ume. 
* 3) Page and Entry Numbers. Page numbers 
appear in the running heads next to the inside 
margins. Entry numbers (i.e., the numbers аѕ-* 


. signed to specific tests and books) appear in the ` 
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running heads next to the outside margins. The 
entry numbers on facing pages represent the first 
and last tests or books listed on those pages. The 
Table of Contents refers to page numbers; to 
cross references and the indexes refer to entry 
numbers; except when using the Table of Con- 
tents, the reader will have no need to use page 
numbers. 

4) Stars, Asterisks, and Ellipses. A star pre- 
ceding an entry indicates a new book or test not 
previously listed in this series of yearbooks and 
bibliographies. An asterisk preceding an entry 
indicates a book or test which has been revised 
or supplemented in some way since being last 
listed. An asterisk following a book entry, a test 
entry, or a test reference indicates that the entry 
or reference was prepared from a firsthand ex- 
amination of the publication in question. Aster- 
isks and ellipses in quotations and excerpts indi- 
cate omissions; asterisks indicate a break in the 
continuity of reading, and ellipses indicate con- 
tinuity of reading. — 

5) Citations for Cross References. Cross ref- 
erences to reviews and references in this and 
earlier volumes are frequently made. Cross refer- 
ences should be interpreted thus: “see 416" re- 
fers to entry 416 in this-yolume; "see 3: 368" 
refers to entry 368 in The Third Mental Meas- 
urements Y earbook; “see 40 : 1236” refers to en- 
try'1236 in The Nineteen Forty Mental Meas- 
urements Yearbook; “see 38:932” refers to 
entry 932 in The Nineteen Thirty Eight Mental 
Measurements Yearbook; and “see 36: B210" 
refers to entry B210 in Educational, Psycholog- 
ical, and Personality Tests of 1936. 

6) Test Entries. For each test, an attempt 
has been made to present in the bibliographic en- 
try the following information in the order given : 


a) TITLE. The test titles are printed in boldface type. 
Secondary and series titles are set off from main titles 
by a colon. Subtest or booklet titles are printed in small 
capital letters. When the titles on the test booklet and 
the test manual differ; the better known title is used. 

b) DESCRIPTION OF THE GROUPS FOR WHICH THE TEST 
IS INTENDED. The grade, chronological age, or semester 
range is usually given. “Grades 1B, 1A, 2-3, 4-12, 13- 
17" indicates that there are five test booklets: a booklet 
for the first half of grade r, a booklet for the second 
half of grade 1, a booklet for grades 2 and 3, a booklet 
for grades 4 to 12 inclusive, and a booklet for under- 
graduate and graduate students in colleges and univer- 
sities. “First, second semesters” indicates that there are 
two test booklets: one covering the work of the first 
semester, the other covering the work of the second 
semester. "I, 2 semesters” indicates that the second 
booklet covers the work of the two semesters. 

_¢) DATE OF COPYRIGHT OR PUBLICATION. The inclusive 
range of copyright dates (or publication dates if not 
copyrighted) for the various forms, accessories, and 
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editions of a test is reported. When the publication date 
differs from the copyright date, both dates are given; 
eg. “1948, c1946-48" means that the test was copy- 
righted both in 1946 and in 1948 but was not published 


,until 1948. 


d) PART scores. The number of part scores is pre- 
sented along with their titles or descriptions of what 
they presumably represent. 

€) INDIVIDUAL OR GROUP TEST. All tests are group tests 
unless otherwise indicated, à 

f) MACHINE SCORABLE TESTS. Tests which may be 
scored by the International Test Scoring Machine (see 
3:397) are marked “IBM.” Е 

9) FORMS, PARTS, AND LEVELS. All available forms, 
parts, and levels are listed with the most recent date of 
publication. Out-of-print forms are usually listed but 
without accompanying dates. ў 

А) RELIABILITY AND VALIDITY. Ап important change 
in the test entries is the inclusion of statements regard- 
ing the complete absence of data in the test manual on 
the reliability and validity of the test. — 

i) cost ОЕ TESTS. Price information is given correct. 
as of early 1952. Although every precaution has been 
taken to ensure accuracy, some prices may be in error 
and other prices may have changed. For full and up-to- 
date information on test prices, the latest catalogs of test 
publishers should be consulted. М 

j) TIME. The number of minutes of actual working 
time allowed examinees and the approximate length of 
time needed for administering a test are reported when- 
ever obtainable. The latter figure is always enclosed in 
parentheses. Thus, “50(60) minutes" indicates that the 
examinees are allowed fifty minutes of working time 
and that a total of sixty minutes is needed to administer 
the test. 

k) AuTHOR. For most tests, all authors are reported. 
In the case of tests which appear in a new form each 
year, only authors of the most recent forms are listed. 
Names are reported exactly as printed on test booklets. 
Names of editors are not reported for tests edited by 
two or more persons. 

1) PUBLISHER. The full name of the publisher or 
distributor is reported for each test. For addresses of 
the publishers, see the Publishers Directory and Index. 


7) Test References. All known references, 
published and unpublished, on the construction, 
validity, use, and limitations of each test are re- 
ported immediately after the test entry. These 
references are arranged in chronological order 
by year of publication and alphabetically by au- 
thors within years. The test bibliographies are 
believed to be fairly complete through 1951; no 
references for 1952 are included. In order to as- 
sist students who wish to do selected reading on 
a particular test, references are given to ab- 
stracts in Psychological Abstracts. For example, 
“(РА 26: зот)” refers to abstract 301 in Vol- 
ume 26 of Psychological Abstracts; *( РА 27: 
587, title only)" indicates that the reference was 
listed but not abstracted in Psychological Ab- 
stracts. 

8) Original Test Reviews. Original test re- 
views of a particular test are arranged in alpha- 
betical order by reviewers. Cited references 
which are also references for the test under re- 
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view are indicated by the use of italic numbers 
in parentheses. Cited references which are not 
among the test references are indicated by the 
use of superscripts which refer to footnotes. 

9) Excerpted Test Reviews. Excerpts from 
test reviews first published elsewhere appear im- 
mediately after the original test reviews in alpha- 
betical order by journal. In general, only critical 
comments were excerpted. 

10) Cross References to Test Reviews. Cross 
references to reviews of the same or earlier edi- 
tions of tests in earlier yearbooks and to related 
reviews in this volume follow the original and 
excerpted reviews. 

11) Book Entries. The books listed in the sec- 
tion “Books and Reviews” are arranged in al- 
phabetical order by authors with anonymous 
books preceding the others. Rather complete bib- 
liographic information—including both domestic 
and foreign publishers and prices—has been pre- 
sented for each book. In addition to measure- 
ment books, the section “Books and Reviews” 
lists books on statistical methodology in educa- 
tion and psychology. Review excerpts are not, 
however, presented for the statistical books since 
such review excerpts are available in another 
publication, Statistical Methodology Reviews, 
1941-1950. Cross references to the statistical 
methodology book reviews are given. 

12) Classified Index of Books. A roughly 
classified index presented at the beginning of the 
section “Books and Reviews” will assist the 
reader to locate books on a particular subject. 
In addition to using this index, readers are 
urged to skim over titles and excerpts in search 
of works which might otherwise be overlooked. 

13) Book Reviews. Excerpts from book re- 
views first published elsewhere are arranged 
under each book in alphabetical order by jour- 
nals. In general, only critical comments were ex- 
cerpted. Those books which deal with specific 
tests are also included in the section “Tests and 
Reviews” immediately following the relevant 
test ; review excerpts for these books appear only 
in the section “Tests and Reviews." 

14) Cross References to Book Reviews. Cross 
references to reviews of the same or earlier edi- 
tions of books either in this volume or in earlier 
volumes follow the excerpted reviews. 

15) Catchwords. The running heads present 
catchwords to assist readers in the location of 
particular material. 

16) Periodical Directory and Index. The ma- 
jor purpose of this directory and index is to serve 


as a key to the abbreviations used for excerpted 


journals. 
17) Publishers Directory and Index. The 


. major purpose of the directory and index is to 


provide addresses of test and book publishers. 
Stars indicate test publishers which issue cata- 
logs devoted entirely or in large part to tests; 
asterisks indicate other publishers of one or more 
tests listed in this volume. 

18) Index of Titles. If the exact title of a test 
or book is known, the quickest way of locating 
it is to use the Index of Titles, an alphabetical 
listing of all books and tests'in this volume. Book 
titles are set in italic type and testi titles i in roman 
type. 

19) Index of Names. This Tits the names of 
all authors, editors, and others mentioned in 
book entries, test entries, references, reviews, 
review excerpts, and cross references. The ab- 
breviations and numbers “following the names 
may be interpreted thus: “rev, 68” indicates au- 
thorship of an original review of test 68; “test, 
96” indicates authorship of test 96; “bk, B128” 
indicates authorship of book B128; “exc, 256” 
indicates authorship of an excerpted review; 


“ref, 126(34)” indicates authorship of reference- 


34 for test 126 ; “crossi ref, 1 86" indicates that the 
name is mentioned іп а^ cross. reference. under 
test 186; “f, 67” indicates that the name is men- 


tioned ina footnote toa review of test 67 5. “other, à 


574" indicates that the name is mentioned in the 
entry for test 574 in a manner not classifiable 
under any of the above fateBori s 
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ACHIEVEMENT BATTERIES 


Reviews вү Oliver F. Anderhalter, Harry D. Berg, Benjamin S. Bloom, Miriam M. Bryan, 

Herbert S. Conrad, Paul L. Dressel, Max D. Engelhart, Warren G. Findley, Gustav J. Froehlich, 

Eric F. Gardner, J. Raymond Gerberich, Paul R. Hanna, J. Thomas Hastings, Virgil E. Herrick, 

Charles R. Langmuir, Roger T. Lennon, Gordon N. Mackenzie, John V. McQuitty, Stanley D. 

Nisbet, Claude E. Norcross, A. Harry Passow, Ralph C. Preston, Alvin W. Schindler, J. Harlan 
Shores, Anton Thompson, and David V. Tiedeman. 


[1] 

*American School Achievement Tests. Grades 1, 
2-3, 4-6, 7-0; 1041-47; 4 levels; Robert V. Young, 
Willis E. Pratt, and Frank Gatto (I); Public School 
Publishing Co. * 

а) PRIMARY BATTERY I. Grade 1; 1941; 3 scores: word 
recognition, word meaning, numbers; Forms A, B; $3 
per 100, postage extra; 30 per specimen set, postpaid ; 
35(50) minutes. 

i5 PRIMARY BATTERY II. Grades 2-3; 1041; 7 scores: 
sentence and word meaning, paragraph meaning, arith- 
metic computation, arithmetic problems, language us- 
age, spelling, total; Forms A, B, C; norms ['41] ; $4.80 
per 100, postage extra; 36¢ per specimen set, postpaid; 
85(105) minutes. 

C) INTERMEDIATE BATTERY. Grades 4-6; 1942-43; 7 
scores: same as Primary Battery II; Forms A (42), 
B (743), С ('43) ; manual (’42) ; $7.20 per 100, postage 
extra; 366 per specimen set, postpaid; 127(147) min- 
utes. 


d) ADVANCED BATTERY. Grades 7-9; 1947; 7 scores: 


sentence meaning and word meaning, paragraph mean- 
ing, arithmetic computation, problem solving, language 
and spelling, total; Forms A, B, С; $7.20 per E yu 
age extra; 366 per specimen set, postpaid; 127(147) 
minutes. 
REFERENCES 

A E R. “Validity of Tests in Beginnin; 

Rea ori ARN seins F 4з. * (PA 18:2605) a z 


Rarrz C. Preston, Associate Professor of Ed- 
ucation, University of Pennsylvania, Philadel- 
phia, Pennsylvania. 


This test compares favorably with similar in- 
struments which have been constructed to sur- 
vey achievement in the basic academic skills at 
the elementary school level. 

This test's inclusion of a battery exclusively ` 
for the first grade is sensible. To be sure, the 
manual is not in accord with good practice when 
it suggests that the test be used during the first 
semester. Readiness tests would be more appro- 
priate at the beginning of the first grade, and this 
battery would seem to have its chief utility near 
the close of the school year. Word recognition is 
wisely tested through both auditory and visual 
stimuli. Word recognition and word meaning 
constitute the entire language portion of the first- 
grade battery. Thus, reading (as opposed to 
word calling) is not tested. This decision will be 
regretted by those teachers who are more inter- 
ested in a child's ability to read with understand- 
ing than in his ability to recognize isolated 
words. One of the splendid features of this first 
grade battery is its provision for the test ad- 
ministrator to give oral directions for each item 
throughout most of the test. For example, the 
administrator says, “Put a cross beside the word 
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“е?” and “Put a cross in the box beside the 
number that is larger than 25." Such detailed 
guidance is superior to having children of this 
age work independently. It reduces the preva- 
lence of low scores which commonly occur at the 
first grade level due to the child's failure to 
understand directions and to his inexperience 
with tests. 

The other three batteries, covering grades 2 to 
9, follow the usual pattern. The word meanings 
subtest is arranged in the following manner: 
“When one is ill he is " Four words fol- 
low, one of which is to be chosen. This arrange- 
ment makes for a more valid and functional vo- 
cabulary test than the common type in which 
each item contains a word for which the child 
is to select a synonym. 

In general, the alternative responses in the 
items of all areas are good and contribute to a 
discriminating test. The alternative responses 
are the product of careful study of errors com- 
monly made by children. The multiple choice 
device is happily used in the arithmetic subtests 
and better measures the child's number concepts 
than those tests which depend entirely on his 
skill at manipulating figures. 

Unfortunately, rigid time limits for the vari- 
ous subtests result in mixing measures of speed 
with measures of other factors such as paragraph 
comprehension and arithmetic problem solv- 
ing ability. Hence, the comprehension and skill 
Scores of the slow but accurate child may be 
quite misleading. The test under review, how- 
ever, sins no more in this respect than other com- 
parable achievement tests. Still, experience in 
the reviewer's clinic with all types of tests has 
led the reviewer to conclude that more accurate 

-evaluation may be secured when comprehension 
is measured independently of speed (as through 
the "repeating scales" of the Cooperative Eng- 
lish Test: Reading Comprehension at the high 
School and college levels, and as in the relatively 
untimed Gates Reading Survey at the elemen- 
tary and junior high school level). If di authors 
do not choose to do this, they should then report, 
as do the authors of the California Reading 
Tests, what percentage of pupils have been found 
to be able to complete the test within the speci- 
fied time limits. 

Most schools would find the spelling subtests 
unsatisfactory since they are tests of recognition 
of correct spelling rather than tests of ability to 

, reproduce from memory the correct spelling of 

a word. This is an illustration of how the con- 


venience of the multiple choice type of item may 
interfere with test validity. 

Are the norms sufficiently representative*so 
that almost any school could use them with 
reasonable confidence? That question cannot be 
answered from the information available in the 
test manuals. Data are not supplied concerning 
the geographical location of the school systems 
utilized in the standardization process, and it 
should be emphasized that norms for the Inter- 
mediate and Advanced Batteries are based on 
as few as 1,000 and 2,000 pupils, respectively, 
in varying types of school districts (the distribu- 
tion of numbers among the varying types is not 
reported). The urban districts used have popu- 
lations “of 5000 or more," which tells little of 
the size of these cities. It is to be hoped that 
eventually test publishers will realize that com- 
plete, forthright, and precise descriptions of 
standardization enormously and properly en- 
hance a test's prestige. In this connection, it is 
to the authors' credit that detailed information 
is supplied regarding the selection of test items. 
The care with which items were chosen for most 
of the subtests justifies confidence in their valid- . 
ity. 

Of major concern to consumers of tests will 
be the relatively low reliabilities reported for 
many of the subtests, some of the r’s running as 
low as .77, .72, .74, .76, etc. The Intermediate 
Battery is the most consistently reliable of the 
four batteries with r’s ranging from .86 to .94. 


For reviews by Walter W. Cook, and Gordon 
N. Mackenzie and Glen Hass of the tests for 
Grades 1—6, see 3:1. 


[2] 
*California Achievement Tests. Grades 1—4.5, 4-6, 
7-9, 9-14; 1933-51; a revision of Progressive Achieve- 
ment Tests (see 3: 15) ; 10 scores: reading vocabulary, 
reading comprehension, total reading, arithmetic rea- 
soning, arithmetic fundamentals, total arithmetic, me- 
chanics of English and grammar, spelling, total lan- 
guage, total; IBM for grades 4-14; 4 levels; tests in 
reading, arithmetic, and language are available as sep- 
arates; Forms AA ('so), BB ('so), CC (о), DD 
C50) ; manuals (’51) ; postage extra; 356 per specimen 
set of any one level, postpaid; Ernest W. Tiegs and 
Willis W. Clark; California Test Bureau. * 
а) PRIMARY BATTERY, Grades 1-4.5; $3 per 25; 90(110) 
minutes. 
b) ELEMENTARY BATTERY. Grades 4-6; $3 per 25; sep- 
arate answer sheets may be used; 46 per IBM answer 
sheet; 7¢ per scoreze answer sheet; $1.40 per set of 
stencils for machine scoring of answer sheets; бо@ per 
set of stencils for hand scoring of answer sheets; 
120(135) minutes. 
C) INTERMEDIATE BATTERY. Grades 7-9; prices same as 
for Elementary Battery ; 150(165) minutes. 
d) ADVANCED BATTERY. Grades 9-14; Forms AA, BB, 
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CC only; $1.80 per set of stencils for machine scoring 
of answer sheets ; other prices same as for Elementary 
Battery ; 150(165) minutes. 


REFERENCES 
1. See 40:1193. 
2-4. See Su: 
5. CLARK, ILLIS W. “Вазїс Skills in War Time." Ed 63: 


КЕЕ ЫП, J Am Assn Col Reg 20:367-82 Ap 45. * (PA 
19:2750 

7. CHuncH, Атғвер M. “The Standardized Testing Program 
Summary Report: Territorial Public Schools, 1946-47.” Hawaii 
Ed R 36:49-- О '47. * (PA 22:4627) 

8. SCHELLENBERG, Ernest Davin, A Study of the Relation- 
ship of the Progressive Achievement Tests and the California 
Mental Maturity Subtests, Master's thesis, University of South- 
ern California (Los Angeles, Calif.), 1948. 

9. Barley, Heren К. “A Study of the Correlations Between 
Group Mental Tests, the Stanford-Binet, and the Progressive 
Achievement Test Used in the Colorado Springs Elementary 
Schools.” J Ed Res 43:93-100 О '49. * (PA 24:2825) 

то. SPIVEY, GORDON | AURICE. The Relationship Between 
Temperament and Achievement of a Selected Group of John 
Muir College Students. Doctor's thesis, University of Southern 
California (Los Angeles, RE 1950. (Abstracts of Disserta- 
tions....1950, 1951, pp. 298-300. 

її. SPAULDING, GERALDINE. “The Use of the Progressive 
Achievement Test at the Ninth-Grade Level in Independent 
Schools," pp. 73-7. (PA 25:6477) In 1950 Fall Testing Program 
in Independent Schools and Supplementary Studies. Foreword 
by Ben D. Wood. Educational Records Bulletin, No. 56. New 
York: Educational Records Bureau, January 1951. Pp. xiii, 89. 
Paper, lithotyped. * ү й 

12. THOMPSON, ANTON. “Measurement of Pupil Progress in 
Learning to Spell Selected Words.” Calif J Ed Res 2:104-10 


My ’sr. 

WannEN С. Finptey, Director of Test Develop- 
ment, Educational Testing Service, Princeton, 
New Jersey.* 

The California Achievement Tests are, as the 
tests and manuals state, for the most part identi- 
cal with the 1943 edition of the Progressive 
Achievement Tests with minor editorial revi- 
sions. These tests may be hailed, as their prede- 
cessors have been, for the general suitability of 
their material to the grades for which they are 
intended, for the convenient outlines provided 
for analyzing types of errors made by pupils, and 
for the effort to provide a continuous measure- 
ment of growth in basic skills from primary 
grades through junior college. The California 
Achievement Tests can be criticized, as were 
their predecessors, for the excessive difficulty of 
much of the primary battery, for an undesir- 
able use of the term “diagnostic” to describe 
the analytical procedures favorably mentioned 
above, and for the vagueness of statement in 
the accessory materials about the standardiza- 


tion procedures. In view of the availability of - 


these latter criticisms in previous issues of The 
Mental Measurements Y earbook, it is difficult to 
understand why these faults have been allowed 
to persist, especially in a test that has been given 


a new name. 
A frequent criticism of standardized achieve- 


1 The author of this review is indebted to Clarence Derrick 
of the Test Development Department of Educational Testing 
Service for a memorandum on the reading and language tests 
and to Eleanor J. Riegel of the same department for a memo- 
randum on the arithmetic tests. 
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ment tests is that they measure a limited range 
of objectives, either those which are easiest to 
evaluate or, in the case of long established bat- 
teries, those which closely follow patterns set 
up some decades ago. Within each of the skill 
areas covered by these tests, the range of objec- 
tives tested is broad ; and, although the number 
of items in each subcategory is not sufficiently 
large to permit accurate measurement of indi- 
viduals in each such subtest, the subscores are 
useful indicators of the presence of difficulties in 
individuals and groups. Each complete battery 
affords from 18 to 21 such subscores. A con- 
venient profile form for these subscores appears 
on the back of each test booklet, and the inside 
back cover provides for summarization of the 
pupil's performance with respect to a still more 
refined set of subcategories. These tests were 
clearly designed for instructional use and lend 
themselves to such use by teachers and super- 
visors sufficiently sophisticated in testing to re- 
alize the limitations of measures based on vary- 
ing numbers of items. 

The individual test questions are generally 
good, testing significant understandings in func- 
tional form. They are clearly and efficiently pre- 
sented, and all batteries except the primary are 
set up for use with or without separate answer 
sheets, the response forms being completely ob- 
jective. 

A significant feature of these tests is that they 
offer scores for the same то part, subtotal, and 
total categories from grade 1 through grade 14. 
The continuity in interpreting scores over a wide 
grade range thus afforded is immediately useful 
at all levels and provides an aid not previously 
available to those who wish to plan curriculum 
or evaluation programs that stress continuous * 
growth of all students at all grade levels with re- 
spect to basic skills and common learnings. The 
use of grade scores to provide a continuous scale 
over so wide a range of grades results in special 
difficulties of interpretation at the higher grade 
levels where reading, arithmetic, and language 
are not generally taught. This limitation is miti- 
gated by the provision of tables of percentile 
norms for these grades, but even these fail to 
reflect growth and progress relative to earlier 
achievement. A new type of scale to indicate 
growth from the lowest measurable level to 
either mastery or distinctly superior adult ac- 
complishment is needed if interlocking batteries 
like these are to attain maximum usefulness. 

The single-grade reliability coefficients for 
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these tests are satisfactory, the values for the six 
basic components at the four levels ranging from 
.83 to .96 with a median of .go. The subtotal and 
total scores, of course, show progressively higher 
reliabilities. The manuals report that the grade 
Scores, percentile norms, and other normative 
data are based on samples of 50,000 to 100,000, 
stratified with respect to IQ distributions, 
amount of acceleration and retardation, and pro- 
portions of Caucasian, Mexican, Negro, and 
other ethnic groups. An effort has been made, 
then, to provide scales basic to growth stud- 
ies and to continuous interpretation of scores 
made by the same individual on adjacent batter- 
ies. 

On the basis of statements and data in the 
manuals alone, however, one might well be wor- 
ried over the soundness of the basic normative 
procedures. In particular, it is simply incredible 
that the grade placement and age norms and 
percentile-in-grade norms should be identical for 
all forms of a particular test at a particular level. 
But this is true of all the normative statistics for 
all то categories of part, subtotal, and total scores 
in every battery. Basic statistics will not roll 
over and play dead in this neat fashion as any 
seasoned test'builder will testify. The explana- 
tion of this surprising uniformity in the norms 
is to be found in the procedure used in building 
and norming the tests. Form AA in each battery 
constitutes a model for the other forms; each 
item in each of the other forms tests the same 
function as the correspondingly numbered item 
in Form AA. Moreover, these parallel items 
were carefully screened for equivalence of diffi- 
culty to the items in Form AA from a larger 
number of parallel items tried out. In each bat- 
tery, Form AA alone has been standardized in 
carefully selected samples of schools in eight 
geographical districts into which the publishers 
divide the United States for standardization pur- 
poses. It has then been inferred that the norms 
found for Form AA may be assumed to afford 
the best interpretation of achievement on Forms 
BB, CC, and DD. This reviewer is disposed to 
applaud the test-building procedure and to rec- 
ognize a certain common sense in using a single 
table of norms for interpreting achievement in 
a specific classroom. This will make good sense 
to the classroom teacher. For administrative, 
guidance, research or any other comparative pur- 
pose, however, the inevitable differences in true 
Scale equivalents of part and total scores are 
suppressed. Perhaps the solution is two sets of 


norms, one for ordinary classroom use, the other 
for comparative studies. 

Because of the unconventional procedures 
used in establishing the basic norms, the tables 
for correcting expectancies for groups with IQ's 
that differ from those reported as typical for each 
grade in the norms tables become a dubious aid, 
Although grave damage is often done to educa- 
tion and to individual personalities in the name 
of the IO, corrections for IQ are desirable in 
evaluation of group performance, and tables of 
such corrections constitute a step in the right 
direction. It is highly regrettable, therefore, that 
at best these constitute a refinement on otherwise 
crudely smoothed percentile ranks, measures 
that hardly warrant reporting of corrections to 
two decimal places. 

The use of the term “diagnostic” in so much 
of the literature about these tests is a disservice 
to those who wish to develop a properly objective 
terminology in educational measurement. Per- 
haps it is too much to hope that test users asa 
group will observe the niceties of usage, but it” 
seems reasonable to ask, as an earlier reviewer | 
did, that a term like “analytical” be substituted 
for “diagnostic” in the description of tests like 
these, saving the term “diagnostic” for measures 
that involve testing exhaustively the elements in 
a category, like the 100 additive combinations of 
two single-digit numbers or measures that de- 
scribe individual differences within a single 
grade with respect to such a category with some 
specified reliability, say .90. This reviewer has 
used the term semidiagnostic to refer to sub- 
scores like the 18 to 21 provided by the Cali- 
fornia batteries and then only when each was as 
reliable as the six basic component scores in 
each of these batteries. Adherence to a standard 
terminology should help unsophisticated test 
users to avoid the pitfall of placing too great 
weight on unreliable differences or gains in 
scores, 

The tests appear not to be speeded. In fact, the 
directions specify that 90 per cent of the pupils 
be allowed to finish unless this has not happened 
by a specified time limit which appears generous. 
This work-limit feature is desirable but leads one 
to underline the question of comparability of 
scores raised by Davis in an earlier review. Most 
students would certainly be expected to show 
to better advantage on norms in a slow group in 
which the maximum time was allowed than in a 
fast, able group that could complete the same 
material more rapidly. In a fast group.a particu- 
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lar student might be one of the 10 per cent who 
failed to finish, while in a slower group he might 
have finished easily and had time to review his 
answers. 

The component tests of the intermediate and 
advanced batteries are well adapted to meas- 
urement of poor learners, but most of the pri- 
mary and elementary tests are inadequate in this 
regard. For example, in the reading comprehen- 
sion test in the primary battery, intended for 
grade 1 through lower 4, the grade equivalent of 
the chance score that might be expected from 
random guessing is 2.4, and even a single right 
answer in a 30-item multiple choice test earns a 
grade score of 1.6. In the elementary battery, for 
all tests except vocabulary, the grade equivalent 
of a chance score based on random guessing is 
3.4 to 3.9, leaving no basis for determining the 
extent of deficiency of poor learners. On the 
other hand, for all tests in the intermediate bat- 
tery chance scores are at least a full grade below 
the lowest grade for which the tests are intended, 
and for the advanced tests the margin is several 
grades. 

The ceilings for the elementary and inter- 
mediate tests are quite adequate, the grade score 
equivalents of the maximum possible scores 
being consistently more than two grades above 
the highest grade level for which the tests are 
proposed. The same might be said of the ad- 
vanced tests insofar as they are used at the high 
school level, but the margin is less adequate for 
junior college testing. Similarly, the primary 
tests have adequate ceiling for grade 3, but 
hardly for grade 4. 

The format of the tests and accessories is at- 
tractive and clear. It is difficult, however, to 
distinguish quickly between materials for the 
different grade levels because the grade designa- 
tions are subordinated in the headings. The use 
of diagonal fraction lines is a real, but minor, 
flaw. The printing of reading selections and 
questions about them on different pages that 
require turning is a more serious criticism. 

A special feature, emphasized in connection 
with all California Test Bureau offerings, is the 
Scoreze. This is a special answer sheet made of 
two sheets sealed together at the edges, enclos- 
ing a sheet of catbon paper. One outer surface 
is identical with the one-page IBM answer sheet 
for the test, while the inner surface of the other 
sheet, which receives the carbon copy of the stu- 
dent's answer marks, has printed on it a scoring 
stencil and the same form for recording errors 


by type that is printed on the back of the test 
booklet. The other combination of outer and 
inner surfaces are duplicate copies of a record 
of raw scores and grade score profile for the 
student's performance on the subparts of the 
test. If the purpose is to encourage teachers and 
students to use test results to plan and motivate 
learning, the difference in cost between Scoreze 
and a regular answer sheet is amply justified. 
Scoring is speedy and accurate, and the two 
sheets, when separated, permit the original to 
serve as a record for administrative or guidance 
purposes of a central office, while the carbon be- 
comes the teacher's itemized record to use with 
the student. 

Before commenting on the batteries, level by 
level, this reviewer would like to question seri- 
ously the use of primary batteries of standard- 
ized achievement tests generally. Reading tests 
may well be used for instructional purposes in 
the primary grades, but the inclusion of stand- 
ardized measures in arithmetic and language 
tends either to stress mechanical aspects of learn- 
ing in these areas because they can be easily 
tested objectively or to measure with false pre- 
cision the skills of problem solving and verbal 
expression that inevitably involve reading com- 
prehension as a major factor. Administrators 
who have felt it desirable to extend formal test- 
ing programs down to the primary grades to 
obtain complete coverage of their schools may 
profitably ask whether any values they derive 
from the results justify imposing this strain on 
pupils and the implied standards on teachers of 
such young children. Authorities in child de- 
velopment stress the desirability of helping chil- 
dren in the primary grades to get a happy start 
on the road to learning in school at their own 
tempos and without undue pressure. After all, 
the payoff educationally is achievement by the 
end of the regular period of schooling in adoles- 
cence; while it may be true that early pressure 
brings earlier rote mastery, it may well be less 
efficient in producing mastery based.on under- 
standing. It should be repeated that this criti- 
cism is general and applies to primary batteries 
in all series. 

PRIMARY BATTERY. This is the weakest seg- 
ment of the offering. Wholly apart from the gen- 
eral criticism of primary batteries just presented, 
this battery is planned for use over such a wide 
range for the growth that takes place at this 
level as to be merely a source of frustration to 
most primary children, especially in grades т 
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and 2. This limitation mars an otherwise well 
conceived reading test. The language test of the 
primary battery measures knowledge of capi- 
talization, punctuation, and spelling, skills that 
should be mastered eventually but hardly 
stressed in primary grades. At best, this test is 
suitable for beginning fourth graders. The tests 
in arithmetic are out of focus, presenting too few 
problems that first or second graders can handle 
and inefficient measurement in third and fourth 
grades. The written directions the pupils must 
read and the verbally stated problems are fully 
as much a test of reading as of quantitative 
reasoning. 

ELEMENTARY BATTERY. Except for the fact 
that chance scores are not low enough to per- 
mit measurement of poor learners in grades 4 
and 5, these tests are generally satisfactory. The 
use of both synonyms and antonyms subtests in 
the vocabulary test may be questioned as a 
needless source of confusion to pupils when per- 
formance on the two item-types is generally so 
highly correlated. The language test is probably 
as suitable for national use as any other since it 
avoids most of the disputable points of gram- 
mar. The directions for the punctuation section 
do not make sufficiently clear that in certain in- 
stances more than one mark of punctuation is 
needed at the numbered position. The arithme- 
tic tests are well designed to measure ability to 
translate written numbers and Roman numerals 
into Arabic numerals, judgment of the relative 
magnitude of numbers, ability to solve word 
problems, and facility in the fundamental opera- 
tions. 

INTERMEDIATE BATTERY. The tests at this level 
are generally the most satisfactory of the series. 
The 9o items in the vocabulary section have 
been grouped by the four principal areas of the 
curriculum : mathematics, science, social science, 
and general reading. This helps guarantee bal- 
anced sampling and facilitates an emphasis on 
teaching and learning key terminology in each 
subject area. Teachers should use these separate 
scores, however, with full knowledge that they 


are highly correlated and only moderately re- ` 


liable in comparison with the total score for vo- 
cabulary. A number of questions on reading se- 
lections can be answered without reading the 
selections. Although the questions themselves 
require reading, this is faulty technique. The 
reading difficulty or problem should be integrally 
related to reading the selection, especially in a 
test whose analytical use is stressed. The word 
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classification exercises in the language tests аге 
especially commendable in that the words are 
presented in sentences. Classification of words 
thus presented reinforces the concept that a 
given word may be different parts of speech de- 
pending on its use. The arithmetic tests possess 
the virtues and limitations of the elementary 
tests in arithmetic. It is a question whether the 
algebra items introduced at this level are par- 
ticularly useful for analyses since measurement 
of competence in algebraic manipulation and 
problem solving would require separate sections 
as long as the sections on corresponding arith- 
metic operations. Ability to interpret and use 
charts, tables, and graphs might well have been 
required at this level. 

ADVANCED BATTERY. If these tests are used 
only to the end of high school, they have ade- 
quate ceiling and plenty of range above chance 
Scores to measure poor learners. At junior col- 
lege levels they provide less margin at the top 
to measure superior students. In all other re- 
spects these tests share the strengths and limi- 
tations of the corresponding measures in the — 
intermediate battery. i 

SUMMARY. The California Achievement Tests 
may be recommended except at the primary level 
for their content and scope and for their instruc- 
tional as well as administrative and guidance 
uses. The emphasis on continuity of growth in 
basic skills over a wide grade range is another 
desirable feature. The method of constructing — 
the norms renders them simpler to interpret for 
classroom use, but prevents rigorous compara- 
tive interpretations of individual and group 
scores for administrative, guidance, and reseatch 
purposes. 


Atvin W. ScuiNpLEn, Professor of Education, 
University of Maryland, College Park, Mary- 
land, 

These tests are essentially survey tests in read- 
ing, arithmetic, and language. They may be used 
to determine achievement, grade placement, and 
percentile rank of pupils in relation to the gen- | 
eral school population; for these purposes the ~ 
tests have merit. The extensive claims which the 
authors make for the tests as diagnostic instru- 
ments are not well founded. True enough, the 
Scores reveal whether or not a pupil is achieving 
at his grade level. They may also reveal in what | 
respects and to what extent the pupil is achiev- 
ing above or below his grade. In this sense the 
tests have some general values in the initial | 
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stages of diagnosis. However, the number of 
items on a given skill is too small to provide re- 
liable information on specific difficulties and 
needs of a pupil or class. A tabulation of scores, 
as suggested in the manual, would be of some 
value ; nevertheless, the tabulated data for three 
items on Roman numerals or for two items on 
carrying in addition would be less reliable than 
the information which the teacher may gain by 
observing her pupils at work in everyday situa- 
tions. The test results are of some value in indi- 
cating areas toward which observation should 
be directed in order to discover difficulties and 
needs. 

The “diagnostic analysis of learning difficul- 
ties," as presented in each test booklet, provides 
an excellent overview of the types of skills which 
the battery samples. As the manuals suggest, 
this may be used as a guide in making tabulations 
for diagnostic purposes; but it is not truly a 
"diagnostic analysis of learning difficulties." 

The general pattern of the "diagnostic pro- 
file" has many commendable features. There 
should, however, be more space for the chart: 
the numbers and the spaces between them are 
too small. These defects could be lessened with 
changes which are possible without using a 
larger sheet. Instead of short white lines to as- 
sist in noting grade placement, the white lines 
might be solid from top to bottom. 

The capitalization and punctuation tests for 
the primary grades are difficult to score and the 
data which they provide are not extremely im- 
portant for pupils in grades 1 and 2. The pos- 
sible score of 15 has little potentiality for either 
survey or diagnostic purposes. The language 
test, with the exception of spelling, might well be 
eliminated from the primary battery. The “me- 
chanics of English and grammar" tests provide 
a meager sampling of language skills. 

A glance at the diagnostic profile for the 
primary grades also reveals that the reading vo- 
cabulary subtest is the only part of the reading 
test with which the average pupil in grade 1 can 
cope with any degree of success: in fact, the 
profile indicates that the primary battery as a 
whole is of little value for grade 1. 

In the elementary, intermediate, and advanced 
examinations certain subtests do not permit pu- 
pils of superior ability and achievement to reveal 
fully their attainments. Accordingly, when the 
Scores of such pupils are compared with their 
Scores on the California Test of Mental Maturity 
the results tend to show unrealistic retardation. 


An examination of the test materials and ob- 
servation of the diagnostic profile reveal that the 
materials in some subtests are not properly dis- 
tributed in terms of difficulty. 

The tests obviously fail to measure some im- 
portant reading and arithmetic skills. Further- 
more, the sampling of items on a given skill is 
usually meager. This shortcoming should be re- 
membered when the test results are used to 
evaluate achievement within a school. The arith- 


-metic tests, especially in the batteries above the 


primary level, emphasize the getting-the-right- 
answer aspect of arithmetic. There 15 evidently 
quite a gap between the goals measured by these 
tests and the goals currently emphasized in the 
theory on teaching arithmetic. 

The time allowed for the arithmetic tests in 
the primary battery is slightly more than the 
combined time for reading and language; in the 
elementary battery the time for arithmetic is 
equal to that for reading and language com- 
bined. This seems unjustifiable, especially at the 
primary level. The reading test should have more 
material to provide a better sampling of basic 
reading skills. The total testing time for the 
three tests in each battery is reasonable, rang- 
ing from 95 minutes for the primary battery to 
152 minutes for the intermediate battery. 

The manuals, including the various norms and 
related explanations, deserve more praise than 
the tests themselves. The directions on adminis- 
tering, scoring, and tabulating are adequate. The 
suggestions on interpreting and using the re- 
sults are helpful, although somewhat ambitious 
in their claims. | 

The California Achievement Tests have two 
rather significant advantages. First, the four bat- 
teries provide a testing program for grades 1 to 
14, inclusive. Secondly, since these tests are ac- 
companied by the California Test of Mental Ma- 
turity, comparisons between the results on the 
two tests make possible a more helpful evalua- 
tion of achievement. The tests may be used with 
better than average advantage in spite of weak- 
nesses mentioned above. " 


J. Hartan Ѕнокеѕ, Professor of Education, 
University of Illinois, Urbana, Illinois. 
TECHNICAL ASPECTS AND DIAGNOSTIC VALUES. 
From several technical standpoints these tests 
are to be highly commended. More than fifty 
thousand cases formed the basis of the standardi- 
zation at each test level. Norms are based upon 
approximately 10 times this number of students 
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drawn from school districts in 20 states. While 
this latter fact is impressive with respect to 
representativeness, it is also a disadvantage in 
that norms representative of such an extensive 
and heterogeneous sampling of areas and school 
districts do not indicate normal expectancy for 
schools of any particular kind. 

Reliability coefficients of the separate tests of 
reading, language, and arithmetic (mathematics 
at the advanced level) at each test level are above 
:90 in each instance. The part tests (Reading 
Comprehension, Vocabulary, Arithmetic Rea- 
soning, etc.) continue to be highly reliable. Of the 
36 major tests in the complete battery, 25 have 
reliability coefficients in the .go’s and none falls 
below .83. While a reliability coefficient of .83 
is generally satisfactory, it is somewhat unfor- 
tunate that this lowest coefficient is found with 
Reading Comprehension (Intermediate Level). 
Published tests of reading comprehension usu- 
ally achieve higher reliability coefficients, as is 
the case for the other reading tests in this bat- 
tery. 

Only the highest admiration can be had for 
the manner in which the California Achievement 
Tests care for the mechanics of testing. Manuals 
are clearly and simply written, and the tests are 
easily administered and scored. A recent com- 
bination answer sheet and diagnostic profile in- 
novation called "scoreze," available at all levels 
except the primary, enables a considerable sav- 
ing of hand scoring and recording time. Print- 
ing, packaging, quality of paper, organization of 
tests and items, and other such physical features 
are excellent. 

Sections of the manuals dealing with interpre- 
tation of results offer many helpful suggestions. 

. The authors are generally careful to protect 
themselves in the manuals against overstate- 
ments concerning desirable uses to which the 
results can or should be put. However, the claim 
that these tests have any reliable diagnostic value 
is misleading. Although this claim is made only 
by implication and is shared by nearly all other 
authors of general achievement tests, it is never- 
theless unfounded. The test user is told in 
guarded language in each manual that the scores 
on the separate sections of each test are not re- 
liable, that is, that they “should be used only as 
guides to indicate the presence of student diffi- 
culties." Reliability coefficients on these subtests 
are not even published. Yet the “diagnostic pro- 
file" includes these unreliable subtest headings 

‚ with space for recording these scores and pro- 


files. The language encouraging the use of these 
data is not so guarded. The manual for the Cali- 
fornia Reading Test, Intermediate Level, read; 
as follows: “The profile graphically illustrates 
the achievement of the student in terms of grade 
placement in Reading Vocabulary and Reading 
Comprehension, and in the seven sections which 
comprise the two tests." (Italics mine.) The {аф 
is that the subtests lack sufficient reliability to 
be used in the diagnostic profile. Furthermore, i 
these subtests were removed from the profile, 
the battery would have little diagnostic value 
This criticism, of course, does not indicate that 
the battery is inadequate as a test of general 
achievement in the skill areas. 

VALIDITY, AND USEFULNESS. The test manuals 
state that the test items in the original 1933 edi- 
tion of the battery were selected only “after care- 
ful study of the most modern city and state 
courses of study." Although items have been 
changed and standardization procedures re 
peated in four later revisions, there is no indica: 
tion that any systematic referral to objectives 
has been made since the 1933 edition. If this isa 
fact and not an omission in the manual, the bat- 
tery might be a more valid test of the reading, 
arithmetic, and language objectives of the 1951 
School if it were rebuilt in terms of more recent 
conceptions of both the nature of these ski 
processes and the generally agreed upon obje 
tives in each. 

Parts of several of the tests of reading compre 
hension may be answered from common general 
knowledge without recourse to the reading pas 
sage. In situations of this sort, it is difficult 
know what is being measured by a reading con 
prehension test. 

One gains the general impression from an ex 
amination of the California Achievement Tests 
or of nearly any other of the widely used general 
achievement tests, that testmakers are much 
more concerned with improving the technicali 
ties of testing than they are with improving tech- 
niques of evaluating the objectives of the public 
schools. Such an observation does not deny the 
importance of proficiency in reading, language, 
and arithmetic as instructional goals. Instead it 
suggests that only. the easily measurable surface 
aspects of these processes are being measured: 
It suggests further that it might be a wiser cours 
for educators and test authors to satisfy them 
selves with less accurate measures of more 
worthwhile aspects of the skill processes. 

Examples of this general failure to select and 
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measure the most useful, desirable, or needed 
aspects of the skill processes are plentiful in the 
reading tests. Reading is now recognized as a 
process involving thought about printed ma- 
terial. Furthermore, the kind and direction of 
this thought is determined, at least in part, by 
what it is that the reader intends to get from 
the passage—the reader's purpose. This and 
other tests of reading largely ignore the kinds 
of thought required to satisfy specific purposes 
of readers pursuing work-type reading. In fact 
the testee finds his purpose only when he gets 
to the questions following the passage. He may 
then find that he was expected to recall specific 
facts from story-type material that seemed to 
be written only to be appreciated. 

However, even if the processes to be meas- 
ured were much more carefully analyzed for the 
elements most significant to educational objec- 
tives, one further caution is in order. The basic 
elements of these processes (reading, arithme- 
tie, language) as analyzed by skill specialists 
may indicate the form these processes assume 
in perfect adult form. These same processes may 
take a different form in immature development. 
For example, to know the steps in a process of 
critical reading as this process is carried on by 
adults who are mature and disciplined in their 
reading and thought processes indicates little 
about what should be measured to determine 
whether children are showing desirable growth 
and development in these processes and disci- 
plines. 

SUMMARY. The California Achievement Tests 
are useful for a general survey of those aspects 
of reading, arithmetic, and language commonly 
measured by tests of general achievement. 
Within this framework they are probably as ac- 
curate and well constructed as other widely used 
achievement batteries. They have no equal for 
ease of administration, scoring and recording of 
data. Use of the diagnostic profile including sub- 
test scores with very doubtful reliability is to be 
discouraged. Such advantages as breadth of 
Scope and relatively short testing time cause 
tests of general achievement to be quite inade- 
quate for diagnostic purposes. The fact that 
these tests probably do not adequately measure 
many of the most crucial objectives sought in 
the skill areas by modern public schools is a de- 
fect shared by all general achievement tests. Im- 
provement in this type of logical validity and 
utility is much more needed than further per- 
fection of existing measures. 
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For reviews by Gerald V. Lannholm and 
Robert C. Pooley of the language test, see 151; 
for a review of the arithmetic test by Robert L. 
Burch, see 411; for reviews by John C. Flanagan 
and James R. Hobson of the reading test, see 
530; for a review by Paul A. Witty of the 1943 
edition, see 3:15; for reviews by C. W. Odell 
and Hugh B. Wood of an earlier edition, see 
40:1193; for reviews by Harry A. Greene and 
J. Paul Leonard of an earlier edition of the 
language test, see 40:1292; for reviews by 
C. L. Thiele and Harry Grove Wheat of an 
earlier edition of the arithmetic test, see 40:1459; 
for a review by Frederick B. Davis of an earlier 
edition of the reading test, see 40:1563; for a 
review by D. Welty Lefever of an earlier edition 
and an excerpt from a review, see 38:876; for a 
review by W. A. Brownell of an earlier edition 
of the arithmetic test, see 38:893; for reviews by 
Ivan A. Booker and Joseph C. Dewey of an 
earlier edition of the reading test, see 38:1110. 


[3] 
*Contemporary Affairs: Every Pupil Test. 
Grades 7-12; 1939-51; new form usually published 
each April and December; form December 1951; no 
data on reliability and validity; no manual; norms 
(^51) ; 2126 per test; 1€ per answer key; postpaid; 40 
(45) minutes; Ohio Scholarship Tests, Ohio State 
Department of Education. * 


[4] 

*Cooperative Contemporary Affairs Test for 
College Students. College; 1938-51; 4 scores: pub- 
lic affairs, science and medicine, literature and fine 
arts, total; new form published annually; IBM; Form 
1951; no data on reliability and validity; no specific 
manual; norms compiled after publication of test; gen- 
eral Cooperative manual ('51) ; separate answer sheets 
must be used; $2.50 per 25; 80¢ per 25 IBM answer 
sheets; 156 per scoring stencil; cash orders postpaid; 
50€ per specimen set, postpaid; 60(70) minutes; Form 
1951: E. F. Lindquist, Robert L. Ebel, John Haefner, ` 
Victor Harris, Jack T. Johnson, William Porter, Wen- 
dell Smith, and Leo Sweeney; Cooperative Test Divi- 
‘sion, Educational Testing Service. * 
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BENJAMIN S. Broom, College Examiner, The 
University of Chicago, Chicago, Illinois. [Re- 
view of Form 1951] 

It was this reviewer’s dubious pleasure to 
review the Cooperative General Culture Test in 
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The Third Mental Measurements Yearbook. 
For some reason, known only to the editor of 
this yearbook, I am slowly and critically proceed- 
ing through the National College Sophomore 
testing program. The present test is the second 
of the recommended and widely used parts of this 
battery. Since these tests are administered to 
an average of 160 colleges and 20,000 students 
annually, they have a considerable effect on both 
the individual students as well as the institutions 
involved. Although it may be claimed or desired 
that tests should not influence the curriculum, 
it is impossible for tests to be really neutral. The 
very number of individuals and schools involved, 
the content of the tests, and the almost insep- 
arable relation between functioning educational 
objectives and tested objectives result in stand- 
ardized tests serving to stimulate or retard think- 
ing about educational problems. 

The charge that the Cooperative General Cul- 
ture Tests were the same from year to year can 
not be levelled at this test. Many of the earlier 
forms of this test required little more than an 
acquaintance with specific contemporary events, 
persons, and places. As such, they provided an 
index of the extent to which the examinees read 
the newspapers or news journals, but gave little 
evidence of what the present test is intended to 
measure—"the student's understanding of re- 

‚ cent social, scientific and cultural developments, 
and the extent to which he is making sound and 
critical observations about developments in our 
contemporary civilization." 

‚х Part I, the Public Affairs section of this test, 
is especially good in that at least one half of the 
questions require the student to recognize re- 

‘lationships among events as well as to deter- 
mine the immediate cause, effect, or explanation 
of some happening. The questions are about sig- 

© nificant events in that they affect the lives of 

“most of us in relatively direct ways. Part П, 

Science and Medicine, and Part III, Literature 

and Fine Arts, are somewhat improved over 

‚ some of the previous forms, but the questions do 

not appear to be as penetrating and searching as 
those in the first part. 

‘About all that a reviewer can do at this point 
is to urge the test writers to move still further 
in developing questions which measure their 
stated objectives. Such questions must do more 
than determine that the student reads about 
current affairs. They must measure the extent 
to which he is able to discern relationships among 
events, whether he understands the background 


* 


and the context in which events occur, and 
whether he can foresee the consequences of par- 


do little more than give accounts of particular 
happenings, with little or no background and 
with little attempt to relate events which are 
separated by time or space. Perhaps, however, 
the test writers might refer to the New York 
Times News of the Week section for a model 
of the kind of understanding the informed citi 
zen really needs if the world about him is to be 
more than a quick-moving kaleidoscope. 


Јонх V. McQuirry, University Examiner, 
University of Florida, Gainesville, Florida. [Re 
view of Form 1951.] і 
This one-hour, 120-item test covers Public 
Affairs, Science and Medicine, and Literature 
and Fine Arts. It is one of the basic tests offered 
in the National College Sophomore Testing Pro 
gram, the National College Freshman Testing 
Program, and the College Preparatory Testing 
Program. The fact that this test is used in na 
tional programs furnishes considerable norma: 
tive and other comparative data which might not 
otherwise be available. 1 
In 1946 the content of the test items was 
shifted to require ability to comprehend and in 
terpret news rather than to remember headlines 
This reviewer is very much in sympathy wi 
this shift. In using items requiring comprehen- 
sion and interpretation, care must be exercised 
to prevent the items from being so complex that 
a specialist's understanding of the subject cov- 
ered by the item is demanded. The authors of 
the test have selected a happy medium in this 
regard. Any person who has carefully read the 
material covered by the item should be able to 
handle it. The level of difficulty does not seem 
to be above that of the average college sopho- 
more ; and the test is very suitable for use with 
college freshmen. 
Ina test as general as one titled Contemporary 
Affairs the authors are confronted with the baf 
fling problem of content. One might have a much 
longer test and still be forced to omit very im- 
portant phases which might properly be in- 
volved. In deciding to use this test, school or col- 
lege personnel might well ask, “From what 
sources will the students be able to obtain the 
information needed to answer the items on the 
test?” It seems to the reviewer that all бо items 
on Public Affairs might be answered through 
knowledge gained from any опе of several 
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sources: (a) radio commentators; (b) daily 
newspapers; (c) news magazines. The radio 
would probably serve as a source for answering 
half of the items on science and medicine; the 
newspaper would likely be a somewhat better 
source; and news magazines probably would 
cover most of the items in the section. For the 
items on literature and fine arts, sources would 
seem much more restricted. It is doubtful 
whether any one of the three sources previously 
mentioned would be entirely adequate, although 
all three together might be. Undoubtedly the best 
overall source would be a course in contemporary 
affairs. Students who are taking such courses 
would seem to have a decided advantage. 

The content of the examination is very de- 
cidedly American. For example, of the 60 ques- 
tions on public affairs, 51 involve the United 
States either at home or abroad. In fact, many 
of the items are so written that one must assume 
that they apply to the United States to answer 
them. In the other two sections about one half 
of the items center on the United States and its 
citizens. 

The period of time covered by the items varies 
from section to section. The items on public af- 
fairs are the most restricted in time, being largely 
concerned with the year 1951. About half the 
items in each of the other two parts cover a con- 
siderably longer period, and: might well be an- 
swered correctly by persons who had gained 
their information prior to 1951. 

Although the authors are undoubtedly aware 
of the importance of time designation in this test, 
it might be well to caution them to be even more 
precise than at present. For example, a ques- 
tion such as “What has happened recently to the 

Commission?" might have one answer 
today and a different one a few days hence. The 
authors might also be cautioned to exercise care 
that the information ih some items does not sup- 
ply the answers to other items. This is true of 
at least two of the items in the section Public 
Affairs. р 

The editor might be chided slightly for say- 
ing in the general directions that the examina- 
tion "requires 75 minutes of working time" 
when actually but 60 minutes are required. 

No information is furnished test purchasers 
regarding validity and reliability. The test con- 
tent is probably carefully enough selected and 
the authors are expert enough in test construc- 
tion that sufficient reliability can be expected. 
The matter of validity is something else, and the 


authors and publishers should make every pos- 
sible effort to secure and provide for their users 
data on validity. 

SUMMARY. New and comparable forms of this 
test are constructed annually to provide meas- 
ures of understanding of current affairs at a 
moderate level of insight in the specific areas in- 
volved. Since the test is widely used, the pub- 
lishers are able to provide unusually adequate 
norms and other comparative data. The use of 
the test is recommended where reliable informa- 
tion is desired concerning an examinee’s knowl- 
edge of current affairs at the level of the informed 
citizen of today rather than at the level of the 
college scholar. 


For a review by Н. T. Morse of Form 1946, 
see 3:2; for arevie by Ralph W. Tyler of Form 
1939, see 40:1182; for areview by Paul M. Lim- 
bert of Form 1937, see 38:948. 


[51 
*Cooperative General Achievement Tests, Re- 
vised Series. Grades 10-12 and college entrants; 
1940-51; 3 scores: terms and concepts, comprehension 
and interpretation, total; IBM ; 3 tests (also listed sep- 
arately) ; Forms X (47), Y (48), Z (51); Forms 
OR-T out of print; no norms for part scores; no spe- 
cific manual; general battery folder (’51); general 
Cooperative manual ('51) ; norms ['41]; $2.50 per 25 
of any one test; Form Z available only to colleges; 50¢ 
per specimen set of any one test, postpaid; separate 
answer sheets may be used; 80 per 25 IBM answer 
sheets of any one test; 156 per stencil for scoring ап-+ 
swer sheets of any one test; cash orders postpaid; 


40(45) minutes per test; Cooperative Test Division, А 


Educational Testing Service. * 
а) TEST 1, A TEST OF GENERAL PROFICIENCY IN THE 
FIELD OF SOCIAL STUDIES. Jeanne M. Bradford (X) and 
Elaine Forsyth Cook (Y,Z). ^ 
b) TEST П, A TEST OF GENERAL PROFICIENCY IN TH 
FIELD OF NATURAL SCIENCE. Paul J. Burke (X), Carl 
A. Pearson (Y), and John G. Zimmerman (Z). " 
C) TEST III, A. TEST OF GENERAL PROFICIENCY IN THE 
FIELD OF MATHEMATICs. Paul J. Burke (X) and Ber- 
nice Orshansky (Y,Z). 

А 
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PauL L. Dresset, Chairman, Board of Exam- 
iners, Michigan State College, East Lansing, 
Michigan. [Review of Forms T, X, and Y.] 

PURPOSES OF THE TESTS. A 4-page leaflet dated 
1940 entitled The Cooperative General Achieve- 
ment Tests (Revised Series): Information Con- 
cerning Their Construction, Interpretation and 
Use appears still to be the most up-to-date state- 
ment of the rationale of these three tests: 


The revised series.., beginning with Form QR, 
were developed to provide measures of general pro- 
ficiency in the fields...as contrasted with the mastery 
of the more specific content of separate courses. * items 
requiring knowledge of descriptions, explanations, or 
events are specifically avoided. The emphasis is placed 
to as great an extent as possible on the broader objec- 
tives at which instruction in each of the various fields 
is aimed * [The tests] indicate the proficiency of the 
student in dealing with typical practical and academic 
materials in a broad field rather than knowledge of the 
body of facts included in a particular textbook. 


Regarding possible use of the tests, a number 
of statements are made: 


The Cooperative General Achievement Tests may be 
used as survey tests at the end of the school year * Re- 
peated administrations of comparable forms permit the 
School to make accurate estimates of the progress of 
the various groups. These results are also valuable in 
prognosis and placement and....ability grouping. Guid- 
ance officers should find total scores on the tests use- 
ful in recommending future courses of study. * a stu- 
dent's performance on the two sections should indicate 
whether his greatest weakness lies in lack of knowl- 
edge of vocabulary or in ability to comprehend mate- 
rials in the field. * The test is especially useful in 
placement of freshmen entering college. * [The tests] 
provide indications of student ability which are some- 
pat ерсе} of the specific courses taken in high 
school. 


In order to appraise the extent to which the 
tests fulfill the descriptions and purposes just 
stated, it is desirable to consider separately the 
tests in each area and then to examine the entire 
set of tests on the basis of certain common qual- 
ities and objectives. Each of the various forms 
.of the three tests follows the same pattern of 
division into two parts: the first is largely a 
vocabulary test which attempts to test under- 
standing rather than rote memory, and the sec- 
ond attempts to test the ability to comprehend 
and interpret paragraphs or materials adapted 
from newspapers, magazines, or textbooks. The 
reviewer found it desirable to work through each 
test in its entirety both to check on the items and 


to determine by introspection the actual tasks 
presented by the items, 


SOCIAL STUDIES. For the social studies test, the 
stated objectives correspond rather well with 
those commonly subsumed under the term criti- 
cal thinking. The vocabulary section is, of 
course, a test of knowledge. As might be ex- 
pected, Form T, published in 1942, involves 
some outdated items, such as those dealing with 
the CCC and "the present war." Form Y like- 
wise includes an item dealing with groups cov- 
ered by an out-of-date version of the Federal 
Board Security Act. These items suggest the 
need for new forms or continual revision of old 
forms if tests of this type in the social studies 
field are to be entirely satisfactory and still be 
current. 

Despite attempts at avoidance, rote associa- | 
tion of terms crops up reasonably frequently in 
the vocabulary section. As an example, note 
Item 18, Part I, Form T wherein the stem and 
correct answer combined reads, “An ideal state _ 
where social conditions are perfect is called a ` 
utopia." The probable pat association of ideal 
and utopia would seem to render the item in- 
significant. 

The range of vocabulary in the various forms 
is tremendous. The basis for selection of the ` 
terms should be clarified for the sake of the test 
user. The reviewer suggests that a more mean- 
ingful vocabulary section would be obtained by 
testing in Part I for the meaning of key words 
in the reading selections given in Part II. This 
association of the parts would provide a much 
clearer basis for interpreting differential per- 
formance. 

Only one actually unsatisfactory item was 
found in the vocabulary section of the three 
forms. Item 43, Part II, Form Y speaks of the 
daily range of temperature exceeding the an- 
nual range of temperature. Except for some 
restricted special definition of annual range of | 
temperature, apparently not generally known to | 
professors or weather bureau employees, the 
item is meaningless. 

It is the reviewer's impression that the items ` 
of Part II of the various forms were not written | 
to any carefully established blueprint insuring 
the testing in balanced fashion of all the objec- 
tives stated. Form T, which probably should be | 
withdrawn from circulation in any case, seems 
particularly weak on this score, appearing to | 
test only comprehension and an occasional draw- 
ing of an inference. Forms X and Y, while better | 
balanced, still lean very heavily to the usual read- | 
ing comprehension type of item, More can and 
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should be done in testing other of the stated ob- 
jectives of critical thinking. 

NATURAL SCIENCES. Test II in its various 
forms is open to many of the same criticisms 
made of the social studies test. Vocabulary items 
vary from testing the fact that the Adam's apple 
is another name for the larynx to selecting that 
one of the animals which has a chitinous exo- 
skeleton and jointed appendages. Again, the re- 
lationship between the objectives stated for Part 
II and the actual items does not seem to have 
been carefully planned. Form X contains many 
items of simple factual nature which appear to be 
almost independent of the material presented. 
In Form Y there is more evidence of an attempt 
to test ability to draw inferences and to make 
generalizations. The tests are weighted in the 
direction of the physical sciences, probably con- 
sciously, although this is not so stated. Items, in 
general, are well written and have no technical 
flaws. 

MATHEMATICS. The various forms of Test III 
show a rather marked cleavage between Parts I 


and II. Part I in each case deals with mathe- - 


matical vocabulary ranging in terms and con- 
cepts from simple arithmetic to trigonometry, 
solid geometry, and advanced algebra. Part II 
items are much more practical and realistic in 
terms of "proficiency" than much of the material 
involved in Part I: they utilize, in general, only 
the simpler arithmetical concepts and depend 
heavily on the use of tabular and graphical ma- 
terial. It is unfortunate that more is not done to 
get at the essential nature of mathematics and 
of mathematical reasoning. It is the reviewer's 
opinion that this aspect of mathematics—too lit- 
tle emphasized and difficult to test as it is—has 
as much general applicability as has the reading 
and interpretation of graphical, tabular and other 
quantitative materials. Items 17-23 in Form X, 
which are built around the fundamental laws of 
arithmetic, exemplify a step in the direction of 
such testing. E 

RELIABILITY. Information on the reliability of 
all of these tests is provided by showing on the 
scoring key the standard error of measurement 
of a scaled score. While impossible to read ac- 
curately in this form of presentation, the size of 
this error provides assurance of adequate relia- 
bility. From this and other data available the 
reviewer estimated that the usual reliability co- 
efficients—which would in this case be correla- 
tions between equivalent forms—would be in the 
neighborhood of .go. 


VALIDITY. Essentially no information on va- 
lidity, however defined, is available on any of 
these tests. Although objectives measured by the 
tests are stated, there is no evidence that any 
definite scheme was followed to insure adequate 
sampling of these objectives. Despite suggestions 
that the tests may be used for prediction, for 
placement, for studies of progress, and for rec- 
ommending future courses of study, no correla- 
tions with high school or college grades are re- 
ported. Other than class means, no evidence is 
reported on the amount of change which has 
been noted or which might be reasonable in a 
group of students over a period of a year or 
more. 

Intercorrelations of the tests and parts of 
tests are reported for the earlier Form OR, thus 
providing some data upon which judgments 
might be made about relative performance on 
different tests or on part scores of a test. How- 
ever, the use of the tests in comparing vocabu- 
lary knowledge and interpretation skills is rather 
effectively discouraged by the fact that only one 
Score is provided for on each of the tests in 
mathematics. 

SUMMARY. The type of testing represented by 
these instruments is desirable and consistent 
with the present concern about general educa- 
tion objectives. However, there is considerable 
doubt that prediction, placement, or advice on 
selection of courses is sufficiently consistent with 
measurement of the outcomes of general educa- 
tion for one instrument to function in all these 
ways. It would seem desirable to reconsider the 
function of these tests, to make that function and 
the associated objectives specific, and then to 
develop the tests around a well considered pat- 
tern to insure adequate coverage of the objec- 
tives. Although a specific recommendation pre- 
ceding such rethinking is somewhat out of ` 
order, it seems to the reviewer that it would be 
preferable to relate the vocabulary section more 
closely to the interpretation section and then to 
provide for two scores which might enable the 
teacher to discern discrepancies between knowl- 
edge of terminology and ability to use these in 
context or in problem situations. Furthermore, 
if the tests are to measure development, norms 
of this kind should be provided in addition to the 
customary status norms. Finally, should the tests 
be oriented toward providing an index of gen- 
eral education outcomes, a similar test covering 
the broad field of the humanities should be de- 
veloped. 
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For a review by John V. McQuitty of Forms 
S and T, see 3:3; for a review by Harry D. 
Berg of Test r, Forms S and T, see 3:596; for 
a review by Palmer O. Johnson of Test 2, Forms 
S and T, see 3:548; for a review by John F. Ran- 
dolph of Test 3, Forms S and T, see 3:316. 
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*Cooperative General Culture Test. College; 
1930-51; 6 scores: history and social studies, litera- 
ture, science, fine arts, mathematics, total; Form XX 
(’51—a revision of Form X); Forms 1930, 1933-37, 
O-U, W-Y out of print ; no data on reliability and valid- 
ity; no specific manual; no description of normative 
population; norms ('si—sophomores); general Co- 
operative manual ('51) ; $3.90 per ei separate answer 
sheets must be used; $1.70 per 25 IBM answer sheets ; 
45¢ per set of scoring stencils; cash orders postpaid; 
50@ per specimen set, postpaid; 150(160) minutes; 
Norman J. Blair, Jeanne M. Bradford, Miriam M. 
Bryan, Paul J. Burke, and Herbert Danzer; Coopera- 
tive Test Division, Educational Testing Service. * 
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Jouw V. McQuirry, University Examiner, 

` University of Florida, Gainesville, Florida. [Re- 
‚ view of Forms Y and XX.] 
"This is one of the three basic tests offered in 
the National College Sophomore Testing Pro- 
gram. Form Y was used in the 1950 program 
and Form XX in the 1951 program. The test is 
also available for use in connection with the Na- 
tional College Freshman Testing Program, al- 
though it is not included in the basic battery. 
The fact that this test is used in a program of 
the scope of the Sophomore Program provides 
sources of normative and other comparative data 
that might not otherwise be available. 
The test is designed to provide a profile of the 
wd background in several important areas 
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of the college curriculum. In Form Y, the last 
of several parallel forms, the areas tested include 
current social problems, history and social stud- 
ies, literature, science, fine arts, and mathemat- 
ics. In Form XX, a revision of earlier Form X, 
the section dealing with current social problems 
has been omitted. This is a sensible revision, 
since the test is usually administered in conjunc- 
tion with the Cooperative Contemporary Affairs 
Test for College Students, thus making for a 
considerable degree of overlap in this particular 
area. 

The test has enjoyed wide usage, and its suc- 
cessive forms have emerged with amazing paral- 
lelism in spite of critical reviews. Apparently 
there is considerable divergence of opinion be- 
tween authors and reviewers. А very thorough 
four-page review of Form W by Benjamin S. 
Bloom appeared in The Third Mental Measure- 
ments Yearbook. 'That review and earlier ones 
leave the present writer but little opportunity for ^ 
new comments or approaches. A possible proce- ` 
dure is to investigate the effects, if any, on ` 
Forms Y and XX of Bloom's review. | 

In his review Bloom offered the perennial 
complaint of reviewers that while the test has 
apparently been carefully made, it is little more 
than a miscellaneous collection of items and in- 
formation and that course boundaries are not - 
completely disregarded, in spite of the publish- ^ 
er's claims to the contrary. This complaint ap- 
plies as much to Forms Y and XX as to the 
earlier forms. It seems extremely doubtíul if 
items dealing with “Who did this?" “Who said - 
this?" “What is this?" and “With what is this ^ 
concerned?” can add up to anything more than ^ 
a measure of knowledge of facts. Items dealing ^ 
with “What brought this about?" “What is the ` 
contribution of this?" “How does this differ | 
from or agree with that?" “Why was this more 
important than that?” “Where will this trend 
lead?” and “What have we learned from this?” 
would at least provide a-better measure of un- 
derstanding in the several areas covered by the 
test. І 

But, Bloom wrote, “It is not the test, in itself, 
but the ways in which the test is used and the 
possible effect of its use on educational institu- 
tions, instructors, and students that cause one 
to be alarmed.” This reviewer agrees with Bloom 
that the wide use of the test year after year would 
seem to indicate that a large number of college 
administrators and instructors may be influ- 
enced to accept the definition of general culture 
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as exemplified by the specific test items employed 
and "may even, consciously or not, organize 
some of their instruction in such a way that their 
students may do reasonably well on the test" ; 
and that, worse still, since college faculties evi- 
dently approve the test, students are likely “to 
accept the definition of general culture provided 
by the test and to guide their study accordingly.” 
This would indeed be tragic—general culture 
is far too broad ever to be tested adequately 
in 215 or 3 hours with any such collection of 
items. 

In his review also, Bloom contrasted the con- 
tent of the Cooperative General Culture Test 
with the courses proposed in A Design for Gen- 
eral Education * and found a number of gaps in 
the test, largely in the areas of social adjustment, 
marriage and the family, vocational orientation, 
philosophy and religion, and personal and com- 
munity health. These gaps still exist. 

The writer has seen no studies giving the re- 
liability and validity of the test. The test length 
and the care with which it has been prepared 
should insure a somewhat respectable reliability. 

‚ As indicated by the comments above, the re- 
viewer would question seriously the validity of 
the test, even though, according to the publisher, 
the content “has been determined by the con- 
sensus of a number of scholars in various fields.” 

SUMMARY. This is a very carefully prepared 
test which should have sufficiently high relia- 
bility to afford individual measurement. The 
test is widely used and extensive norms are avail- 
able. Where measures of a student's acquaint- 
ance with a wide area of information (largely 
academic) are desired, the test can be recom- 
mended. The test can scarcely be said to encom- 
pass general culture, to measure cultural aware- 
ness, to assess the ability to make cultural 
interpretations, or to fathom depth of cultural 
understanding. 

Reviewers of this test have consistently 
pointed out that the test is a miscellaneous com- 
pilation of items, largely factual in nature. The 
authors of the test have, with equal consistency, 
held to their original basic concepts. 


For reviews by Benjamin S. Bloom and H. T. 
Morse of Form W, see 3:4; for reviews by La- 
vone А. Hanna, Edward S. Jones, and Hilda 


1 American Council on Education, Committee on Design for 
General Education. T. R. McConnell, Chairman. 4 Design for 
General Education: For Members of the Armed Forces. Fore- 
word by George F. Zook. American Council on Education Stud- 
les, Vol. 7, Series т, No. 18. Washington, D.C.: American Coun- 
cil on Education, June 1944. Pp. vii, 186. Paper. * 


Taba of Form P, sce 40:1184; for a review by 
F. S. Beers of Form 1936, see 38:871. 
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*Cooperative Test on Recent Social and Sci- 
entific Developments. Grades 9-12; 1946-51; 3 
scores: social-economic-political, scientific-technical- 
medical, total; IBM; Form 1951; no data on reliabil- 
ity and validity; manual (’51) ; norms ('50); 40(50) 
minutes; E. F. Lindquist, Robert L. Ebel, John Haef- 
ner, Jack T. Johnson, and Wendell Smith; Coopera- 
tive Test Division, Educational Testing Service. [Be- 
ginning with the 1952 form, future tests in this series 
will be published as the Jowa Test of Understanding of 
гона» Affairs by Science Research Associates, 
nc. 


REFERENCES 
1. ANDERSON, Howard К. “A Nationwide High-School Test- 
ing Program,” pp. 65-71. (PA 22:3208) In National Projects 
in Educational Measurement: A Report of the 1946 Invitational 
Conference on Testing Problems, New York City, November 2, 
1946, Herschel T. Manuel, Chairman. Edited by К. W, Vaughn. 
Sponsored by the Committee on Measurement and Guidance. 


American Council on Education Studies, Vol. 11; Series 1, 


Reports of Committees and Conferences, No. 28. Washington, 
D.C.: the Council, August 1947. Pp. vii, 80, Paper. * 

2. CocHRAN, SAMUEL W., AND Б Freverick В. “Pre- 
dicting Freshman Grades at George Peabody College for Teach- 
ers.” Peabody J Ed 27:352-6 My ’so. * 

Rocer T. Lennon, Director, Division of Test 
Research and Service, World Book Company, 
Yonkers, New York. [Review of Form 1951] 

Each spring, from 1946 through 1951, the 
Cooperative Test Division sponsored the Na- 
tionwide High School Testing Program involv- 
ing the administration of an annually prepared 
form of the Cooperative Test of Recent Social 
and Scientific Developments. The purpose of the 
program was to help schools evaluate the prog- 
ress made by students toward a knowledge and 
understanding of world events, and to stimulate 
more critical study of contemporary social and 
scientific developments. Following the national 
program each year the test was made available 
for general purchase. 

The test consists of 75 four-option multiple 
choice items. Part I (Items 1-50) is concerned 
with social, economic, and political develop- 
ments; and Part II (Items 51-75), with medi- 
cal, scientific, and technical developments. A 
2-page manual is devoted entirely to directions 
for administering and scoring the test. None of 
the information on validity, reliability, interpre- 
tation of results, etc., that one customarily looks 
for in a manual is there reported. A separate 
8-page pamphlet, Norms for Public School 
Achievement, was made available to users of 
the test after results of the national program had 
been summarized. 

It is hard to know quite how to appraise the 
validity of an instrument of this kind. There are 
in this area no courses of study, no textbooks, 
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no consensus of expert judgment, such as the 
achievement test maker customarily relies on. 
Whether the particular phases of contemporary 
developments that the authors have chosen to 
cover, and the relative emphases which they have 
placed on the various topics, are appropriate and 
reasonable must be determined largely by each 
user in relation to his own program of instruc- 
tion. About half the material in Part I is con- 
cerned with political developments, the other 
half being divided more or less equally between 
social and economic developments; about half 
of Part II is concerned with developments in 
the field of medicine, with the remainder dis- 
tributed widely over other scientific and techni- 
cal fields. The reviewer's general impression is 
that the coverage of the several areas is a reason- 
ably well balanced one. 

The individual questions in the test by and 
large reflect a high level of authorial and editorial 
competence. Questions are clear, options well 
chosen. While the authors have not been entirely 
successful in their efforts to avoid items that 
measure merely superficial acquaintance with 
contemporary events, or that deal with trivial or 
transitory matters, the number of such items is 
small. Some of the items in Part II (e.g., 64, 67, 
70, 72, and 73) are not, strictly speaking, con- 
cerned with "recent developments." In Part 
I, Item 24 gives a clue to the answer for Item 
4, and Item 25 for Item 23. But these are in- 
significant defects in a test that is, on the whole, 
very well written. 

The norms include percentile norms for each 
grade from 9 through 12 for interpreting total 
scores of individual students, and for interpret- 
ing school averages on Part I, Part II, and total 
test. The norms are based on a stratified sample 
of 4,429 students, selected from some 46,000 
whose answer sheets were returned by 364 
schools that participated in the 1951 program. 
There is no reason to doubt that the normative 
group is typical of all the students that partici- 
pated in the program and no claim is made that 
it is more widely representative. On the basis of 
the percentile norms, it would appear that the 
test is of suitable difficulty for 11th and reth 
graders, somewhat hard for 9th and roth grad- 
ers. Social studies teachers—and others—may 
be discouraged to note the relatively small in- 
crements in average score from grade to grade. 
As students progress through high school, they 
advance in knowledge and understanding of con- 
temporary affairs at a very modest rate indeed, 
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if their performance on this test is a true index, 
The norms pamphlet also gives the per cent of 
students in successive grades answering cor. 
rectly each question іп the test. These item-dif. 
ficulty values represent a useful feature that 
should encourage schools to undertake a ques- 
tion-by-question study of their own results and 
thus add to the instructional value of the testing, 
No estimates of the reliability of the test are 
given. Kuder-Richardson estimates of the relia- 
bility of the total score, calculated by the re- 
viewer on the basis of the item difficulty values 
reported, were .88 for grade 10 and .89 for grade 
12. Failure to present reliability data is regret- 
table, particularly in light of the recommenda- 
tion that the scores be used for guidance pur- 
poses and made a part of the student's cumulative 
record. Working time is 40 minutes for the 75 
items, which means that some of the slower- 
reading students were probably handicapped. 
Educators and laymen alike agree that an in- 
creased awareness and understanding of what 
is happening in our political, social, economic, 
and scientific world is an important objective of 
secondary education. Results of this test should 
help schools discover to what extent their stu- 
dents are progressing toward this goal, and 
should have a beneficial influence in motivating 
both student and teacher to pay greater айеп- 
tion to these areas. | 
The reviewer must admit to some misgivings” 
about the wisdom of encouraging schools to pur- | 
chase a test which they have not seen, and about ` 
whose validity and reliability they know noth-- 
ing; their faith in the authors and distributors” 
might not always be so well placed as in this 
instance. It is difficult, however, to see how this 
is to be avoided in the case of tests prepared | 
anew each year and useful for a single year only. © 


For reviews by Ernest W. Tiegs, Ralph W. 
Tyler, and Edgar B. Wesley of Form 1946, set 
3:5. 
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1 
*Coordinated Scales of Attainment. Grades 1, ai 
3, 4, 5, б, 7, 8; 1946-50; IBM for grades 4 and overi 
8 levels; primary and advanced master manuals ('50); | 
separate answer sheets must be used in grades 4-8: 
$1 per specimen set, postpaid; soc per master manual; | 
postage extra; spelling: James A. Fitzgerald; Eng- 
lish: Dora V. Smith; reading: Ullin W. Leavell; his- 
tory: Edgar B. Wesley; geography: M. E. Branom; 
arithmetic: Leo J. Brueckner; literature: Ellen Frog- 
ner; science and hygiene: Victor C. Smith and August 
Dvorak; Educational Test Bureau, Educational Pub: 
lishers, Inc. * | 
а) BATTERY I. Grade 1; 1047-50; 8 scores: picture 
word association, word-picture association, vocabulary | 


17 ] 


recognition, reading comprehension, arithmetic expe- 
rience, number skills, problem reasoning, computation; 
Forms А (’50), B (49) ; directions for administering 
(Form A—s0, Form B—49) ; $1.75 per 25. 

b) BATTERY 2. Grade 2; 1047-50; 9 scores: same as 
for Battery 1 plus spelling; Forms А (о), B (^49) ; 
directions for administering (Form A—’50, Form B— 
'49) ; $1.75 рег 25. 

с) BATTERY 3. Grade 3;, 1947-50; 7 scores: picture- 
word association, word-picture association, vocabulary 
recognition, reading comprehension, problem reason- 
ing, computation, spelling; Forms А (’50), B (49); 
directions for administering (Form A—’s0, Form 
'49) ; $1.75 per 25. 

d) BATTERY 4. Grade 4; 1946-50; II scores: punctua- 
tion, usage, capitalization, reading, history, geography, 
science, literature, computation, problem reasoning, 
spelling; IBM; Forms А (46), B (до); directions 
for administering (Form А— 50, Form B—'50) ; $2.90 
per 25; $1.80 per 25 IBM scorable answer booklets; 
$1.45 per 25 hand scoring answer booklets; 50¢ per set 
of machine or hand scoring stencils; 50¢ per 25 normal 
progress charts. 

е) BATTERY 5. Grade 5; 1946-50; II scores: same as 
for Battery 4; IBM; Forms A (746), В (’49) ; direc- 
tions for administering (Form A—'5o, Form B—'50) ; 
prices same as for Battery 4. 

f) BATTERY 6. Grade 6; 1946-50; II scores; same as 
for Battery 4; IBM; Forms A ('46), В (40); direc- 
tions for administering (Form А— 50, Form В— 50); 
prices same as for Battery 4. 

g) BATTERY 7. Grade 7; 1946-49; II scores: same as 
for Battery 4; IBM; Forms A (’46), В (49); direc- 
tions for administering (Form A—'5o, Form B—'50) ; 
prices same as for Battery 4. 

h) BATTERY 8. Grade 8; 1946-49; II scores: same as 
for Battery 4; IBM; Forms A (46), B (40); direc- 
tions for administering (Form A—'5o, Form В—'50) ; 
prices same as for Battery 4. 


ALVIN W. SCHINDLER, Professor of Education, 
University of Maryland, College Park, Mary- 
land. 

This battery was designed to provide data on 
achievement level and rate of progress. These 
purposes should be recognized in evaluating the 
battery. 

The manuals are worthy of commendation. 
There is one for each grade, and there are also 
two master manuals—one for grades 1-3 and 
one for grades 4-8. The directions for the pupils 
as read by the teacher are clear and brief. Ex- 
planations for teachers on administering and 
scoring the tests and on deriving and interpret- 
ing the scaled scores are complete and helpful. 
Nevertheless, group meetings to clarify the co- 
ordinated plan which permeates the entire pro- 
gram should be arranged in schools using the 
scales. The master manuals are worthy of study, 
although it is unfortunate that they provide little 
information on procedures used in arriving at 
the scaled scores and the norms. 

The answer sheets and the scoring keys for 
Batteries 4-8 are arranged to facilitate scoring 
and derivation of scaled scores. Three types of 


TESTS & REVIEWS: ACHIEVEMENT BATTERIES 8 


answer sheets are available for these grades. In 
the primary grades the pupils record their re- 
sponses at various places in the test booklets. 
The scoring of these tests is therefore incon- 
venient, since an answer key and a test booklet 
each contains 18 pages. 

A convenient "profile of scores" is provided 
on each booklet or score sheet. The profile uti- 
lizes scaled scores. Norms on “grade equivalent" 
and "age in grade" are provided with the profile, 
thus eliminating the inconvenience of separate 
tables. The norms with a given battery provide 
“grade equivalent" and “age in grade" for three 
years only. Data for "percentile rank interpre- 
tation" accompany the cutout scoring keys for 
Batteries 4-8. The norms should be based on a 
larger number of pupils. 

The format of the test booklets is commend- 
able. The type is appropriate. There is adequate 
spacing between lines. An exception may be 
noted in the responses to questions in the pri- 
mary reading tests. These responses are in un- 
necessarily small type, and the spacing is inade- 
quate. In the Batteries 4-8 the small type used 
for responses in multiple choice items may also 
justify criticism. 

The indicated testing time is 90 minutes for 
the primary grades and 256 minutes-for Bat- 
teries 4-8. Testing periods of 30 to 45 minutes 
are recommended. The pupils are allowed to 
work on a given section until 9o per cent of them 
have completed it. On the basis of unlimited time 
the authors claim that the tests are “power tests.” 
Test content as well as unlimited time is an 
important factor in determining whether or not 
a test is a power test. 

Batteries 1-3 include four types of reading ex- 
ercises but they do not measure all types of im- 
portant reading abilities. The arithmetic tests 
for grades 1 and 2 measure number skills as well 
as fundamental processes ; nevertheless, there is 
a very noticeable gap between test content and 
recognized objectives of primary arithmetic. The 
reading tests for these grades are as good as 
other standardized reading tests, and the arith- 
metic tests are above average. 

The reading and arithmetic tests in Batteries 
4-8 differ rather sharply from those at the 
primary level. The strategy in measuring read- 
ing achievement utilizes paragraphs followed by 
multiple choice questions. The exercises require 
desirable types of reading skills, but they do not 
measure all types. The major emphasis is on 
reading to grasp the central thought, to note de- 
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tails, and to note meaning of vocabulary in con- 
text. The authors claim that the reading tests 
also measure ability to interpret content, to inte- 
grate dispersed ideas and to draw inferences, but 
their claims in these respects are probably over- 
done. Some of the items which are intended for 
these purposes simply require noting of details. 
However, in some instances they demand infer- 
ences or interpretations which are not justified 
by the content. The arithmetic tests emphasize 
problem reasoning and computation in Batteries 
3-8. As in grades 1 and 2 the test content is too 
narrow to satisfy current theory on desired out- 
comes. Nevertheless, the arithmetic tests have 
more than average merit. 

Sections on science, geography, history and 
literature are included in Batteries 4-8. They are 
inferior to the language, reading and arithmetic 
sections. Part of the inferiority may result from 
inevitable difficulties in the selection of content. 
Nevertheless, there is unnecessary reliance on 
recall of isolated facts. Furthermore, the items 
call for much information which is unimportant. 
The literature items refer to important books; 
nevertheless scores on tests of this kind do not 
present proof that objectives of literature are 
being achieved. Teachers who desire tests in 
these four areas will find that these materials 
compare favorably with other tests, These tests 
and the tests on skills should be in separate book- 
lets. 

The construction of the multiple choice items 
merits some praise and considerable adverse 
criticism. On the credit side it may be noted 
that each item usually contains one or two plau- 
sible responses. In some items, especially in the 
reading test, some wrong responses are too 
nearly correct. However, many of the wrong re- 
sponses are not at all plausible. An item analysis 
would probably reveal that one wrong response 
could be eliminated from most of the items with- 
out loss of effectiveness. No data are given to 
show the extent to which individual items were 
checked for discriminating power. Much testing 
time could probably be saved by eliminating the 
weakest response in each five-response item and 
by eliminating items which have low discriminat- 
ing power. Data to refute this conclusion are not 
presented in the manuals. 

Tables for converting raw scores to scaled 
scores are found on the answer sheets for Bat- 
teries 4-8, and with the answer keys for Bat- 
teries 1-3. An analysis of these tables reveals 
that changes of 4 to 7 points in the raw score 
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may mean an entire grade difference in grade 
equivalent or an entire year difference in 
equivalent. This may indicate that the content of 
each battery has a marked insufficiency of ma. 
terial really at the level of the pupils for whom 
the battery is intended. This fact may seriously 
undermine the value of the scales for the stated 
purposes. The authors do not provide data on 
the consistency with which the scales show rate 
of progress. 

The "guide to remedial work" provides a 
commendable outline of test content. It may be 
used to get a view of what each battery attempts 
to measure. The guide does not offer suggestions 
on remedial teaching, although the outline may 
be used as a framework for diagnosis. 

For an elementary school testing program | 
which stresses information and skills, the Co- | 
ordinated Scales of Attainment are а good 
choice. They permit a maximum comparison of | 
scores. However, it should be remembered that | 
the scores of a given pupil merely reveal where 
he ranks among other pupils on the kinds of 
achievement,measured by the materials. 


For reviews by Roland L. Beck, Lavone A. 
Hanna, Gordon N. Mackenzie and Glen Hass, | 
and C. C. Ross of Batteries 4—8, see 3:6. 
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*Essential High School Content Battery. Grades 
9-13; 1951—52, C1950-51; 5 scores: mathematics, sci- 
ence, social studies, English, total; IBM; Forms AM 
C51), BM ('52) ; manual (’51) ; $4.75 per 25; separate 
answer sheets must be used; $1.40 per 25 IBM answer 
sheets; postage extra; 356 per specimen set, postpaid ; 
200 (215 according to manual, 225 according to test 
booklet); David P. Harry and Walter N. Durost; 
World Book Co. * 

Hersert 5. Conran, Chief, Research and Sta- 
tistical Standards, Office of Education, Federal 
Security Agency, Washington, D.C. 

The four fields covered in the test battery are: 
mathematics (66 items, 45 minutes), science (70 
items, 40 minutes), social studies (9o items, 40 | 
minutes), and English (249 items, 80 minutes). 
Including a 10-minute general instruction pe | 
riod, the working time for the entire battery is 
215 minutes. The test in mathematics “samples | 
arithmetic skills, general mathematics, algebra, 
geometry, and, to a very slight degree, trigo- 
nometry and advanced algebra." In the test on 
science, “about 46 per cent of the test's content 
is drawn from the physical sciences, about 41 
per cent from the biological sciences, and about 

1 The opinions expressed in this review are those of the - 


author, and have ‘ho bearing on any policies or views of the - 
Office of Education or the Federal ecurity Agency. 
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I3 per cent is common to both." The social 
studies test "includes content from American 
History, World History, Civics and Govern- 
ment, Economics, and Problems of Democracy." 
The English test includes reading, vocabulary, 
business definitions, use of references, literature 
acquaintance, language usage, capitalization, 
punctuation, and spelling. The Essential High 
School Content Battery is described as “а com- 
prehensive battery of high school achievement 
tests....for use as a survey-type instrument." 
I. CURRICULAR AND Face VALIDITY. A. 
SELECTION OF ITEMS. The manual outlines in 
unusually commendable detail "the. major 
sources which, together with textbooks and 
courses of study, were used in determining the 
content of the four tests." An analytic inventory 
of the content of each test is also provided: in 
the social studies test, for example, Items 1—10 
deal with "acquaintance with contributions of 
famous Americans" ; Items 11—20 deal with “un- 
derstanding of current social and political prob- 
lems”; Items 21-30 deal with “understanding 
of vocabulary of social studies"; etc. Since the 
reviewer is not a subject matter expert, he did 
not undertake more than a casual inspection of 
individual items (and even this was restricted 
to Form Ам, since this is the only form for 
which the specimen set includes item-analysis 
data). In general, the items have impressed this 
reviewer favorably. In connection with a few 
items, however, some question did arise, which 
was not always allayed by the empirical results 
of the item analysis. Thus, according to the 
key, one subtracts in order to “find how much 
change is due when more than the exact amount 
of the bill is offered in payment" (Mathematics, 
Item 29) : the reviewer's observation is that one 
generally does better by adding. (This item is 
fairly easy, according to the item analysis, but 
the validity coefficient is low.) In the test on 
science, the student is expected to refute the 
"common misconception" that "tuberculosis can 
be inherited," by using the principle that “only 
the physical structure of an organism has been 
demonstrated to be controlled by genes." This 
seems bad to the reviewer on the ground that, 
according to modern thought, (a) genes do not 
by themselves completely "control" anything ; 
(b) genes are known to influence function and 
behavior, as well as “physical structure"; and 
(c) the implied irrelevance of “physical struc- 
ture" to function and malfunction is unwar- 
ranted, Again in the science test (Item 61), a 


single generalization is taken as supplying the 
"cause" when actually at least two or three other 
factors enter in (viz., a shorter workday, in- 
creased prosperity, and decreased costs of amuse- 
ments). Such gross oversimplification is what 
students in science should be trained against, 
not toward. In the social studies test, this re- 
viewer raised his eyebrows at the statement 
(according to the key) that Mustapha Kemal 
"established a democratic republic in Turkey." 
Here there seems to be confusion between the 
form and the reality (Kemal being virtually a 
pure dictator)—a confusion to which the teach- 
ing of government in the schools is perhaps ad- 
dicted, but which tests should not duplicate. 
The language usage test includes a few items 
(60-63) that belong more properly in the test. 
on spelling. The spelling test is of the proof- 
reading type, which is not as valid as dictation. 
The test on capitalization and punctuation also 
seems of limited validity, since it focuses atten- 
tion on the points in a sentence that require at- 
tention; this gives the student help which he 
would not have if he were writing the sen- 
tences himself. The foregoing, it should be em- 
phasized, are exceptions to what appear to the 
reviewer as generally well constructed, excellent 
items. 

B. OMISSION OF CONTENT. The authors state 
that "within the confines of any test it is im- 
possible to measure all the important outcomes 
in any area." This is of course true: thus, the 
oral aspects of English (including articulation 
and effectiveness of expression) cannot be meas- 
ured in a written test. It seems to the reviewer 
a poor defense, however, to say that "in a test 
designed for machine scoring, there are restric- 
tions....which make it impossible to measure cer- 
tain types of skills or certain objectives... [such 
as] composition ability, original solution of 
mathematical or scientific problems, spelling 
ability in a dictation-type situation, and other 
worth-while objectives of particular courses." 
The solution is obviously to make use of hand 
scoring as needed, rather than to consign whole 
sections of the curriculum to nonmeasurement. 
While each section of the battery might be 
criticized for omissions (criticism is easy!), 
perhaps the most prominent inadequacy occurs 
in the test devoted to literature. This test (named 
Literature Acquaintance) consists exclusively 
of brief quotations which are to be identified 
with various novels, essays, plays, etc. Surely 
the years in high school spent on literature 
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should yield more than merely the ability to 
remember the source of a quotation! 

C. BALANCE. In general, the apportionment of 
items to different fields seems judicious and 
correct; there are, however, a few instances 
where exception may be taken. In mathematics, 
for example, 6 items are devoted to “interpre- 
tation of mathematical graphs," and 8 to “ap- 
plication of mathematics to life problems" : this 
reviewer would prefer to see the latter somewhat 
increased, if necessary at the expense of the 
former. Similarly, in social studies, r1 items are 
devoted to “knowledge of contributions of world 
leaders" and то to “knowledge of international 
relations" : again, this reviewer would prefer to 
see the latter increased, if necessary at the ex- 
pense of the former. A serious imbalance seems 
to occur in the test on English: here 60 items 
are devoted to language usage, бо to capitaliza- 
tion and punctuation, and 60 to spelling—leav- 
ing only 69 items for the other five parts of the 
test, of which only 15 are devoted to literature. 
Since all items are given equal weight in scoring, 
literature becomes, in terms of number of items, 
only six per cent of the total English course as 
tested ! 

IL PRELIMINARY Tryout. “Two experi- 
mental forms of the Mathematics, Science, and 
Social Studies tests were prepared, and six 
forms of most of the parts of the English test. * 
Approximately 1,000 students in each grade 
from 9 to 12 took the experimental forms of each 
test." These data were employed for an item 
analysis of the tests in mathematics, Science, and 
social studies, and for the calculation of relia- 
bility coefficients. A detailed mimeographed re- 
port of this “pre-publication experimentation” 
(Supplementary Technical Report No. I) is 
available from the test publishers on request. 
In the case of the English subtests (except 
Vocabulary and Literature Acquaintance), a 
"novel plan" was followed. Intercorrelations be- 
tween the six forms of each subtest were calcu- 
lated, and those two forms were selected (in- 
tact) which showed the highest intercorrelation 
while also showing "satisfactory distribution 
characteristics.” This “novel plan,” utilizing as 
it does whole groups of items, involves less sta- 
tistical labor than item analysis ; unfortunately, it 
cannot be trusted to select the best individual 
items from the total supply. 

ПІ. STANDARDIZATION SAMPLE. For pur- 
poses of standardization, the test battery was 
administered at the close of school in May 1950 


to 46,000 students in 288 high schools locate 
in 35 states. “The distribution of size of school 
for the normative group...agrees closely with | 
that for....high schools generally.” Data regard. - 
ing the age and Terman-McNemar IQ's of the _ 
students in the standardization sample indicate 
that the sample is “typical” in these respects, 
Finally, "the normative data for various geo. 
graphical divisions of the country were... 
weighted that the final norms are based on a 
group whose geographical distribution corre- 
sponds closely to that of the national high schog 
population.” Thus it appears that a strenuous 
(and probably successful) effort was made to _ 
obtain a fair national standardization sample, | 
Two possible shortcomings of the sampling plan 
may, however, be noted: (a) It is not clear 
whether or not ће 11-year school systems found _ 
in some areas of the South were included in the | 
sample; very likely they were, but the таппа | 
is silent on this point. (b) No direct attempt — 
was made to insure a fair distribution of the 
quality of schooling in the high schools selected | 
for the standardization sample. This may be an 
important omission, since superior teaching in 
a well administered school undoubtedly leads to 
superior achievement test scores. Salary data | 
(indicative of quality of schooling) are generally _ 
available in state departments of education; it 
is time that such data were put to use in the 
selection of samples for the standardization of 
educational tests. | 
IV. Formar. Although the format of the test _ 
battery is in general excellent, the following | 
criticisms or suggestions appear to deserve con- | 
sideration: (a) It might be better to issue the 
test battery in four booklets (one each for math- | 
ematics, science, social studies, and English), in- И 
stead of one. If this were done, it would not be | 
necessary each time to give the directions, “Don't | 
go ahead" (to steal a glimpse at the items for 
the next session of testing), or “Don’t go back" 
(to work on items covered at the previous ses- 
sion). (b) It is time-consuming for the test- 
scorer to indicate, in handwriting (on page ! 
of the answer booklet), which set of norms was 
employed. (c) The combining of scores for the 
two parts of the English test is awkward and 
time-consuming. First, the total score for Parts 
A, B, C, and D is recorded on the third page of | 
the answer sheet; then the page is turned, and |. 
the score is recorded on the other side, i.e., on 
the fourth page; then the total score for Parts | 
E, F, G, and H is recorded on the fourth page: _ 
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and then the two scores are added (horizontally) 
to obtain a grand-total English score. The whole 
process is not only time-consuming, but invites 
error. 

V. Directions, Although the scoring for- 
mula provides no penalty for guessing, the di- 
rections include the admonition, *Do not make 
wild guesses." This of course gives a premium 
to the student who does guess, whether the guess 
is wild or not. The conservative, conscientious 
student is especially handicapped in the English 
test, where 2-choice items abound (there are 180 
of these) ; but even in the 5-choice items, the 
"wild guesser" gets a 20 per cent premium on 
guessed items. In the reviewer's judgment it 
might be sounder testing, and would certainly 
be better ethics, for the instructions to state 
forthrightly that there is no penalty for guessing. 

VI. ArrERNATE Forms. According to the 
Supplementary Technical Report No. 1, “items 
were allocated to the final forms so as to yield 
two forms closely parallel as to content, and 
equivalent with respect to item difficulty and 
item validity." In spite of this procedure, the 
correlation between alternate forms of the four 
tests—Mathematics, Science, Social Studies, and 
English—is in only one case above .9o. The 
highest alternate form reliability coefficients are 
for the most part found for the mathematics test 
(.85, .92, and .87, in grades 10, 11, and r2, re- 
spectively) ; the lowest coefficients are uni- 
formly found for the science test (.67, .78, and 
.81, in grades то, 11, and 12, respectively). 

VII. RELzABILITY. Both alternate form and 
split half reliability coefficients are presented for 
each of the four tests of the battery (the alter- 
nate forms were administered within an interval 
of a week) ; but only the split half coefficients are 
presented for the battery as a whole. The split 
half coefficients are all higher than the alternate 
form coefficients (median difference, .045), sug- 
gesting that a speed factor may be boosting the 
split half coefficients somewhat. The reliability 
coefficients, in general, are based on rather small 
groups (median N — 155). It is not stated 
whether the reliability groups are drawn from 
a single high school, or whether each group is 
composed of stadents drawn at random from the 
various schools in the standardization sample; 
however, differences in the size of the reliability 
groups (which vary, for example, from 96 to 
297 in grade 12) suggest that the former, and 
preferable, technique was employed. What the 
typical test user needs to know, of course, is the 
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reliability of each test for a group such as is 
found in his own school, or in his own school 
system. (A national sample, reflecting regional 
differences in curricula, grade placement of 
course content, teacher efficiency, adequacy of 
previous education of students, and student dif- 
ferences in intelligence and interest, would yield 
spuriously high coefficients.) 

The magnitude of the reliability coefficients 
has already been indicated in the preceding sec- 
tion VI. Since not one of the tests has an alter- 
nate form reliability coefficient consistently as 
high as .90, this reviewer cannot escape the con- 
clusion that the tests do not come up to the 
standard ordinarily required for individual meas- 
urement. True, the split half coefficients (which 
are the only ones quoted in the advertising mat- 
ter) are higher; but it is the alternate form co- 
efficients which are more pertinent (unless we 
are interested in students’ consistency only at 
the time of taking one form of the test) ; and 
even the split half coefficients are below .90 in 
six instances out of twelve. The battery as a 
whole shows a single-grade split-half reliability 
of .95 or better. 

The manual recommends, either explicitly or 
implicitly, that use be made of various difference 
scores, as follows: (a) the difference between 
scores on the various tests of the battery—as a 
measure of "strengths or weaknesses"; (b) the 
difference between a student’s test score, and 
the score that would most probably be expected 
on the basis of the individual’s intelligence—this 
difference between actual and expected per- 
formance is analogous to the “regression АО”; 
and (c) the difference between a student’s suc- 
cessive scores in different grades—this of course 
is a measure of “growth.” As is commonly the 
case in other manuals, no indication is given in 
this manual of the reliability of any of these dif- 
ference scores ; nor is the statistical information 
supplied from which such reliabilities could be 
definitely calculated. In view, however, of the ` 
meager reliability of the tests, it is certain that 
the reliability of all the proposed difference 
scores would be much too low for valid use in the 
assessment or guidance of individual students. 

VIII. Norms. “Standard scores" and per- 
centile norms are presented for each test (Math- 
ematics, Science, Social Studies, and English) 
for three groups: (а) all students; (b) students 
taking “academic or scientific” (i.e., college pre- 
paratory) courses; and (c) students taking 
"commercial or general" courses. Similar norms 
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for median test score are given for all students 
only. End-of-term (May) norms are the ones 
printed in the manual received by the reviewer ; 
but midyear norms will also be provided in later 
manuals, or on request. This richness of norms 
is very commendable, though the presentation of 
separate norms for the academic and nonaca- 
demic groups raises a question regarding the de- 
sirability of norms for the (heterogeneous) total 
group. 

A useful and unusual feature is the presenta- 
tion of the complete distribution of average 
standard scores made by the schools in the stand- 
ardization sample (these data are presented 
separately for each test, and for each grade). 
This information enables a principal to judge 
not merely whether his school's average (on a 
given test, for a given grade) is above or below 
the national average, but whether his school 
falls within, for example, the middle ten per 
cent of all schools or above the 75th-percentile 
school. There are, to be sure, definite risks in 
any such comparison: thus, the high position of 
a school in (say) social studies may be explicable 
not by superior teaching or superior student 
effort, but by some peculiarity of the curricu- 
lum in the school, or by the unusually small 
percentage of nonacademic (“commercial” or 
"general") students in the school. The possi- 
bility of the latter condition suggests that a 
principal should, in general, calculate two av- 
erages for each grade: one for the academic 
students, and one for the nonacademic. It would 
be desirable, then, for the distribution of school 
averages also to be divided into these two classes. 

Another unusual and desirable feature is a 
table showing the mean standard score on the 
mathematics and the science tests (separately by 
grade) for students who have had, respectively, 
0, I, 2, 3, .. . , 9 semesters of instruction in 
the subject matter of the test. This information 
permits observation of whether a student is 
above or below the average of others with simi- 
lar training ; the table does not, however, enable 
one to say whether the student is at (say) the 
6oth percentile, the 7oth percentile, etc. 

Also presented is the mean raw score for each 
part of the English test (separately by grade), 
for use by those who may be especially interested 
in the part scores on this test. 

ІХ. Manvat. The manual provides adequate 
directions for administration of the test and 
gives more information than usual on the de- 
velopment and statistical characteristics of the 


test. The manual is strengthened by the Sup- 
plementary Technical Report No. 1, already ге. 
ferred to. Also to be available are statistical data 
on the relation between the test scores and in- 
telligence. On the debit side must be mentioned: 
the absence of data on the intercorrelations 
among the tests of the battery ; the general ab- 
sence of standard deviations in the statistical ' 
tables; and the absence of information on the 
reliability of recommended difference-scores, 

X. DISREGARD oF INDIVIDUAL DIFFERENCES, 
The Essential High School Content Battery 
was apparently devised with the thought that 
there is need for a battery which can be 
given to any high school student at any 
time—regardless of what subjects he has stud- 
ied, how long he has studied them, or when 
he studied them. The only adjustment to indi- 
vidual differences is made through the fairly 
elaborate set of norms: norms for students in 
the academic or college preparatory course vs. 
the nonacademic course; norms for grades 9, 
IO, II, and 12; and norms (of a limited char- 
acter) for those with a differing period of in- 
struction in mathematics and science. To the i 
reviewer, however, this procedure seems all 
wrong. The proper adjustment to individual dif- 
ferences in the curriculum is made by the use | 
of separate tests, in the subjects which the stu- 
dent has actually taken, administered at a series 
of appropriate dates (such as at the beginning of 
the course, at mid-term, at the end of instruc- 
tion, and perhaps one or two years after the end). 
Such a procedure recognizes and respects in- | 
dividual differences directly and completely. The 
use of a uniform battery for all students at al- 
most any time in the semester contradicts the 
very soul and purpose of the measurement move- 
ment. Testing should have as its uncompromis- 
ing purpose the discovery of the individuality of 
each person by the most accurate and valid 
measures feasible. 

There is not one use to which the Essential | 
High School Content Battery may properly be 
put which cannot be served better by a battery 
of specialized tests. Consider, for example, the 
test on science. The content of this test is drawn | 
principally from the three subjects, biology, 
chemistry, and physics. What sense is there in 
giving such a test in the ninth grade (or the 
tenth grade) to a student who has not taken 
any of these courses? How much sense is there 
in giving the science test to a girl in the twelfth 
grade who has taken only biology in the ninth 
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grade and no science since then? What sense 
is there in giving the mathematics test to a girl 
specializing in secretarial studies? These are not 
specially exceptional cases, and the answer is 
that there is little or no sense. Obviously what 
is needed is a supply of tests adapted to the in- 
dividual curriculum of the individual student 
and administered to the student at appropriate 
times. Nothing else does justice to the student. 

According to the manual, “the basic premise 
underlying the construction of this battery....is 
that there is a common body of knowledges and 
skills which it is reasonable to expect a high 
school graduate to possess." A test to measure 
such a "common body of knowledges and skills" 
would be very useful. But the Essential High 
School Content Battery (despite its name) is 
simply not that kind of test, as is shown by three 
facts: (a) The content of each test goes con- 
siderably beyond “the common body of knowl- 
edges and skills which it is reasonable to expect 
a high school graduate to possess." Should every 
high school graduate be expected to know, for 
example, the algebra and geometry (not to men- 
tion the trigonometry and advanced algebra) in- 
cluded in the mathematics test? (b) The dif- 
ficulty of each test (mean score typically less 
than half the total number of items) must lead 
one to conclude that more has been included 
than "the common body of knowledges and 
skills, etc."—either so, ог the schools have been 
astoundingly and impossibly unsuccessful! (c) 
The provision of separate norms for academic 
vs. nonacademic students, and of some norma- 
tive data for students with a varying number of 
semesters of instruction in mathematics and sci- 
ence, belies the premise that the test measures 
only the “common body of knowledges and skills, 
etc,” If the knowledges and skills are truly 
“common,” why the need for separate norms for 
different groups? Actually, we have here just 
another achievement test battery which prob- 
ably measures the ‘common body of knowledges 
and skills” as well as (or perhaps better than) 
other batteries, but which sacrifices adjustment 
to the individual student’s curriculum for the 
sake of administrative convenience. 

The principal technical objection to a set of 
separate, “customtailored” tests as compared 
with a uniform battery is that comparable norms 
are difficult to obtain, since it is not feasible to 
administer the tests to the same set of schools. 
The reviewer questions this objection, because 
it is feasible to administer separate tests, if not 
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in the same schools, at least in equivalent schools. 
For this purpose, the Directory of Secondary 
Schools (available from the U.S. Government 
Printing Office) constitutes a new resource 
which can be of great assistance. 

XI. Titre or THE Barrery. The title of the 
Essential High School Content Battery is ob- 
jectionable for three reasons. First, as has al- 
ready been stated in the previous section, the 
test goes beyond the measurement of only the 
"essential" high school content. Second, what 
is "essential" or unessential is an individual mat- 
ter not to be decided en masse for all high school 
students by any test constructor or test pub- 
lisher. [For the girl planning to enter an office 
after graduation, typewriting (as well as spell- 
ing) may be "essential"; for the boy planning 
to study engineering, trigonometry may be “ез- 
sential"; for the unstable boy or girl (requir- 
ing a larger measure of self-understanding or 
self-expression) a course in human relations or 
in carpentry or in drawing may be "'essential."] 
Third, in the context of the current struggle for 
more realistic and improved high school cur- 
ricula, the title, Essential High School Content 
Battery, must be regarded as unfortunate. If the 
content of this battery represents the “essen- 
tial,” does not the implication arise that other 
high school content is wnessential or dispensable? 
Is this what the World Book Company wishes 
to imply? We have heard from “taxpayers’ 
leagues” about the so-called “fads and frills” of 
modern education. It is surprising indeed, how- 
ever, to see an educational test publisher accept 
and take over the language of the enemy. 

XII. ConcLUDING REMARKS. In this review- 
er's judgment, the items of the Essential High 
School Content Battery are well constructed, 
and the norms are more reliable and complete 
than usual. The manual supplies a great deal of 
valuable information. On the debit side, the re- 
liability of the four component tests (Mathe- 
matics, Science, Social Studies, and English) is 
somewhat lower than desirable. As usual, the 
recommended difference scores (to measure 
growth, special strengths or weaknesses, and 
deviation from the score to be expected on the 
basis of intelligence) are not reliable enough 
for use in individual assessment or guidance. 
Exception is taken to the name of the battery, 
most importantly on the ground that what should 
be considered “essential” high school content is 
an individual matter. Finally, this reviewer be- 
lieves that, though this battery is good (or even 
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superior) of its kind, it is not of the right kind. 
What is needed is a supply of separate tests, 
comparably standardized, adapted to the courses 
which the student has actually taken, and ad- 
ministered at appropriate times in the student's 
career. i 


J. Tuomas Hastines, Associate Professor of 
Education. Director, Unit on Evaluation, Bu- 
reau of Research and Service, College of Edu- 
cation; and University Examiner; University 
of Illinois, Urbana, Illinois. 

The authors and publisher of this test have 
done an excellent job of presentation of ma- 
terials. The care and thought which have gone 
into preparing the manual of directions, which 
is well indexed with a guide to the contents, make 
the manual a paragon which should be studied 
closely by other test authors and publishers. Al- 
most all of the necessary statistics for proper 
interpretation of the test materials have been 
presented in the manual. Certain additional ma- 
terials to those presented in the manual—re- 
sults of an item analysis showing item difficulty 
and internal consistency and tables showing re- 
lationship of intelligence tests to the results of 
the Essential High School Content Battery— 
are available on request from the publisher with- 
out charge. The standardization group is care- 
fully described, and it is a satisfactory sample 
for standardization of such a test. Both alternate 
form and split half reliabilities have been pre- 
sented by grade level, and differential norms 
according to course of study of the student are 
given. This reviewer feels it is unfortunate that 
intercorrelations among the tests are not pre- 
sented since they would be of a good bit of aid 
in interpreting the meaning of differences in 
scores on the various tests in profile form. It 
would help if standard deviations were presented 
for part scores so that the user might see their 
relative weights in the total. 

The authors have cautioned test buyers not 
to use certain of the part scores within tests for 
individual diagnosis because of their low relia- 
bilities (the subparts are quite short). In their 
discussion of validity the authors emphasize that 
any complete evaluation within a school will 
call for supplementation of the results on these 
tests by the use of “other tests, including many 
locally prepared tests.” Such cautionary state- 
ments are correct, but they are unfortunately 
rare in published test manuals. There is also 
material in the manual which should be of help 


to schools in preparing for administration of the 
tests as well as material which should facilitate 
the use of the test results by all people concerned, 
It would be well for any school using the tests 
to see that its personnel had an opportunity 
to study the manual and then discuss it 
several weeks prior to the actual giving of the 
test. 

So far as content of the test is concerned, there 
can be no question concerning the relevance and 
authority of the sources used for objectives, 
functions, and content. There can be a question, 
however—and there probably will be in many 
schools—regarding the relevance or importance 
(not the “роойпеѕѕ”) of many of the items for 
a general education program. Many of the items 
are heavily “academic,” and it seems to this 
reviewer that this is especially true of the math- 
ematics section, although not confined to it. 
Some support is given this statement by a study 
of the differences in percentiles for the same 
standard scores between the sample taking aca- 
demic and scientific courses and the sample 
taking commercial and general courses. This is 
certainly not to say that the tests are therefore 
not “good,” but it serves to stress the statement 
made by the authors themselves—“Thus, judg- 
ment as to the validity of a particular test for 
particular uses must be made by the user of the 
test in terms of his own purposes and his local 
situation.” It is entirely probable that the test 
will be better received by schools which have a 
philosophy of academic college preparation than 
it will be by those schools which stress general 
education, It certainly could be somewhat useful 
even to the latter as a supplement to other ex- 
aminations if they kept themselves fully aware 
of the nature of the items in each of the examina- 
tions. 

In format and organization both the tests and 
the manual are easily satisfactory. It would be 
well for test users to check carefully any errors 
made by their pupils in Part D of the English 
test since a fairly unusual setup is used with a 
master list. Pupils who are not familiar with 
tests of the separate answer sheet variety may 
have special difficulty with this part. 

In summary, the Essential High School Con- 
tent Battery appears to be an exceptionally well 
developed test, and the authors should be praised 
for having made meaningful data available to 
the user and for having prepared an unusually 
useful manual of directions. It is hoped that the 
authors and the publisher will be able to keep this 
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test up to date and will be able to publish new 
data as the need develops. 


A. Harry Passow, Research Associate, Citizen- 
ship Education Project; and Gorvon N. Mac- 
KENZIE, Professor of Education; Teachers Col- 
lege, Columbia University, New York, New 
York. 

That there exists a “common body of knowl- 
edges and skills which it is reasonable to ex- 
pect a high school graduate to possess" is the 
basic premise underlying the construction of 
the Essential High School Content Battery. The 
battery's announced purpose is to provide “con- 
tinuing measures of the student's growth and 
development with respect to these knowledges 
and skills." 

If there is "substantial agreement among ex- 
perts in various fields as to important outcomes 
of secondary school offerings in those fields" and 
one agrees that curriculums should be standard- 
ized, achievement batteries may have some cur- 
ricular validity. On the other hand, if tests 
should meet the needs and objectives of the 
user, as the authors themselves declare, "in 
terms of his own program and his local situa- 
tion," curricular validity has hardly been “well 
established," despite the publisher's claims. This 
is probably the major issue in the use of any 
achievement battery. 

The test items were based on an analysis of 
textbooks, courses of study, and professional 
literature, Which texts and courses of study 
were examined are not indicated although the 
püblisher claims an "extensive analysis." The 
professional literature and expert opinion con- 
sulted are primarily yearbooks of the learned 
Societies. The statistical validity of the test is 
not discussed in detail. 

The authors suggest the test results may per- 
mit better appraisals of achievement, guidance, 
improvement of instruction, graduation exami- 
nation, college admission and placement, and 
System wide surveys. The use of batteries for 
some of these purposes is questionable. For in- 
Stance, test results will enable improvement of 
instruction, the authors claim, by providing "de- 
pendable information on the success of the pro- 
gram in accomplishing its objectives." By pre- 
suming that these objectives are or should be 
the same everywhere, achievement batteries tend 
to crystallize programs and thus block their an- 
nounced aim, improving instruction, at least in 
so far as this can be done by local groups plan- 
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ning cooperatively to meet locally determined 
goals, 

For those who see value in achievement bat- 
teries and their valid uses, this battery does have 
merit. The manual which accompanies the two 
alternate forms includes clear, concise instruc- 
tions for administering the tests and for using 
the perforated stencil keys in handscoring; di- 
rections for machine scoring are provided by 
the publisher on request. The authors indicate 
that a test designed for machine scoring places 
restrictions on the types of items which can be 
used and therefore is limited as to the kinds of 
objectives which can be measured. How to inter- 
pret results and how the tests were standardized 
are also discussed as well as its statistical relia- 
bility. 

The battery was administered to 46,000 stu- 
dents in 288 schools from 35 states (these fig- 
ures, incidentally, differ in the three places they 


_appear). Data for geographical divisions of the 


country were weighted so that the “final norms 
are based on a group whose geographical distri- 
bution corresponds closely to that of the na- 
tional high school population.” The authors, who 
do not explain this weighting, might better have 
prepared norms for the various geographical 
divisions, if these are significantly different, In- 
stead, they merely advise administrators to take 
account of any appreciable deviation from the 
normative group (in median IQ and chronologi- 
cal age) when interpreting results. 

End-of-year percentile norms by grade and 
test cover the total school population, students 
taking academic or scientific courses, and stu- 
dents taking commercial and general courses. 
Mean standard scores for mathematics and sci- 
ence according to number of semesters of in- 
struction supplement the other norms, although 
the manual does not explain how the teacher can 
best use this information. Certainly these dif- 
ferences must affect the end-of-year percentile 
norms as well. 

The two forms are not exactly comparable in 
difficulty. A sampling by the authors indicated 
that the science tests were equivalent, and that 
Form Bm of the other three tests was slightly 
easier than Form Am. These differences, which 
the authors claim are minor ones, are supposedly 
balanced by converting the “number right" to 
a standard score. 

Although the emphasis is supposed to be on 
social utility and on understandings, only three 
of the eight parts of the mathematics test stress 
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the type of consumer mathematics generally as- 
sociated with social utility. Instead, the test 
samples arithmetic, general mathematics, alge- 
bra, geometry, and, very slightly, trigonometry 
and advanced algebra. , 

The science test attempts to measure func- 
tional knowledge of factual material and ability 
to understand and apply scientific principles, 
concepts, and methods. Since many students 
have had only a course in general science, some 
of the language may be too technical. Of course, 
students can guess that the organic acid in 
grapefruit is citric acid without knowing what 
an organic acid is. The section on application of 
methods of science does go beyond facts, as 
might be expected. Presumably the differences 
in number of semesters and the contact that 
students have had with the mathematics and sci- 
ence areas are taken care of in the norms pro- 
vided. 

The social studies test attempts to sample 
American and world history, civics, govern- 
ment, economics, and problems of democracy. 
The test reflects the authors’ view that “unless 
the student has a certain minimum store of fac- 
tual information, it is unlikely that he will be 
able to function effectively in the areas of un- 
derstanding, or reasoning about, social prob- 
lems.” While agreeing with this view, one still 
questions whether the facts the authors test 
are either minimum essentials or of permanent 
worth. The last item from which price control 
was removed after World War II and the leader 
of the naval action around the Solomon Islands 
are two examples of isolated information. The 
section on global geography is limited primarily 
to Africa and South America and seems to neg- 
lect Europe, Asia, and the Middle East. 

The section on English which takes two pe- 
riods appears to be particularly vulnerable. The 
authors point out that “reliability figures for the 
parts of the English test indicate that these part 
scores, in most cases, ought not be used for indi- 
vidual diagnosis, but can be used satisfactorily 
in the interpretation of class results.” This type 
of English test perpetuates a traditional, teacher- 
centered English program which ignores com- 
munication and gives stress to formal, unrealistic 
grammar. Reading for information requires a 
student to memorize the details of a paragraph 
which, in all probability, holds little interest for 
him except as a means to "pass" this test. 

The part dealing with references presupposes 
that all libraries are equipped with standard ref- 
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erences familiar to children. The literature sec- 
tion implies that every child in every school 
should have read standard selections and should 
be able to recognize a selected sentence or two, 
These are the “classics” which many schools 
and teachers supposedly rate "essential." Those . 
teachers who have encouraged more suitable 
reading for children may find their students hay- 
ing difficulty with this kind of item. 

The punctuation and capitalization section 
requires more proofreading skills than communi- 
cation ability. The value of language as com- 
munication is evidently less important than writ- 
ing a formal grammar style—even if the latter 
does sound unrealistic. 

Accompanying the manual of directions are 
individual profile charts, class record sheets, and 
item difficulty analysis sheets—all helpful but 
hardly the “wealth of new interpretive devices” 
which the publisher claims. 

The time limits make it impossible for very 
many students to finish the tests or answer all 
items. It is doubtful whether an achievement 
battery should stress speed as well. No battery 
can measure achievement as such, of course, 
while ignoring intelligence or reading ability as 
they affect test taking. No provision is made for 
correction for guessing. In a number of cases 
understanding of concepts is not necessary in 
order to successfully guess the proper answer. 

The Essential High School Content Battery, 
despite its good features, will fall short of the 
publisher’s implied promises. It may have local 
uses but to imply that it will answer: Where 
does your high school stand? What progress are 
your students making? How can you determine 
individual and class strengths and weaknesses 
efficiently, accurately, economically ?—is prob- 
ably to claim too much for any battery of tests 
to be used nationally. Some of the “essentials” 
are questionable and others cannot be tested by 
this kind of battery. 


[10] 

*Graduate Record Examinations Profile Tests. 
Senior year college through graduate school and can- 
didates for graduate school; 1937-51; available only 
in Graduate Record Examinations programs (see 527) ; 
6 scores: physics, chemistry, biological science, social 
studies, literature, fine arts; 240(260) minutes; Educa- 
tional Testing Service. * 
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. STAFFORD, JoHN W. “The Prediction of S $ 
c School." Abstract. 4m Psychol 6:298 Л sry dE 


Max D. ENGELHART, Director, Division of Stu- 
dent Examinations, Department of Secondary 
Education, Chicago Public Schools, Chicago, Il- 
linois. 

The first form of the Profile Tests of the Grad- 
uate Record Examination was prepared in 1937 
by committees of four or more scholars nomi- 
nated by the deans of the graduate schools of 
Columbia, Harvard, Princeton, and Yale. There 
was widespread critical review of the exercises 
and preliminary tryout of questions prior to 
final selection. In 1945-46, the Profile Tests 
were converted to answer sheet form without 
significant changes in the content of the tests. In 
1949, the content of the tests was reviewed and 
a few of the items revised which had become ob- 
solete, or where responses needed revision be- 
cause of historical events. No recent item analy- 
ses have been made. Since their introduction in 
1937, more than 250,000 students have taken 
these tests in the National Program for Gradu- 
ate School Selection. The tests have been eval- 
uated in a number of studies to determine their 
prognostic value and have been subjected to fac- 
tor analysis research. 

A number of characterizations of the tests 
may be quoted as indicative of their nature and 
functions. 

The questions in total represent a significant sam- 
pling of the knowledge regarded as important in the 
subjects. * The Examination is aimed directly at test- 
ing the student's mastery of the subjects. * The Pro- 
file Tests were designed originally to assist in the ap- 
praisal of the educational backgrounds of prospective 
graduate students, * to portray the student's strengths 
and weaknesses in broad fields of knowledge * heavily 
loaded with a general factor which is highly correlated 
with the Verbal Test, is virtually equivalent to a long 
test of verbal intelligence * Among these purposes are 
evaluation of the outcomes of instruction, general edu- 
cational guidance, and certification for promotion or 
the awarding of degrees. 4 
The last sentence applies to the Graduate Record 
Examination Institutional Testing Program as 
a whole with the implication that the Profile 
Tests contribute to these functions. 

On the undergraduate level an important 
characteristic of the trend in higher education 
toward general education is the extent to which 
it is increasingly recognized that mastery of 
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knowledge is not a significant objective unless 
the student also acquires the intellectual skills 
and abilities requisite to application of knowl- 
edge to thought-provoking and worthwhile prob- 
lems. One might argue, however, that instruc- 
tors on the graduate level, though they may not 
widely accept the objectives of general educa- 
tion, do value mastery of knowledge in a variety 
of subject fields. If this were the case, however, 
should not the possession of knowledge in vari- 
ous fields be a factor indirectly contributing to 
success in graduate study as evaluated by grad- 
uate marks or other ratings? Most of the cor- 
relations between scores on the Profile Tests 
and such marks or ratings are relatively low. 
Six tests are included in the battery of Profile 
"Tests and these tests vary greatly in length: 50 
exercises in physics, 40 exercises in chemistry, 
130 in literature, 9o in fine arts, 175 in blology, 
and 202 in social studies. The 50 multiple an- 
swer exercises in physics are reasonably well, 
distributed over mechanics, electricity, sound, 
and light. Almost half of the exercises involve 
understanding and in a number of cases applica- 
tion of important general principles, but the ma- 
jority of exercises require memory of specific 
facts. The 40 exercises of the chemistry test are 
also of the multiple answer type. Not more than 
one fourth of these exercises pertain to general 
principles, and while the rest of the exercises 
pertain to a variety of facts, certain very im- 
portant topics in the area of elementary college 
chemistry are wholly neglected. The 130 exer- 
cises in the test on literature make this test al- 
most exclusively a test of literary acquaintance, 
although it is a comprehensive one. Fortunately, 
the ability to match titles with authors is rela- 
tively unmeasured. In most cases, the item stem 
pertains to a significant quotation, to an im- 
portant character or event, to the theme of a 
work, or to a characterization of the general 
nature of the writings of a particular author. 
While the content of the exercises ranges widely 
over world literature, there is no direct attempt . 
to evaluate student knowledge of interrelation- 
ships. The 9o items of the test in fine arts are 
largely concerned with the matching of brief 
characterizations or titles of the works of archi- 
tects, painters, sculptors, and composers with the 
names of these persons and with the matching 
of brief definitions of important terms in the 
various artistic fields with the terms. The items 
are well distributed over the various fields, but 
all pertain to relatively specific facts. The 175 
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items of the biology test are varied in character. 
Only the first 42 involve matching of definitions 
with important biological terms or brief char- 
acterizations with the names of great biologists. 
While many of the remaining exercises require 
nothing more than knowledge of facts, the ma- 
jority of the exercises require broad understand- 
ings and applications of general principles. Cer- 
tain of the series of exercises in the later parts 
of this test contain items requiring knowledge of 
interrelationships and for the examinees should 
present novel thought-provoking problems. In 
the opinion of this reviewer, this is the best of 
the Profile Tests. The 202 items of the social 
studies test range rather widely over world his- 
tory and to a much smaller extent over economics 
and political science. Over half the items require 
matching of characterizations of persons with 
their nàmes, brief definitions with terms, and the 
like. As in the biology test, but to a smaller ex- 
tent, there are series of related items requiring 
discrimination. 

Tn general, the items of all of the Profile Tests 
are satisfactory with respect to form. An occa- 
sional exercise contains one or two distractors 
which seem too obviously incorrect, a very few 
exercises contain errors of fact, and a very small 
number could be improved by providing a more 
significant or accurate correct answer. One com- 
mendable feature of the examination is the pro- 
portion of items for which the alternatives are 
greater than five. 

Several types of reliability coefficients are re- 
ported for each of the tests. Those computed by 
means of the Kuder-Richardson formula 21 are 
as follows : physics .87, chemistry .88, literature 
.94, fine arts .95, biology .96, and social studies 
.96. The test-retest reliability coefficients for an 
interval of one year are: physics .87, chemistry 
-89, literature .90, fine arts .90, biology .89, and 
social studies .97. With the possible exception 
of those for physics and chemistry, these are 
highly satisfactory reliability coefficients. (Ac- 
tually, in terms of numbers of exercises the 
physics and chemistry tests are as reliable as 
the others.) In Lannholm and Schrader's sum- 
mary (3) of the various studies which have 
been made of the predictive efficiency of the 
Graduate Record Examinations, numerous va- 
lidity coefficients are reported where the criteria 
were graduate course marks, average marks, or 
ratings. In some cases the predictor was an 
evaluation of Profile Test scores rather than the 
Scores on individual tests. A tally of 30 first- 
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order coefficients relevant to the present tests 
reveals 1 coefficient above .70, 3 between .60 and 
.69, 9 between .50 and .59, 7 between .40 and 
.49, 6 between .30 and .39, 3 between .20 and 
.29, and 1 between .10 and .19. The median of 
these coefficients falls between .40 and .49. In 
general, the effectiveness of prediction is en- 
hanced when the scores, or evaluation of scores, 
on the Profile Tests is combined with evaluation 
of previous scholastic record. There is some evi- 
dence that the Profile Tests are more useful in 
identifying relatively weak candidates than in 
predicting the degree of success. Of course, in 
appraising the data of such validity studies, one 
should recognize that the fallibility, restricted 
validity, and limited range of graduate marks, 
are factors contributing to the lowering of the 
validity coefficients. The true predictive effi- 
ciency of the tests is probably greater than is 
indicated by the coefficients reported. 

Comprehensive and varied norm data are re- 
ported for the Profile Tests for both advanced 
undergraduate and graduate students. These 
data are classified according to sex, geographical 
regions, and major fields of specialization. This 
reviewer is inclined to raise only one question 
about the norms, or better, with respect to the 
use of scaled scores. For the group participating 
in the original standardization, a highly select 
group, the means and standard deviations of 
the scaled scores on each test were respectively 
500 and тоо. Now the means and standard de- 
viations of more recently tested and more repre- 
sentative groups vary from test to test. Doesn't 
this imply that the scaled scores are not com- 
parable from test to test? Of course, interpreta- 
tions can be made in terms of percentile ranks 
and may always have to be made in terms of 
such measures for the various subgroups recog- 
nized in the standardization data, but would it 
not be better to use a system of scaled scores 
based on data pertaining to each subgroup, ог 
on the data pertaining to the total sample used 
in obtaining norms? 

The Statistical Analysis Department of the 
Educational Testing Service has commented as 
follows with respect to the points made in the 
preceding paragraph. “Maintaining the original 
scale is desirable so that studies based on the 
original scale will continue to hold for the new 
scale. Thus, even though the scale no longer 
corresponds to the norms, it has value in tying 
in with earlier forms of the test. It is not claimed 
that the present norms are based on a repre- 
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sentative sample, but that they are descriptive 
statistics based on users." The first comment 
seems a good reason for retaining the original 
scale, except for the possibility that re-scaling, 
through enhancing the comparability of the 
scaled scores, might promote easier interpreta- 
tion on the part of the users of the tests. It 
seems reasonable to suppose also that the more 
recent and comprehensive data are more repre- 
sentative than the original data and, in the 
opinion of this reviewer, are a more legitimate 
basis for scaling. 

In preparing this review, the writer studied 
each of the new three-hour Advanced Tests of 
the Graduate Record Examination. Each of 
these tests seems far superior to the Profile Tests 
in face validity with respect to intellectual skills 
and abilities transcending knowledge. One can 
hypothesize that the current Profile Tests do 
not significantly enhance the predictive effi- 
ciency of the Graduate Record Examination 
where each candidate for admission to a gradu- 
ate school takes the Aptitude Test and the Ad- 
vanced Test in his field of specialization. On the 
other hand, for the sake of guidance, for eval- 
uation of undergraduate instruction, and for 
screening from graduate schools those who do 
not acquire in some measure knowledge and in- 
tellectual skills and abilities which range more 
widely than a single subject field, this reviewer 
would not exclude some general testing in place 
of the current Profile Tests. This could require 
a revision of the present tests with a maximizing 
of exercises involving applications of principles 
and understandings of relationships, or it could 
mean substitution of the Tests of General Edu- 
cation. In the long run, revision of the kind just 
mentioned, or substitution of the T'ests of Gen- 
eral Education, could enhance the predictive ef- 
ficiency of the battery—when graduate faculties 
more universally come to value the ability to 
apply knowledge learned in various fields, or the 
attainment of the objectives of general education. 


[11] 

X*Graduate Record Examinations Tests of Gen- 
eral Education. Grades 13-16; 1946-51; available 
only in the Institutional Testing Program of Graduate 
Record Examinations (see 527); 9 scores: general 
mathematics, physical science, biological science, social 
studies, literature, arts, effectiveness of expression, vo- 
cabulary, general education index; 480(540) minutes 
in two sessions; Educational Testing Service. * 


REFERENCES 
т. Heston, Josern C. “Educational Growth as Shown by 
Retests on the Graduate Record Examination.” Ed & Psychol 
MS 10:367-70 au 'so. * (PA 25:3402) Addendum: 11:532 
51. 
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*The Gray-Votaw-Rogers General Achievement 
"Tests. Grades 1-3, 4-6, 5-9, 7-9; 1934-51; a revision 
of The Gray-Votaw General Achievement Tests (see 
3:9); IBM for grades 5-9; 2 editions; cash orders 
postpaid; Hob Gray, David F. Votaw, and J. Lloyd 
Rogers; Steck Co. * 
a) [COMPLETE EDITION.] Grades 1-3, 4-6, 7-9; 3 lev- 
els; manuals (’48) ; $3 per 30 of any one level; 25¢ per 
specimen set of any one level. 

1) Primary. Grades 1-3; 1934-51; 6 scores: reading 

comprehension, reading vocabulary, spelling, arith- 

metic reasoning, arithmetic computation, total; 

Forms Q (748), R (48), S (51), Т (51); (55-70) 

minutes. 

2) Intermediate. Grades 4-6; 1934-50; ІІ scores: 

same as Primary level, plus elementary science, lan- 

guage, literature, social studies, health and safety; 

Forms Q (48), К (48), S (50), T (^50); (140- 

150) minutes. 

3) Advanced. Grades 7-9; 1934-50; II scores: same 

as Intermediate level; Forms О (48), R (48), S 

C50), Т (50): (140-150) minutes. 
b) ABBREVIATED EDITION. Grades 5-9; 7 scores: ele- 
mentary science, social studies, knowledge of litera- 
ture, choice of words, reading, arithmetic, total; IBM ; 
Forms U (°51), V (51), W (51), X (751) ; manual 
(51); $2.75 per 30; separate answer sheets may Бе. 
used; $1.25 per 50 answer sheets; $1.75 per 50 IBM 
answer sheets; 30¢ per set of stencils for machine scor- 
ing of answer sheets; 15¢ per stencil for hand scoring 
of answer sheets; $1.50 per 100 profile charts; 25¢ per 
specimen set; 73(80) minutes. 

REFERENCES 
т. See 40:1187. 
2-4. See 3:9. 


OLIVER Е. ANDERHALTER, Director,eBureau of 
Institutional Research, St. Louis University, St. 
Louis, Missouri. 

I. DESCRIPTION OF TESTS. A. AREAS COVERED 
AND PURPOSE. The battery of tests consists of a 
primary test, an intermediate test and an ad- 
vanced test, each with four forms, intended to 
measure achievement in a number of subject 
matter areas in grades 1 through 9. The primary 
test gives scores in reading comprehension, read- 
ing vocabulary, spelling, arithmetic reasoning, 
arithmetic computation, and total ; while the in- 
termediate and advanced tests give scores in 
these same areas in addition to elementary sci- 
ence, language, literature, social studies, and 
health and safety. The tests are designed for 
“diagnostic and supervisory purposes at any 
time during the school year." As specific appli- 
cations of the tests, the authors indicate their 
use for measuring the results of remedial pro- 
grams, for determining promotions, for provid- 
ing information to the counselor in the guid- 
ance program, and for the sectioning of pupils. 

The intended purposes of the tests are, as a 
whole, consistent with accepted applications of 
achievement batteries, although it should be rec- 
ognized that objective test results cannot be the 
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sole determiners of such things as promotions. 
There may be some question concerning such 
blanket statements regarding the use of the test 
as: "sectionizing of pupils may be made safely 
on the basis of the tests." Of special note is the 
effort the authors have made in the directions 
to insure against such misuses of the tests as 
drilling classes on the items and teaching for the 
tests. 

B. CONTENT OF THE AREAS. The arithmetic 
computation and arithmetic reasoning sections 
follow the conventional pattern for measure- 
ment of these traits, with the arithmetic compu- 
tation consisting of problems in the basic arith- 
metic processes and the arithmetic reasoning 
section consisting of word problems. The method 
of requiring filling in of answers rather than 
selecting from alternate answers has disadvan- 
tages in so far as scoring is concerned but prob- 
ably adds to the reliability of the scores. 

The vocabulary section consists of an incom- 
plete sentence and several alternates from which 
to select the word to complete the sentence. In 
some cases these sentences call for strict defini- 
tions, such as: “Beautiful means brown red 


old pretty," while in other instances the sen- . 


tence calls for application, such as: “We taste 
with ours fingers toes tongues ears.” This 
latter feature of the vocabulary section appears 
to be extremely realistic. 

The science, literature, social studies, and 
health and safety sections of the two top level 
batteries are conventional multiple choice items 
covering pertinent subject matter within the 
fields. Although the reviewer’s familiarity with 
the content in several of these areas is limited, 
the material covered appears to adequately sam- 
ple the fields. i 

The language section of the battery is similar 
to the language section of the Stanford Achieve- 
ment Test in which the student selects one of 
two possible words within a sentence to make 
the sentence grammatically correct; and the 
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in the primary battery in which the word is 
simply used) and then repeats the word. The 
use of the sentence defining the word probably 


overcomes the difficulty of the student's misun-- 


derstanding the pronounced word which is in- 
herent in dictated spelling tests; but does not 
overcome the possibility of clues to the correct 
spelling through the manner in which the word 
is pronounced. 

C. GENERAL FORMAT. The general format of 
the test is of a high standard in all levels of the 
test. In the primary test, the lettering is suf- 
ficiently large and well enough spaced to insure 
clarity to students at this level. The paper ap- 
pears to be of good quality, and the gloss effect 
is negligible. 

II. DIRECTIONS FOR ADMINISTRATION AND 
Ѕсовіхс. The directions for administering, as 
given, should be clear to both the test administra- 
tor and the student being tested. Pretesting in- 
structions are included to insure a testing situa- 
tion close to normal. 

The scoring directions are included on the 
scoring keys and should allow little chance for 
improper scoring. 

III. INTERPRETATIVE INFORMATION. A. NORMS 


“AVAILABLE. The norms consist of age and grade 


equivalents for each part score, along with per- 
centile ranks for the part scores. In addition, 
arithmetic means and standard deviations are 
presented for each part score at each grade 
level. A unique feature of the test is the inclusion 
of such means and standard deviations for 
“high” and “low” groups at each age and grade 


"level. The stated purpose of such norms is to 


indicate the variability which occurs at each 
grade level and consequently to “lessen the 
temptation to ‘double’ promote” students. While 
such norms are likely to emphasize the varia- 
bility within each group, much the same informa- 
tion could be obtained the percentile norms. 

The population upon which the norms were 
based consisted of 2,160 pupils “selected to 


reading comprehension section is similar to parts ‘sample the school population." No information 
of the Iowa Silent Reading Test in which the is given concerning the manner in which this 


student first reads a paragraph (or group of 
short descriptive sentences in the primary bat- 
tery) and then answers questions, of a multiple 
choice type, concerning the content in the para- 
graph. 

The spelling test calls for the test proctor to 
read the word to be spelled, but is somewhat 
unique in that the proctor follows the word with 
a sentence in which the word is defined (except 


sample was obtained, the geographic breadth of 
the sample, etc. 

Of special note are the facts : first, that the au- 
thors either directly administered the tests us 
for the collection of norms, or directly super- 
vised the administration—with no tests used for 
this purpose being administered by teachers 1n 
the schools; and second, that an 18-month span 
was used to define students in age for grade 
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rather than the more conventional 12-month 
span. The direct supervision of the test adminis- 
tration of tests used for normative purposes is 
extremely commendable and should allow more 
confidence in such results. The use of the 18- 
month span for defining students in age, does, 
in this reviewer's opinion, come closer to ap- 
"proximating the actual school situation than does 
the use of the 12-month span. 

B. AIDS TO INTERPRETATION. Ап individual 
profile sheet is provided on the cover of each 
test. This profile allows comparing the student's 
scores with his own average grade, with the 
average of his class, and with the age or grade 
norms. The manual gives adequate directions 
for the formation of the profile and should pro- 
mote desirable use of such profiles through an 
interpreted example. 

The manual also gives desirable assistance 
in the use of the test results for class analysis. 
Among other assistance given, the manual de- 
tails the procedures for the construction of fre- 
quency distributions, for computation of means 
and standard deviations, and for evaluating the 
differences between means made by different 
groups. The completeness of the directions for 
the use of both the individual profile and the 
class analysis record is among the best features 
of the test. 

IV. VariprTv. Claims to validity rest prima- 
rily in the manner in which the tests were con- 
structed. The authors state that the items in the 
original form were based upon an analysis of 
courses of study and texts, an analysis of teach- 
ers outlines and tests, the opinions of expert 
supervisors, and on analyses of various spelling 
and other scales. They go on to state that the 
items were given an experimental tryout, and 
only those items that showed an increase in the 
number of correct responses through successive 
grades were selected for the final tests, to be М 
divided into е four forms. 

While the method of selecting original items 
is beyond criticism, the manual falls somew 
short in indicating the scope of the materials 
analyzed, particularly in so far as prevalence of 
use in different geographic areas is concerned. 

Further statistical analyses of the items and 
of the test as a whole would add to the evidence 
presented for validity. While the method of se- 
lecting items insures discrimination between age 
groups, the use of other conventional types of 
item analysis would have provided information 
concerning the item discrimination within the 


age groups. In addition, correlation studies in- 
volving other tests of similar content areas would 
have provided added assurance of the value of 
the tests. 

V. Revrasiity. Split half reliability coeffi- 
cients are presented for each of the part scores 
in each level of the test. No data are presented 
as to which form (or forms) were used in the 
computation of these coefficients. While the au- 
thors mention that reliability coefficients were 
also computed by the alternate forms method, no 
such coefficients are reported in the manual. 

The split half coefficients reported were all 
based upon heterogeneous groups (the coeffi- 
cients for the intermediate tests are for grades 
4-6 and the coefficients for the advanced tests 
are for grades 7-9), which does not give a clear 
picture of the reliability of the tests as used in 
practice. Reporting of reliability coefficients for 
single grade levels would be much more desirable 
—as would the reporting of standard deviations 
for single grade levels. 

There is some question as to the correctness 
of using split half reliability coefficients for these 
tests. In a trial administration of the language 
section in the intermediate battery and the arith- 
metic computation section in the advanced bat- 
tery, this reviewer found that a majority of the 
students were unable to finish in the required 
time limits—but successfully answered a num- 
ber of items in additional time given. It would 
appear that speed might be a contributing factor 
toa sufficient extent to make split half coefficients 
too high. Reporting of alternate forms coeffi- 
cients would be a desirable safeguard. 

For each reliability coefficient given, the 
manual indicates a “probable error of estimated 
true score." There is some doubt as to what 
is meant by this probable error as given. A 
number of checks were applied to the standard 
deviations and reliability coefficients presented. 
The conventional and desirable probable error 
of measurement (indicating the error involved 
in using the observed score as the true score— 
which is what is done in practice) was found to 
give results somewhat larger than the values 
reported in all cases; and the probable error of 
predicting the true score from observed score 
by means of regression equations (which is of 
little significance in ordinary uses of tests) gives 
results sometimes smaller and sometimes larger 
than those presented. The practice of indicat- 
ing a probable error of measurement on the in- 
dividual profile is commendable, 
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VI. EguivALENCE oF Forms. The equiva- 
lence of forms is based upon the authors' state- 
ment that the means and standard deviations of 
scores on each form are equal. While the actual 
values for each form are not presented in the 
manual, the authors state that the average 
difference between scores on equivalent forms 
given on consecutive days is one tenth of a point. 

While the equality of means and standard de- 
viations is essential for equivalence of forms, the 
errors of measurement could still be unequal and 
the tests would not be actually "equivalent." 
Gulliksen has shown that equivalence is assured 
if the means, standard deviations and intercor- 
relations are equal, and provides a technique 
for statistically testing the equality of these three 
parameters simultaneously. The presentation of 
the intercorrelations of forms and the application 
of such a technique would consequently add to 
the assurance of equivalence. 

VII. OvERALL EvALUATION. In many as- 
pects, the test compares favorably with similar 
tests in the field. A number of the procedures 
used, such as the direct administration or super- 
vision of the tests used for the analyses, are 
highly commendable. Primary shortcomings of 
the test rests in the lack of, or lack of infor- 
mation concerning, certain desirable statistical 
analyses, along with insufficient reporting of in- 
formation such as the normative population, and 
the scope of texts and courses of studies ana- 
lyzed. 


For a review by Roland L. Beck, see 3:9; for 
reviews by Joseph E. Moore and C. C. Ross of 
the Advanced Test, see 40:1187. 


[13] 
*The Harlow Achievement Tests for Texas, 
Grades 1, 2, 3, 4, 5, б, 7, 8; 1942-50; 8 levels; manual 
for grades 3-8 ('50) ; no data on reliability; specimen 
set not available; postage extra; Harlow Publishing 
Corporation. * 
а) FIRST GRADE. 1942; title on test booklet is The Har- 
low Achievement Tests for First Grade; 7 scores: 
word recognition, sentence meaning, nature study, 
health, spelling, number work, total; Form G; manual 
['42]; 90€ per 10; 20-90, 606 per 10; (90-110) min- 
utes. 
b) SECOND GRADE. 1942; title on test booklet is The 
Harlow Achievement Tests for Second Grade; 
scores: meaning of words, paragraph meaning, health 


spelling, arithmetic, total; Form G; manual ['42] ; 90d 2 


рег 10; 20-00, 70€ рег 10; (90-110) minutes. 

C) THIRD GRADE. 1949; 9 scores: arithmetic computa- 
tion, arithmetic reasoning, meaning of words, meaning 
of paragraphs, correct use of words, spelling, geogra- 
phy, science, total; Form L Revised; 90¢ per 10; 20- 
90, 70€ per 10; (120) minutes. 

d) FOURTH GRADE. I949; IO scores: same as Third 


Grade plus health; Form L Revised; 90¢ per 10; 20- 
90, 806 per 10; (120) minutes. 

@) FIFTH GRADE, 1949; IO scores: same as Third Grade 
plus health; Form L Revised; $1.10 per 10; 20-90, $1 
per 10; (155) minutes. 

f) SIXTH GRADE. 1949; 11 scores: same as Third Grade 
plus hygiene, United States history; Form L Revised; 
$1.10 per 10; 20-90, $1 per 10; (170) minutes. 

0) SEVENTH GRADE. 1049; II scores: same as Third 
Grade plus hygiene, Texas history; Form L Revised; 
$1.60 per 10; 20-90, $1.30 per 10; (150) minutes. 

А) EIGHTH GRADE. 1049; I2 scores: same as Third 
Grade plus hygiene, United States history, citizenship; 
Form L Revised; $1.60 per 10; 20-90, $1.30 per 10; 
(190) minutes. 


[14] 

*The Harlow Battery Achievement Test. Grades 
1-3, 4-6, 4-8, 7-9; 1945-47 ; 4 levels; no data on relia- 
bility ; мы! per manual (manual free with orders of 30 
or more); specimen set not available; postage extra; 
Harlow Publishing Corporation. * 
а) PRIMARY BATTERY. Grades 1-3; 1945-47; 7 scores: 
English, spelling, reading comprehension, word mean- 
ing, arithmetic computation, arithmetic reasoning, to- 
tal; Forms I (45), II (45), III (47) ; Form I man- 
ual ['45], Form II manual ['45], Form III manual 
[47] ; $1.10 per 10; 115 (135) minutes. 
b) INTERMEDIATE GRADE BATTERY. Grades 4-6; 1945; 
IO scores: same as Primary Battery plus history, geog- 
raphy, hygiene; Forms І, II, III; manual for each 
form [’45] ; $1.10 per 10; (125) minutes. 
C) UPPER GRADE BATTERY. Grades 4-8; 1945-46; 10 
scores: same as Intermediate Grade Battery; Forms I 
(46), II (45); Form III out of print; Form I man- 
ual [46], Form II manual [45] ; $1.10 per 10; (210) 
minutes. 
d) JUNIOR HIGH BATTERY. Grades 7-9; 1945-47; 10 
scores: same as Intermediate Grade Battery; Forms I 
(745), П (45), ITI (47) ; Form I manual [45], Form 

manual [45], Form ПІ manual [’47] ; $1.60 per 10; 
(140) minutes. 


[15] 
*Iowa Every-Pupil Tests of Basic Skills. 
a) NEW EDITION. Grades 3-5, 5-9; 1940-47; IBM for 
grades 5-9; 2 levels; 4 tests are listed separately) ; 
Forms L (^40), М ('41), N (42), О (^43) ; battery 
manual ('47) ; $6 per 25 sets of complete Elementary 
Battery (grades 3-5); $6.40 per 25 sets of complete 
Advanced Battery (grades 5-9); $1.60 per 25 of any 
one test in Elementary Battery; $1.75 per 25 of any 
one test in Advanced Battery; 40 per 25 cumulative 
record forms; 456 per general manual; 336 per speci- 
men set of Elementary Battery; 33% per specimen set 
of Advanced Battery; separate answer sheets may be 
used with Advanced Battery; postage extra; Elemen- 
tary Battery: 196(230) minutes; Advanced Battery: 
263 or 271(325) minutes; H. F. Spitzer in collabora- 
tion with Ernest Horn, Maude McBroom, H. A. 
Greene, and E. F. Lindquist; Houghton Mifflin Co. * 
I) Test A, Silent Reading Comprehension. 3 scores: 
reading comprehension, vocabulary, total; 636 per 
25 IBM answer sheets; 20¢ per stencil for machine 
scoring of answer sheets; Elementary Battery: 
46(50) minutes; Advanced Battery: 68(85) minutes. 
2) Test B, Work-Study Skills. 6 scores: map read- 
ing, use of references, use of index, use of dictionary, 
alphabetizing (Elementary Battery) or graphing 
(Advanced Battery), total; 636 per 25 IBM answer 
sheets; 40€ per set of stencils for machine scoring 0: 
answer sheets; Elementary Battery: 47(55) min- 
utes; Advanced Battery: 77(90) minutes. 
3) Test C, Basic Language Skills. 5 (Advanced Bat- 
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tery) or 6 (Elementary Battery) scores: punctua- 
tion, capitalization, usage, spelling, sentence sense 
(Elementary Battery only), total; 756 per 25 IBM 
answer sheets; 40€ per set of stencils for machine 
scoring of answer sheets; Elementary Battery: 
46(60) minutes; Advanced Battery: 55 or 58(70) 
minutes. 
4) Test D, Basic Arithmetic Skills. 4 scores: funda- 
mental knowledge, fundamental operations, prob- 
lems, total; Advanced Battery, Form P ('44) ; IBM 
for Forms O and P only; 636 per 25 IBM answer 
sheets; 40€ per set of stencils for machine scoring of 
answer sheets; Elementary Battery, 57(65) minutes ; 
Advanced Battery, 63 or 68(80) minutes. 
b) SRA EDITION. Grades 5-9; 1949-50; 4 tests in one 
booklet (tests, part scores and time allowances same 
as New Edition) ; Forms I ('49), II ('50) ; separate 
answer sheets must be used; tests rented rather than 
sold; rental service includes scoring, tabulating and 
reporting results; prospectus and price information 
sent upon request; 273(330) minutes; prepared under 
the direction of E. F. Lindquist, with the assistance of 
Ernest Horn, H. F. Spitzer, Maude McBroom, H. A. 
Greene, Robert Ebel, Katherine Hunt, and Julia Peter- 
son; Science Research Associates, Inc. * 
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I. Tug CoMPLETE BATTERIES. Successive 
forms of these tests, developed as a professional 
service for the public schools of Iowa, have been 
available for nationwide use for more than a 
decade. Their present widespread distribution 
is evidence of their continuing popularity among 
teachers and school administrators. 

The tests are designed to measure and eval- 
‚ uate the pupil's functional mastery of a wide 
variety of critical skills involved in reading, 
work-study, language, and arithmetic. A de- 
tailed study of the tests gives the impression that, 
in general, the skills in each of these areas have 
been expertly analyzed and those that are cru- 
cial to total educational growth quite thoroughly 
sampled. 

The complete batteries are longer than most 
other achievement batteries. While from the 
very nature and purposes of the tests, each sub- 
test might be made somewhat longer without 
any considerable overlapping of testing situa- 
tions, the present length would appear to be 
about as great as the practical consideration of 


available testing time will permit. 
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The tests are recommended for use in individ- 
ual diagnosis. Those who insist upon arbitrary 
standards for statistical reliability for tests to 
be used for purposes of individual diagnosis may 
question the suitability for such use of those sub- 
tests whose reliabilities fall consistently in the 
-50's, .60’s, .70’s, and low .80’s. Those who 
recognize the many elements that enter into the 
computation of reliability coefficients and that 
must be taken into consideration in their inter- 
pretation (in this case, notably, the liberal test- 
ing time and the unusual homogeneity of the 
sample groups on whose scores the coefficients 
are based) will not be seriously disturbed about 
these lower coefficients so long as the tests can 
be satisfactorily demonstrated to qualify as ade- 
quate measures of skills whose development they 
consider to be among the goals of their programs 
of instruction. 

THE NEW EDITION. The tests in this edition 
are presented in two batteries, Elementary and 
Advanced, with four separate tests comprising 
each battery: Test A, Test of Silent Reading 
Comprehension; Test B, Test of Work-Study 
Skills; and Test C, Test of Basic Language 
Skills; and Test D, Test of Basic Arithmetic 
Skills. The Elementary Battery, with 15 sub- 
tests, is recommended for use in grades 3-5; 
and the Advanced Battery, with 14 subtests (a 
test of sentence sense included in Test C of the 
Elementary Battery is omitted in the Advanced), 
is recommended for use in grades 5-9. There 
is some question as to the suitability of the Ad- 
vanced Battery for use in grade 5, especially in 
those schools which follow the grade placement 
program of the modern curriculum, It would, 
therefore, seem to be important that the selec- 
tion of the battery for this grade level be made 
only after the tests have been carefully examined 
to determine which battery will be most.suitable 
in terms of the local situation. 

Four forms of the complete batteries are 
currently provided : Forms L, M, N, and O. (In 
addition, Test D, Advanced Form, is available 
in Form P.) Since all forms of each test are 
directly comparable, it is thus possible for the 
administrator who is interested in a continuous 
testing program to give the tests annually over 
a period of eight years without ever repeating 
the same test. Few achievement batteries offer 
opportunities for such an extensive program. 

The comprehensive directions in the Ex- 
aminer's Manual which accompanies each test 
makes the task of administering and scoring the 
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tests a comparatively simple one. In the Ele- 
mentary Battery answers are recorded directly 
in the booklet; scoring is perforce by hand. In 
the Advanced Battery answers are recorded on 
a detachable cover page or on separate hand or 
machine scorable answer sheets. The latter are 
available for all forms of Tests A, B, and C, and 
for Forms О and P of Test D. Convenient scor- 
ing stencils are provided for all forms. 

The format of the Elementary Battery is some- 
what superior to that of the Advanced. The type 
is larger and clearer, and the items are better 
spaced; the pupil is not confronted with the 
cluttered answer sheets which accompany the 
earlier forms of Tests C and D of the Advanced 
Battery. The quality of the paper on which the 
tests are printed is not such as to show them off 
to particularly good advantage. { 

The use of these tests as diagnostic instru- 
ments has been anticipated by the provision in 
the Examiner's Manual of an item-by-item clas- 
sification of the specific skills tested. These tables 
are sufficiently comprehensive to be effectively 
used to determine those aspects of individual 
pupil development and of the whole instructional 
program which need attention. Helpful sugges- 
tions are given for remedial teaching aimed at 
the development of greater proficiency in each 
skill and for the modification of teaching pro- 
cedures in order to achieve better teaching re- 
sults. Test users are reminded that these sug- 
gestions are not presented as universally ap- 
plicable, but must be adapted to individual pupil 
needs and to the local course of study. 

The norms accompanying these tests are prob- 


ably the most extensive provided for achieve- 


ment batteries at the elementary school level. 
For the evaluation of individual achievement 
there are grade norms, percentile norms within 
grade, age-at-grade norms, and chronological 
age norms. In addition, there are special norms 
for school averages, these making possible the 
evaluation of the average score made by a class 
of pupils against norms established specifically 
for such averages rather than against norms of 
individual pupil achievement. The norms are 
based on a total of 196,851 pupils in 350 differ- 
ent school systems. While the majority of the 
schools are located in Iowa, over 60,000 pupils 
were tested in other states. The opinion of the 
authors that while the norms may best be char- 
acterized as Midwestern norms, they are prob- 
ably “аз representative of nationwide public 
school кы. л as any so-called national 


norms that have been established for stand- 
ardized tests" will probably not be challenged. 
The authors caution test users that the norms 
for these tests do not, any more than do the 
norms for other achievement batteries, provide 
“standards” for the individual pupil, class, or 
school, but offer only information concerning 
average achievement in all schools. They further 
caution that since courses of study vary in differ- 
ent systems, the scores on these tests should be 
interpreted in terms of the relationship of the 
test content to the content of the local course of 
study. 

These cautions are included in the extensive 
discussion of considerations in the interpretation 
of test results in general and of results on these 
tests in particular which is included in the Man- 
ual of General Information, a booklet of some 
60 pages which is available to test users who 
desire more detailed information concerning all 
aspects of the program than that given in the 
Examiner’s Manuals. For the test user, ех- 
perienced or inexperienced, this discussion and 
the one on the nature of reliability are highly 
recommended by the reviewer. 

THE SRA EDITION. Since 1949 a “de luxe” 
single booklet edition of the Advanced Battery 
has been available on a rental basis from Science 
Research Associates. Forms I and II are cur- 
rently offered in this edition. The booklets, аї-` 


. tractively printed and bound, are designed for 


multiple use. Separate answer sheets are re- 
quired, a single answer sheet serving for the 
entire battery. This compact answer sheet is 
undoubtedly most convenient for administering 
and scoring purposes, but the answer boxes are 
necessarily so small and so close together that the 
pupil must take particular care to record his 
answers correctly. ' 

The tests in the SRA Edition are similar in 
organization to the tests in the New Edition, but 
they reflect а refinement of test content and 
some improvement in item writing technique 
over the earlier forms. It is of interest to note 
that scores on the tests in the SRA Edition are 
corrected for guessing; this is not true of the 
scores on the tests in the New Edition. 

Under the rental arrangements, the tests are 
administered by local personnel, and scored and 
interpreted by SRA. The test results reported 
include part and total grade equivalent scores 
for individual pupils and class averages for each 
test and subtest. Norms provided for pupil scores 
and school averages (first semester, grades 6- 
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9; second semester, grades 5-9) are based on 
the scores of 48,650 pupils in approximately 
2,000 schools participating in the 1950 Iowa 
program. 

Among the supplementary materials provided 
users of the SRA Edition, many of which repre- 


sent adaptations of materials provided for the 


New Edition, is an interesting profile leaflet 
which is given to the pupil before the testing to 
acquaint him with the nature and purposes of the 
battery, and again after the testing so that he 
can plot his own profile. This would appear to 
have interesting possibilities for helping the in- 
dividual pupil to become aware of his strengths 
and weaknesses in skills covered by the battery, 
and for stimulating him to take greater respon- 
sibility for the maintenance of skills in which 
he is already strong and the improvement of 
skills in which he is deficient. 

IL Tue IwprvipUAL Tests. The separate 
tests making up the batteries are described in 
some detail below. Comments concerning the 
Advanced Forms are equally applicable to the 
New Edition and to the SRA Edition unless 
otherwise specified. 

TEST A, SILENT READING COMPREHENSION. 
This test is divided into two parts. Part 1, 
which concerns general reading comprehension, 
includes items designed to test paragraph com- 
prehension, the ability to grasp and understand 
significant details, the organization of ideas, and 
the ability to appreciate the total meaning of 
selections read. No attempt is made to provide 
any measure of speed of comprehension. Part II, 
concerned with specific word meanings, is de- 
signed to measure the extent of the pupil's read- 
ing vocabulary. The items throughout are mul- 
tiple choice items. Working time is 46 minutes 
for the Elementary Form and 68 minutes for 
the Advanced Form. The test can be adminis- 
tered in a single period or in two shorter periods. 

The items in Part I are based on selections 
representative of different kinds of reading that 
the pupil is required to do in connection with 
his school work. The number and type of items 
vary with the length and content of the selection. 
Selections in the Elementary Form range from 
simple one-paragraph selections followed by 
three to six items to one reasonably difficult full 
page selection made up of several paragraphs 
followed by 15 to 17 items. Seven selections in 
all are included. For the Advanced Form there 
are five reading selections, four of which are of 
full page length, thus allowing for more adequate 


testing of ability to understand and work with 
large units of subject matter than is possible in 
the average reading test. There are as many as 
20 items on some of these selections. 

: The selections themselves provide extremely 
interesting and enjoyable reading. In the Ele- 
mentary Form some story materials are used, 
together with simple expository selections from 
the social studies and science areas; in the Ad- 
vanced Form the materials are limited to in- 
formative selections in these two subject areas. 
The biographical sketches and the historical nar- 
ratives in the Advanced Form are exceptionally 
well chosen. It is to be regretted that there are 
no fictional narratives included in the Advanced 
Form. Certainly the basic skills involved in 
reading fiction are as many and as vital as are 
those involved in reading nonfiction; and the 
ability to read fiction skillfully is, for many 
pupils, likely to serve as useful a purpose, both 
immediately and ultimately, as ability to read 
study-type materials. 

From the point of view of vocabulary and 
concepts involved, the selections seem appro- 
priate for the grade levels concerned. As in any 
reading test designed for a spread of three or four 
grades, the poorest readers at the lowest grade 
level served will find most of the materials ex- 
tremely difficult, and the best readers at the high- 
est grade level served will find most of the mate- 
rials comparatively simple. However, at least one 
selection in each form is simple enough so that it 
should present few hazards to the least able 
reader, and similarly, at least one selection is 
sufficiently difficult so that it should challenge 
the most able reader. The range of difficulty of 
the “in-between” selections is probably about 
as satisfactory as can be obtained in a test of 
this type. 

The items cover a variety of skills considered 
to be significant components of ability to com- 
prehend reading materials of the type presented. 
Unfortunately, strict comparability of items 
from form to form is not possible since ready- 
made selections form the basis for the question- 
ing; nevertheless, the overall coverage within 
each form seems to be adequate. The items them- 
selves are skillfully designed so that they re- 
quire genuine understanding of meaning rather 
than ability to “hunt and find” matching words 
and phrases. Occasionally, not often, an item 
contains technical vocabulary difficult enough to 
confuse pupils who might readily recognize the 
correct response more simply stated. For ex- 


1$ THE FOURTH MENTAL MEASUREMENTS YEARBOOK 


[ 36 


ample, in Item 34, Elementary Form N, which 
asks “Why was the hole in the ground [for a 
grass fire] made longer than it was wide?" 
fourth or fifth grade pupils who might reply “So 
that air could reach the fire" might not under- 
stand the more technical “To provide a draft 
for the fire.” 

The words in the vocabulary section, 40 in 
the Elementary Form and 50 in the Advanced 
Form, are not presented in isolation but in 
phrases and sentences, a commendable feature. 
As would be expected, some of the words in 
the Elementary Form are beyond the reading 
vocabulary level of most third grade pupils, but, 
except in Form M, the number is not too large. 
In Forms L, N, and P, 50 per cent or more of the 
words in this part appear in Stone’s Graded 
Vocabulary for Primary Reading, a good repre- 
sentation indeed. A previous reviewer has rec- 
ommended making the vocabulary section a tool 
in the interpretation of the comprehension re- 
sults by selecting the words for the section from 
among the harder words in the selections in 
Part I. This would appear to have interesting 
possibilities. 

The directions call for selecting the word that 
has the same meaning or most nearly the same 
meaning as the italicized word in the phrase or 
sentence. For the most part, the correct re- 
sponses are aptly selected. In some cases, how- 
ever, the correct response is a considerably more 
difficult word than the stem word, as, for ex- 
ample, “A vain attempt"—"futile" (Advanced 
Form L, Item 14; Advanced Form N, Item 47) 
and “Persuade him"—"induce" (Advanced 
Form O, Item 26). In some cases, also, the 
correct response represents somewhat less than 
the same meaning or nearly the same meaning 
as the stem word. “A head bandage” is not al- 
ways a “wrap” (Elementary Form L, Item 23) ; 
“A temporary absence" is not necessarily a 
“short” one (Advanced Form N, Item 23); 
and for “Horse and wagon,” "vehicle" might 
read more precisely “a kind of vehicle” (Ele- 
mentary Form O, Item 36)—except that “ve- 
hicle" is too difficult a choice anyway. 

Because of the high interrelationships among 
the several abilities involved in general reading 
comprehension, only a single score for Part I 
is provided. However, in the section of the Ex- 
aminer's Manual devoted to the "Classification 
of Skills and Suggestions for Remedial Work," 
the items in this part are identified in terms of 
the several skills they purport to measure. While 


the classification is admittedly somewhat sub- 
jective and the suggestions for remedial work 
in some cases overlapping, much information of 
practical value is provided here in identifying 
the skills in which the pupil is weakest and in 
planning instruction aimed at the improvement 
of those skills. Suggestions for the systematic 
improvement of vocabulary are also given. 

TEST B, TEST OF WORK STUDY SKILLS. This test 
is organized into five parts. Both the Elementary 
Form and the Advanced Form are concerned 
with the skills involved in map reading, the use 
of references, the use of an index, and the use of 
the dictionary. In addition, the Elementary Form 
includes a test of alphabetization, and the Ad- 
vanced Form a test on the use of graphs, charts, 
and tables. The Elementary Form requires 47 
minutes and the Advanced Form 77 minutes of 
actual working time. The test may be given 
either in one long testing period or in two shorter 
testing periods. 

This test is of particular interest not only be- 
cause it provides a comprehensive measure of 
abilities which are of significance in effective 
study but also because it is the only test which 
measures some of these abilities. It is of particu- 
lar importance because the development of these 
vital skills is often neglected in the instructional 
program on the assumption that the pupil will 
acquire them incidentally. 

The purposes the test is designed to serve are 
sound, the coverage of the test is most compre- 
hensive, and the individual items are ingeniously 
constructed. The most serious questions which 
occur to this reviewer are concerned not with 
the test itself but with the extent of its suitability 
for use in the lowest grade for which each form 
is intended. It is difficult to see how most third 
grade pupils could fail to be greatly frustrated 
on all parts of the Elementary Form. They have 
had little or no experience in map reading and 
very meager experience with reference books; 
they have had no formal training in the use of 
the index or even in the use of the dictionary; | 
and what practice they have been given in alpha- 
betization has been limited principally to simple 
initial-letter exercises. In the Advanced Form à 
majority of the items on the reading of pattern 
maps and graphs, charts, and tables require the 
application of more mature skills than the aver- 
age fifth grade pupil can reasonably be expected 
to have developed. It would, therefore, seem 
likely that in most situations the Elementary 
Form of the test should provide a more mean- 
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ingful measure than the Advanced Form for 
pupils at this grade level. 

The map reading section, Part I, is most in- 
triguing. Both forms include items designed to 
test ability to visualize a represented area and 
to read directions correctly, to recognize and 
interpret standard map symbols, to identify and 
visualize with the use of a key symbols that are 
not standardized, and to apply knowledge of 
physical facts in the interpretation of map situa- 
tions. The Advanced Form includes also items 
designed to test ability to get facts and informa- 
tion from parallels and meridians and to read 
and interpret facts from pattern maps. All of 
the maps depict imaginary lands, insuring in 
their reading and interpretation the application 
of geographical knowledge and understanding 
rather than casual information. It is interesting 
to note that no global maps for the Air Age are 
as yet included. 

The maps in the Elementary Form are neatly 
drawn, and they are clearly readable. About one 
fourth of the items at this level involve simple 
deductive reasoning based upon information 
supplied by the maps. The maps in the Advanced 
Form are extremely complicated, and their 
proper reading and interpretation demand con- 
siderable attention to minute detail. The most 
difficult items at this level are those requiring 
the pupil to obtain information from several 
pattern maps showing population distribution, 
elevation, rainfall, railroads, crop areas, min- 
eral resources, etc., and to combine that informa- 
tion with general geographical knowledge in 
reaching conclusions as to how man probably 
lives in a particular area. Items of this type can 
be very useful so long as it is kept in mind that 
they involve considerably more than basic map 
reading skills. For example, the question that 
asks, “Which of these states probably leads in the 
production of hogs?" (Form L, Item 28; and 
Form М, Item 28) cannot be answered intelli- 
gently by the pupil who can read the map sym- 
bol for "corn" but does not know that corn 
makes up a substantial part of the hog's diet. 

Part II covers the use of the references usu- 
ally found in school libraries. General references 
like atlases and maps, dictionaries, encyclo- 
pedias, and yearbooks, and specific references 
such as textbooks, government publications, 
and magazines are included. The items are in- 
teresting and realistic, and they have more va- 
riety than one would expect in view of the fact 
that they must be based on a rather small selec- 


tion of references. The value of this section to 
the individual school will, of course, depend 
more than any other section on the extent of the 
materials accessible to the pupils for practical 
reference purposes. 

The sections on the use of the index, Part III, 
and the use of the dictionary, Part IV, are excel- 
lent. In the former the ability to alphabetize and 
to follow cross references are among the skills 
measured ; in the latter the ability to alphabetize, 
to interpret the symbols used, to use the pro- 
nunciation key, and to select meanings to em- 
ploy in a given context are all included. Both 
parts require the pupil to read carefully and to 
use good judgment. Except for a few uncom- 
mon words like “nyctalopic,” “praenomen,” and 
"gynarchy" on the dictionary page for the Ad- 
vanced Form, the entries are well chosen, and 
the items are simple and practical. For the in- 
dex entries involving subtopics, the organization 
followed in the SRA editions rather than that 
followed in the earlier editions is favored. For 
example, in this entry: “Gold, 108-109; mining, 
114; use of, 112; value of, 110” (from Ele- 
mentary Form O), there is reference to a vague 
"something" about gold on pages 108—109, but 
there is no indication as to what that “ѕоте- 
thing" is. A more useful arrangement is this 
one: "Inflation: during the Revolution, 106; 
in Jackson's time, 355; and the farmers, 677— 
681; problems of, 750-751” (from SRA Form 
I), with page references all identified in terms 
of the specific subtopics covered. 

Part V in the Elementary Form is limited to 
exercises involving simple alphabetizing and the 
use of guide words. In the Advanced Form this 
part is devoted to the reading of graphs, charts, 
and tables. Bar graphs, line graphs, circle graphs, 
and pictographs are all represented. Items are 
designed to measure ability to recognize what 
information can be obtained from a particular 
graph or table; to read amounts and to com- 
pare values shown; to grasp outstanding facts 
portrayed ; and to determine trends and under- 
lying relationships through correct interpreta- 
tion of the data presented. The situations pic- 
tured give information of the type that the pupil 
will have need to read and interpret in real life. 

The format of the Elementary Form, designed 
to require marking of responses directly in the 
test booklet, is simple and attractive. The format 
of the Advanced Form, set up to require the 
use of an answer sheet, is more complicated. To - 
allow for the use of an answer sheet printed on 
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the inside of the cover page, the items-are so 
arranged that the pupil must start his map read- 
ing work on the last page of the test booklet and 
then go back to the first page. This format is 


separate answer sheet is required. 


discussion of the various skills measured by the 
test and extensive suggestions for developmental 
and remedial instruction. The manual empha- 
sizes the fact that these skills are most clearly 
and understandingly developed by use in the 
actual preparation of lessons in the various sub- 
jects and not by isolated drill. 

TEST C, TEST OF BASIC LANGUAGE SKILLS. This 
test is designed to measure the pupil’s ability to 
apply his knowledge of essential skills in the 
mechanics of correct expression in recognizing 
what is right or wrong in writing of the type he 
is likely to be called upon to do. Exercises in- 
volving punctuation, capitalization, usage, and 
spelling are included in both the Elementary 
and Advanced Forms, with a test of sentence 
sense added in the Elementary Form. The work- 
ing time required is 47 minutes for the Elemen- 
tary Form and 55 to 58 minutes for the Ad- 
vanced Form. Again, the test may be admin- 
istered in one or in two periods. 

Insofar as fulfillment of the purposes set forth 
by the authors is possible in an objective test, 
that fulfillment has, in the opinion of the re- 
viewer, been pretty largely accomplished. The 
situations have been skillfully designed to ap- 
proximate actual writing experiences, and spe- 
cific points of emphasis have, for the most part, 
been judiciously selected. 

The punctuation and capitalization sections, 
Parts I and II, are unusually comprehensive. 
They do not appear to involve any especially 
controversial items, certainly an achievement 
when one considers the total number of items 
in all forms. 

The usage section, Part III, is made up of 
a number of short sentences, in each of which 
the pupil is asked to choose from two words or 
phrases the one which correctly completes the 
sentence. On the whole, the items in this part 
reflect the newer philosophy with regard to 
which usage elements should be emphasized, 
and at which grade levels certain common errors 
should be attacked for elimination. However, 
some traces of older language practice still per- 
sist. Several items in the Elementary Form are 
concerned with usages not now generally treated 
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until. junior or senior high school, and several 
in the Advanced Form with usages not now 


-treated-until senior high school, if at all. Mind- 
.. ful of the necessity for challenging the most able 
not followed in the SRA edition, for which'a `` 
; ,tlie reviewer is,not inclined to be so critical of 
The Examiner’s Manual contains a detailed 


pupils in the highest grade served by each form, 


the inclusion of items involving usages with 


"which pupils may not have had formal class- 


room experience, as of the inclusion of items 
involving usages concerning the correctness or 
incorrectness of which there is considerable dis- 
agreement among language authorities. Con- 
sider, for.example, the following items from the 
Advanced: Form: “My ankle doesn’t feel so 
(ей, оой) today". (Form L, Item 41) ; “Will 
you -(Іоап, lend) me your pencil?" (Form M, 
Item 77) ; “(Try to, try and) get it" (Form N, 
Item 22); "Divide it (among, between) us 
three” (Form О, Item 11); “Did you hear 
about (our, us) winning the game?" (SRA 
Form I, Item 78) ; *( Whom, Who) did you call 
on the telephone?" (SRA Form II, Item 67). 
Controversial items like these, for which choices 
formerly considered to be incorrect are now 
considered by a respectable number of language 
authorities to represent "established usage" can 
no longer serve any useful purpose in a usage 
test in which the testee is required to make an 
"either-or" response. . 

In Part IV, the spelling section, the pupil is 
asked to choose the correct spelling from among 
four different spellings of the same word. In the 
Elementary Form the words are presented in 
context; in the Advanced Form they are pre- 
sented in isolation. The examiner who recog- 
nizes the validity of the multiple choice spelling 
item should be impressed with this part ; it will 
not be acceptable to the examiner who believes 
that ability in spelling can be measured only in 
actual writing situations. 

The spelling words are well chosen, and the 
incorrect spellings, representing the errors chil- 
dren аге most likely to make, are most convinc- 
ing. A check of the spelling words against the 
words in Rinsland's A Basic Vocabulary of Ele- 
mentary School Children and in two leading 
spelling series for the elementary grades shows 
that only a small number of the words are beyond 
the classroom or free writing experiences of 
most pupils in the grades for which the test is 
intended; and these words the pupils should be 
able to spell independently on the basis of gen- 
eralizations growing out of their spelling in- 
struction, 


1$ 
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Part V of the Elementary Form is designed | 


to test the ability of the pupil to recognize “good” 


sentences as opposed to “wrong” exercises in 
which such common errors as-these'are герге-.' 


sented: parts of sentences written as complete 


sentences, two sentences written as.a single sen-. 


tence, and parts of sentences added to otherwise 
good sentences where they do not belong. у 

In general, the format of the test is simple and 
attractive, and most of the sections are easy to 
administer and easy to score. A previous re- 
viewer (Ayer, 3: 10) has criticized the punctua- 
tion exercises in the Advanced Form аз be- 
ing somewhat complicated to administer, with 
choices and answer boxes interspersed with the 
text. This format is, fortunately, limited ое 
earlier Forms L, M, and N; in Form O the 
choices are printed on the answer sheet only, 
while in SRA Forms I and II the choices are 
printed in the test booklet in a.column opposite 
the text. The same reviewer has criticized the 
procedure for scoring the punctuation and cap- 
italization sections in the Elementary Form as 
making only partial allowance for overpunctua- 
tion and overcapitalization. (The pupil's free 
responses are recorded directly in the test book- 
let; the scoring stencil exposes only those points 
believed to present likely punctuation and cap- 
italization problems.) While it would be desir- 
able to provide for more accurate scoring, it is 
difficult to see how that could be accomplished 
without making the scoring procedure too un- 
wieldy to be practical in the average testing sit- 
vation. This criticism does not apply to the Ad- 
vanced Form, in which the pupil is given a 
choice of responses at designated points only. 

In the Examiner's Manual the detailed classi- 
fication of items in terms of the punctuation, 
capitalization, and usage skills covered in the 
test is worthy of special mention. For situations 
in which careful diagnosis of specific strengths 
and weaknesses in these areas is planned in antic- 
ipátion of extensive remedial instruction, this 
classification should prove to be a valuable ref- 
erence. Because the spelling of most words does 
not follow definite rules, no attempt is made to 
classify the skills involved in the spelling section ; 
however, brief suggestions are given for de- 
veloping spelling ability. 

TEST D, BASIC ARITHMETIC SKILLS. This test 
is composed of three parts. Part I is concerned 
With vocabulary and fundamental knowledge; 
Part II measures computational skills in the 
four fundamental processes; and Part III tests 


ability to solve verbal problems. Actual working 
time is 57 minutes for the Elementary Form 
“and 63 to 68 minutes for the Advanced Form. 
i Like the other tests in the battery, the test is 
‘so organized that it may be administered in a 
„single long period or in two shorter periods. 

Part I represents a serious attempt to measure 
the basic understandings that the modern arith- 
metic program aims to develop. The pupil is 
given an opportunity to show the extent of his 
technical arithmetic vocabulary and of his abil- 
ity to apply mathematical reasoning when he 
performs certain fundamental operations. Spe- 
cific skills involved include reading and writing 
numbers ; understanding of the number system, 
the common processes, quantitative measures, 
and numerical facts, terms, and symbols; and 
ability to identify common geometric figures, to 
make quantitative estimates, and to compare 
the size of numbers and fractional parts. In spite 
of time limitations, the coverage is comprehen- 
sive. 

The few adverse criticisms that can be directed 
at this part are concerned with specific items 
and item types rather than with the specified 
aims and objectives or the overall organization 
of the part. For one thing, the desirability of 
including even a few items of the strictly defini- 
tion type would seem rather questionable. Abil- 
ity of the pupil to define "area" or to recognize 
a “denominate number” as such does not neces- 
sarily indicate ability to apply the knowledge of 
the definition in a problem situation. Separate 
items testing only definition or recognition might 
well be omitted entirely, since recognition of 
terms can be tested in situations requiring ap- 
plication of arithmetical understanding. Where 
definitions are given, there should be only one 
acceptable answer, and that should be precise. 
One item requiring the definition of a rectangle 
(Advanced Form L, Item 26) has two equally 
acceptable answers: "It has four right angles 
with the sides longer than the ends" and “It has 
four right angles with all four sides equal." A 
more precise answer would be “It has four right 
angles and four straight sides,” For another 
thing, it would seem desirable that the exercises 
in this section should insofar as possible be so- 
cially useful ones. The social utility of items like 
this one (Elementary Form N, Item 18) is 
questionable: "If you add one bushel to one 
million bushels, how many bushels will there 
be?" Such a situation will rarely, if ever, need 
to be dealt with in real life. And rarely will the 
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child be called upon to estimate the height of 
a man in yards, as in Item 5, Advanced Form 
M, and Item 9, SRA Form II, or to translate an 
item number as high as XLI in any test that 
he may be taking (Item 22, SRA Form II). 

The items in Part II are computational ex- 
amples-involving whole numbers, fractions, and 
decimals in the Elementary Form, with items 
on percentage and denominate numbers added in 
the Advanced Form. The examples are, as the 
authors state, within the area of social utility, 
except for a few “posers” included in order to 
test adequately the pupils who are unusually 
skillful in computation. That the authors have 
been somewhat sensitive to changes in grade 
placement of topics during the past decade is 
evident by slight changes in content from form 
to form. For example, in the earlier editions of 
the Elementary Form there are examples in- 
volving multiplication and division of fractions, 
the addition of columns of decimals (all tenths), 
4-place multipliers, and 3-place divisors. In the 
latest form, all of these, except for the 4-place 
multipliers, are wisely omitted. It is, by the way, 
hard to understand how 4-place multipliers and 
3-place divisors ever got into the Elementary 
Form since they have never appeared in the Ad- 
vanced Form. 

Analysis of the content of Part II points up 
the need for cautioning the prospective test user 
that extreme care must be taken in choosing the 
form to be used with fifth grade pupils in order 
to insure that they will be able to attempt a 
large enough number of items to give an ade- 
quate measure of their computational ability. 
While the Elementary Form and the Advanced 
Form might have been equally useful at this 
grade level in 1940 and may still be equally use- 
ful in systems which follow a traditional program 
of instruction, it seems unlikely that the Ad- 
vanced Form could provide any adequate meas- 
ure for fifth grade pupils in a modern arithme- 
tic program. With emphasis on multiplication 
and division of fractions and decimals now gen- 
erally delayed until the sixth grade, the average 
fifth grader could be expected to work correctly 
only about one third of the examples presented 
in Forms L through P in the section dealing 
with whole numbers and fractions. In the sec- 
tion dealing with percentage, decimals, and de- 
nominate numbers, he could handle only the ex- 
ample or two involving the simple changing of 
inches to yards, pints to gallons, minutes to 
hours, etc. In the SRA editions, approximately 


one fourth of the examples presented in th 
section on fundamental processes are within th 
experience of the fifth grader in a modern aritl 
metic program. 

The problems in Part III are concerned 
more plausible and practical situations than am 
frequently utilized. Isolated problems and re 
lated problems are both presented, with th 
number of related problems being slightly large; 
for the Advanced Form. The selections aroun 
which the problems are built are on the whol 
well chosen, an unusually large number of 
involving the type of experiences in which chil 
dren actually find opportunities to apply thei 
arithmetical skills. The pictures and diagram 
accompanying the problems are clearly and a 
tractively drawn. In the related problems in 
volving the purchase of food items in the Ele 
mentary Form the prices do not show any effe 
of recent inflation. The pupil who figures cor 
rectly that “Mr. Barnes bought a dozen ba 
nanas, 6 apples, 5 lemons, and 3 dozen plums 
for 87 cents (Form О, Item 19) is in for a sur. 
prise if he goes looking for a similar bargain a 
the corner fruit store. | 

Since the problems in Part III reflect th 
emphasis in Part II, this part of the Advanced 
Form would be frustrating to many fifth grad 
pupils. The prospective test user should there 
fore examine this part with the same care wit 
which he examines Part II in order to detet 
mine whether the Advanced Form or the Eli 
mentary Form will be more useful in his pa 
ticular situation. 

For the Elementary Form the pupil's wor 
is recorded directly in the test booklet. Part 
requires the marking of one of four response 
for each item, and Parts II and III the recordin| 
of the results of the pupil’s own computation й 
boxes supplied for that purpose. The Elementar 
Form is easy to take, and it is conveniently sco: 
by hand. For the Advanced Form the pupil 
responses may be recorded on the inside of # 
test cover or on a special answer sheet. 
Forms L, M, and N multiple choice items аї 
presented only in Part I. Forms О and P ай 
SRA Forms I and II, the only forms which @ 
machine scorable, are made up of multiple choit 
items throughout. Special answer sheets 
provided with these forms. The machine sco! 
able forms would seem to be easier to administe 
and they are certainly more convenient to scot 
than the earlier forms. An innovation in th 
machine scorable forms is the use in Parts IT ai 
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III of the “None” response, standing for “None 
of the preceding answers is correct.” The va- 
lidity of this type of response, when there is one 
indisputably correct answer to the question asked 
and the responses accompanying the “None” 
have been carefully constructed to cover the 
commonest errors, has been well established. 

The Examiner’s Manual for this test contains 
an excellent discussion of the “Need for Devel- 
oping Better Understanding of Arithmetic Proc- 
esses.” The philosophy underlying the meaning 
theory of arithmetic and its implications for 
teaching and testing are presented briefly but 
clearly and effectively. Of particular importance 
to test users is the suggestion that care should 
be taken in interpreting the scores made on that 
portion of the test which emphasizes mechani- 
cal computational procedures (Part II) since 
pupils in arithmetic programs which attempt 
to develop meaning and understanding will 
probably not be able to score as high on that 
part, especially in the earlier stages of learning, 
as will pupils in programs which concentrate 
on the computational phase. This observation is 
made, however: “In the long run, children 
taught by methods which develop meaning and 
understanding will be superior, even though 
they may seem temporarily retarded in the early 
stages of instruction by insistence on under- 
standing.” У 

ПІ. Summary. The Тоша Every-Pupil Tests 
of Basic Skills are designed to measure educa- 
tional growth in certain specific skills involved 
in learning. They are not concerned with the 
measurement of subject matter achievement as 
such. In terms of their objectives, the tests ap- 
pear to be quite valid. In the opinion of the re- 
viewer, the scores are, with some exceptions, 
sufficiently reliable for judicious use in individual 
diagnosis. 

Тһе test content is of generally high quality, 
and the individual items are on the whole skill- 
fully written and judiciously selected. Excep- 
tions noted by the reviewer have been recorded 
above. Probably the most outstanding feature 
insofar as content is concerned is the inclusion 
of a work-study test, the scope of the content of 
which is unique with these batteries. The suit- 
ability of the Advanced Battery for use at the 
fifth grade level in many schools is highly ques- 
tionable. Selection of the proper battery at this 
grade level must be made with special care. 

The instructions for administering and scoring 
the tests and recording test results are adequate 


and clear. The suggestions for analyzing and 
interpreting the results and for planning con- 
sequent instruction are extensive and sound. The 
test user who will devote some time to studying 
the manuals and other supplementary materials 
will find in them much valuable technical in- 
formation. 


ANTON Тномрзок, Supervisor of Research, 
Long Beach Public Schools, Long Beach, Cali- 
fornia. 

These tests were designed to measure selected 
skills in reading, work-study, language, and 
arithmetic. The Elementary Battery, recom- 
mended for use in grades 3-5, consists of four 
separate tests which yield 15 part scores. The 
Advanced Battery, used in grades 5-9, yields 
14 part scores, 13 of which bear the same titles 
as those of the Elementary Battery. Although 
there are many test batteries that provide satis- 
factory measures of achievement in the areas of 
reading, written English, and arithmetic, the 
Iowa tests have pioneered in the measurement 
of such important work-study skills as the ability 
to interpret a map. 

The authors have followed recommended pro- 
cedures in attempting to insure curricular va- 
lidity. However, the prospective test purchaser 
must carry the final responsibility for deter- 
mining the validity of each test in terms of the 
needs of a specific local situation. 

This test series has many commendable fea- 
tures. By including a fairly broad sampling of 
items in each subtest, the authors have increased 
the reliability of the scores derived. Instead of 
inflating the reliability coefficients reported by 
giving the tests to a heterogeneous pupil popula- 
tion, the authors have reported the reliability 
coefficients for each subtest on the basis of 
scores derived from a single class in a single 
school. They caution test users against attaching 
undue significance to small irregularities in the 
profile of a pupil's scores since some of the 
shorter subtests (e.g., Use of References) are 
not reliable enough to warrant such confidence. 

The tests are easy to give and to score. Teach- 
ers who follow the authors' suggestions for ana- 
lyzing the pupils' errors find the tests useful as 
diagnostic instruments. The test manuals supply 
helpful suggestions for the remedial work that 
should follow the giving of the tests. Several 
tables of norms are provided so that test users 
may convert raw scores into grade, age, age-at- 
grade, or percentile equivalents. Norms for 
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school averages are also supplied. The fact that 
there are four or more equivalent forms of each 
test available makes it possible to appraise a 
child's progressive development in a skills area 
without giving the same form of the test twice. 
The Manual of Information accompanying the 
tests is unusually complete. 

Some features of the Iowa tests may limit 
their use. The norms weré based on a Mid- 
western population; the majority of the schools 
in the standardization sample were from Iowa. 
The authors candidly state that “There is some 
evidence that these norms are somewhat higher 
than those provided with many of the other cur- 
rent standardized tests." A teacher or school 
administrator located in a community in which 
the public is critical of the teaching of the basic 
skills will not be inclined to purchase a survey 
battery with norms “somewhat higher than those 
provided with many of the other current stand- 
ardized tests.” 

Occasionally the grade equivalent of a pupil's 
raw score for a total test is slightly out of line 
with the grade equivalents earned on the com- 
ponent parts of the test. For example, if raw 
Scores of 27 and 23 are earned on Parts I and 
II of Form N of the Elementary Reading Com- 
prehension Test, the grade equivalents are 4.1 
for each part score, but the grade equivalent of 
the total raw score of 50 is 4.0. Cases of this 
type sometimes lead to criticisms by pupils and 
teachers. 

Although the Elementary Battery is intended 
for use in grades 3-5, it is much more satisfac- 
torily administered in grades 4 and 5 than in 
grade 3. The Elementary Battery has wider ac- 
ceptance with fifth grade teachers than does the 
Advanced Battery. Teachers' opinions differ 
sharply concerning the appropriateness of many 
of the items in the Work-Study Skills Test. 

Many elementary teachers object to the test- 
ing technique used in measuring spelling ability. 
A word is spelled in four ways (e.g., “booke, 
buk, book, boke") and pupils are asked to check 
the correct spelling. Even though results on 
such a multiple choice test correlate highly with 
those from a dictation type test, teachers tend to 
lack confidence in the validity of multiple choice 
spelling scores. 

Science Research Associates has recently ar- 
ranged with the original publishers to issue two 
forms of the Advanced Battery. Under the SRA 
plan, multiple use booklets containing the com- 
plete battery are rented to a school. Separate 


answer sheets are returned to SRA for scor- 
ing, tabulating, and reporting of results. The 
bibliography in the SRA Manual for Interpre- 
tation includes no reference published since 1939, 
Some teachers will oppose the изе of SRA's 
"self interpreting profiles" with pupils as imma- 
ture as those in grades 5 and 6. The explanation 
which accompanies these profiles places stress 
on interpupil comparisons without recognition 
of individual differences in level of mental abil- 
ity. 

Taken as a whole, the Jowa Every-Pupil 
Tests of Basic Skills represent one of the best 
series available for measuring many fundamental 
skills. They deserve careful consideration in any 
school system which is planning a systematic 
evaluation program. 


For reviews by Frederic L. Ayer, Gustav J. 
Froehlich, and Ralph C. Preston, see 3:10; for 
reviews by James R. Hobson and Constance M. 
McCullough of Test A, see 3:501; for reviews 
by William A. Brownell and Leroy H. Schnell 
of Test D, see 3:334; for reviews by Harriet 
М. Barthelmess, William A. Brownell, J. Mur- 
ray Lee, and Charles W. Odell of the 1937 form, 
see 38:872. 
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Edition, Grades 12-13; 1924-43; 5 scores: English 
and literature, mathematics, science, history and social 
studies, total; IBM; Forms L (’43), М (’43) ; manual 
(43) ; $3 per 25 sets of test and answer sheet; separate 
answer sheets must be used; $2.10 per 100 hand scored 
answer sheets; 4¢ per IBM answer sheet; 30¢ per set 
of machine scoring stencils; 15¢ per hand scoring sten- 
cil; 25¢ per manual; postage extra ; 35¢ per specimen 
set, postpaid; 75(85) minutes; D. B. Stuit, H. A. 
Greene, and G. M. Ruch; Bureau of Educational Re- 
search and Service, State University of Iowa. * 
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Davin V. ТівремАМ, Lecturer on Education, 
Harvard University, Cambridge, Massachusetts. 

This battery consists of a 20-minute, 100-1tem 
test of English and literature, a 20-minute, 60- 
item test of mathematics, a 15-minute, 75-1tem 
test of science, and a 20-minute, 100-item test 
of history and social studies. The English and 
literature section samples the student's vocabtr 
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lary, grammar, and literary acquaintance. The 
authors claim that the student’s literary compre- 
hension is also sampled but the items provide 
little opportunity for the student to demon- 
strate his comprehension of literature. The math- 
ematics section stresses algebraic concepts heav- 
ily but it does include some concepts usually 
taught in a geometry course. Knowledge of 
physics, chemistry, and biology is tested by the 
science section with physical concepts receiving 
the greatest emphasis. The history and social 
studies section includes items from the “fields 
of ancient, medieval, and modern history with a 
heavy emphasis on the American contributions 
to modern history.” 

The authors claim a two-fold purpose for the 
examination: “(1) the evaluation of achieve- 


ment and mastery at the secondary school level, . 


and (2) the use of this information concerning 
achievement and mastery for guidance and prog- 
nostic purposes at the college and university 
level." 

The authors are to be commended for their 
statement that, so far as the first purpose is 
concerned, the chief measure of the examina- 
tion’s validity “is the agreement between what 
the test measures and competent analyses of 
the content of academic high school curricula.” 
The authors indicate that considerable time and 
attention were given to analysis of the content 
of academic high school curricula and to con- 
struction of items which would adequately sam- 
ple such content. However, they fail to report 
the textbooks and courses of study which were 
included in the analysis so that it is impossible 
to describe precisely the type of high school pro- 
gram for which the sample of items is valid. 

It would be nice if we were able to say cate- 
gorically that a test were valid or invalid. In 
reality, however, the job is not that easy. The 
decision concerning validity is a function of the 
use for which the results are being considered. 
The same test may be valid for several purposes 
and invalid for others. In the final analysis each 
person who is considering use of a particular 
test should procure a copy of the test and judge 
for himself if it will satisfy the purpose he has 
in mind. 

If I were in charge of the testing program in 
a school which I knew well, and the high school 
principal asked me if we ought to give the Jowa 
High School Content Examination to evaluate 
the “achievement and mastery” of our seniors, 
I would be inclined to give a copy of the exami- 
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nation to an English, a mathematics, а science, 
and a social studies teacher in the school and to 
have them tell me whether they thought what 
they had been trying to teach in their subjects 
during the four years of high school was ade- 
quately represented by the questions in the 
several sections. I am inclined to believe that 
each would complain that one can't possibly test * 
for all of the content they taught in a 15- or 
20-minute examination. They would be likely 
to say that I already have an indication of the 
seniors’ performance on two, three, four, or 
more 3-hour examinations in the several fields. 
In addition, the English, science, and social 
studies teachers are likely to complain that 
“achievement and mastery” in their subjects is 
not indicated when a student has to-average five 
answers a minute to complete the section. The 
mathematics teacher would probably lodge a 
similar complaint about the three answers a 
minute the student would have to average to 
complete the mathematics section. The English 
teacher is likely to protest that the grammar and 
vocabulary she has been teaching is poorly repre- 
sented by the few questions on the examination, 
that she doesn’t much care if a student can 
identify the name of a poem or the poet from a 
quoted line or two, that much of the literature 
she has covered in her classes is not sampled, 
and that there is no opportunity to identify those 
students who can turn a “neat phrase in composi- 
tion.” The science and social ‘studies teachers 
will probably lodge similar protests about in- 
adequate sampling of content and will in addi- 
tion decry the unimaginativeness of the tasks ọn 
which the students are to demonstrate their mas- 
tery. Both teachers are likely to say that the 
science and social studies sections of the exami- 
nation require very little reasoning with the 
content of their respective courses. The social 
studies teacher may complain about several 
items which are “correct” because one or two 
authors have made such statements. The mathe- 
matics teacher will probably not be too harsh 
on the algebra items in the mathematics section. 
However, he will probably object to the scarcity 
and superficial nature of the geometry items. 
He will say that trigonometry and solid geome- 
try are not represented at all. 

As a result of all this, I would advise the 
principal that the teacher evaluations of the 
students which had been accumulated over the 
past four years were a more valid but possibly 
less reliable indication of the students’ “achieve- 
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ment and mastery" of what was being taught in 
his high school than the 75-minute Тоша High 
School Content Examination. 

On the other hand, if this were my first year 
as a guidance director in an Iowa high school, 
I would be inclined to give the Тоша High School 
Content Examination to the seniors in the col- 
lege preparatory curriculum in order to learn 
what the grades of various teachers mean in 
terms of competition outside my school. 

In the latter case, I would also be inclined to 
take 85 minutes of the time of an eleventh or 
twelfth grade college preparatory student who 
had just transferred to my high school from 
one about which I knew little, in order to learn 
how he stacks up in these four basic areas. 

The authors report that correlations between 
high school and college grades and scores in the 
individual sections of the test range from .50 to 
‚бо. Correlations between total test score and 
general grade point average are said to be of 
the same general magnitude. Therefore, stu- 
dents who score high on a particular section 
tend to receive high grades in that subject and 
students who score low on the section tend to 
receive low grades. But what of the individual 
student? Does a student who achieves a score 
higher than those of 90 per cent of the students 
invariably have a grade which exceeds those 
of 9o per cent of the students? Obviously not 
when correlations between test scores and grades 
range from .50 to .60. How then can we find 
what the average grades associated with various 
test scores are likely to be and what the variation 
of grades about these expectations is? How do 
we know whether these expectations apply to 
our high school or to the college our counselee is 
thinking of attending? The answer is that we 
don’t know. The test manual doesn’t give enough 
information for us to compute grade expecta- 
tions and variation of grades about these ex- 
pectations. Nor does it tell us the high schools 
‘апі colleges for which the correlations are ap- 
propriate. 3 

Kuder-Richardson “foot-rule” reliability co- 
efficients for both forms are in the neighbor- 
hood of .89 in grades 11 and 12 for the English 
and literature, mathematics, and history and so- 
cial studies sections. The reliability coefficient 
of the science section is about .84 in the eleventh 
grade and .87 in the twelfth grade. The relia- 
bility coefficients of all four sections are around 


= _ 91 for grade 13. Reliability coefficients for the 


total score on the test range from .9I to .95. 


Thus, for groups which vary as much as those o íi 
which the authors computed their reliability co. 
efficients presumably did (variation not ге 


other tests of a similar nature. 

Is the test reliable then? Again the ans 
cannot be a categorical one. We must ask, “ 
liable enough for what purpose?” As soon as we 
ask this question we are no longer interested in 
reliability coefficients ; we are interested in stand- 
ard or probable errors of measurement. We 
find that for Form L scores in grade 12, when 
we employ the dubious procedure of using 
Kuder-Richardson “foot-rule” reliability co- 
efficients in the formula which calls for an esti- 
mate of the ratio of the variance of the true 
scores to the variance of the obtained scores, 
the probable error of measurement is about 3 
standard score units for each section and about 
б standard score units for the total score. What: 


L score in grades 11 and 13 or for a Form M 
score in any grade is not reported. However, for 
two twelfth graders who took Form L we can 
say that the test is reliable enough so that we 
may suspect that their true scores are different: 
їп апу section if they make obtained scores which: 
differ by about то standard score units or more. 
If we knew what standard of mathematica 
achievement is necessary for successful comple- 
tion of a college mathematics course, we could 
say that the test is reliable enough to believe t 
Johnny's true score exceeded this standard i 
his obtained score were 7 or more standard scoti 
units above the standard. Thus for some диез 
tions and for some individuals, the probable 
error of measurement is such that we will an- 
swer affirmatively the question, “Is the test гез 
liable enough ?” and for other questions and other 
individuals the reliability question will be an= 
swered negatively. We can not give an absolute 
answer to the question. 

The total score for the examination is the s 
of the standard scores for each section. Standard 
scores were computed for a sample of “2519 
dividuals seventeen and eighteen years of age 
and in the fourth year of high school at the ti 
of the testing.” Although it is not stated, it 
presumed that the standard scores are a simple 
linear transformation of the raw scores rather 
than normalized scores. | 

Percentile ranks corresponding to the stand- 
ard scores of each section and to their sum (total 
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score) are reported for Grades 11 and 12 and for 
college freshmen. Percentile ranks are based on 
the scores of 3,042 eleventh graders, 6,364 
twelfth graders, and 622 college freshmen. The 
percentile ranks provide a means of comparing 
the achievement of a student with that of the 
students in the above groups. However, they 
provide no information for important questions 
like: “What standard of achievement in the 
various subjects is necessary for entrance to a 
particular college ?" “When is extra work neces- 
sary?" "When should a student be advised 
against taking the next course in a science cur- 
riculum ?" In fact, like most tests, the Jowa High 
School Content Examination provides only a 
means of comparing the performance of a stu- 
dent you know in a situation you know with the 
performance of students you don't know in 
situations you don't know. For all one knows, 
percentile ranks in the neighborhood of one or 
ten may be an acceptable standard of perform- 
ance for the situation which a counselor is con- 
sidering. 

In general, the Тоша High School Content 
Examination, Form L and its hand scored an- 
swer sheet fall just a little short of a craftsman- 
like job. Some confusion may be caused by the 
few questions where the stem surrounds the op- 
tions or where the options introduce a question. 
Also, some difficulty may result from the illegi- 
bility of the roots in questions 32 and 38 of the 
mathematics section. In addition, scorers of the 
hand scored answer sheet may complain that the 
paper on which the answer sheet is printed is 
so thin that marks on one side of the answer 
sheet show through the reverse side, that X's 
made with a pencil in small circles don't show 
clearly enough through the scoring key, that the 
scoring key lacks precision of drilling of correct 
answer spaces, that the upper guide mark of 
the scoring key for the front of the answer sheet 
is not drilled, and that the answer sheet is just 
a little too large to fit well into the fold of the 
scoring key. 

All of the things that are usually done in ex- 
amination construction and reporting have been 
done for the Тоша High School Content Exam- 
ination and have been done well in general. The 
reviewer feels, therefore, that the examination 
compares favorably with other available ex- 
aminations of general high school achievement. 
However, the usual practice falls far short of 
what we want. Until test users give up the 
idea that you can tell all about the 4-year high 
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school achievement of a student by means of a 
75-minute examination, we are going to find test 
publishers selling more shorter tests than longer 
tests. Until we give up using tests because Jones 
does, we are going to continue to have vague 
validity statements and to get many scores on 
many students which are never used. Until we 
know what we want to test for and what we are 
going to do if scores come out one way or an- 
other, we are going to have general and not too 
meaningful validity statements, general and not 
too meaningful reliability statements, and re- 
ports of relative standing which have only slight 
meaning. 

The only one who can evaluate this test for 
your purposes is you. Do so and you will be 
more content with it if you decide to use it. 


For a review by Maurice E. Troyer, see 3:11. 
[17] 


*The Iowa Tests of Educational Development. 
Grades 8.5-13.5; 1942-48; 10 scores on 9 subtests: un- 
derstanding of basic social concepts, general back- 
ground in the natural sciences, correctness and appro- 
priateness of expression, ability to do quantitative 
thinking, ability to interpret reading materials in the 
social studies, ability to interpret reading materials in 
the natural sciences, ability to interpret literary mate- 
rials, general vocabulary, subtotal of the previous eight 
scores, uses of sources of information; Forms X-2 
(42), Y-2 (42); school administrator's manual 
(48) ; examiner's manual (48) ; teachers’ and coun- 
selors’. manual (’48) ; separate answer sheets must be 
used; tests rented rather than sold; administrators 
must agree to give entire battery to every pupil en- 
rolled in the high school, or during the first year, to 
every pupil in one full grade; rental service includes 
scoring, tabulating, and reporting results; prospectus 
and price information sent upon request; 450(480) 
minutes; Paul Blommers (Test 4), Robert Ebel (Test 
2), John Gerber (Test 3), Julia Peterson (Test 7), 
J. William Maucker (Test 1), and Kenneth W. 
Vaughn (Tests 5, 6, 8 and 9), under the direction of 
E. F. Lindquist; Science Research Associates, Inc. * 
REFERENCES 

т. Puearman, Leo T. “Comparisons of High-School Grad- 
uates Who Go to College With Those Who Бо Not.” J Ed 
Psychol 40:405-14 N '49. * 


tional Development Tests.” Abstract. Calif J Ed Res 2:182 S 
5I. 

Enric Е: GARDNER, Associate Professor of Edu- 
cation, Syracuse University, Syracuse, New 
York. 

Forms X-2 and Y-2 are essentially similar to 
the older Forms X-1 and Y-1. Subtest 1 in each 
of the new forms contains the same items as 
appeared in the older forms except for a few 
modifications and replacements. Subtests 2 
(General Background in Natural Science) and 
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3 (Correctness and Appropriateness of Expres- 
sion) have been entirely rewritten by new 
authors, Robert Ebel and John Gerber, respec- 
tively. Except for minor changes in the abbre- 
viated titles, subtests 4-9 are identical, item for 
item, in the corresponding forms of the new and 
old editions. Details noted in earlier reviews will 
generally apply to the new forms. 

The previous rental policy of making these 
tests available only as an entire battery which 
must be returned immediately after administra- 
tion to Science Research Assotiates for scoring 
and tabulation has been modified. The same tests 
are offered in a new separate booklet, self-scor- 
ing edition (1951), as well as in the single book- 
let centrally-scored form. Publication of sepa- 
rate booklets issued for sale rather than rental 
provides for local scoring and analysis of indi- 
vidual items by teachers and school research 
staffs. 

The manual reports that “Each of Tests 
1-8....was designed to yield a coefficient of re- 
liability of .91 (on the average) for pupils in a 
single grade in a single school." A table of relia- 
bility coefficients ranging from .81 to .94 for each 
subtest in each form is given. These are not the 
usual reliability coefficients, but apparently are 
obtained from a sample of pupils in grades 9-12 
where the within-grades within-school sums of 
squares and sums of cross products have been 
used to obtain correlation coefficients between 
the scores on the odd- and even-numbered items 
and where the resulting reliability coefficients 
have been corrected by the Spearman-Brown 
formula. Unfortunately, no evidence is reported 
as to the variability of the coefficients which is 
characteristic of different classroom groups nor 
is there any information about the reliability of 
the tests in successive grades. A careful exami- 
nation of the many excellent items leads one to 
believe that Forms X-2 and Y-2.are comparable 
with respect to content measured. However, data 
to present evidence relative to the statistical 
comparability of the two current forms are lack- 
ing, e.g., such as correlation coefficients between 
subtests of these two forms and distributions of 
item difficulty indices. 

Scale scores are presented which are claimed 
to have the same meaning for all tests. These 
standard score scales “are so fitted to the raw 
score scales that a representative sample of pu- 
pils from grades 9-12 inclusive shows the same 
normal distribution of standard scores on each 
of the tests.” Scores obtained from such a scal- 


[ 46 


ing procedure can be considered comparable 
provided the mean performances of a representa- 
tive sample on each test are considered equiva- 
lent and the frequency distributions of the true 
scores for different functions are identical. The 
likelihood of lack of bias in favor of any particu- 
lar subtest is indicated by the description of the 
selection of the normative sample. Subtests which 
show the same normal distribution of standard 
scores will also have true score distributions 
which are identical to each other provided all 
the subtests in question have the same reliability, 

The use of the probable error of measurement 
as the size of the scale unit to eliminate “the pos- 
sibility that teachers will attach undue signifi- 
cance to small differences in test scores” is a 
worthwhile effort. Regarding the comparability 
of units from.one function to another, it is ob- 
vious that unit size on a standard score scale is 
related to reliability and will vary from scale 
to scale unless the reliabilities are equal. This 
unit size, however, will represent the actual 
probable error of measurement at any particular 
point along the scale only if the standard error 
of measurement is constant throughout all por- 
tions of the scale, i.e., the reliability of the tests 
is equal in successive grades for constant varia- 
bility. 

The table of intercorrelations of subtest scores 
given in the manual presents values from .397 to 
.791 with a median correlation of about .62. The 
average within-grades within-schools reliability 
of the difference between every possible pair of 
scores on Tests 1-8 was given as .71. This value 
is presented as evidence that the various tests do 
measure different things. It seems to this re- 
viewer, that the overall evidence presented in 
the manual would allow one to emphasize the 
common variance equally as well as the residual 
variance. 

The norms are based on approximately 50,000 
pupils in 290 high schools located in Iowa an 
neighboring states. The reviewer wishes to com- 
mend the care taken to obtain a really representa 
tive sample of the schools in Iowa and vicinity, 
since the value of any set of norms depends upon 
the adequacy with which the normative group 
has been defined. He also wishes to point out the 


use in both the selection of the sample and some | 


of the resulting statistical analysis of the prac 
tice of considering the school as the unit rather 
than the individual pupil. These norms até 
claimed to be adequately representative of na- 


tionwide achievement, on the grounds that the _ 
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median performance and interquartile range on 
the Тоша Tests of Educational Development of 
a group of 30,000 graduating seniors, selected 
by the armed forces as a national stratified sam- 
ple, corresponded closely with the scores of the 
Towa portion of the sample. If national norms 
are claimed, it would seem preferable that re- 
sults obtained from a large representative na- 
tional sample be used. 

The tests are not based on the analysis of the 
content of specific courses, but are constructed 
to test what an educated person needs to know 
and understand about certain important educa- 
tional areas ; hence, face validity and the informa- 
tion that items with high discriminating power 
were selected are offered as evidence of validity. 
A desirable emphasis is placed upon the user's 
personal examination of the items as an impor- 
tant criterion of validity ; however, this task is 
practically impossible without some frame of 
reference to describe the contents sampled and 
to assist the user in classifying the items into 
meaningful categories. Such a frame of refer- 
ence could profitably be included in the manual. 
The new General Manual does present data rela- 
tive to the validity of the total score for predict- 
ing success in the first year of college. 

From the point of view of the technical as- 
pects of test construction, the Towa Tests of 
Educational Development constitute an excel- 
lent battery. The format, directions, scaling and 
item construction are well done. As measures of 
certain broad aspects of the pupil’s educational 
development they are definitely superior tests. 
Their relative merit in comparison with other 
high school subject matter tests of achievement 
depend upon one’s philosophy of testing. 


For reviews by Henry Chauncey, Gustav J. 
Froehlich, and Lavone A. Hanna, see 3:12. 
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*Metropolitan Achievement Tests. Grades 1, 2, 
3-4, 5-7.5, 7-9.5; 1931-50; 5 levels; Forms К (46), 
S (47), T (048), U (49), V (’s0); directions for 
administering ('47); subtests from elementary, inter- 
mediate, and advanced batteries in arithmetic and 
reading are available as separates; 806 per interpreta- 
tion manual (’48), postage extra; 356 per specimen 
set of any one level, postpaid; 30€ per handwriting 
Scale for grades 1-3; Gertrude H. Hildreth with the 
Collaboration of Richard D. Allen, Harold H. Bixler, 
William L, Connor, and Frederick B. Graham; World 
Book Co, * 

4) PRIMARY I BATTERY. Grade I; 1931-48; 4 scores: 


Word pictures, word recognition, word meaning, num- 
bee orms В, S, T only; $1.80 per 25; 45(60) min- 
es, 


b) primary II BATTERY. Grade 2; 1932-48; 5 scores: 


reading, word meaning, arithmetic fundamentals, arith- 
metic problems, Spelling ; Forms R, S, T only; $2.10 
рег 25, postage extra; 85(100) minutes. 

c) ELEMENTARY BATTERY. Grades 3-4; 1932-49; 6 
Scores: reading, vocabulary, arithmetic fundamentals, 
arithmetic problems, language usage, spelling; Forms 
R, S, T, U only; $2.60 per 25; 135(150) minutes. 

d) INTERMEDIATE BATTERY. Grades 5-7.5; 1932-50; 2 
editions, ) 

I) Complete. 10 scores: reading, vocabulary, arith- 

metic fundamentals, arithmetic problems, English, 

literature, geography, history and civics, science, 

spelling ; 3320 рег 25; 200(240) minutes. _ у 

2) Partial. 6 scores: reading, vocabulary, arithmetic 

fundamentals, arithmetic problems, English, spelling ; 

$2.60 per 25; 155(180) minutes. j 
€) ADVANCED BATTERY. Grades 7-9.5; 1932-50; 2 edi- 
tions. 

1) Complete. 10 scores: same as complete Interme- 

diate Battery; $3 per 25; 220(240) minutes, y 

2) Partial. 6 scores: same as partial Intermediate 

Battery ; $2.60 per 25; 165(180) minutes. 
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Жр. bes Pe *Measurement of Pupil Progress in 


Learning to Spell Selected Words." Calif J Ed Res 2:104-10 
My 's1. * 
Warren С. Finney, Director of Test Devel- 
opment, Educational Testing Service, Prince- 
ton, New Jersey. { 
Important advances have been made in edu- 
cational measurement generally and in the 
Metropolitan Achievement Tests, in particular, 
since this achievement battery was published in 
its original form in 1931. Much that is favorable 
can be said of the way this battery has main- 
tained, and sometimes set the pace of develop- 
1 The author of this review is indebted to the following mem- 
bers of the Test Development Department of Educational Test- 
ing Service for memoranda on the indicated subject areas of the 
tests: Elizabeth C. Adams (social studies), Bernard S. Cayne 


(science), Clarence Derrick (English), Eleanor J. Riegel (arith- 
metic). ‘ 
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ment: the expansion of the test manual as an 
aid to users, the thorough and expert standardi- 
zation of the tests, the provision of varied norms 
including some for special types of schools, em- 
phasis on the desirability of using tests early in 
the year for instructional purposes, inclusion of 
a section on elementary science. There is even 
specific evidence of responsiveness to criticisms 
of this test in earlier yearbooks, or to criticisms 
of a similar nature. On the other hand, one may 
note certain rigidities in successive editions of 
this nationally used instrument : the tendency to 
hold closely to old subject matter outlines and 
old item types, relative emphasis on knowledge 
of formal and. factual content as distinguished 
from understanding and application of knowl- 
edge, failure to indicate the greater applicability 
of standardized tests in the upper grades than 
in primary grades. 

The manual for the 1931 batteries was a long 
step forward over previous manuals for achieve- 
ment batteries and the Manual for Interpreting 
which accompanies the 1947 revision is even 
more extensive and helpful. Separate materials 
for each form give directions for scoring, other 
materials at each level provide directions for 
administering the test, and the several types of 
norms available are published in separate book- 
lets. The 120-page manual proper consists of 
90 pages of helpful guidance to test users and 30 
pages of technical information about the con- 
struction and standardization of the 1947 re- 
vision. The manual and related materials, then, 
are major features of this test offering and con- 
stitute a good frame of reference for general ap- 
praisal of the offering. 

The manual opens with an excellent, succinct 
summary of ways in which test results may be 
used by teachers and administrators at all levels 
of a school system. Very usable detailed notes 
are given for planning a testing program, in- 
cluding an illustrative motivating administrative 
memorandum that is a gem. In this same section 
the arguments for fall and spring testing are 
given, with emphasis on the more constructive 
forward look that goes with fall testing. Group 
scoring procedures are outlined and recom- 
mended. The advantages and limitations of 
grade scores, age scores, percentile-in-grade 
norms, and modal age-grade norms are carefully 
explained. Very clear and detailed explanations 
áre given, with illustrative samples, of how the 
teachers may use the summary of the pupil's 
part scores on the front of the test booklets, the 


individual profile chart, the class record sheet, 
the class analysis chart and a form for analysis 
of errors on the separate items ‘of a test. Sys- 


tematic methods for combining results from dif- 


ferent classes and schools are illustrated. Over 
twenty pages are then devoted to suggestions 
for teachers that combine a sure understanding 
of teaching method and a sound mental hygiene 
approach. Ample bibliographical references on 
teaching are presented, including a number on 
remedial teaching. The final chapter explains 
in full detail the steps of item selection, item 
analysis, equating old and new tests, and stand- 
ardization of final forms. The explanation leaves 
no room for doubt that a thorough and compe- 
tent job has been done of establishing a revised 
series of batteries that (a) afford several equiva- 
lent forms at each of five educational levels, (5) 
can be used with confidence as equivalent to 
corresponding batteries of the previous edition, 
and (c) permit continuous interpretation of 
achievement on the same scales over several 
grades. 

The booklets of norms are separately printed 
in five sections. The first includes introductory 
information and tables relating the new forms to 
the corresponding forms of the earlier edition. 
The second gives traditional grade scores, age 
scores and percentile-in-grade scores. The third 
gives grade scores and percentile-in-grade 
scores for modal-age groups, those in each grade 
who are at the age for that grade of pupils who 
have been regularly promoted. These are es- 
pecially useful for schools following a program 
of regular annual promotions of virtually all 
pupils. They also serve as a more stable yard- 
stick for normally promoted pupils in other 
communities since they are not influenced by 
differences in percent promoted in the commu- 
nity and the national sample. The fourth booklet 
gives grade norms based on 100,000 parochial 
school pupils, while the fifth gives grade norms 
based on 20,000 pupils in segregated Negro 
schools. Care is taken to point out that these 
samples, though large, may not be strictly repre 
sentative. They provide such schools, however, 
with useful information not available for most 
tests. 

Mimeographed tables of regional norms ate 
available for the New England States, New York 
State and the Southern States. Some users will 
also be interested in the comprehensive norms 
available for relating scores on this 1947 revision 
of the Metropolitan Achievement Tests to the 
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five forms of the 1940 revision of the Stanford 
Achievement Test. 

The chief criticism of these batteries is that 
the description of the tests in the manual and 
the tests themselves give the distinct impression 
of old wine in new bottles. New items have been 
prepared, new and clearer drawings have been 
provided in the pictorial sections, grade place- 
ment of some materials has been changed, but 
the tests are aptly characterized by the state- 
ments that "the pattern of the first edition of 
this series has been adhered to closely in con- 
structing the new forms" and “every effort was 
made to use only items that represented the typi- 
cal content of instruction rather than the most 
recent innovations." The tests themselves, even 
more than the items, show this highly conserva- 
tive trend. The breakdown of the total testing 
time into units of school subjects is still the 
breakdown that prevailed when such tests were 
originally devised for administrative purposes 
of surveying average tendencies in the local 
school in comparison with national norms in 
traditional subject matter subdivisions. If the 
tests had been built primarily to serve those 
teacher uses so well explained in the manual, 
separate tests of specific improvable skills within 
subjects would have been the rule, as in the 
Iowa Every-Pupil Tests of Basic Skills, par- 
ticularly Test B: Work-Study Skills with its 
separate components measuring (a) how to in- 
terpret maps, (5) how to use an index, (c) how 
to use a dictionary, (d) where to find informa- 
tion, (e) how to read charts, tables and graphs. 
Arithmetic is no longer simply the province of 
the twin goddesses of fundamentals and prob- 
lems, but an area involving at least the separate 
skills of (a) understanding the language of arith- 
metic problems, sometimes called social arith- 
metic, (b) ability to carry out specified compu- 
tations, (c) ability to determine what operations 
are required for solution of a problem, (d) abil- 
ity to determine when data are sufficient for 
solving a problem or what additional informa- 
tion is needed, (e) ability to interpret charts, 
tables and graphs. These are the types of units 
that would help teachers "to diagnose an indi- 
vidual pupil's difficulties in learning" or "to 
provide a basis for counseling parents regarding 
a pupil’s achievement." They would be more 
useful for administrative purposes, too, because 
they would tend to focus attention on the signif- 
icant details of teaching and learning. Scores on 
component tests could always be combined if 
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subject composites were desired for a specific 
purpose. 

Item types could be improved. The Inter- 
mediate and Advanced batteries could be made 
completely objective without loss. Good objec- 
tive tests of spelling can be made. Arithmetic 
computation and problem solving can be pre- 
sented in multiple choice form. Contrary to the 
statement on page 6 of the manual, it is not 
"difficult to get satisfactory range of measure- 
ment in reading if recognition type items alone 
are used." It is being done regularly at high 
school and college levels. The chief limitation of 
the reading material in this test is that it con- 
centrates on ability to reproduce the factual con- 
tent of material read. The spread of difficulty in 
this skill is narrow, but ability to infer, gen- 
eralize, apply from what is read is not narrow 
and is what is needed to measure higher reaches 
of reading ability. The completion type item 
used is not a natural reading type and requires 
skill in expression in addition to reading ability, 
which may lead to very different response sets 
on the part of examinees, varying from quick 
acceptance of any reasonable appearing answer 
to persevering search for the best word to use in 
every case at the expense of normal progress 
through the test. 

It is to be hoped that the progress that has 
been made in this revision along lines of pro- 
ducing a highly helpful manual and interpretive 
materials and of establishing the standardiza- 
tion on such a firm footing will be matched in 
the next revision by progress in modifying the 
content and form of questions to relate them 
more closely to the learning objectives of modern 
education and in organizing the tests to yield . 
more detailed breakdowns of subject knowl- 
edge. This reviewer is aware that the present 
batteries represent a genuine effort to match 
typical school content and subject matter or- 
ganization. Items testing broader outcomes were 
tried out and found very difficult for pupils at 
the grade levels for which the items were de- 
signed. Much thought and experimentation will 
be required to produce effectively discriminat- 
ing items that test ability to apply knowledge 
and skill But such efforts appear to be emi- 
nently worthwhile, nay essential if the batteries 
are to contribute to the improvement of educa- 
tion and to avoid the unintended effect of per- 
petuating undesirable emphases that may still 
be typical, but happily are declining. 

The component tests in the batteries appear to 
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be highly reliable. Corrected split half reli- 
ability coefficients for appropriate single-grade 
groups vary from .79 to .97 with a median of 
ol. Test-retest coefficients would have been 
more conclusive here because it is impossible to 
tell whether any of the tests are speeded enough 
to affect spuriously the reliability estimates ob- 
tained by the split half technique. 

This brings us to the question of whether the 
tests are speeded or are primarily power tests. 
It would be helpful in connection with evaluat- 
ing these tests if we could be assured that “80% 
of all pupils in grade 4 completed every test in 
the Elementary Battery" or otherwise to be able 
to conclude whether the test was speeded or not. 
In the absence of such information, we may sus- 
pect speededness in tests requiring more than 
three responses per minute, especially if median 
scores in the norms run below 50 per cent right 
—after allowing for average chance success— 
in the highest grade for which the test is offered ; 
but the conclusion is not sure. Judged by these 
criteria, the word-picture association test of the 
Primary I battery, all tests except arithmetic 
problems in the Primary II battery, and the 


social studies tests and science tests in the' 


Intermediate and Advanced batteries may be 
speeded. Although the statistical evidence on the 
literature tests would indicate that they are 
speeded, this is more doubtful because the va- 
riety of literary works read by children in dif- 
ferent schools means that on any test broad 
enough to represent all reading lists, the per- 
centage of books represented that would have 
been read by a typical pupil must necessarily be 
small. 

- The advisability of including tests of knowl- 
edge as distinguished from skills in an achieve- 
ment battery intended for national use is a knotty 
problem. This problem is reflected in the fact 
that partial batteries are provided by the pub- 
lishers at the two highest levels. These partial 
batteries cover only the skills tested in the Ele- 
mentary battery and omit the tests of substantive 
knowledge. At present, educators who properly 
desire to give weight to substantive knowledge 
in their testing must choose between using the 
complete batteries with their items in literature, 
social studies, and science that may not be at- 
tuned to local courses of study and using only 
the partial batteries together with local tests of 
literature, social studies and science. Local tests, 
of course, permit no comparison with outside 
standards and their content may therefore seem 


to some pupils and teachers less important be- 
cause omitted from the standardized testing, It 
is perhaps unnecessary to point out that this is 
a dilemma that affects the use of all achievement 
batteries in some degree. 

The adequacy of a number of the component 
tests to measure achievement of poor learners 
at the various levels may be questioned. The 
grade equivalents of test scores that might be 
achieved by random guessing are less than six 
months below the lowest grade level for which 
the test is intended in all of the multiple choice 
tests in the three lowest batteries. (In the Pri- 
mary I battery, the chance score on the Word 
Recognition Test has a grade equivalent of 1.4.) 
In the higher level batteries, chance scores are 
at least a full grade below the lowest grade for 
which the test is intended, but in the multiple 
choice tests of the Intermediate battery and in 
the spelling test of the Advanced battery, the 
margin is less than two grades. 

There would appear to be adequate ceiling in 
each test. Only in the arithmetic tests of the 
Primary II and Elementary batteries are the 
grade score equivalents of the maximum possible 
scores consistently less than two grades above 
the highest grade level for which the tests are 
proposed and this phenomenon is doubtless ex- 
plained by the sequential nature of teaching in 
arithmetic at such grade levels which makes it 
unlikely that many pupils will be far ahead of 
grade. 

The format of the tests is unattractive, but 
clear. The paper seems quite inferior and tears 
easily. This doubtless saves cost and is an unim- 
portant criticism of tests that are expended with 
one use. 

This reviewer questions the value of includ- : 
ing formal standardized testing in the primary 
grades as part of a program. Instructional use 
can be made of good reading tests at this level 
without including such tests in a formal pro- 
gram of testing that seems to imply summariza- 
tion and checking on children. In these grades, 
authorities in child development stress the de- 
sirability of helping children to find themselves 
and to begin to acquire the basic skills without 
competitive pressure and the imposition of ex- 
pectancies of performance. Achievement is not 
like bricklaying; more may be gained by less 
pressure at first, and the payoff educationally is 
achievement at the end of the normal course; in 
a long race, no prizes are given to leaders at 
the end of the first lap. Readiness tests and read- 
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. ing tests have a place, but alert teachers should 
use even these measures as informal aids to 
supplement or correct personal impressions. 
Early emphasis on abstract number ¢ombina- 
tions and on spelling is out of place at this level 
according to such criticism. Again, it should 
be pointed out that this criticism would apply 
equally to other series of achievement test bat- 
teries that include batteries for primary grades. 

PRIMARY I BATTERY. The reading tests at this 
level appropriately stress the skills of word- 
picture association, word recognition, and ele- 
mentary word meaning. The chance factor in the 
word recognition test has been mentioned in 
the general discussion. The numbers test con- 
tains many good items testing elementary quanti- 
tative concepts, but many others on abstract 
number combinations. The Metropolitan Readi- 
ness Tests, also published by World Book Com- 
pany, should be considered as an alternative by 
schools that do not place heavy emphasis on ex- 
pected achievement in primary grades. 

PRIMARY II BATTERY. The reading tests at this 
level are more difficult versions of the types 
offered in the previous battery. The authors are 
to be commended for employing a readability 
measure, the Lorge Readability Index, to con- 
trol the difficulty of the reading matter. A check 
of the spelling words against Rinsland’s A Basic 
Vocabulary of Elementary School Children 
shows that the words are appropriate to second 
grade. The arithmetic problems test contains a 
desirable combination of some oral, and some 
written presentation. The arithmetic fundamen- 
tals test is just that. If the local program stresses 
abstract number and spelling, these tests are 
relevant, otherwise not. 

ELEMENTARY BATTERY. The reading tests at 
this level are less satisfactory than those at the 
primary levels. Over 95 per cent of the items 
can be answered by referring to the selection 
for specific information ; some questions can be 
answered without reading the selections ; in other 
cases, the stem of one question indicates the 
correct answer to a preceding question; the 
dubious completion-type item is introduced at 
this level. The spelling test is appropriately 
based. Language usage is first tested at this level, 
but the emphasis is not well balanced. For ex- 
ample, the development of sentence sense 15 not 
tested although it is a more important objective 
of instruction at this grade level than distinctions 
between “may-can” and “It’s me—It’s I” which 
are covered. Again, the arithmetic problems test 


TESTS & REVIEWS: ACHIEVEMENT BATTERIES 18 


is superior to that on arithmetic fundamentals. 
Even so, some problems testing further refine- 
ment in pupils’ concepts of shape and time might 
well have been included. It would be more ap- 
propriate, too, to test concepts of common frac- 
tions pictorially and in the testing of measure- 
ment than to give formal exercises in addition 
and subtraction of fractions. It should be noted 
here that the problems in fractions in this test, 
which involve adding and subtracting like frac- 
tions, are an improvement over the previous edi- 
tion in which combination of unlike fractions 
was criticized by previous reviewers. 
INTERMEDIATE AND ADVANCED BATTERIES. 
The same ten tests are included in both batteries. 
Comments made on the limited range of objec- 
tives measured by the reading tests at lower 
levels also apply here. Similarly, the language 
usage test stresses a formal correctness of usage 
at the expense of more basic objectives. The 
spelling test is generally satisfactory, but the test 
in the Advanced battery does not measure poor 
spellers adequately. The literature test, intro- 
duced at this level, suffers from the limitation of 


^ most tests in this area at all levels in that it can 


test only for superficial memory of detail, inci- 
dent, or character as evidence of acquaintance ; 
understanding is hardly tested. The arithmetic 
tests are generally sound. There is a good sam- 
pling of computational problems and interpre- 
tation of graphic materials is required. The items 
sampling understanding of algebra and geometry 
are few in number and not well chosen. They 
should either be expanded or omitted, The prob- 
lems test has a very high ceiling for the number 
of items involved. The social studies tests cover- 
ing history and civics are well balanced between 
American history (one half), civics (one third), " 
and world backgrounds of American history. So- 
cial and economic aspects of history, as well as 
political, are treated. The Intermediate test gives 
most emphasis to developments before the Civil 
War, the Advanced test puts more weight on 
events since the Civil War, probably a sound cur- 
ricular emphasis for most schools that use the 
tests. The emphasis is factual and the tests are 
apparently speeded. Advanced terms like “sink- 
ing fund,” “basilica,” “redoubt,” and “refectory” 
are to be found in the Intermediate test. The Ad- 
vanced test includes a good chronology section, 
but little on the U.S. Constitution. The social 
studies test of geography is well balanced be- 
tween physical and social aspects and between 
the geography of the Western Hemisphere and 
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that of the rest of the world. It involves a number 
of "thought" questions in both batteries and 
good map questions in the Intermediate battery. 
Why map questions are omitted from the Ad- 
vanced battery is unclear. The geography test at 
both levels appears speeded. The science test re- 
flects a good balance of subject matter of ap- 
propriate difficulty. The emphasis is on recall of 
specific fact with little on understanding or ap- 
plication of knowledge. The wording of 20 per 
cent of the items is ambiguous or imprecise. In 
some of the cases, the wording will not confuse 
the reader, but editing is in order. This test may 
also be speeded, as explained earlier. 

SUMMARY. The Metropolitan Achievement 
Tests are superlatively standardized measures 
of what they measure and are accompanied by an 
excellent manual and varied norms. The reader 
must be the judge of whether they measure what 
he wants measured in the form in which he wants 
it measured. 


For a review by Robert L. Burch of the arith- 
metic test, see 416; for reviews by James К. 
Hobson and Margaret G. McKim of the read- 
ing test, see 543; for reviews by E. V. Pullias 
and Hugh B. Wood of an earlier edition, see 
40:1189; for reviews by Peter L. Spencer and 
Harry Grove Wheat of an earlier edition of the 
arithmetic test, see 40:1458.1; for a review by 
D. A. Worcester of an earlier edition of the read- 
ing test, see 40:1551; for reviews by Jack W. 
Dunlap, Richard Ledgerwood, and Charles W. 
Odell of an earlier edition see 38:874; for re- 
views by Foster E. Grossnickle and Guy М. 
Wilson of an earlier edition of the arithmetic 
test, see 38:892; for reviews by Ivan A. Booker 
and Joseph C. Dewey of an earlier edition of the 
reading test, see 38:1105. 


[19] 
*Modern School Achievement Tests: Skills Edi- 
tion, Grades 2-8; 1931-49; 6 scores: reading compre- 
hension, reading speed, arithmetical computation, arith- 
metical reasoning, spelling, reading accuracy; Revised 
Forms I (49), II (’49); no data on reliability and 
validity; manual (48); $8.75 per 100; 356 per speci- 
men set; postpaid; 08(120) minutes; Arthur I. Gates, 
Paul R. Mort, and Ralph B. Spence; Bureau of Publi- 
cations, Teachers College, Columbia University. * 
REFERENCES 
1-3. See 40:1190. 


СнАкгз К. LANGMUIR, Associate Professor of 
Education, Syracuse University, Syracuse, New 
York. 

When these tests were first published in 1931, 
their validity was entirely based on an appeal to 
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the authoritative judgment of nine listed au- 
thors, the course-of-study ratings of the Bu- 
reau of Curriculum Research, Teachers College, 
Columbia University, and checks by “master 
teachers." The original manual contained no 
usable data. 

The revised edition issued 18 years later is 
essentially the same test with different items. All 
of the basic inadequacies of the 1931 issue are 
faithfully reproduced. The small editorial re- 
visions add up to a net loss. An effort to econo- 
mize on scoring the reading comprehension sub- 
test results in complicating the directions to the 
point where the items are practically puzzles. 
'The reading speed and accuracy subtests have 
been shortened and made more difficult. A. score 
of one item right is now reported as the norm 
for grade 2.0. The content of arithmetic com- 
putation has been improved slightly by inclusion 
of a graph, an equation, and geometric mensura- 
tion. Time is saved by having grades 8 to 12 
(sic!) begin at item 19. Since the scoring di- 
rections are incomplete, some users will be sur- 
prised to find the maximum eighth grade score 
(30 items) equivalent to grade 7.2, whereas a 
bright fifth grader might get 47 right and be 
equated to grade 12.7. Similar scoring difficul- 
ties apply to arithmetic reasoning. The problem 
can be solved in the key which should state that 
the score is the number of items right plus the 
number not given. In the original test the au- 
thors avoided extrapolations outside the observ- 
able range of grade equivalents by using the 
symbols H, V, and L, meaning high, very high, 
and low. But this sensible solution to extreme 
scores was dropped in favor of extrapolations 
which are patently absurd. 

The three authors of the revised edition ap- 
peal to the authority of the original 1931 edition. 
Furthermore, the manual states, “because the 
items have been carefully selected from standard 
curricula on file in the Curriculum Laboratory of 
Teachers College the tests can be used for di- 
agnostic study if the teacher so desires. One 
method is to examine the individual test papers 
to see whether significant patterns characterize 
the test results * Another method is to tabulate 
the records of all pupils on all items. This per- 
mits the diagnosis of both class and individual 
patterns. Does the class tend to show an usual 
number of successes ог failures...?" This is а 
very fine recommendation which could be fol- 
lowed if test authors would supply basic informa- 
tion. Just what observed result is unusual? 
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Probably the most outstanding fact about the 
tests is that they are catalogued, advertised, and 
issued for sale complete with a manual which 
reports not one single fact about the scores. The 
entire justification for the grade equivalent scale 
consists of one 43-word paragraph which is here- 
with printed in toto, “The norms for the revised 
tests were established by equating the new tests 
to the original tests. Twelve cities, with ex- 
penditure levels about average for the country 
and representative of different sizes of com- 
munities and of geographic sections, were chosen 
for the equating.” 

The manual contains a number of optimistic 
suggestions and questions addressed to teachers 
on the problems of pupils and schools which 
imply the value of test results in meeting the 
problems. The solution of all of these problems 
requires abundant experimental data presented 
in usable form. The requirement of usable form 
implies a logical substructure underlying both 
the construction of the tests and their experi- 
mental application. The Modern School Achieve- 
ment Tests do not offer any approach to solv- 
ing the questions put by the authors to their 
prospective consumers. 

The test could not be successfully or eco- 
nomically administered below grade 4. Any user 
at any grade level would have to depend wholly 
on his own experimental results for statistical 
information bearing on the evaluation of local 
findings. In summary, publication of this re- 
vision reveals an example of no progress what- 
ever over a period of two decades. 


For reviews by William A. Brownell, Herbert 
S. Conrad, and Herschel Т. Manuel of the first 
edition, see 40: 1190. 


[20] 

*Municipal Battery: National Achievement 
Tests. Grades 3-6, 6-8; 1938-50; 2 levels; 2 editions ; 
subtests except in spelling and health published as sep- 
arates; 50¢ per specimen set of any one level of any 
one edition; postage extra; Robert K. Speer and Sam- 
uel Smith; Acorn Publishing Co. * 
а) COMPLETE BATTERY. Grades 3-6, 6-8; 1038-50: 
Grades 3-6: Forms A (’48—a reprinting of the 1938 
test), B. (’s1—a reprinting of the 1939 test); Grades 
6-8: Forms A ('so—a reprinting of the 1938 test), B 
('50—a reprinting of the 1939 test); 10 subtests; no 
norms for part scores; directions for administering 
(’38—English norms revised in 1950) ; $3.25 рег 25; 
grades 3-6; 203 (225) minutes; grades 6-8; 202 (225) 
minutes, 7 1 

1) Reading Comprehension. 4 scores; following di- 

E sentence meaning, paragraph meaning, 

total. 

s Reading Speed. 

3) Spelling. 
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4) Arithmetic Fundamentals, 3 scores: computation, 
number comparisons, total. 

5) Arithmetic .Reasoning. 4 scores: comparisons, 
problem analysis, problems, total. 

) English. 5 scores: language usage—words, lan- 
guage usage—sentences, punctuation and capitaliza- 
tion, expressing ideas, total. 

7) Literature. 3 scores: motives and moods, miscel- 

laneous facts, total. 

8) Geography. 3 scores: geographical ideas and com- 

parisons, miscellaneous facts, total, 

9) History and Civics. 3 scores: lessons of history, 

historical facts, total. 

10) Health. 
b) PARTIAL BATTERY. Grades 3-6, 6-8; 1938-39; 6 sub- 
tests; Forms A (38), B (730); directions for admin- 
istering (738); $275 per 25; grades 3-6: 138(155) 
minutes; grades 6-8: 137(155) minutes. 

1) Subtests same as 1-6 in Complete Battery. 


RarrH C. Preston, Associate Professor of 
Education, University of Pennsylvania, Phila- 
delphia, Pennsylvania. 

This battery has several praiseworthy fea- 
tures. Its format is acceptable (although the 
printing is inferior—mutilated letters, words, 
and lines appear in all forms). The spelling sub- 
test has the helpful arrangement of providing a 
sentence with a blank line on which the dictated 
word is written. This should reduce errors 
which so commonly arise in spelling tests as a 
result of faulty auditory discrimination on the 
part of the pupil or faulty enunciation on the 
part of the test administrator. The literature 
subtest goes beyond testing mere recall of factual 
material and calls upon skill in interpreting mo- 
tives and moods in reproduced passages from 
familiar literature. 

Another of the commendable features is the 
table of norms for each of three levels of intelli- 
gence—IQ’s below 90, IQ’s from 9o to 109, and 
1075 above 109. Unfortunately, information is 
not given concerning the intelligence test or 
population upon which these norms are based. 
When properly used, however, this device should 
help to reduce the still common malpractice of 
judging achievement in terms of grade place- 
ment instead of in terms of capacity. Teachers 
should be reminded, however, that mental age 
is a better criterion than IQ, and that IQ will 
serve adequately, therefore, only in schools 
which obtain a high degree of homogeneity with 
respect to chronological age in any given grade. 

The authors of this test laudably attempt to 
test “the power to apply facts” and “ability to 
make accurate judgments.” They hope to avoid 
the pitfall of other tests which give an unfair ad- 
vantage to the pupil who is a good memorizer 
yet a superficial thinker. Certainly this is a 


worthy aim, and it is applied in the language arts 
and arithmetic portions of the test (though in- 
explicably absent in the social studies portions 
as will be discussed later). However, one of the 
regrettable and unnecessary results is that some 
of the items in the reading comprehension sub- 
test resemble too closely items that would seem to 
belong more appropriately in an intelligence test. 
Some of the items tend to call for a type of logical 
analysis and a mental agility which are not re- 
quired in typical reading situations. For ex- 
ample, one item reads: “Look at these five let- 
ters: h g f o s. Draw a circle around the middle 
letter of the five letters. On the line at the end 
of this sentence, write the one letter which comes 
just before the letter around which you drew a 
circle." 

'The reading speed subtest will not measure 
reading speed for all children. In one form, the 
child is directed to draw a line under every 
sentence which tells that John came home—a 
statement which occurs frequently in the 18 
paragraphs which follow. Although the direc- 
tions request the child to read every sentence 
quickly, it is unlikely that all will do so. A per- 
fect score could be secured by doing little more 
than locating and underlining the phrase “John 
came home" or variations of it (such as “John 
soon came home"). A few foils occur, such as 
*he came home," requiring the child to deter- 
mine whether John or Fred is referred to. For 
some children, however, this subtest is more 
nearly a test of speed of skimming for the loca- 
tion of a particular phrase than a test of genuine 
speed of reading. 

The social studies subtests fail to reflect cur- 
riculum trends. For example, despite the shift 
to greater fusion in social studies in the ele- 
mentary school, geography and history are 
treated in isolation (although, to be sure, history 
and civics are merged). Nor do these subtests 
recognize the unmistakable shift from emphasis 
on discrete facts to emphasis on generalizations 
and "cross-fertilized" facts. Typical multiple 
choice items are: “Which country, among the 
following, was England's strongest competitor 
in world trade before 1914?” and “Which of 
these countries has sold the most goods to the 
United States?" One cannot quarrel with in- 
formational items in a social studies achievement 
test, but one can expect, in view of the broad- 
ened objectives of social studies instruction, to 
find many more items dealing with principles 
and generalizations. This is a particularly justi- 
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fied expectation from a test which claims, as 
does this one, to “add a new emphasis....on the 
power to apply facts and on the pupil's ability to 
make accurate judgments." 

One of the conspicuous lacks is an adequate 
science subtest. A health test alone represents 
the science area. It is hard to understand this 
gap, since comprehensive science programs have 
been rapidly developing among elementary 
schools for a number of years. 

Schools which аге contemplating the use of 
the Municipal Battery would be well advised to 
secure the partial batteries which deal with skill 
areas only. This is because the content areas in 
this battery consist chiefly of items dealing with 
discrete facts (as opposed to items dealing with 
generalizations). The elementary school cur- 
riculum is so dynamic and so lacking in uni- 
formity in the content areas (social studies, 
science, and literature), that it is unlikely that 
factual subtests of this sort would fairly evaluate 
learning outcomes for any given school. A con- 
tent-area test, to be helpful under these circum- 
stances, needs to emphasize universal principles 
and skills. Certain other tests have made good 
starts in this direction. 

Final judgment concerning the utility of this 
battery cannot be made because of the apparent 
unavailability of precise information concern- 
ing its development. This test has been on the 
market for 13 years, yet the authors and pub- 
lishers have not troubled themselves to publish 
complete information concerning construction, 
equivalence of forms, or standardization. Fur- 
thermore, the copyright dates may prove mis- 
leading. A page-by-page comparison of one of 
the 1938 forms with the same form dated 1950 
reveals that the latter is merely a reprint with 
no differences in items or in norms. 


For reviews by A. M. Jordan of the complete 
battery for grades 6-8 and Hugh B. Wood of 
the batteries for grades 6-8, see 40: 1191. 


[21] 
*Ohio Eighth Grade Test. Grade 8; 1934-51; 5 
scores: English, arithmetic, science, history, total ; new 
form published annually ; no data on reliability and va; 
lidity; no manual; norms compiled after publication © 
test; 5¢ per test; 2¢ per answer key; 5€ рег copy 0 
1950 form for review; postpaid; Ohio Scholarship 
Tests, Ohio State Department of Education. * . 


[22] 
*Ohio General Scholarship Test for High School 
Seniors. Grade 12; 1930-50; 6 scores: English, his- 
tory, mathematics, science, reading, total; new form 
published annually; IBM; Form 2-50 (750) ; no data 
on reliability and validity; no manual; Ohio norms 
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(50); Ohio schools: 50@ per test including scoring, 
tabulating, reporting of results, and norms; 10¢ per 
sample copy ; postpaid; 150(180) minutes; Ohio Schol- 
arship Tests, Ohio State Department of Education. * 


For a review by C. C. Ross of the 1947 edi- 
tion, see 3:14. 


[23] 

Progressive Tests in Social and Related Sci- 
ences. Grades 4-8; 1946-48; IBM; 3 parts; Forms A 
(46), В (48) ; manual ('47—designated Form А); 
356 per specimen set, postpaid; separate answer sheets 
may be used; 3€ per IBM answer sheet; 40 per sten- 
cil for machine scoring of answer sheets; 40€ per sten- 
cil for hand scoring of answer sheets; postage extra; 
nontimed (150) minutes, (160) minutes if separate 
answer sheets’ are used; Georgia Sachs Adams and 
John A. Sexson; California Test Bureau. * 

a) PART I, SOCIAL STUDIES I. 8 scores: exploration and 
colonization, the westward movement, later develop- 
ment of the nation, understanding of democracy, the 
American heritage (total of preceding 3 subtests), peo- 
ple of other lands, people of other times, people of other 
lands and times (total of preceding 2 subtests) ; $175 
per 25; nontimed (50) minutes, (53) minutes if sep- 
arate answer sheets are used. À 
b) PART П, SOCIAL STUDIES П. 8 scores: geographic 
facts (U.S.), geographic facts (world), reading maps 
and geographic terms, effects of geography on man, 
total geography, food-clothing-shelter, transportation 
and communication, basic social processes (total „of 
preceding 2 subtests) ; $2 per 25; nontimed (60) min- 
utes, (64) minutes if separate answer sheets are used. 
C) PART II, RELATED SCIENCES. 7 scores: eating for 
health, other health information, safety information, 
total health and safety, the world about us, man's in- 
creasing control over nature, total elementary science; 
$1.75 per 25; nontimed (40) minutes, (43) minutes if 
separate answer sheets are used. 


Harry D. Bere, Associate Professor and Ex- 
aminer of Social Studies, Michigan State Col- 
lege, East Lansing, Michigan. 

This battery of tests is well conceived and 
expertly executed in all departments including 
selection and division of content, individual item 
construction, provisions for administration, and 
the establishment of norms. Within the limi- 
tations of their purposes, which are clearly de- 
fined by the authors in an unusually complete 
manual of directions, the tests should prove 
highly useful in the elementary school area 
grades 4 to 8. 

The chief purposes of the tests are (a) to 
measure pupil knowledge of a sampling of com- 
mon learnings, (b) to provide part scores which 
will be of value in analyzing the status of indi- 
viduals, classes, and schools to show their needs 
in specific areas and to provide a basis for plan- 
ning and guiding instruction, and, (c) to test 
understanding of relationships and processes 
and to require the student to apply facts and 
principles to new situations. While the above 
purposes are inclusive, important, and capable 
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of being achieved by the battery, there are areas 
which are not adequately covered. I have in mind 
particularly the basic social science skills, Ex- 
cept for an unusually well done section on map 
reading skills, the tests are devoted to evaluat- 
ing content mastery and reasoning involving, 
content, It would therefore be wise to use the 
tests in conjunction with such a program as the 
Iowa Every-Pupil Tests of Basic Skills. 

The battery consists of three test booklets, 
each containing two major parts which are in 
turn divided into a series of subparts. Attention 
should be called to this breakdown because it 
illuminates several important features of the 
battery: its comprehensive coverage, its diag- 
nostic possibilities through the use of part scores, 
and its adaptation to the unit organization of 
school work, for the tests follow units which 
are commonly offered in the elementary school. 
To make the extensive data which the tests are 
capable of yielding as usable as possible, the au- 
thors have provided both age-grade and per- 
centile norms. A percentile profile chart for each 
student is printed on the student’s own test 
booklet where it may be easily related to per- 
formance on parts and individual items. 

A few words should be said about the manner _ 
in which the tests were validated. Instead of 
merely going to textbooks the authors analyzed 
courses of study, made individual item analyses 
to determine the consistency with which item 
success increased with grade level, asked class- 
room teachers to rank items in the order of their 
importance, and gave tryout forms containing 
many times the number of items finally used to_ 
large groups of students. The absence of trivial 
items reflects the use of these procedures. In- 
formation on reliability for each test at each 
grade level is also given. Reliability varies from 
.73 for Test 1 at the fourth grade level to .96 for 
Test 4 at the eighth grade level. It is worth 
noting that all reliabilities were computed for 
single grade ranges thereby preventing them 
from being as high as they might have been had 
less homogeneous populations been used. 

The adverse criticisms of the test are compara- 
tively minor. A few technical flaws are to be 
found in individual items and in at least one case 
an item is given away by an immediately pre- 
ceding one. There is also a tendency to empha- 
size the retention of facts for their own sake. 
This last criticism is mitigated in part by the 
general importance of the facts selected. Some 
parts of the tests contain fewer reasoning items 
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than others. The geography areas seem to have 
an abundance and the history areas a paucity. 
In some cases, but not many, the vocabulary 
may be slightly advanced for fourth graders. 
Words which are incidental«o the items such as 
“philosophy” and “secede” would seem to be 
in this class. One might also criticize the use of 
true false in addition to multiple choice exer- 
cises. Most of the items are of the latter type and 
itis hard to see why this generally superior form 
was not used exclusively. In some tests as few 
as 8 true false items as compared with 27 mul- 
tiple choice are used. 

As previously stated, all of the above criti- 
cisms are comparatively minor. They should not 
be allowed to detract from the overall excellence 
of the tests. For those who wish diagnostic tests 
capable of yielding highly useful and usable data 
on content achievement in the social and related 
sciences at the elementary school level, this is 
a series of tests worthy of careful consideration. 


J. Raymonp Gerzericu, Director, Bureau of 
Educational Research and Service, and Profes- 
sor of Education, University of Connecticut, 
Storrs, Connecticut. 

I. Tue Test BATTERY. A. PURPOSE AND GEN- 
ERAL NATURE. The authors describe this battery 
as “subject-matter tests which are in harmony 
with the modern school curriculum.” They list 
four purposes: (a) to measure knowledge gen- 
erally accepted as important ; (b) to provide part 
scores for analyzing the status and needs of in- 
dividual pupils, classes, and schools; (c) to test 
understanding and the ability to apply facts and 
principles in new situations; and (d) to facili- 
tate-an interpretation of results through the 
provision of age, grade, and percentile norms. 

The battery consists of 6 tests in 3 booklets, 
Each test contains 2 to 4 subtests (called sec- 
tions) making a total of 17 subtests. The number 
of items in subtests varies from 20 to 40. The 
number of items per test are: Test r, The Amer- 
ican Heritage, 95; Test 2, Peoples of Other 
Lands and Times, 50; Test 3, Geography, 105; 
Test 4, Basic Social Processes, 75; Test 5, 
Health and Safety, 75; and Test 6, Elementary 
Science, 65. 

B. VALIDITY. Validity is justified in terms of 
analyses of textbooks and other instructional 
materials, expert opinion, and item analysis tech- 
niques. Courses of studies of 18 states and 11 
cities, 66 textbooks, and several curriculum re- 
search studies were used in the preliminary se- 
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lection of content. Mimeographed editions con- 
taining from three to four times as many items 
as were planned for the final edition were ad- 
ministered to pupils in five school districts, 
Results from the 3,786 such tests were analyzed 
in terms of item discriminating power and in- 
creases in percentages of correct answers from 
grade to grade. 

The authors then employed what they de- 
scribe as “an infrequently used but highly ef- 
fective technique" in validating items. Reactions 
of 117 teachers and 51 supervisors, presumably 
specialists at the appropriate grade levels and in 
the pertinent areas, were obtained to the items 
of the preliminary edition. No item was chosen 
for the final edition unless at least 60 per cent 
of the raters listed it as A or B (testing essential 
or important and significant information), as 
contrasted with C, D, or E (testing fairly signif- 
icant, comparatively unimportant or inappro- 
priate, and inconsequential or trivial informa- 
tion), and unless it also met the statistical 
criteria. 

C. RELIABILITY. One of the Kuder-Richardson 
formulas (which one is not specified) was used to 
estimate reliability coefficients for each of the 
six tests for each grade from 4 to 8. Reliability 
coefficients are not given for the subtests. The 
sizes of samples upon which the reliability co- 
efficients were based are not reported. The co- 
efficients range from .73 to .96, with two below 
.80, seventeen from .80 to .89, and eleven .90 or 
above. There is a rather consistent increase in 
mean coefficients by grades from the .83 for 
grade 4 to the .92 for grade 8. Mean coefficients 
by tests indicate that Tests 4, 3, 6, 1, 5, and 2 
rank in that order from high to low. Two of these 
means are above .90, two are between .85 and 
:90, and two are below .85. 

D. ADMINISTERING AND SCORING. The six tests 
are designed for administration either with or 
without separate answer sheets in three sittings 
of two tests each. Specified times are 50, 60, and 
40 minutes for the three parts when pupils re- 
cord their answers in the booklets and one fourth 
again as much time when pupils use separate 
answer sheets. The authors indicate that the 
battery consists of power rather than speed tests 
and that pupils should be allowed such time as 
is necessary for them to complete work on each 
booklet. 

Hand scoring keys are printed in columnar 
form for scoring marked test booklets. Machine 
scoring cutout stencils are available for use 1n 
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the hand scoring or machine scoring of separate 
answer sheets. Hand scoring requires the mark- 
ing of responses to 465 items in three booklets, 
with the answers appearing in 54 columns on 27 
pages for the entire battery. Machine scoring 
and hand scoring of answer sheets entail the use 
of six stencils, one for each side of the three an- 
swer sheets. The test scores are obtained by 
adding the section scores ; in machine scoring the 
test scores as well as the section scores are ob- 
tained in the scoring process. All scoring is in 
terms of the number of correct answers, as no 
correction for chance or conversion to derived 
scores is involved. 

Hand scored answers as they appear in the 
test booklets are primarily of three types: i5 
single-letter written responses to multiple choice 
items, alternating from a-b-c-d for one item to 
e-f-g-h for the following item, (5) encircled “Т” 
or "F" responses to true-false items, and (c) 
the counterpart of machine scored answer sheet 
vertical marks between pairs of dotted lines for 
many of the map identification items of the ge- 
ography test. 

E. NoRMS. The test norms were established on 
results from the administration of 17,000 tests 
to pupils in grades four to eight of representative 
schools in nine states, The mental ability levels 
and chronological ages of pupils in the norma- 
tive group were taken into account, but the man- 
ner in which such information was used is not 
specified. The norms provide for grade equiva- 
lents ranging from 2.4 to 12.0 and for age equiv- 
alents ranging from 93 to 206 months for each 
of the six tests. Percentile norms by thirds of 
grades are given for each of the 6 tests and 17 
subtests. The norms provide equivalents in terms 
of raw score ranges for areas represented only 
by the nine deciles and by percentiles of 1, 5, 95, 
and 99. A difference of one point in a subtest 
score frequently results in a change of 10 points 
on the percentile rank scale if the score is near 
the middle of the distribution, suggesting that 
the subtest scores, often based on small numbers 
of items, have such low degrees of reliability 
that they are of little significance. Even the per- 
centile ranks for test scores fail to provide for 
very precise ranking. 

F. REPORTS OF RESULTS. The cover page of 
each test booklet and the front side of each an- 
swer sheet provide positions for recording test 
and section scores, grade equivalents, and per- 
centile ranks for the designated grade as well as 
a profile chart upon which percentile ranks for 
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the appropriate grade may be represented graph- 


ically. The three class record sheets, one for each 
part, provide for listing the name, sex, and grade 
of each pupil, his grade placement on the tests, 
and his percentile ranks on the sections. Pro- 
vision is also made for class summaries of test 
scores by grade levels in halí-year intervals and 
of subtest scores by the above percentiles. Ap- 
parently no provision is made for a single record 
and profile on which results from all six tests 
can be summarized for each pupil or for a single 
class record sheet providing a ready overview of 
group results for all six tests. 

The percentile norm graphs are of doubtful 
value for two reasons : percentile norms are more 
appropriate to the high school and college levels 
than to the intermediate and advanced grade 
levels ; and, since the percentile norms are coarse, 
many test users will place more confidence in 
them than is warranted by their reliability, par- 
ticularly for the subtests. 

G. CONTENT. Although the test material is 
factual in nature, as is often true of achievement 
tests in the content areas, many of the items 
measure such higher level and less tangible out- 
comes as concepts, understandings, and applica- 
tions, while others, particularly in Test 5, deal * 
with attitudes. Time sequences, causes and ef- . 
fects, and other types of relationships receive 
emphasis in the history, basic social processes, 
and science tests. Factual place geography and 
also understandings, as measured in one section 
by use of a map of an imaginary country, receive 
emphasis in Test 3. Practically identical items 
appear in the two forms of a test on occasion, but 
such overlaps are apparently random rather than 
planned with the idea of developing forms having : 
paralleling content throughout. In general the 
test content per se seems to be appropriate to 
the grade levels for which the battery is designed 
and the instructional outcomes its authors seek 
to measure. 

H. ITEM TYPES AND CHARACTERISTICS. More 
than 60 per cent of the 465 items in the battery 
are of the 4-response multiple choice type and 
approximately 30 per cent are in true-false 
form. Small numbers of 3-response multiple 
choice and alternate response not of the true- 
false variety make up the remainder. Occasion- 
ally, 3-response and 4-response items appear to- 
gether in the same subsection. With the excep- 
tion of 37 map identification items in multiple 
choice form, nearly all of the items stand alone. 
Forms A and B parallel each other exactly in 
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numbers and types of items by test, section, and 
subsection. à 

Only one subtest of the battery, Section А of 
Test r, is made up of one item type and runs 
from the first to the last item without a break 
for instructions. Fairly typical for a section is a 
group of 5 to 18 true-false items, followed by 
7 to 27 4-response multiple choice items. Ap- 
proximately 40 sets of instructions to pupils 
appear in the entire battery. Some of these repeat 
instructions given for earlier sections and sub- 
sections. 

At least one pair of items appears to demand 
contradictory responses. In Test 5, Section A, 
Item 5 is keyed “T” for the statement “Digestion 
begins in the mouth," but Item 8 of Form B for 
the same test and section is also keyed “T” for 
the statement “Digestion begins in the stomach.” 
The “British Empire,” referred to in Test 2, 
Section A, Items 2 and 19 of Form A and Items 
3 and 16 of Form B presumably went out of 
existence when the British Commonwealth of 
Nations was established some years ago, al- 
though the distinction may be a rather fine one. 
Item 97, Test 3, Section D of Form A includes 
“grazing” as one of the alternative occupations 

` in which people may engage. The reviewer finds 
it strange to think of "eating" milk (Test s, 
Section A, Item 15 of Form A), to regard hot 
water as a “food” (Test 5, Section A, Item 24 
of Form A and Item 15 of Form B), and to 
describe such various organisms as ants, bees, 
butterflies, ducks, larks, moths, robins, spiders, 
and swallows as “animals” (Test 6, Section A, 
various items). Although broad dictionary defi- 
nitions lend support to some of these usages, one 
might believe, especially since the terms “in- 
sects” and “birds” are used in the stems of other 
items, that intermediate and advanced grade 
pupils should be able to discriminate rather more 
finely. 

Item 25, Test 1, Section B of Form A in- 
cludes a factual clause which gives the answer 
to Item 30 of the same test, section, and form. 
Item 51, Test 1, Section C of Form A clearly im- 
plies the correct answer to Item 58 of the same 
test, section, and form. Items 16 and 32, both 
in Test 4, Section A of Form A, test the same 
idea, although in one item by way of producing 
“vegetables” and in the other by “truck farm- 
ing." Items 42 and 44 of Test 3, Section B of 
Form B deal with the production and sale of 
tea by China in such manner that the right 
answer to both items is at least suggested by the 


[ 58 


occurrence of the words “China” and “tea” in 
each. 

There are some other deviations from com- 
monly accepted principles of item writing and 
test construction: personal pronouns are some- 
times used, the statement and question forms are 
intermingled in what appears to be random 
order in multiple choice sections, there are many 
subsections, some of them very short, and vary- 
ing item forms occur and recur very frequently, 

I. FORMAT. The format of the test is good but 
not superior. Some of the multiple choice items 
in the two-column arrangement used throughout 
the battery require as much as three inches of 
vertical space, with the responses consequently 
appearing to be quite tightly printed. This co- 
lumnar format and varying lengths of answers 
result in such varieties of spacing for the re- 
sponses to multiple choice items as a paragraph 
form, a block columnar arrangement, a stag- 
gered one-column arrangement, a staggered 
two-column arrangement, a four-in-a-row ar- 
rangement, and others. Occasionally the text of 
an item encroaches on the answer column. The 
“F” responses for true-false items are in some 
subsections unduly close to the identifying item 
numbers at their right, whereas in other sub- 
sections this type of spacing is appropriate. A 
somewhat smaller type size might well reduce 
the overcrowding and permit the use of greater 
consistency in the form of responses to multiple 
choice items. 

Typographical errors and omissions, at least 
one of them potentially serious, occur rather 
frequently. A verb is repeated in Item 12, Test 
3, Section A of Form A. No answer position 
and final item number are printed for Item 89, 


‚ Test 1, Section D of Form A. The “Т” and “Ей 


answers and final identifying number of the item 
do not appear for Item 42, Test 4, Section B of 
Form B. A misprint in the age equivalents col- 
umn of normative data in the manual shows 
“120” where either “115” or “116” should ap- 
pear. Occasionally, but not consistently, the ini- 
tial letters of first words, even when the words 
are articles, are capitalized in all four alternative 
answers to multiple choice items, although in 
general such initial letters аге in lower case ex- 
cept for proper nouns. 

П. Tur Test ВАТТЕВҮ ІХ Use. A. PUPIL 
GROUPS TESTED. Аз a check on certain functional 
characteristics of the battery, Form A of all 
three booklets was administered with separate 
answer sheets during April 1951 to 76 pupils n 
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the fifth grade of two graded elementary schools 
in a rural section of Connecticut—one attended 
quite largely by children of University of Con- 
necticut faculty members and the other more 
typical of rural elementary schools in eastern 
Connecticut. The time allotments in each school 
were those specified when separate answer sheets 
are used. No attempt was made to allow all pu- 
pils to complete work on each booklet. Stand- 
ardized tests with machine scored answer sheets 
had previously been taken by both pupil groups. 

B. SCORES IN RELATION TO ITEMS OMITTED 
AND UNATTEMPTED. Median grade equivalents 
for the six tests were: Test 1, 6.1; Test 2, 4.5; 
Test 3, 6.2; Test 4, 4.8; Test 5, 5.4; and Test 6, 
5.0. It is notable that the medians for Tests 1, 
3, and 5 are higher than the medians for the 
even numbered tests in the corresponding book- 
lets. The ability levels of these pupils are prob- 
ably underestimated on Tests 2 and 4, possibly 
even on Test 6. Item counts were made of un- 
attempted items, that is, those items occurring 
without answers after the last item for which a 
response was recorded, and of items omitted. 
On each of Tests 1, 3, and 5 the mean number 
of items omitted was at least twice as great as 
the mean number unattempted, while for each 
of Tests 2, 4, and 6 the mean number of items 
unattempted was at least twice as great as the 
mean number omitted. For Tests 1, 3, and 5 the 
means of omitted and unattempted items com- 
bined constituted 8.6 per cent, 12.7 per cent, and 
8.7 per cent respectively of the total numbers of 
items. Similarly, 32.8 per cent, 27.7 per cent, 
and 22.0 per cent of the total numbers of items 
were omitted or unattempted on Tests 2, 4, and 
6 respectively. It thus appears that approxi- 
mately one tenth of the items on the three first 
tests of each sitting were not answered, whereas 
for the following even-numbered tests approxi- 
mately one fourth of the items were not an- 
swered. It seems reasonable to conclude that 
less than an appropriate time allotment was de- 
voted to Tests 2, 4, and 6 by this group of fifth 
grade pupils and that their results were well 
below normal expectation on the first two if 
not on all three of these tests. 

It is the belief of the reviewer that test authors 
and publishers are obligated to recommend spe- 
cific time limits for achievement tests and to be 
sure that recommendations are sufficiently am- 
ple to permit 9o to 95 per cent of the pupils to 
complete the tests during the allotted time. Great 
administrative inconvenience is caused test users 


by extensions of test timing beyond specifica- 
tions, especially if the amount cannot be antici- 
pated in advance and may turn out to be greai 

C. CHANCE SCORES IN RELATION TO TEST 
Norms. The concept of the chance score merits 
use in the analysis of test results. The means of 
the chance score distributions were determined 
by the reviewer for the six tests of the Progres- 
sive battery. The means for Tests 1 to 6 re- 
spectively are 30.8, 20.5, 29.9, 25.3, 27.5, and. 
22.5. It is interesting to note that the corre- 
sponding grade equivalents for these chance 
scores are respectively 4.2, 4.6, 4.4, 3.9, 3.0, and 
3.3. Percentile rank equivalents for chance scores 
at the 4.34.6 level are 50, 50, 50, 40, 10, and 20. 
The chance score on Test 2 will even place the 
pupil at the tenth percentile for an 8.7-8.9 grade 
placement. It seems apparent, in view of this 
evidence, that at least the first four tests are 
unsuitable for use in grade 4. No account is here 
taken of the probable upper limits of individual 
chance scores in terms of the standard deviations 
of the chance score distributions because of the 
complexities involved in obtaining such stand- 
ard deviations for tests employing two or more 
item types with different chance patterns. It is 
certain that use of this more refined technique 
would place these tests in an even more un- 
favorable position for use in the lower grades. 

p. TEST RELIABILITIES. Reliability coefficients 
were computed by the reviewer for each of the 
six tests, using the results for the 76 fifth grade 
pupils who took the entire battery, by the split 
half and Kuder-Richardson (Formula 21) meth- 
ods. The split half correlation coefficients, 
stepped up by use of the Spearman-Brown 
formula, were .93, .94, .96, .94, .90, and .95, 
respectively for Tests 1 to 6. Comparable Kuder- 
Richardson estimates were .90, .92, .94, .95, .86, 
and .95. These two sets of estimates compare 
favorably with each other and both are in all 
instances higher than the comparable Kuder- 
Richardson estimates of .84, .76, .93, .89, .84, 
and .86, reported by the authors for Grade 5. 

The reviewer believes that both the split half 
and the Kuder-Richardson estimates are spuri- 
ously high because of the large number, of 
omitted and, more particularly, unattempted 
items, and indirectly because the standard devia- 
tions are greater than would have been obtained 
if the many low scoring pupils had been given 
sufficient time to complete the test and hence to 
raise their scores. The fact that percentages of 
the 76 pupils attaining scores below the means 
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of the chance score distributions were 19.7, 53.9, 
188, 38.2, 7.9, and 30.3 for Tests rito 6 re- 
spectively lends support to this h 

ПІ. Summary, In the judgment of the re- 
viewer, this battery embodies a good but not a 
superior level of workmanship. It seems to be 
well validated, and the test scores apparently 
ers sufficient reliability to serve in group com- 
parisons and probably in individual pupil guid- 
ance. The section scores, however, are probably 
too unreliable to serve any significant purpose 
and most of the battery seems unlikely to func- 
tion well at the grade 4 level. 

Test content seems in general to be appro- 
priate to the purposes outlined for the battery 
and at the level of pupil maturity for which it 
is designed. Many of the test items do not meet 
high standards for item writing, and there is a 
sufficient number of contradictions, overlaps, 
and inconsistencies among the items and also 
enough discrepancies in the printing to indicate 
that the final test copy could well have been 
edited more carefully, 

Recommendations for timing the test seem 
likely to present major problems to the user at 
the lower grade levels, and the reviewer believes 
that understatements of results for Tests 2, 4^ 
and 6 will occur on the many occasions when 
very lengthy testing sessions are not, and per- 

even cannot be, provided. The percentile 
norms are probably too coarsely grouped to be 
of great value, and they doubtless have very low 
reliability for section scores. It seems likely that 
grade norms, or age norms if desired, afford the 
most satisfactory and at the same time reliable 
basis for interpreting results from the six tests. 

When compared with the social studies and 
science parts of the intermediate and advanced 
levels of the Stanford Achievement Test and 
the Metropolitan Achievement Tests, this bat- 
tery does not seem to i 
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Srantey D, Nisser, Professor of Ed 
University of Glasgow, Glasgow, Scotland, 

This is a set of three attainment tests for 
year-olds, the school subjects tested being 
metic, reading, and spelling. They are desig 
for British school children and could not be 
fully employed outside Britain. 

Technically the tests are good. The use 
pictures, the layout and printing, and the is 
structions in the manual about incentives, @ 
mosphere, and breaks are all very sound, Th 
tables of norms are clear and easily read, fg 
spection of these tables shows that the raw seq 
have a good spread, and that the tests, 
fore, presumably have satisfactory discrimig 
ing power. The standardization is trustwort 
being based on a complete age group in th 
County of Northumberland. The general polig 
too, of producing tests for a single age group 
а good one, allowing finer discrimination thi 
would otherwise be possible, The only technie 
criticisms which I would make are: (а) R 
ability coefficients are not given in the manu 
(b) The passages in the reading test are po 
sibly not varied enough in subject matter 
give children of different interests an equ 
chance (there are too many descriptions of 
ture—birds, fruit, gardens, etc,—whereas ¢ 
would expect some of the many other spheres 
children's interests to be represented). (c) Th 
instruction in the manual that there should 
no exhortation to the children to do their best 
and that "however little a child has done, 
efforts should be praised" seem perhaps qu 
tionable, since probably more children of 
age do badly on tests because they are not t 
hard than because they are nervous. 

It is on more general grounds, however, 
I am most uneasy. Obviously the results of t 
in these formal subjects depend very much, 
pecially at this early age, on the educatio 
policy of the individual school. Some 
will introduce formal work at the age of 
others not till six, while others prefer to ре 
pone it till the child enters the Junior School 


would never expect their seven-year-olds ta 
to do multiplication and division of mon 
which 
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tests should never be used to mold the curricu- 
lum," extensive use of such tests, nevertheless, 
tends to have this effect. For this reason, tests 
devised and standardized locally to evaluate 
achievement in a community where recent cur- 
riculum improvements have been made are some- 
times superior to achievement tests standardized 
on a national basis. While this may be true in 
the skill areas to a limited extent only, it is par- 
ticularly true in the content areas. 

'This test is accompanied by carefully con- 
structed national norms based upon the test re- 
sults of 300,000 pupils in 173 communities in 
32 states. A commendable feature is that two 
types of norms are provided: (a) norms based 
on model age groups, from which accelerated 
and retarded pupils were eliminated and (b) 
traditional norms based on the total population 
tested. 

The coefficients of reliability of the subtests 
range from .71 to .95. The test is sufficiently 
reliable for individual diagnosis among the vari- 
ous school subjects but not, as the authors cau- 
tion, for diagnosis within a subject matter area. 
It is designed as a survey test, not as a diagnostic 
test. 

The subtests in the content areas (literature, 
social studies, and elementary science) are the 
least reliable of the subtests. It is questionable 
whether they should have been included at all, 
since much of the criticism of this test must be 
directed toward them. The partial batteries, con- 
taining only the tests in the skill subjects, are 
far superior to the complete batteries. Schools 
using the complete batteries should interpret 
scores in the content areas with great care. The 
content of the curriculum in these areas in a 
particular school in 1951 may be much different 
from that indicated by a 1940 analysis of courses 
of study developed prior to that date. 

It should be mentioned in this connection, 
however, that a major revision of the Stanford 
Achievement Test has been underway during 
the past year and new forms of the test will be 
available in 1952. The following changes are 
being made in the test: four batteries of tests 
will be available rather than three, a new subtest 
in study skills will be added, the spelling tests 
will be a multiple choice rather than a dictation 
test, the Intermediate and Advanced Batteries 
will be designed for machine scoring, and 50 
per cent of the items will be new. 

The Stanford Achievement Test was a pioneer 
in its field when it was first published in 1923. 


It has been one of the foremost tests since that 
date. With the appearance of the 1952 revision, 
it will undoubtedly retain its position as one of 
the finest achievement tests available. 


Vircu. E. Herrick, Professor of Education, 
University of Wisconsin, Madison, Wisconsin, 

The reader will want to examine the previous 
reviews of the Stanford Achievement Test by 
Walter W. Cook and Ralph C. Preston. No new 
revisions have been made in this test since the 
publication of these reviews in the Third Mental 
Measurements Y earbook. 

Test items in the Stanford tests can be de- 
fended on the basis of their ability to discrimi- 
nate between the achievement of individuals 
and different age groups, their comparability 
with similar items in duplicate test forms, and 
on the basis of their being judged representative 
of materials in present courses of study. They 
can be criticized, nevertheless, on the follow- 
ing bases. (a) Few workers in curriculum would 
accept courses of study as a basis for determin- 
ing either what is currently taught or what is 
considered educationally valuable. (b) Too great 
an emphasis is placed on learning at the factual 
level. In paragraph meaning and in arithmetic 
reasoning subtests, some attempt has been made 
to get the student to apply his knowledge and to 


infer from facts, but this is not a major emphasis. 


(c) An achievement battery designed for broad | 
general use must by its very nature be restricted 
to materials which are shopworn and representa- 
tive of the minimum (if not the worst) of com- 
mon practice. The reading paragraphs are loaded 
with "Long ago a king named Bruce had been 
defeated six times by another king....Once upon 
a time the Danes were trying to surprise a group 
of Scotch soldiers.."; (d) The elementary 
schools of this country are working hard to 
develop important language, intellectual, and so- 
cial processes. It would seem that more en 
phasis on these important learning processes ап 
on key concepts and generalizations, in addition 
to some factual materials, would improve the 
curricular validity of the test. i 
Examination of the word usage and spelling 
subtests in the primary, intermediate, and ad- 
vanced test booklets using the Thorndike-Lorge 
and the Rinsland word lists as criteria revea 
that the words included in these tests are well ' 
graded as to difficulty and that they are in the 
vocabularies. of children. The spelling test 18 
again developed on the basis of writing lists °% 
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words, a technique which is practiced in the 
usual spelling class but which has little to do 
with evaluating more functional spelling skills. 

Examination of the content range in para- 
graphs in the reading comprehension subtest 
indicated that some attempt had been made to 


include science materials and, in the primary — 


battery, social experiences close to the life of the 
child. Most of these subtests, especially those on 
the intermediate and advanced levels, are heavily 
loaded with fable, historical folk lore, and “tit- 
erary" materials. The Social Studies subtests I 
and II tend to stress factual and place informa- 
tion and to minimize the development and use 
of important historical and geographical concepts 
in dealing with common problems of human re- 
lationships. These subtests also fail to deal with 
skills involved in the historical and scientific 
method and in the use of historical and geo- 
graphic materials, maps, globes, and so forth. 
Perhaps the least useful subtest in the inter- 
mediate and advanced battery is the elementary 
science test. This test is one of factual informa- 
tion. There is some question whether baboons, 
evergreen trees, and panthers are very impor- 
tant subjects to use to survey the science under- 
standings and skills of elementary school chil- 
dren. The content of this test has little to do 
with what modern science programs are stress- 


ing. The arithmetic reasoning subtests stress the. 


use of important mathematical concepts in prob- 
lem situations. Some care has been exercised to 
get the content of these problems within the life 
experience of most children. The subtest on 
computation is the traditional sampling of the 
arithmetical processes on the various levels and 
is adequate for survey purposes. 

A number of technical questions might be 
raised. One question is how much responsibility 
should a single paragraph in the reading tests 
carry for providing opportunities for right or 
wrong responses. In general, one or two re- 
sponses are provided in each paragraph. In 
some tests, however, as high as five responses 
are developed for a single paragraph. Is there 
an unusual weighting for this group of re- 
sponses? A second question is the use of a single 
or small group of test items to make judgments 
about the adequacy of relatively large areas of 
work. In one subtest, the failing or passing of 
one item at a particular point would make a 
5-month difference in the grade placement of 
the individual in this subject. In another sub- 
test the passing or failing of an item would make 


TESTS & REVIEWS: ACHIEVEMENT BATTERIES 26 


no difference. Most of us would not want to 
evaluate a year's work in a subject area on the 
basis of the responses to from three to fifteen 
items. 

These above points raise serious questions 
about the use of any achievement test battery 
for the purpose of drawing broad sweeping gen- 
eralizations about the achievement of children, 
Certainly no school system would want to use 
the material in these tests as a basis for determin- 
ing norms for the represented subject areas. 
These tests are very valuable, however, if used 
for particular purposes and interpreted intelli- 
gently. The reviewer feels that the reading tests 
can be improved through better selection of 
content; the science and social study tests are 
very questionable; the word usage and spelling 
tests, within their limitations, are well-done; 
and the arithmetic tests are adequate. The Stan- 
ford Achievement Test represents some of the 
best practice in this field of testing. From the 
point of view of the curriculum of the elementary 
school, however, the content material in this 
test is not representative of sound, generally ap- 
proved practice. 


For a review of the literature test by Winifred 
Post, see 195; for a review of the arithmetic test 
by Robert L. Burch, see 419; for a review by 
James R. Hobson of the reading test, see 555; 
for reviews by Bertram Epstein and Paul E. 
Kambly of the science test, see 593; for reviews 
by Walter W. Cook and Ralph С. Preston, see 
3:18; for reviews by Margaret G. McKim of 
the reading test, see 3:503; for a review.by Ray 
G. Wood of the social studies test, see 3:505. 


[26] 

Tests of General Educational Development. High 
school, college; 1944-45; IBM; 2 levels; Form B 
(44); (Forms A, Y, and Z restricted to the Armed 
Forces; Forms C, D, and E restricted to the Veterans 
Testing Service program); no data on reliability ; 
manuals ('45); separate answer sheets must be used; 
8oé per 25 IBM answer sheets for any one test; 15° 
per scoring stencil for any one test; cash orders post- 
paid; 50% per specimen set of any one test, postpaid; 
worklimit (120) minutes per test; prepared by the 
Examinations Staff of the United States Armed Forces 
Institute; published by the American Council on Edu- 
cation; distributed by Cooperative Test Division, Edu- 
cational Testing Service. * 
d) HIGH SCHOOL LEVEL. 5 tests; $2.50 per 25 of any 
one of Tests 1-4; $2.25 per 25 of Test 5. 

т) Test 1, Correctness and Effectiveness of Expres- 

sion. } ЖГ 

2) Test 2, Interpretation of Reading Materials in 

the Social Studies. y } RU 

3) Test 3, Interpretation. of Reading Materials in 

the Natural Sciences. . ! ‘ 

4) Test 4, Interpretation of Literary Materials, 
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5) Test 5, General Mathematical Ability. 

b) COLLEGE LEVEL. 4 tests; out of print: Form B. 
т) Test т, Correctness and Effectiveness of Expres- 
sion. 
2) Test 2, Interpretation of Reading Materials in 
the Social Studies. м Re 
3) Test 3, Interpretation of Reading Materials in 
the Natural Sciences. D 
4) Test 4, Interpretation of Literary Materials. 
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Gustav J. Екоєнілсн, Assistant Director, Bu- 

reau of Institutional Research, University of Il- 

linois, Urbana, Illinois. [Review of Form B.] 
The Tests of General Educational Develop- 


- ment constitute a comprehensive achievement 


battery covering the major areas of the typical 
high school curriculum as well as the first two 
years of college work as generally offered in 
survey courses in the liberal arts. There are 
separate high school and college level subtests 
in Correctness and Effectiveness of Expression, 
Interpretation of Reading Materials in the Social 
Studies, Interpretation of Reading Materials in 
the Natural Sciences, and Interpretation of Lit- 
erary Materials. There is an additional high 
school level test in General Mathematical Abil- 
ity. 

"the test battery was developed for use in the 
educational and vocational guidance of veterans 
of World War II. More specifically, the high 
school level tests were to serve as a “high school 
equivalence examination," that is, as a basis for 
determining whether or not a veteran who had 
never attended or never completed high school 
had nevertheless acquired the equivalent of a 
general high school education. The college level 
tests were to provide a measure of the degree to 
which, the returning veteran was ready to do 
advanced college work, and the number of units 
of credit he should receive for introductory col- 
lege courses which he had never taken. The 
battery was originally prepared in two com- 
parable forms, A and B. Form A was intended 
for military use only, and Form B for general 
civilian use. z 

Form A of the high school level tests is still 
restricted to exclusive use in the armed forces. 
Two additional equivalent forms, Forms Y and 
Z, are also restricted to military use. Forms C, 
D, and E are reserved for exclusive use in the 
Veterans’ Testing Service program. Form А 
of the college level tests is used at the present 
time in both the armed forces and the agencies 
of the Veterans’ Testing Service. Form B of 
both the high school and the college level tests 
is available to high schools, colleges, and ойе 
responsible organizations for use with civilian 
students, At the present time this form is used 
primarily for purposes of general educational 
appraisal rather than for accreditation purpose 
Form B of the complete battery was made avail- 
able to this reviewer. 


6s ] 
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The Tests of General Educational Develop- 
ment, as the title implies, is designed to measure 
broad outcomes of learning—the acquisition of 
concepts, ideas, and generalizations, rather than 
the memorization of facts. The approach in the 
tests of the social studies, the natural sciences, 


and literature is through the reading situation. 


These subtests consist of a number of selected 
reading passages “taken from textbooks and 
other writings” in the area being tested. Each 
reading passage is accompanied by multiple 
choice questions which call for an interpretation 
and evaluation of the passage read. While it is 
obvious that test items of this kind must, in 
effect, constitute a reading test, a critical exami- 
nation of the reading passages actually used in 
Form B reveals that these tests are much more 
than tests of reading. The several passages are 
such that they can be comprehended readily, or 
interpreted and evaluated effectively, only by 
those whose background experiences and basic 
knowledge in these respective fields are rich and 
diverse. The college level test in social studies 
also has a number of marked multiple choice 
items which can be answered directly from the 
testees’ general knowledge. 

Correctness and effectiveness of expression is 
measured by means of a multiple choice spelling 
test and a series of multiple choice exercises in 
which the testees make a decision as to whether 
suggested changes or corrections will actually 
improve a theme containing representative sam- 
ples of errors and inept expressions characteris- 
tic of the writing of high school and college 
students. 

_ The high school level mathematics test con- 
sists of problem solving exercises, of a more or 
less practical type, with multiple choice answers. 
Included also are a few simple problems involv- 
ing high school algebra and geometry. 

The controversy over the validity of the Tests 
of General Educational Development continues. 
Unfortunately the test manuals are still devoid 
of objective evidence. Their discussion relative to 
validity continues to be in terms of face validity 
—that the GED tests do provide a procedure for 
assigning high school and college credit which 
is far superior to the alternative method of grant- 
ing blanket credit to returning war veterans. 
This position is subscribed to by most educators ; 
however, with the increasing popularity of the 
civilian battery (Form B) in adult education 
programs of nonveterans, the social utility upon 
which the earlier arguments for face validity are 


> 


based decreases in significance. The principal 
objection raised is that the reading-reasoning 
outcome measured by the GED tests—although 
a very important “general development” —is not 
the only one that should be considered. 

The sponsors of the GED tests recently rec- 
ognized {һе need for a comprehensive evalua- 


„tion of this test battery ; consequently the Amer- 


ican Council on Education was prevailed upon 
to sponsor such a project. The report on the 
project (38) indicates that a truly exhaustive 
study was made. The report concludes: "Quite · 
obviously, the passing of a test is not the same 
as spending four years in high school or taking 
a six semester-hour college course." But it con- 
tinues, “There can be no question that the total 
effect has been beneficial both in terms of im- 
mediate results and in terms of developments on 
which to build for the future." This conclusion 
is based primarily upon the following findings: 
(a) The professional caliber of the test authors 
is of the highest; their methods were thorough 
and painstaking. (b) The test items are highly 
discriminating when considered against the total 
test scores—internal consistency coefficients ap- 
proaching .50 were found for the high school 
battery, Form B. (c) The items are somewhat 
easier than is commonly desirable or usual in 
achievement tests—the average item ease ranges 
from 55.7 per cent for the mathematics test to 
70.0 per cent for the English expression test. 
This is defended on the basis that the emphasis 
is on the pass-fail point rather than on grading. 
(d) The GED program continues to be large. 
More than half of the states now provide for the 
issuing of high school equivalency certificates on 
the basis of GED test data. All branches of the 
armed forces accept successful completion of the 
GED test as being the equivalent of high school 
graduation. Many civil service programs accept 
the GED certificate in lieu of a high school di- 
ploma. On the other hand, acceptance by in- 
dustry and the professions has been varied, and 
colleges are much more ready to accept the GED 
high school equivalency certificate to meet the 
entrance requirement than they are to grant 
college credit on the basis of standing on the 


college level GED tests. 


No information about the reliability of the 
GED tests is to be found in the examiner’s 
manuals. This is unfortunate, especially since 
the high school battery has been standardized on 
a representative sample of more than 35,000 
public high school seniors, and the college level 
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norms were established by administering both 
high school and college level forms to the same 
large groups of representative college students. 
'The American Council on Education evaluation 
study summarizes a number of available limited 


reliability studies which yielded coefficients of. 


reliability of .85 to .94. On the basis of these, 


and recognizing the fact that the tests are rela- 


tively long, with generous time limits so that the 
inflationary effect of speed is almost entirely 
eliminated, it may be said that the reliability of 
the GED tests is adequate. 

Other bits of pertinent evidence uncovered by 
the American Council on Education evaluation 
study may be summarized as follows : (a) "GED 
admissions as a group were not quite as success- 
ful in college as the regular high school gradu- 
ate.’ (b) GED tests “were not specifically 
constructed for diagnosis or prediction (even 
though) they were used for such purposes with 
some success." (c) The peacetime GED pro- 
gram should contain an extension of the present 
examinations “їп both techniques and in content 
so as to cover other educational objectives." (d) 
The GED battery variance is "largely generated 
by a single common factor....these analyses show 
that (all) of the tests are essentially measuring 
one thing. This suggests that general educa- 
tional development as measured by these tests 
cannot be clearly separated into the four or five 
content areas." (e) “The GED tests are meas- 
uring psychological traits which are also meas- 
ured by tests of achievement, scholastic aptitude, 
and mental maturity." The extent of the inter- 
correlation, however, is not so high as to war- 
rant the conclusion that the GED tests are meas- 
ures of intelligence. 


Percentile norms for each test in the high 
school level battery are presented for the “total 
U.S." and for six regional groups. Such regional 
determination of norms is worthy of special 
mention; however, this reviewer would like to 
add that in a program as extensive as this one 
a further breakdown by states and by rural- 
urban regions within states would be desira- 
ble. 


For the college level tests percentile norms . 


are available for three types of institutions, classi- 
fied according to the mean score of their entering 
freshmen on the 1941 American Council on Edu- 
cation Psychological Examination, and for an 
unclassified group of colleges. The standardizing 
population for each subtest was selected from 
those students just completing a course similar 
to the given subtest. Again the availability of 
norms by types of institutions is commendable, 
even though the standardizing population is 
limited in several of the “types.” 

The Tests of General Educational Develop- 
ment are definitely superior tests of their type. 
They have been extremely useful in the postwar 
educational adjustment of returning war veter- 
ans. Their civilian use is increasing. Objective 
evidence on their validity and reliability is be- 
ginning to pile up, but much remains to be done 
in the way of basic and definitive studies de- 
signed to test the worthwhileness of this bat- 
tery. 


For reviews by Herbert S. Conrad and War- 
ren G. Findley, see 3:20; for a review by Char- 
lotte W. Croon of Test 1, College Level, see 
3:122; for reviews by W. E. Hall and C. Robert 
Pace of Test 2, College Level, see 3:528. 
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‘CHARACTER AND PERSONALITY 


Reviews BY Dwight L. Arnold, Frederic L. Ayer, Benjamin Balinsky, John E. Bell, Arthur L 
Benton, E. J. G. Bradford, Hubert E. Brogden, Donald T. Campbell, Launor F. Carter, Robert C. 
Challman, Kenneth E. Clark, Charles №, Cofer, Douglas Courtney, William J. E. Crissy, Lysle И. 
Croft, William M. Cruickshank, W. Grant Dahlstrom, D. Russell Davis, Robert G. Demaree, 
Albert Ellis, Hans J. Eysenck, Frank S. Freeman, N. L. Gage, Harrison G. Gough, Nelson G. 
Hanawalt, Philip L. Harriman, Dale B. Harris, Charles M. Harsh, George W. Hartmann, Ken- 
neth L. Heaton, William E. Henry, Robert R. Holt, Charles H. Honzik, Albert L. Hunsicker,, 
Harold E. Jones, E. Lowell Kelly, Seymour G. Klebanoff, Kate Levine Kogan, William S. Kogan, ^ 
Morris Krugman, Roger T. Lennon, Frank M. Loos, C. M. Louttit, Ardie Lubin, Louis L. Mc- 
Quitty, Ross W. Matteson, James Maxwell, N. W. Morton, C. Robert Pace, Albert I. Rabin, 
T. W. Richards, Ephraim Rosen, John W. M. Rothney, David G. Ryans, Theodore R. Sarbin, 
Helen Sargent, William Schofield, William Seeman, Helen Shacter, Laurance F. Shaffer, Edward 
Joseph Shoben, Jr., Verner M. Sims, William Stephenson, Naomi Stewart, L. Joseph Stone, 
Charles R. Strother, Percival M. Symonds, Florence M. Teagarden, Leona E. Tyler, Neil Van 
Steenberg, Howard R. White, and J. R. Wittenborn. 
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[27] 

XActivity Vector Analysis. Adults; 1948; 5 scores: 
aggressiveness, sociability, emotional adjustment, so- 
cial adaptability, total; I form; distribution is re- 
stricted to industrial firms using the Activity Vector 
Analysis Program of Human Relations and purchase 
arrangement with the publisher; 
nontimed 5(10) minutes; Walter V. Clarke; Walter 
V. Clarke Associates. * 


[28] М 
The Adjustment Inventory. Grades 0-16 adults А 
1934-38; ІВМ; 2 levels; 1 form; tentative norms; 156 
per specimen set; tissue paper stencil for hand scoring 
of test booklets supplied free; cash orders postpaid ; 
nontimed (25) minutes; Hugh M. Bell; Stanford Uni- 
versity Press. * 
a) STUDENT FORM. Grades 9-16; 10 
scores: home, health, social, emotional; 
manual [’34]. 
1) [Regular Edition.] $1.75 рег 25; separate answer 
sheets may be used; $1 per 50 IBM answer sheets ; 
фт per set of stencils for machine scoring of answer 
sheets; $1 per plastic stencil for hand scoring of test 
booklets. ES. 
2) [IBM Test-Answer Sheet Edition.] 2.50 per 50; 
$1.50 per set of stencils for machine scoring of test- 
answer sheets. А 
b) ADULT FORM. Adults; 1938; 5 adjustment scores: 
home, occupational, health, social, emotional ; manual 
[738]; prices same as for Regular Edition; separate 
answer sheets may be used; plastic hand-scoring stencil 
not available. 


1934; 4 adjustment 
2 editions; 
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Since 1934 the Student Form has been a popu- 
lar test in schools and clinics. Since 1938 an 
Adult Form has been available which adds oc- 
cupational adjustment to the original measures 
of home, health, social, and emotional. Just how 
widely the Adult Form is used and to what pur- 
pose, the writer has been unable to ascertain. 
Only four published studies specifically refer- 
ring to it have been found in our survey of the 
literature. Forlano and Kirkpatrick (68) com-, 
bined the Social Scale score from the Adult 
Form with the Alienation score of the Wash- 
burn Social Adjustment Inventory for the pur- 
pose of selecting radio tube assemblers. In their 
validating study involving 20 girls on the job, 
all those scoring above average on the test were 
rated “good” by their supervisor and all those 
scoring below average were rated “fair.” The 
supervisor had no knowledge of the test scores. 
The authors claim, but furnish no supporting 
data for the claim, that a composite of the social 
scores and intelligence ratings were effective in 
predicting the subsequent success of new tube 
mounters. Seagoe (70, 76) found that the Adult 
Form showed some predictive possibilities, es- 
pecially long range ones, in selecting teachers. 
Her studies were based upon correlations of 
test scores with ratings of success of student 
teachers and teachers in service. No other study 
has been found where a prediction of occupa- 
tional success has been attempted using the Bell. 

Data on the Student Form are more abun- 
dant. In general these studies convey a favorable 
attitude toward the test; but some find it un- 
suited for a particular purpose, some express no 
opinion, and a few express a negative opinion. 

Several surveys involving the inventory have 
been made. In 1936 Pallister * asked a number 
of psychologists to rate some vocational tests. 
Of the 38 psychologists responding, 14 rated the 
Bell: 11 rated it as efficient and 3 as not efficient. 
In 1946 Darley and Marquis ? made a survey of 
tests used in 51 contract clinics for veteran 
guidance. Sixteen personality tests were men- 
tioned by the respondents. The three most fre- 
quently mentioned were the Minnesota Multi- 
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phasic Personality Inventory, use index 55; 
Bell's Adjustment Inventory, use index 35; and 
Bernreuter's Personality Inventory, use index 
29. The same year Kornhauser? got replies 
from 67 well known test specialists to the ques- 
tion: "In the field of personality testing, how 
satisfactory or helpful for present practical 
use do you consider personality inventories 
and questionnaires (such as Bernreuter, Bell, 
Humm-Wadsworth, etc.) ?" The results were: 
highly satisfactory, 1.5 per cent ; moderately sat- 
isfactory, 13.5 per cent; doubtful, 36 per cent; 
rather unsatisfactory, 33 per cent; and highly 
unsatisfactory, 16 per cent. The clinical as com- 
pared to the nonclinical psychologists tended to 
rate the inventories more favorably. Only a 
slightly higher percentage of the raters consid- 
ered the Rorschach more satisfactory than the 
inventories. There was some suggestion that 
those who consider the Rorschach satisfactory 
tend to consider the inventories unsatisfac- 
tory. In the few comments regarding particular 
blanks, the Bell and the MMPI received favor- 
able mention. 

In the same year Ellis (72) published a com- 
prehensive survey of the literature concerning 
the validity of personality inventories and con- 
cluded that the evidence was largely negative. 
The Bell was one of four tests singled out for 
special study and made the poorest showing of 
the four. It is the writer's opinion that this study 
is misleading. Ellis assumed that the inventories 
were designed for the use to which the investiga- 
tors put them, and that the validating data were 
unquestionable. In some of the rr validating 
studies cited on the Bell by Ellis, the authors 
themselves pointed out the weakness of the vali- 
dating data, and in others they should have done 
so. Intelligence tests would look a little sick if 
they were evaluated in a similarly objective, 
though indiscriminate, manner. We have ana- 
lyzed the 11 studies cited by Ellis on the Bell 

. and are not much impressed by his objective 
rating of them. 

Also іп 1946 Painter and Painter * made a 
survey of entrance tests used by a sample of 
colleges. Of the 42 replying, 14.3 per cent re- 
ported that they gave personality tests to in- 
‘coming freshmen. Bell led the list of tests used 
for this purpose. Feder and Mallett (5) found 
, 8 Kornhauser, A. “Replies of Psychologists to a Short Ques- 
tionnaire on Mental Test Developments, Personality Inven- 
tories, and the Rorschach Test." Ed & Psychol Meas 5:3-15 
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that the inclusion of the Bell in the entrance bat- 
tery of tests left the prediction coefficient of 
scholastic success unchanged. Drought (9) and 
Griffiths (бо) have both shown that there is no 
correlation with scholarship, either with the 
total score or with relevant subscales. Batemen 
(94) used the Bell to check the hypothesis that 
high school students who work for pay are better 
adjusted. The results were negative. However, 
Smith (84) found that students who participated 
in extracurricular activities were better adjusted 
as measured by the Social Scale. He interpreted 
this as validating data for the test. 

A number of studies have checked the relia- 
bility coefficients of the Student Form (2, 3, 7, 
10, 48, 79) and have found them satisfactory. 
Greene and Staton (13) and Eckert and Keys 


(37)have both found that there is no correlation ' 


between the Bernreuter-Flanagan Sociability 
Scale and the Bell Social Scale. Watson (57) re- 
ported correlations between his Inventory of Af- 
fective Tolerance and the Bell which range from 
—.60 for Social to —.11 for Health: for the 
total Watson and Bell, —.56. Tyler (3) and 
Sperling (55) have both verified Bell’s inter- 
correlations of subscales and the total scale. The 
correlations reported by Bell in the manual 
appear to be well substantiated. Pallister (14) 
found that students in a Scottish University fell 
within the American norms, and Sukov and 
Williamson (30) found no difference in scores 
between Jews and non-Jews. Damrin (79) gave 
the test to 153 high school girls under signature 
and again anonymously but identified by an in- 
genious system. The correlations between the 
two administrations were high, ranging from 
.75 to .97; Damrin interpreted this as evidence 
both for the reliability of the test and the de- 
pendability of the answers. More responses on 
the anonymous tests were changed from poor 
to good adjustment than vice versa. 

Validity is always a difficult problem. Un- 
fortunately, only a few of the validating studies 
approach the care and scientific accuracy 0 
Bell's original validating data. One of the better 
ones is that of Turney and Fee (2) with high 
school students. The reliability of teachers’ rat 


ings and the test were both checked by having ' 


them repeated after six months. Both were found 


to be reliable. The correlations between the two 


ranged from .18 for Home to .42 for ‘Health. 


The second ratings correlated with the secon® ( 


Bell gave higher coefficients in three of the five 
measures, Traxler (48) found higher and lowe" 
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correlations than the above: .64 for Social and 
zero for Home. He concluded that high school 
teachers do not know enough about home ad- 
justments to rate them with any meaning. Clark 
and Smith (50) concluded that the test is not 
valid on the basis of teachers’ ratings. However, 
we cannot take their findings too seriously, for 
the reliability of the teachers’ ratings in the case 
of large departments was as low as .47. They 
found that the teachers tended to rate the stu- 
dents very much the same. Also the teachers' 
ratings were on more specific items than the 
Bell subtests. 

Pederson (15) compared Bell test scores with 
data combed from college application autobi- 
ographies. Students who revealed home malad- 
justment showed a reliable difference on the 
Home Scale when compared with those not 
speaking of home maladjustment. However, 
these data were not very helpful in predicting 
the cases which would turn up at the dean's 
office on account of home difficulties. The Health 
score was found to be indicative of poor health 
as revealed in the autobiography, personnel rec- 
ords, and the health department. Individuals 
rated maladjusted by the social advisor tended 
to show maladjustment on the Social Scale. 
Aside from health, the dean's records appear to 
be sadly deficient concerning adjustment of col- 
lege students and hence have limited value in 
validation studies. This is probably an important 
factor in Marsh's (60) conclusions that social 
and emotional maladjustment is best predicted 
from the Home Scale, and that this is not par- 
ticularly sensitive to maladjustment until the 
case becomes “critical.” This conclusion is based 
upon a comparison of 23 girls referred to the 
dean with 50 “normal” girls (i.e., girls not so 
referred). Keys and Guilford (6) found low but 
statistically significant correlations with prob- 
lem behavior for the total score and three of 
the subtests. Spoerl (61) found the test useful 
in analyzing emotional difficulties of bilingual 
students. ў 

Darley (4) found that roughly one third of 
the cases diagnosed clinically were picked up by 
the Social and the Emotional scales and that 
nearly one half of the cases diagnosed by the 
test as maladjusted were confirmed in clinical 
study. Darley found the subtest scores much 
more useful than the total score. Tuckman (77) 
found that self-ratings on a five point linear scale 
corresponding to the Bell subtests correlate high 
enough with the Bell scores that these ratings 
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can be substituted for the Bell if the time is 
limited. These self-ratings would appear to be 
validating data for the Bell. In general, correla- 
tions between the subtests and outside ratings · 
are highest where the raters are best qualified 
to rate (2, 15, 48, 50, 77). 

Several item analyses have been made. Gilkin- 
son and Knower (42) reported that good speak- 
ers as a group have better social adjustment as 
measured by the Social Scale; they found 19 
items which differentiated between good and 
poor speakers, 15 of them from the Social Scale. 
Altus and Bell (65) found 22 items in the Stu- 
dent Form which discriminated between ad- 
justed and nonadjusted illiterates in a special 
Army training center. These items, combined 
with some others, could be used to predict suc- 
cessful training. Marsh (60) found 14 items 
from the Home Scale which were useful in pre- 
dicting social and emotional maladjustment. 
Woolf (63) found 12 items in the Home Scale 
which best discriminate between satisfactory and 
unsatisfactory adjustment in college. 

Three studies have been found where an at- 
tempt has been made to use the Bell to discrimi- 
nate between delinquents and nondelinquents. 
Bartlett and Harris (18) compared 119 delin- 
quents with 148 school children and found no 
difference on the Social Scale but marked dif- 
ferences in Home and Emotional adjustment in 
the expected direction. Hathaway (34) found 
the test did not discriminate between his nine 
psychopaths (restricted usage) and the normal 
population. Wittman and Huffman (77), using 
the Adult Form, compared the scores of high 
school students, training school delinquents, and 
state hospital patients. Although the test showed 
rather marked differences on home adjustment 
and emotional adjustment, especially for the 
delinquent girls who were largely sex offenders, 
none of the differences were statistically signif- 
icant due to the large standard deviations. 

The Bell has been used in other special re- 
search projects. There has been some interest in 
measuring personality by means of interest tests. 
Tussing (56) found that he could predict the 
Bell Social scores by a rescoring of Strong’s 
Vocational Interest Blank, but not the other 
scales, Klugman (112), however, found no re- 
lation between the Kuder Preference Record and 
the Bell. Cowen and Thompson (116) found no 
relation between the Bell and “rigid personality” 
as measured by the Luchins water jar test. Van 
der Merwe and Theron (85), however, found a 
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substantial correlation between the Bell Emo- 
tional score and a finger plethysmograph record. 
This would appear to indicate that emotional 
stability as measured by the Bell has a physio- 
logical concomitant. 
In conclusion it appears that there is no doubt 
concerning the correlations published in the 
- manual of the Adjustment Inventory. The valid- 
ity apparently is as good as any of the paper and 
pencil adjustment inventories and better es- 
tablished than most of them. The subtests fur- 
nish valuable data concerning adjustment in 
four areas. It has proved to be a valuable instru- 
ment in research, in schools, and in clinical work. 
Apparently its popularity over the years is well 
justified. There is no indication in the literature 
that it can be depended upon to pick out of a 
group all maladjusted individuals, nor that all 
*maladjusted" according to the score are actu- 
ally maladjusted. Its greatest usefulness would 
appear to be as a guide to interview and as an 
aid in better understanding the individual. The 
quantitative scores furnish data of diagnostic 
value, but answers to specific items will often 
point the way to much more valuable data. In 
reviewing the literature one is impressed with 
the fact that the test results are very frequently 
more satisfying than the validating criteria. 
Some more studies involving the Adult Form 
would appear to be highly desirable. 


Тнкоровк R. Sargin, Associate Professor of 
Psychology, University of California, Berkeley, 
California. 

The Adjustment Inventory began as a set 
of a priori scales intended to isolate persons who 
were maladjusted in four areas: home, health, 
social and emotional. Later, an occupational 
scale was added to the adult form. The scales 
have satisfactory odd-even reliability coefficients, 
„Зо to .89. Only those items which differentiated 
between extremes of the distribution on the 
standardizing group were retained. Empirical 
validation was achieved by comparing scores 
with ratings of counselors, personnel workers, 
etc. Critical ratios are all significant —demon- 
strating that the scores differentiate persons 
rated by counselors as “well-adjusted” from 
those rated as “poorly-adjusted” in a particular 
area, e.g., emotional adjustment. 

A decade ago, this reviewer found the Adjust- 
ment Inventory useful in screening college stu- 
dents for mental hygiene interviews. The health, 

~ T home, social, and emotional categories served а 
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a frame of reference for the counselor in or- 
ganizing his impressions about the student, In 
the r930's, when reliable knowledge of per- 
sonality dynamics was in its infancy, these cate- 
gories had descriptive utility for untrained or 
semitrained student personnel workers. Today, 
such rubrics as health adjustment, home adjust- 
ment, etc., seem anachronistic. More appropri- 
ate are scales that probe for specific variables, 
such as type of ego-defense, internalization, so- 
matization, hysteria, etc. The Minnesota Mul- 
tiphasic Personality Inventory, for example, is 
currently more useful as an instrument for per- 
sonality diagnosis, especially when subject to 
coding, profile analysis, and configural scoring 
as advocated by С. S. Welsh.* 


For reviews by Raymond B. Cattell, John G. 
Darley, C. M. Louttit, and Percival M. Symonds 
of the Student Form, and reviews by S. J. Beck, 
J. P. Guilford, and Doncaster G. Humm of the 
Adult Form, see 40:1200; for a review by Aus- 
tin H. Turney of the Student Form, see 38:912 
(although three reviews are listed under 38:912, 
only the review by Austin Н. Turney is a re- 
view of Bell’s Adjustment Inventory ; the other 
two are reviews of Bell's School Inventory). 


[29] 
*Affectivity Interview Blank. Ages 7-12; I951; à 
measure of child feelings; individual; І form [51]; 
$1.40 per 25; 15¢ per specimen set; postpaid; (15) 
minutes; Elizabeth Mechem Fuller; Child Develop- 
ment Laboratories, University of Michigan. * 
REFERENCES 
т. Mzcuzw, Mary ErizasrrH. The Relationship о Affec- 


tivity to Various Measures of Growth im Children. Doctors 
thesis, University of Michigan (Ann Arbor, Mich.), 1941. Ab- 


stract: Microfilm Abstracts 3:82-4 no 2 '41. $ f 
2. Mrcuem, ErizAsETH. "Affectivity and Growth in Chil 
dren.” Child Develop 14:91-115 Je '4s. * (РА 17:3994) 
з. FULLER, ELIZABETH MECHEM, “How do the Children Feel 
About It?" Childhood Ed 23:124-32 N '46. * (PA 21 13963) 


Morris KrucMan, Assistant Superintendent in 
Charge of Guidance, Board of Education, New 
York, New York. 

In its present stage, this instrument merits 
attention only because “The Affectivity Inter- 
view Blank was devised at the University 0 
Michigan as a part of the child growth studies 
under the direction of Professor Willard C. 
Olson." Labeled an "interview blank," with the 
avowed purpose of contributing "to the prob- 
lem of describing the feeling aspects of chil 
dren's personalities,” this blank could be aC 
cepted as an experimental subjective device 

1 Welsh, George S. "An Extension of Hathaway's MM 
Profile MER SUR J Consult Psychol 12:343-4 5-0 4 


ж са 23:17 file 
elsh. George S. “Some Practical Uses of MMPI Pr? 
Coding." J Consult Psychol 15:82-4 F 's1. * 
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The author, however, after considerable dis- 
cussion of the tentativeness of the blank, pro- 
ceeds to supply “norms,” “evidence” of relia- 
bility, and scoring weights for each item, leaving 
the impression that we are dealing with an ob- 
jective measuring instrument. This is definitely 
not the case. ; 

The blank consists of 71 items of such diffuse 
nature that they do not seem, collectively, to 
make sense. Examples are: “Do you have a 
dog, cat, or other pet?” “Do you ever wish that 
you had never been born?” “Do you comb your 
own hair?” Each of these items has a weight of 
one. The 7-year-old (the 5-year-old also, ac- 
cording to the author) and the 12-year-old boy 
or girl achieves the same credit for saying “yes” 
to “Do you comb your own hair?" This is not an 
isolated example, but rather typical of the kind 
of “affectivity” measured by this blank. 

“Norms” in terms of percentiles for children 
aged 7 to 12 (and applicable as low as age 5, 
according to the author) are based on 65 chil- 
dren in the original experimental group and 30 
children added later. This is typical of the “stand- 
ardization” procedure. 

The author makes the following claim for the 
instrument: “In general, for the populations 
studied, the overt behavior of the children bore 
demonstrable relationship to their verbalized 
feelings about themselves.” The manual does not 
supply the evidence for this conclusion. 

In summary, the Affectivity Interview Blank 
does not give the impression of being an instru- 
ment that can measure affectivity in children, 
nor are the author’s claims that it does so sub- 
stantiated by the evidence presented in the 
manual. 


Verner M. Sims, Professor of Psychology, 
University of Alabama, University, Alabama. 
This blank is offered as a preliminary form 
of a structured interview blank from which it is 
possible to arrive at a quantitative expression of 
the “feeling aspects of children’s personality.” 
It is proposed for use with children prior to the 
mastery of reading, that is, from the ages 5 
through 12 years. No serious effort is made by 
the author to define what the blank measures, 
but the description of its construction and an 
examination of the items indicate that it under- 
takes to measure what is generally compre- 
hended under the term “personality adjust- 
ment.” In fact, one of the two criteria used for 
the inclusion of items was “а minimum number 


of five appearances in one or more....published 
personality tests, clinical findings, and textbook 
treatments of problems in the personality area." 

Evidence of validity for the blank is contained 
in the method of selecting the items (which al- 
most surely means some agreement with other , 
such instruments) ; in the fact that items were. 
retained only after an experimental tryout for 
*ambiguity or lack of understanding by the chil- 
dren” (the second criterion used in item selec- 
tion); in the reported agreement among 150 
adults, with advanced training in the psychol- 
ogy and education of young children, concerning 
the response to each question which was “‘indica- 
tive of a higher level of affectivity” (70 per cent 
or more of the judges agreed on all but 16 of the 
69 scorable items in the blank) ; and in the find- 
ing that “for the population studied, the overt 
behavior of the children bore demonstrable re- 
lationship to their verbalized feelings about 
themselves” (although no data to support this 
conclusion are reported). 

A split half reliability of .82 for 65 subjects, 
and a test-retest reliability of .70 for 44 subjects, 
with a school year intervening, suggest some 
stability for the quality or qualities measured by 
the test. The scoring of the blank is objective 
and relatively simple. Good suggestions for its 
administration and use are furnished in the 
manual. Percentile norms based on 65 ele- 
mentary school children (a sample which is 
recognized by the author as inadequate) are 
given. 

This interview blank seems to have promise. 
Accepted in the tentative spirit in which it is 
presented, it represents the beginnings of a 
worthwhile instrument. It is to be hoped that 
the author, or someone else, will undertake to 
carry on the work necessary if the blank is to 
become a tool useful in counseling and guidance. 
Particularly needed are more careful item analy- 
ses, better “outside” validation, and more ex- 
tensive and representative norms. 


[30] 
xThe Alcadd Test. Adults ; 1949; a test for the meas- 
urement of alcoholic addiction and identification of in- 
dividuals with alcoholic problems; 6 scores: regularity 
of drinking, preference for drinking over other activi- 
ties, lack of controlled. drinking, rationalization of 
drinking, excessive emotionality, total; 1 form; $2.75 
per 25; 35% per specimen set; postpaid; nontimed (5- 
15) minutes; Morse P. Manson; Western Psychologi- 
cal Services, Box 775, Beverly Hills, Calif. * 
REFERENCES 

1. Manson, Morse P. “А Psychometric Determination of 
Alcoholic Addiction,” Am J Psychiatry 106:199-205 S '49. * 
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Cuanrrs H. Honzik, Personal Counselor and 
Vocational Adviser, Veterans Administration, 
San Francisco, California. 

This test differs from the author's The Man- 
son Evaluation in three respects: (a) The Al- 
cadd Test items refer directly to drinking and 
drinking habits; (b) The Alcadd Test deter- 
mines more closely the degree of alcoholic ad- 
diction; and (c) it attempts an objective rec- 
ognition of such characteristics of alcoholic 
addiction as regularity of drinking, rationaliza- 
tion of drinking, etc. The existence of these two 
rather similar tests is valuable in large screen- 
ing programs, such as may exist in Veterans 
Administration, federal and state hospitals, and 
clinics for alcoholics, where slightly differing 
practical questions may be better answered by 
one test than by the other. For example, in the 
cases where alcoholism or even moderate drink- 
ing is denied, The Manson Evaluation would be 
more likely to make the desired differentiation, 
as well as to indicate some of the personality 
characteristics of the testee ; in those cases where 
. there is no question of alcoholic addiction, the 
degree of addiction and some of the dynamics of 
the addiction, may be determined by The Alcadd 
Test. 

The test consists of 65 items, 61 pertaining to 
drinking; examples are, "I drink only to join 
the fun," and *I often take a drink or two in 
the middle of the afternoon." Only 60 items are 
scored. An original battery of 160 such ques- 
tions, obtained from observation of alcoholic ad- 
dicts and írom the literature on the overt 
behavior, attitudes, feelings, and other charac- 
teristics of alcoholics, was constructed and ad- 
ministered to comparable groups of male and 
female alcoholics and nonalcoholics. By item 
analysis 60 items were selected, each of which 
had a critical ratio of 2.7 or higher, for both 
male and female groups. These 60 items were 
then administered to 123 male and female alco- 
holics, and 159 male and female nonalcoholics. 
Using a cutting score of 12 for the male groups, 
it was found that 97.6 per cent of the alcoholics 
were correctly predicted, while a cutting score 
of 14 for the females gave 97.5 per cent correct 
predictions. The reliability coefficient of the test 
was found to be .92 for the male group, .96 for 
the female group. 

A subjective analysis of the 65 items revealed 
five clusters which describe characteristics of 
alcoholic addiction and addicts. These are regu- 
larity of drinking, preference for drinking over 


other activities, lack of controlled drinking, ra. 
tionalization of drinking, and excessive emo. 
tionality. Separate scores may be computed for 
these five characteristics, the differences be. 
tween alcoholics and nonalcoholics on all the 
characteristics being statistically significant, In- 
teresting differences were found between male 
and female alcoholics, the females showing less 
control more rationalization, and more emo- 
tional immaturity than the males. 

The test is easily administered to groups or 
individuals and takes from 5 to 15 minutes. Scor- 
ing is simple and can be done in 2 to 3 minutes, 
A profile is provided which is filled in as the 
scores are computed. 

This test, like all questionnaires, is subject to 
the dangers of untrue scores through deliberate 
or unconscious falsification. A test in this area 
of human behavior would seem to be particu- 
larly vulnerable since many alcoholic addicts are 
known to deny stubbornly frequent drinking and 
other facts that are glaringly obvious to ob- 
servers. For such addicts the test would give 
false scores. On this point, an interesting ques- 
tion arises: why were 2.4 per cent of the male 
alcoholics and 2.5 per cent of the female alco- 
holics incorrectly placed by the test? The tend- 
ency to falsify is probably responsible. Never- 
theless, the reviewer believes that this test, 
discreetly used, can be a useful tool in screening 
programs and a valuable adjunct to clinical 
methods. 


ALBERT L. HuNsicKEn, Supervising Psycholo- 
gist, Galesburg State Research Hospital, Gales- 
burg, Illinois. 

This yes-no questionnaire was designed to 
facilitate rapid identification of alcoholic addicts 
and to locate possible areas of their maladjust- 
ment. The 60 questions, obviously directed 
towards the evaluation of drinking habits and 
with no attempt to camouflage their purpose 
were carefully selected from an original set of 
160 questions administered to alcoholics and 
“normals” of essentially equivalent age, educa- 
tion, and social status. 

The male scoring norms are based on results 


„from 83 alcoholics in treatment for alcoholism, 


61 social drinkers, and 17 abstainers. Female 
norms are based on 40 alcoholics in treatment, 
58 social drinkers, and 23 abstainers. The shorter 
approximation of the Kuder-Richardson formula 
produced a reliability coefficient of .92 for the 
males and .96 for the females. The scores of the 
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known alcoholic group and of the abstainers for 
either sex do not overlap, and very little over- 
lapping is found between known alcoholics and 
social drinkers. This reviewer administered the 


Alcadd Test to 11 known alcoholics from an' 


outpatient clinic and 18 hospital employees, who 
omitted all identifying information from their 
questionnaire. The results obtained were very 
similar to those found by the author. 

From a subjective analysis of the 60 ques- 
tions, the author finds five categories which are 
labeled (a) regularity of drinking, (b) pref- 
erence for drinking over other activities, (c) 
lack of controlled drinking, (d) rationalization 
of drinking, and (e) excessive emotionality. The 
subjective breakdown of the 60 questions into 
these five arbitrary categories supposed to reveal 
areas of maladjustment would probably not 
stand up under a careful cluster or factor analy- 
sis of the questions. 

When this questionnaire is used on known 
alcoholics in a treatment environment, results 
comparable to those discussed in the manual may 
be expected. However, could one expect the same 
results from the alcoholic addict if he were tak- 
ing this test as one of a series of screening tests 
preparatory to a job which he wants badly? Or 
would he, as the reviewer feels certain, conceal 
the seriousness of his addiction in most situations 
where he becomes just another member of a 
"normal" population who takes this test? 

The Manson Evaluation, by the same author, 
which is made up of 72 questions only remotely 
associated with drinking, will probably prove a 
better alcoholic screening test than this one. 
Still, this questionnaire remains a pioneer study 
in the field of alcoholic addiction and will prove 


useful to those institutions, clinics, and psycholo-. 


gists who are faced with the problem of evaluat- 
ing and working with alcoholic addicts. 


[31] 

Babcock Test of Mental Deterioration. Ages 7- 
24; 1930-40; title on test booklet is The Revised Exam- 
ination for the Measurement of Efficiency of Mental 
Functioning; 8 scores: easy questions, learning, repe- 
tition, motor, initial learning, easy continuous, total 
efficiency (total of six previous scores), verbal level; 
I form, '40; $13.75 per set of testing materials and 50 
record blanks; $3.75 per 50 record blanks; $3 per 
specimen set (includes manual and pictures of testing 
materials); postage extra; nontimed in part (7o+) 
minutes; Harriet Babcock and Lydia Levy; C 
Stoelting & Co. * 
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D. Russet Davis, Reader in Clinical Psychol- 
ogy, University of Cambridge; and Honorary 
Consultant Research Psychiatrist, United Cam- 
bridge Hospitals; Cambridge, England. 

As early as 1914, B. Hart and C. E. Spearman 
studied the performance of psychotic patients 
on a large number of mental tests and compared 
it with that of normal subjects, but their find- 
ings were of little practical or theoretical value. 
А very useful advance was made in 1930 when 
Babcock first published her methods of exami- 
nation, especially because she drew attention to 
the value of vocabulary tests as means of as- 
sessing the premorbid intelligence level of pa- 
tients and, hence, by comparison between scores 
on vocabulary and on other tests, of measuring 
the deterioration that can be attributed to an 
illness. The theoretical views with which Bab- 
cock has tried to explain her original and her 
more recent results have been subjected to dam- 
aging criticism. That they are not acceptable 
need not detract, however, from the value placed 
on the essential features of her methods. Indeed, 
it is now generally agreed that the scores ob- 
tained on vocabulary and kindred tests are rela- 
tively unaffected either by psychological or by 
organic disease processes, and this fact has been 
made use of in most modern methods of psycho- 
logical examination of patients. 

Yet, however useful it may have been in the 
1930's, Babcock's test battery is not now satis- 
factory for general use, and the present revision 
has done little or nothing to reduce the short- 
comings which the all-round improvement in 
test methods in the last decade has revealed. 

Four tests of the original battery have now 
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been omitted, seven new tests have been added, 
and changes have been made in eight tests. But 
the norms are inadequate by modern standards, 
and the manner of arriving at weighted scores 
and the efficiency index is an unnecessarily 
roundabout one. Some of the tests might still 
be useful for the many clinical purposes for 
which precise norms are not required and may 
survive for occasional use. Many of the tests are 
unsatisfactory, however, for too much emphasis 
is placed on speed in the scores; these scores are 
considerably influenced, therefore, by the “set” 
which the patient has been brought to adopt. 
Some of the material is inappropriate; for ex- 
ample, most examiners prefer to greet a patient 
by name at the first meeting and would dislike 
starting their testing with the words, “Now an- 
swer these questions as promptly as you can: 
what is your name?" In the reviewer's opinion, 
the counterparts in the Wechsler-Bellevue In- 
telligence Scale of the test of general informa- 
tion, substitution, and digits forward and re- 
versed are better in every case, although it is 
debatable whether the Wechsler-Bellevue vo- 
cabulary test is superior to the Terman-Merrill 
vocabulary test. 

The most useful tests in the battery are those 
of memory, which form an essential part of an 
examination when organic brain damage is sus- 
pected, and which the Wechsler-Bellevue scale 
lacks: especially useful are the paragraph re- 
production test, which is a modification of the 
Terman-Merrill reading and report, and the 
sentence repetition test, now slightly revised. The 
paired associates test has been lengthened to in- 
clude nine instead of seven pairs of words and 
is now too long. Also, learning is tested in two 
trials only, and there is now no chance of de- 
tecting failures to unlearn wrong responses—an 
interesting sign suggestive of an organic defect. 
The memory for designs test has something to 
be said in its favour, although many examiners 
may still prefer less formal and less standardised 
methods. The Knox cube test is a most suitable 
one for class demonstrations on patients. 


Seymour С. KrresANorr, Chief Psychologist, 
Franklin Delano Roosevelt Veterans Adminis- 
. tration Hospital, Montrose, New York. 

This examination consists of a group of sub- 
tests designed to measure three essential areas 
of mental functioning: learning ability, motor 
ability, and repetition ability. The final result 
is an obtained efficiency index which reflects the 
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difference between capacity level of intellectual 

functioning and current efficiency. 
Consideration of the principles underlying 

the construction of the test reveals certain sig- 


'nificant problems. As Wesley has shown, the 


traditional concept that vocabulary consistently 
serves as a measure of capacity level must be 
challenged. More recent studies of the intel- 
lectual process indicate that vocabulary scores 
show a definite tendency to decline with increas- 
ing age as well as in the presence of significant 
personality pathology. Further, the assumption 
of Babcock that normal functioning can be evalu- 
ated optimally by the use of normal subjects of 
less than 25 years of age indicates a failure to 
consider the chronology of the population for 
whom the test was intended. 

The revised Babcock test does not constitute 
a more novel or original contribution to the 
problem of intellectual deficit. It employs the 
longstanding thesis that discrepancy between vo- 
cabulary and recently acquired associations will 
serve as a measure of "deterioration." It appears 
that this clinical fact has been overly exploited 
in an effort to obtain a quantitative index of 
intellectual impairment and has impeded prog- 
ress in understanding intellectual deficit. Con- 
tinued corroboration of a clinical phenomenon 
does not advance knowledge of the intellectual 
process. It is time to direct energy toward quali- 
tative analysis of those intellectual functions 
which reveal deficit with the end of determining 
the conditions under which impairment occurs. 
Further, the conditions underlying the reversi- 
bility of intellectual deficit constitute variables 
for concerted study. 

In essence, the revision of the Babcock test 


approaches the problem of intellectual deficit - 


with the same theoretical basis formulated by | 
Wells and Kelley * in 1920. It remains, however, 
a useful technique when utilized by clinicians 
who are sensitive to its limitations. 


For related reviews, see 3:72. 


[32] 
*[The Baxter Group Test of Child Feeling.] 
Grades 1-8, 7-9 (separate edition available for paren 
and teachers of children in grades 1-9); 193574) 
c1935; a revision of Baxter Group Test of Child Fett. 
ings (see 40:1206) and Baxter Individual Tests 0) 
Child Feelings (see 40:1207); 2 levels; 2 editions: 
$1.50 per 25 of any one edition of any one level; 50" 
per manual for any one level of the children's tests; 
per specimen set of any one edition of any one 
level; postage extra; (30-40) minutes; Edna D. Bax- 
1 Wells, F. L., and Kelley, C. M. “Intelligence and Psycho 
sis." Am J Insanity 77:17-45 Л '20. * 
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ter; Baxter Foundation for Research in Education, 
Inc., 315 Central Park West, New York, N.Y. * 
а) CHILD'S TEST: BAXTER GROUP STORY-TEST OF CHILD 
FEELINGS. Grades 1-8; 1935; 23 scores: friendliness, 
responsiveness, generosity (jealousy), respect (sar- 
casm), dependability, daydreaming, obedience, com- 
pliance, self-control, fairness (domineering), courage, 
studiousness, creativity, concentration, temperance 
(boisterousness), carefulness (destructiveness), grace 
(awkwardness), unselfishness, honesty, courtesy, cheer- 
fulness, kindness, tidiness; 1 form; mimeographed 
manual ['35]. 
b) THE BAXTER TEST OF CHILD FEELINGS. Grades 7-9; 
1935; 25 scores; same as above plus sociality, appear- 
ance; I form; mimeographed manual ['35]. 
C) THE BAXTER PARENT-TEACHER TEST OF CHILD FEEL- 
тїс. Parents and teachers of children in grades 1—0; 
1935-46; 1 form, '46; no data on reliability and valid- 
ity; no manual; no norms. 

REFERENCES 


т. Baxter, Epna Dororny. “The Baxter Child Personality 
Test.” J Appl Psychol 21:410-30 Ag '37. * (PA 12:967) 

2. Baxter, Epna Dororny, “Baxter Group Test of Child 
Personality.” Abstract. Psychol В 36:629-30 О '39. * (PA 14: 
600, title only) 

[33] 


XClient Centered Counseling Progress Record- 
ing Chart. Adults and children undergoing psycho- 
therapeutic counseling; 1950; individual; 1 form; $1 
per 25; 356 per specimen set; postage extra; nontimed 
(20) minutes; Russell N. Cassel; the Author, 344 B 
St., Hayward, Calif. * 
[34] 
xThe College Inventory of Academic Adjust- 
ment. College; 1949; 7 scores: curricular adjustment, 
maturity of goals and level of aspiration, personal 
efficiency-planning and use of time, study skills and 
practices, mental health, personal relations, total; 1 
form; $2 per 25; 156 per specimen set; cash orders 
postpaid; nontimed (15-25) minutes; Henry Borow; 
Stanford University Press, Stanford, Calif. * 
REFERENCES 

т. Bonow, Henry. A Psychometric Study of Non-Intellectual 
Factors in College Achievement. Doctor's thesis, Pennsylvania 
State College (State College, Pa.), 1945. CAbstracts of Doctoral 
Dissertations....1945, 1946, pp. 65-73. í 

2. Borow, Henry. “The Measurement of Academic Adjust- 
ment." J Am Assn Col Кед 22:274-86 Ap '47. * 

з. Troxet, LEETHA LORETTA, Study E Three. Vocational 
Interest Measures: Preference Record, Academic Interest In- 
ventory, and Work Interest Analysis. Master's thesis, Univer- 
sity of Kentucky (Lexington, Ky.), 1949. 

Lyste W. Cnorr, Director of Personnel, Uni- 
versity of Kentucky, Lexington, Kentucky. 

The inventory is “designed to identify cer- 
tain attributes, apart from scholastic aptitude, 
which have been shown to be significantly re- 
lated to the college student’s academic perform- 
ance.” It consists of 90 items of the 3-choice 
type (yes, no, undecided), with from 12 to 21 
items in each of the following six sections : Cur- 
ricular Adjustment, Maturity of Goals and 
Levels of Aspiration, Personal Efficiency, Study 
Skills and Practices, Mental Health, and Per- 
sonal Relations. 

Percentile norms for the composite inventory 
are based on 237 male students at Pennsylvania 


State College and the University of Minnesota, 


and 454 women students at the Pennsylvania 
State College. The sampling procedure involved 
in selecting these normative characteristics of 
the two groups are specified. Means and stand- 
ard deviations are given in lieu of percentile 
norms for each of the six sections. 

Split half reliabilities, corrected by the Spear- 
man-Brown formula, of .90 and .92 for groups 
of 155 men and r30 women students, respec- 
tively, are reported for the composite score of 
the inventory. The corrected test-retest relia- 
bility over a span of то days for a group of 130 
women students is given as .92. 

Items for the inventory were selected on the 
basis of their power to differentiate between 
groups of “under-achievers” and "over-achiev- 
ers.” The discussion in the manual of the va- 
lidity of the inventory is based mainly upon the 
fact that section and total scores discriminate at 
the 1 per cent level between a group of 81 “under- 
achievers” and 67 “over-achievers.” The author 
also reports the partial correlation between total 
scores on the inventory and the two-semester 
grade average is .32 with scholastic aptitude 
scores partialed out. 

Considering the data presented in the manual, 
the inventory does not fulfil the reviewer's spec- 
ifications for a useful diagnostic aid in counsel- 
ing, although this is the expressed purpose of 
the inventory. In the first place, there is no 
direct evidence that cross-validation studies 
were made. In describing the various phases of 
the research, the author does not reveal the ex- 
tent to which the population samples were in- 
dependent. Unless the fact is established that 
the validity data were obtained on groups other 
than those used in the item selection process, the 
validity figures are severely open to question. 
In the second place, the evidence that the scores 
on the inventory discriminate significantly be- 
tween groups of achievement deviates, and that 
the partial correlation of the composite scores 
with grade averages is .32, is not strong enough 
to justify the use of the inventory in individual 
counseling situations. “Cost and utility” data are 
more important than critical ratios and correla- 
tion coefficients for the uses for which this in- 
ventory is designed. 

In summary, the author’s claim that this in- 
ventory is a useful “diagnostic aid in counseling” 
is not supported by the data presented in the 
manual. Even the use of the inventory as a check 
list for locating the problem areas of the scholas- 
tic “under-achiever” is not justifiable until it 
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is shown that the items have been cross vali- 
dated. 

This reviewer does not like the idea of the 
women overweighing the men in relation to the 
number of subjects. Follow-up research projects 
have not been accomplished, and it is rather 
doubtful that the results received by the author 
will follow in other institutions of higher learn- 


ing. 


Harrison С. Соосн, Assistant Professor of 
Psychology, University of California, Berkeley, 
California. 

This is a 9o-item inventory yielding part 
scores in each of six logically-derived categories. 
The test was developed at Pennsylvania State 
College by item analyzing the responses of over- 
and under-achievers (identified by means of re- 
gression line deviations) to a specially written 
pool of 400 items. From those which survived 
the statistical analysis, a further choice was made 
using logical and subjective criteria. Each item 
included in the final version of the scale revealed 
differentiating power in both male and female 
samples, and satisfied requirements pertaining 
to clarity, meaningfulness, and consistency. 

The main purpose of the test is to assess 
scholastic interest and potentiality, independ- 
ently of intelligence and aptitude. The separate 
Scores can also be used diagnostically for evalu- 
ating academic functioning in the areas specified. 
In a cross validating sample of 155 college men 
the following correlations with 2-semester grade 
averages were obtained: (a) curricular adjust- 
ment, .30; (b) maturity of goals and level of 
aspiration, .31; (c) personal efficiency: plan- 
ning and use of time, .41; (d) study skills and 
practices, .22; (e) mental health, .27; (f) per- 
sonal relations, .16; (g) total score, .36. Correla- 
tions for women were very similar, although 
somewhat lower. 

Moreover, the test also shows almost no cor- 
relation with intellectual measures. This en- 
hances its value as a predictor of academic 
achievement because it is tapping a domain not 
assessed by intellectual measures. Furthermore, 
the experienced counselor can examine the con- 
figurations of the six part scores to help gain an 
"understanding of the reasons for academic ac- 
complishment or difficulty. 

The research methodology underlying the 
test’s development impressed this reviewer as 
being basically sound. However, some doubts 
were raised concerning the criteria used for re- 
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jecting items that had satisfied the statistica] 
requirements. The 9o selected items were culled 
from a pool of 246. The search for clarity and 
meaningfulness might have led the test devel- 
oper to reject some highly valid, but subtle, 
items. Such an outcome would indeed be un- 
fortunate for this kind of item is often the most 
significant product of an empirical inquiry, us- 
ually represents new knowledge, does not put a 
respondent on his guard, and is the most diffi 
cult kind of item to fake. 

The language and style of the items is dis- 
turbingly cumbersome and awkward. The gen- 
eral flavor is of old fashioned written language 
with very little pacing and spontaneity. One of 
the properties of a good personality inventory 
item is its general resonance—the degree to 
which it sounds like “something people say." 
Such a quality encourages a response concerned 
with the question asked and the issues raised. 
The farther an item departs from this verisimili- 
tude the greater is the opportunity for extrane- 
ous and complicating response sets to manifest 
themselves. In the extreme case an item may 
be consistently rejected merely because of the 
way it is worded rather than because of the 
question asked. 

The reviewer also had some misgivings about 
the test directions. The first paragraph contains 
phrases like "high mental ability," "no better 
than average,” “intellectually superior students,” 
etc. Some students may be offended or annoyed 
by an introduction of this kind, and others may 
be made anxious. In any case, it would appear 
that an introductory paragraph could be writ- 
ten which did not put so much emphasis on st- 
perior and inferior achievement. One method 
would be merely to state that the test would in- 
quire into study habits, personal reactions 
to college work, and general academic satisfac- 
tion. 

The manual is thorough and systematic but 
suffers somewhat from intricacy and complex- 
ity. It is a good report оп the test but not enough 
of an aid to a person who wants to use the test. 
No individual case material is presented, al- 
though the fundamental conception of the test 
recommends it for individual counseling and 
diagnostic application. No explanation is aC 
vanced for the use of weighted scores, аз ОР 
posed to simpler methods. Some users may 0” 
ject to the labor involved in calculating weighted 
Scores; a reference to the test's efficacy if un- 
weighted scores are used would be helpful. The 
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pages in the manual are not numbered, which is 
a mild annoyance to the reader. 

The test does appear to fulfill a useful func- 
tion—that of evaluating and specifying some of 
the nonintellectual factors contributing to aca- 
demic achievement. It should be a useful device 
for college counselors. The limitations of the 
test do not appear to be critical, and can prob- 
ably be overcome to a great extent in future edi- 
tions. 


[35] 
Concept Formation Test. Normal and schizophrenic 
adults; 1940; individual; 1 form; no data on reliability 
and validity; no norms; mimeographed manual ['40] ; 
$13 per set of testing materials, postage extra; (10-60) 
minutes; Jacob Kasanin and Eugenia Hanfmann; 
C. H. Stoelting Co. * 


REFERENCES 

1-19. See 3:27. 

20. HANFMANN, EUGENIA. "Analysis of the Thinking Dis- 
order in a Case of Schizophrenia," Arch Neurol & Psychiatry 
41:568-79 Mr '39. * 

21. FosnerG, Irving A. ‘Multiple Solutions to the Vigotsky 
ae Abstract. Am Psychol 1:280 J1 '46. * (PA 20:3930, title 
only 

22. Cook, CuarLes НАххАРОВР. The Relation of Certain 
Factors of the Concept Formation Test to Scholastic Aptitude 
and to Scholastic Achievement. Master's thesis, Clark Univer- 
sity (Worcester, Mass.), 1947. (Abstracts of Dissertations... 
1947, pp. 78-9.) 

23. Des LAURIERS, AUSTIN, AND HALPERN, FLORENCE. “Psy- 
chological Tests in Childhood Schizophrenia.” Am J Orthopsy- 
chiatry 17:57-67 Jl °47. * (PA 21:2226) А К 

24. FosmERG, Invinc ARTHUR, “А Modification of the Vigot- 
sky Block-Test for the Study of the D Thought Processes.” 
Am J Psychol 61:558-61 O '48. * (РА 23:4126) 4 

25. ScHarer, Roy. The Clinical Application of Psychological 
Tests: Diagnostic Summaries and Case Studies. Foreword by 
David Rapaport. The Menninger Foundation Monograph Series 
No. 6. New York: International Universities Press, Inc. 1348; 
Pp. 346. * (London: George Allen & Unwin Ltd., 1949.) (РА 
23:77 "nm 

26. Fisner, SEYMOUR, Patterns of Personality Rigidity and 
Some of Their Determinants. American Psychological Associa- 
tion, Psychological Monographs: General and Applied, Vol. 64, 
ashington, D.C.: the Association, Inc., 


No. Es hole No, 397. rye 
1950. . v, 48. Paper. 24:625 
27. Norman, RaLPH D.; BAKER, Gaiaces А.; AND DOEH- 


RING, Donatp С. “The Hanfmann-Kasanin Concept Formation 
cC DR Ad c ee cea 2a) 
8256) ы ус! 365-9 5! 

Kare Levine Kocan, Clinical Psychologist, 
6034 44th Ave., N.E.; and WitL1AM S. KOGAN, 
Chief Psychologist, Veterans Administration 
Hospital; Seattle, Washington. 

'This technique continues to be best charac- 
terized as an experimental device rather than 
a test method. The problem posed often serves 
to elicit certain forms of thinking disorder, 
which, when present, can be considered pathog- 
nomonic for schizophrenia. Other failures 
demonstrate interference with conceptual be- 
havior whose exact character is not quite so 
clearly defined. It is generally agreed that poor 
performance on the test may be associated with 
both cortical lesions and the schizophrenias. 
Whether other groups do poorly in the same 
way has not been fully investigated. In all cases 
the judgment depends upon qualitative and sub- 


jective interpretation of the behavior and 
responses observed, and there are no quantita- 
tive norms or ratings upon which the inexperi- 
enced examiner may fall back. 

None of these features detracts from the 
method's usefulness in clinical case studies 
where time permits the examiner to pursue in- 
vestigations beyond the fewest number of most 
fully informative tests in the briefest time. The 
task which the test presents is sufficiently dif- 
ficult to be challenging to subjects of the higher 
intellectual brackets and hence may highlight 
a breakdown in conceptual thinking which is 
still hidden behind the patient's facade of well- 
learned responses for dealing with more familiar 
problems. Moreover, because of its difficulty, it 
introduces certain elements analogous to “frus- 
tration" and “level of aspiration" experiments ; 
in many instances one is permitted to witness 
the dynamic interplay of problem solving and 
emotional tensions as the subject proceeds to- 
ward solution. It might even prove to be worth- 
while research to record and analyse verbatim 
productions in the test situation. 

In essence, then, this concept formation test 
makes its contribution through its ingenious 
material and procedure which provide the ex- 
aminer with a sample of behavior whose im- 
portance to him will be determined by his in- 
terests, the nature of the clinical problem, 
supporting data from other tests, and many other 
factors. It cannot be recommended for routine 
psychometric practice. It can be suggested as 
providing possible enrichment for full psycho- 
diagnostic study. ^ 


For a review by О. L. Zangwill, see 3:27; for 
related reviews see 3:28. 


[36] 
*Contemporary Problems. Grades 7-9, 10-12; 
1951-52; 2 levels; revised preliminary manual ('52) ; 
no norms; $5.25 per 35 sets of test and answer sheet 
of any one level; separate answer sheets must be used ; 
75¢ per 35 answer sheets; 756 per specimen set; post- 
paid; nontimed (20-40) minutes; itizenship Educa- 
tion Project, Teachers College, Columbia University. * 
a) JUNIOR HIGH SCHOOL. Grades 7-9; Form R (’51). 
b) HIGH scHooL. Grades 10-12; Forms C (51), D 
(51). 
[37] 

XCornell Index. Adults; 1944-49; a revision, for 
civilian use, of the Cornell Selectee Index Form N and 
the Cornell Service Index; title on test sheet is eL; 
psychosomatic and neuropsychiatric symptoms; Form 
N2 [45]; revised manual (740) ; $2.25 рег 50; 356 per 
specimen set; postpaid ; nontimed (5-15) minutes; Ar- 
thur Weider, Harold G. Wolff, Keeve Brodman, Bela 
Mittelmann, and David Wechsler; Psychological Cor- 
poration. * 
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Hans J. Eysenck, Director, Psychology De- 
partment, Institute of Psychiatry, Maudsley 
Hospital, London, England. j 

The Index has тот Yes-No questions and is 
scored for general abnormality only. It has ? 
Kuder-Richardson Reliability of .95. Regard- 
ing its validity, it may be said to discriminate 
between known neurotics and known normals 
with about the same degree of success as do most 
similar questionnaires. It is a trifle disingenuous 
of the authors to say, with respect to their Table 
3, which shows the percentage of normal indi- 
viduals and neurotics identified as rejects 
various cutoff levels, that “since the Index % 
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intended to identify the neuropsychiatric unfit, 
these tables are primary evidence of the instru- 
ment's validity." Presumably, the purpose of the 
Index is not to identify neurotics already diag- 
nosed as such, but to predict neurotic break- 
down in groups still psychiatrically undiagnosed. 
On this crucial point in the use of any psychologi- 
cal test for screening, no evidence is given. The 
fact that the Cornell Index correlates with the 
Minnesota Multiphasic Personality Inventory 
scales at a level of about .6 for the neurotic and 
psychotic triads, and .3 to .5 for individual scales, 
is presented as additional evidence of validity. 
Whether such correlations can be regarded as 
proof of validity is a doubtful question; the 
present writer would emphatically discourage 
this practice. Except for the use of “stop” ques- 
tions this test contains little that is new and in 
some ways represents a return to the nonana- 
lytic scales current in the 1920’s. The Index’s 
main use is probably the screening of potential 
neurotics for the Army, the field in which it was 
most extensively used. Even there, however, 
much shorter questionnaires have been shown to 
give equally good discrimination, for example 
the 40-item Maudsley Medical Questionnaire. 
Four uses are suggested for the Cornell In- 
dex: (a) “Neurologic and Psychiatric Wards 
and Out Patient Departments.” Here it seems 
certain that the Minnesota Multiphasic Per- 
sonality Inventory would be a more useful and 
much better validated test. (b) “Medical and 
Surgical Wards and Out Patient Departments.” 
Here again, evidence favours the MMPI; or 
if only a simple maladjustment quotient is re- 
quired a much shorter type of index would be 
sufficient. (c) “Industry.” In my experience, 
questionnaires are not really suitable instru- 
ments for use in industry; and the evidence 
quoted by the authors of this questionnaire is 
not sufficient to convince a critical reader that 
this view is not correct. (d) “Research.” Any 
use of questionnaires in research altogether is 
of doubtful value; I do not feel that this par- 
ticular questionnaire is an exception to that rule. 


Netson С. Hanawatt, Associate Professor of 
Psychology, New Jersey College for Women, 
Rutgers University, New Brunswick, New 
Jersey. Й | 
The purpose of the Index is to furnish a list 
of psychological symptoms “which would serve 
as a standardized psychiatric history and a guide 
to interview; and which, in addition, would sta- 


tistically differentiate persons with serious per- 
sonal and psychosomatic disturbances from the 
rest of the population.” As an adjunct to inter- 
view, this test is probably as good as any other 
and has the added advantage of speed and sim- 
plicity in scoring. As to differentiating persons 
with serious personality difficulties from the 
normal population, it appears to be, like other 
inventories of its kind, not very valid for the 
individual case. The Index has been standard- 
ized on male military recruits. Under these con- 
ditions it has high reliability and good validity. 

The list of 21 references in the manual con- 
tains only one based upon Form N2. Since there 
is quite a difference in the number of items and 
in the content of the various forms used in these 
studies (Form N, Service Index, Selectee In- 
dex), the extent to which the validating data 
can be transferred to the present Form N2 is 
unknown. The manual is not very helpful in 
enabling one to determine which form of the 
test was used for a particular reference. For 
instance, it is claimed that the Index is useful 
in hospitals, clinics, industry, and research, 
Thirteen references are given to buttress this 
claim, none of which report findings for Form 
N2. Another difficulty is the fact that most of 
these references are to military situations—a 
very special type of situation and quite different 
from the usual civilian situation. On the whole, 
the military reports are very favorable for the 
early forms of the Index. One exception is the 
report of Weinstock and Watson (18) who 
found the Selectee Index to be of little use in 
psychiatric screening at a Naval Induction Sta- 
tion. They found that the total score was prac- 
tically valueless and that the stop questions, al- 
though “somewhat more valuable, resulted in 
very small additional identification.” Your re- 
viewer participated in a similar study at another 
naval station (unpublished), using the same 
test, with results more in line with those referred 
to in the manual. In this study the Selectee In- 
dex identified about 80 per cent of those even- 
tually discharged and cut down the number of 
men who had to be interviewed to about one 
third (79). 

If Form N2 turns out to be as good as the 
earlier forms (and there are good reasons to be- 
lieve that it will), it should prove to be of great 
value in situations suggested in the manual. The 
authors’ cautions for limitations of use should be 
very carefully followed. It was devised as “an 
adjunct to interview; not as a substitute unless 
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an interview is impractical.” In case an inter- 
view is impractical, one should be very careful 
in using the Index. In such a case an individual's 
fate should never be determined without addi- 
tional corroborating data. The Index has been 
found to be better adapted to the man on the 
street than to the college graduate in an indus- 
trial situation (33). 

The items on Form N2 are arranged accord- 
ing to various diagnostic categories. The authors 
do not suggest, and one should never attempt, 
a diagnosis on the basis of the test results alone. 

During the past year we have done some re- 
search on Form N2 using 45 college women, 
mostly sophomores. Our results strengthen the 
suggestion in the manual that the test is prob- 
ably useful for women although the standardiza- 
tion was based upon male military personnel. 
We found an r of .81 between Form № and 
the total score on Bell's Adjustment Inventory. 
Neither the Bell nor Form N2, however, showed 
a significant correlation with self-rating of ad- 
justment on an 8-point rating scale. For Form 
N2 the mean was 6.62, the SD 6.26. This is lower 
than Mann's results (see manual) for freshman 
men and quite a bit lower than the average for 

"freshman women. Our group was more selected 
and contained only five freshmen. We repeated 
the test after several weeks with the instructions 
to answer the questions in as favorable a light 

. as possible, assuming that getting a job de- 
pended upon a low score. Under this condition 
the mean dropped to 1.80, the SD 3.04. This 
underlines the authors’ warning that the Index 
should not be used for screening prospective 
employees. The girl with the highest score on 
the first testing, 28, reduced her score to zero 
under the second condition. This girl, by the 
way, was not seriously maladjusted. Her high 
score was partly due to a throat operation of 
some years past. The test also failed to pick out 
the most serious psychological case—the only 

-one of the group who had to leave college be- 
cause of her condition: her score was 12 with 
no stop answers. 

In summary the reviewer would like to em- 
phasize that Form N2 is an open questionnaire 
concerning clinical symptoms. It can be used 
effectively only under conditions where it is to 
the individual’s advantage to report truthfully 
such as in a Clinical or a military situation or 
for research purposes where the investigator has 
the full cooperation of the testee. The validating 
data are based upon military recruits. Some of 


the early forms of this test have proven their 
worth in clinics, industry, etc. The civilian use 
of this test should be checked for validity and 
usefulness in the particular situation. Where the 
individual is concerned, it should be used as an 
adjunct to the interview, not as a substitute for 
it. It fails to pick up certain types of maladjust- 
ment (see manual), and not all persons who are 
rated as seriously maladjusted by the test are 
really so. 


А 
Laurance F. $НАРЕЕВ, Professor of Educa- 
tion, Teachers College, Columbia University, 
New York, New York. 

The Cornell Index, Form N2 is a revision, 
intended for civilian use, of a psychiatric screen- 
ing questionnaire used widely during World 
War IL Its 100 yes-no items refer to neuro- 
psychiatric and psychosomatic symptoms. The 
language is nontechnical, and most examinees 
with grade school education can answer it. Five 
to 15 minutes are required, depending mainly 
on the literacy of the examinee. 

The reported Kuder-Richardson reliability is 
a satisfactory .95. The validity is not so readily 
established. In the construction of the original 
64-item military form, items were analyzed 
against a criterion of neuropsychiatric discharge 
versus acceptance at an induction station after 
a brief psychiatric interview. Retained items 
had critical ratios exceeding 2.5. The manual 
states that the additional 36 items of Form N2 
were "validated on civilian groups," but the 
nature of the groups and of the criteria is not 
specified. The only validation data presented in 
the manual spring from military sources. In view 
of the accumulated evidence that questionnaires 
were more effective under the controls and mo- 
tivations of military life, the lack of evidence of 
validity for civilian applications is a serious fault. 

Three methods are suggested for interpreting 
the scores of the Index. Method A, with a cutoff 
score of 23 unfavorably answered items, detects 
50 per cent of psychiatric “rejects,” while screen- 
ing only 4 per cent of healthy persons. Method 
B, screening those who answer 13 or more items 
unfavorably, detects 74 per cent of the “rejects 
but also falsely identifies 13 per cent of normal 
persons. 

Method C for the interpretation of the Index 
uses the "stop questions" which were a nove 
feature of the military form. The stop questions 
(e.g., *Have you ever had a fit or convulsion? 
were supposed to indicate such extreme pathol- 
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ogy that an unfavorable response even to one 
of them would justify referring the examinee 
{ог further psychiatric diagnosis. Method C 
screens in terms of 13 or more ünfavorable an- 
swers or one or more stop questions. In spite 
of the logical appeal of the stop questions, data 
in the manual show clearly that they are of no 
statistical value. At no cutoff point does the 
acceptance-rejection validity of Method C ex- 
ceed that of Methods A or B which pay no 
special attention to the stop questions. Any 
critical reader of the manual may wonder why 
the authors still recommend the use of the stop 
questions, contrary to the evidence. 

The manual contains percentile norms for 
1,298 freshmen, 836 males and 462 females, at 
one college. There is also a table showing that 
the Index has moderate correlations, mainly 
from .30 to .50, with various scales of the Min- 
nesota Multiphasic Personality Inventory when 
applied to a college population. 

The available data do not permit a compre- 
hensive evaluation of the Cornell Index. On the 
favorable side, it seems as thoughtfully compiled 
as are other instruments of its kind. Evidence 
shows that it does fairly well in predicting the 
acceptance-rejection behavior of psychiatrists at 
military induction stations. Its main shortcom- 
ing is a lack of validation against performance 
criteria in civilian situations. Educational or 
industrial applications should be preceded by 
searching experiments to test its value for the 
specific use proposed. 


[38] 

*The Cowan Adolescent Adjustment Analyzer: 
An Instrument of Clinical Psychology. Ages 12- 
18; 1935-49; a revision of Cowan Adolescent Person- 
ality Schedule (see 40: 1217); 9 scores: fear, family 
emotion, family authority, feeling of inadequacy, non- 
family authority, immaturity, escape, neurotic, com- 
pensation; Form 2 ('49—same as Form І copyrighted 
in 1946 except for slight changes in profile chart and 
format); manual ('49); $2.65 per 25; 40€ per speci- 
men set; postpaid; nontimed (10-30) minutes; Edwina 
A. Cowan, Wilbert J. Mueller, Edra Weathers (test 
only), and Bentley Barnabas (manual); Bureau of 
Educational Measurements, Kansas State "Teachers 
College of Emporia. * 


REFERENCES 
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1-3. See 40:1217. Ber 3 F 

4. ZAKOLSKI, F, C. “Studies in Delinquency: I, Personality 
Structure of Delinquent Boys.” J Genetic Psychol 74:109-17 
Mr ’49. * (PA 23:4925) 

For reviews by Harold H. Abelson and Wil- 
liam U. Snyder, see 3:30; for a review by Good- 
win Watson of an earlier edition, see 40:1217; 
for a review by Harold E. Jones of an earlier 


edition, see 38:918. 


[39] 

C-R Opinionaire. Grades 11-16 and adults; 1935; a 
disguised measure of conservatism and radicalism; 
Forms J, K; booklets containing both forms are also 
available; $1.50 per 25 double-form booklets; $1 per 
25 single-form copies; postpaid; (20 or 40) minutes ; 
Theodore F. Lentz and colleagues ; Character Research 
Association, 946 Goodfellow, St. Louis 12, Mo, * 


REFERENCES 

1-5. See 40:1212. 

6. Dexter, Ему S. “Personality Traits Related to Con- 
muy em aad Radicalism.” Char Pers 7:230-7 Mr '39. * 

13:3747. 

7. ALPERT, R., AND Sarcent, S. S. “Conservatism-Radicalism 
Measured by Immediate Emotional Reactions.” J Social Psychol 
14:181-6 Ag '41. * (PA 1631050) 1 

8. SAPPENFIELD, Bert R. "ideological Agreement and Dis- 
agreement Among Religious Groups.” J Abn & Social Psychol 
38:532-9 О '43. * (PA 18:82 

9. Brower, DANIEL, “The elations of Visuo-Motor Conflict 
to Personality Traits and Cardio-Vascular Activity." J General 
Psychol 38:69-99 Ja '48. * (PA 22:3383) 

то. Ктозток, О. P. The Relation of Serial Position on the 
Lents Conservatism-Radicalism Scale to Affirmative Answers to 
General Domestic Acts. Master’s thesis, University of North 
Dakota (Grand Forks, N.D.), 1948. 

Сковсе W. Hartmann, Professor of Psychol- 
ogy and Chairman of the Department, Roosevelt 
College, Chicago, Illinois. we 

For a “social attitudes” measure developed in 
the Upper Mississippi Valley a score of years 
ago, this instrument contains remarkably few 
signs of obsolescence, a major peril from which 
such scales suffer when concrete contemporary 
controversial issues are incorporated as test, 
items. If a specific quantification of so elusive 
a variable as political and economic “liberalism” 
is desired, better devices (cf. Thurstone) are 
available, but for a comprehensive estimate of an 
individual’s disposition to welcome or resist ra- 
tionally-grounded proposals for change in such 
diverse areas as science, technology, religious 
beliefs and practices, education, sex and family 
life, national and international organization, 
etc., this well seasoned Washington University 
product probably supplies as convenient and 
useful a composite score as any existing alterna- 
tive. 

Each available form consists of 60 sympto- 
matic propositions, survivors of 437 original en- 
tries, such as "Cleanliness is à more valuable 
human trait than curiosity," and "It is not fit- 
ting that a statue of Einstein should occupy a 
niche in Dr. Fosdick’s Riverside Drive Church.” 
The respondent's agreement is then checked 
with a key that was validated beyond the prima 
facie stage as follows: “Experienced judges 
agree with regard to the key. People who call 
themselves conservative or middle-of-the-road 
and who have not changed their church or who 
did or would vote for Hoover or Smith or 
Roosevelt, or who are enrolled in the small de- 
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nominational colleges, make respectively higher 
conservatism scores on the test than those who 
rate themselves as radical or who have changed 
their church or who would or did vote for Nor- 
man Thomas or who are enrolled in large uni- 
versities.” (Page 4 of Lentz's extensive and 
entertainingly-written manual.) 

Among the advantages possessed by this test 
are its brevity, self-administering character, un- 
pretentious but adequate format, percentile 
norms from 580 college students ( Table I), ade- 
quate reliability with a self-r of .83, and a 
uniquely commendable Column 6 in Table II 
giving the substantial percentage of "reaction 
reversals" when retested. Noticeable limita- 
tions are: the questionable disguised nature of 
the variable, the apparent assumption that radi- 
calism merely demands drastic and speedy in- 
stitutional reconstruction stressing goals to the 
neglect of procedure, the paucity of items re- 
ferring to war and violence (now a central cul- 
tural issue), a doubtful separation of the 
“acquiescent” or conformity tendency from con- 
servative convictions as such, and in a very few 
propositions a failure to allow for the distinc- 
tion between pseudo- or partial-radicalism and 
authentic or more thorough-going radicalism, 
thereby paradoxically giving the latter a con- 
servative weight by default! Eliminating the 
ten weakest or most debatable items in each 
form would almost certainly strengthen both 
the inherent technical merit and the pro- 
fessional utility for “applied” social science 
inquiries of the remaining “streamlined” 
fifty. 

In sum, this modest Opinionaire continues to 
serve a defensible and worthwhile function in 
those teaching and research situations where its 
content is relevant and mature interpretation is 
assured, 


For a review by Goodwin Watson, see 4o: 
1212; for a review by Н. Н. Remmers, see 
38:899. 


[40] 
*The Empathy Test. Ages 13 and Over; 1947-51; 
Forms A (47), В (’51); manual ('51), 2 mimeo- 
graphed supplements ('51); $2.25 per 25; postage 
extra; $1 per specimen set; postpaid; nontimed (15) 
minutes; Willard A. Kerr and Boris J. Speroff ; Psy- 
chometric Affiliates, Box 1625, Chicago 9o, Ill. * 


[41] 
Every-Day Life: A Scale for the Measure of 
Three Varieties of Self-Reliance. High school; 
1041; 3 scores: independence, resourcefulness, respon- 
sibility; IBM; 1 form; $175 per 25, postage extra; 
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50€ per specimen set, postpaid; separate answer sh 
may be used; 50€ per 25 IBM answer sheets; $3.50 06 
set of stencils for machine scoring of answer sheets: 
$3.50 per set of stencils for hand scoring of answer 
sheets; postage extra; nontimed (30) minutes; Leland 
H. Stott; Sheridan Supply Co. * 


REFERENCES 
1-6. See 3:38. 


Hanorp E. Jones, Professor of Psychology, and 
Director, Institute of Child Welfare, U. niversity 
of California, Berkeley, California. 

This is a test of three aspects of “self-reliance,” 
derived through a factor analysis of a prelimi- 
nary list of items (patterned after those used 
by Maller in his Character Sketches), and an 
item validation of a series of subsequent forms, 
The test material as published consists of 150 
questions such as “Is it hard to find something 
really interesting to do during your spare time?” 
and “Do you do extra work or unassigned read- 
ing in your courses?” Five alternative answers 
are given for each question, signifying a range 
in frequency from “always” to "never." A hand 
scoring key is provided for three scales: (a) 
independence in personal matters (autonomy, 
independence of decision) ; (b) resourcefulness 
in group activities (social initiative) ; and (c) 
personal responsibility (dependability, conscien- 
tiousness). “Centile norms” are given for each 
of these scalés, but since they are based only on 
380 cases from a single high school (in Lincoln, 
Nebraska) they must be regarded only as local 
norms, to be utilized elsewhere chiefly for the 
purpose of research comparisons. The estimated 
reliability of the scales, based on several de- 
terminations, ranges from .84 to .94, and their 
intercorrelations from .42 to .64. The latter val- 
ues may be unexpectedly high, in view of the 
fact that they are regarded as representing fac- 
tors which constitute an orthogonal system of 
dimensions. No correlations are reported with 
age, although it should be expected that a mean- 
ingful measure of self-reliance would exhibit 
age changes in the grade range (9 to 12) for 
which the test is designed and for which norma- 
tive data are presented. A number of commend- 
able features may be noted in the development 
of this test, but the manual provides no evidence 
of further standardization or of validation stud- 
ies during the past decade. Additional research 
is needed to show whether the validity regarded 
as implicit in the three scales (resulting from 
the method of scale construction) is maintained 
against other criteria. Until such evidence is 
presented, together with more adequate norms, 
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the instrument will be serviceable chiefly for 
a rather limited range of research purposes. 


For a review by Albert Ellis, see 3:38. 


[42] 
Examining fog Aphasia and Related Disturb- 
ances. Ages 8 and over; 1946; diagnostic ratings of 
4 types of disturbances: agnosias, receptive aphasias, 
apraxias, expressive aphasias; individual; 1 form; no 
norms; $6.25 per manual and 50 record forms; $4.50 
per 50 record forms ; $2.75 per manual; postpaid; (30- 
120) minutes; Jon Eisenson; Psychological Corpora- 
tion. 
REFERENCES 


1. BLATT, BENJAMIN. The Problem of Language Localization 
Into Specific Brain Areas: Paychologicn Tests as a Means of 
Localizing Brain Lesions in Patients With Aphasia. Doctor's 
thesis, New York University (New York, N.Y.), 1949. (Micro- 
film Abstracts...., 1950, pp. 145-7.) (PA 25:454, title only) 


D. Russert Davis, Reader in Clinical Psychol- 
ogy, University of Cambridge; and Honorary 
Consultant Research Psychiatrist, United Cam- 
bridge Hospitals; Cambridge, England. 

The careful testing of language and related 
functions may form an essential part of neuro- 
logical examination or, a diagnosis having been 
made, may be necessary for the evaluation of a 
dysphasia in order to plan a course of retrain- 
ing or to assess progress in retraining. Lan- 
guage disorders, however, vary so widely in 
degree and kind that, for whatever purpose they 
are required, tests have to be flexible. Rigidly 
standardised tests have no point. Scores are not 
required. As the author properly remarks, clini- 
cal judgments and not psychometric results are 
important, and this is not a field for the routine 
tester. In the reviewer's opinion, the author has 
gone about as far as is desirable in standardising 
the testing procedure. 

The author's aims have been modest, but he 
has succeeded in assembling useful materials for 
a wide range of tests. Most of those who work 
with cases of aphasia will have assembled simi- 
lar materials for themselves; Henry Head, as 
well as other more recent authorities, provides 
adequate instructions in his monograph for so 
doing. Yet, the author's contribution is useful, 
for he has provided a degree of standardisation 
and his materials are unobjectionable, whether 
the tests are applied to British or American pa- 
tients, They contain no novel features and, per- 
haps, they are to be recommended as much for 
their convenience as for any other reason. 

The tests cover a wide range of language func- 
tions and are intended more for the planning of 
retraining than for diagnosis. By their use an 
adequate extensive survey of the language func- 
tions can be carried out, but they do not form 


a complete examination in themselves; they do 
not excuse a more intensive and thorough study 
of the defects revealed in the survey. Again, 
they should only be resorted to after other less 
formal methods of examination. They can con- 
veniently be used in conjunction with such other 
tests as the vocabulary test of the Wechsler- 
Bellevue Intelligence Scale, other vocabulary 
tests, and tests of abstraction. 


J Nerv & Mental Dis 109:89 Ja’49. N(olan) 
D. C. L(ewis). * The method of evaluation and 
of recording the results is particularly useful in 
research work in this field, according to the 
reports the reviewer has had from those actively 
engaged in the study of aphasia. 

Nerv Child 6:343 Jl '47. This is a well pre- 
pared test for aphasia, with directions for its 
use. 


For a review by С. R. Strother and an excerpt 
from a review, see 3:39. 


[43] 
*Fels Parent Behavior Rating Scales. “For the 
use of the trained home visitor in appraising certain 
aspects of parent-child relationships”; 1937-49; 30 
scores: adjustment of home, activeness of home, dis- 
cord in home, sociability of family, coordination of 
household, child-centeredness of home, duration of con- 
tact with mother, intensity of contact with mother, 
restrictiveness of regulations, readiness of enforcement, 
severity of actual penalties, justification of policy, de- 
mocracy of policy, clarity of policy, effectiveness of 
policy, disciplinary friction, quantity of suggestion, 
coerciveness of suggestion, accelerational attempt, gen- 
eral babying, general protectiveness, readiness of criti- 
cism, direction of criticism, readiness of explanation, 
solicitousness for welfare, acceptance of child, under- 
standing, emotionality toward child, affectionateness 
toward child, rapport with child; 1 form (’37~47) ; 
manual (’49—see reference 14 below); $1 per set of 
scales; $1.50 per manual; postpaid; manual by Alfred 
L. Baldwin, Joan Kalhorn, and Fay Huffman Breese; 
Horace Champney; Fels Institute, Yellow Springs, 


Ohio. * 
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Toward Three-and-Nine-Year-Old Children." J Personality 15: 
143-65 D '46. * (PA 21:3296) 4 J 
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1949. Рр. YE вре А Factorial Study of the Fels Parent 
Эң Scales." Child Develop 20:29-45 Mr '49. * (PA 24: 
Date B. Harris, Professor, Institute of Child 
Welfare, University of Minnesota, Minneapolis, 
Minnesota. 

Child development research has long needed 
a more adequate device than the case report for 
assessing parent behavior toward the child and 
psychological aspects of the home environment. 
Such a tool also should be of great help to clini- 
cians, social workers, and others concerned with 
the description and evaluation of a home’s im- 
pact on a child. The Fels scales go a long way 
toward supplying these needs. 

Formulated by Champney as a result of an 
extensive analysis of the psychological aspects 
of child rearing, the 30 rating scales now offered 
have been carefully selected, revised, and proved 
over a period of 10 years. A considerable body 
of sound research is available to support their 
usefulness. The present scales require the rater 
to review systematically various aspects of the 
parent’s relationship to a particular child and 
to record his estimates in terms of linear rating 
scales. A general description of each character- 


istic is offered, to set it apart from other char- 


acteristics. The rater registers his impression 
along a linear scale, aided by from five to seven 


verbal descriptions or "cue points" distributed 


along the continuum. А set of ratings ordinarily 
would be completed after two home visits and 
interviews of approximately two hours each. 
'The author's effort to make the scales objec- 
tive and reliable may be seen in several addi- 
tional features of the rating procedure. Space to 
rate то children appears on each sheet, so that 
comparisons from child to child as well as to 
the defined points of the scale may increase the 
meaningfulness of estimates. For each child the 
rater is required to indicate on the linear scale 
the limits within which he would accept the 
judgment of another rater as essentially agree- 
ing with his own. He is further asked to estimate 
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the limits of the parent's ordinary variation in 
behavior along the continuum indicated. In each 
case, he also rates the certainty of his judgment, 

The manual furnishes a satisfying amount of 
evidence on validity and reliability. For the 30 
scales, two sets of ratings by a relatively inex- 
perienced rater on the same sample of approxi- 
mately тоо families, separated by a six months 


interval, correlate from .53 to .85, half of the 


scales yielding correlations above .70. This same 
rater, after several years of experience in evalu- 
ating families, produced results which, over a 
period of six months, correlated from .62 to .9o, 
nearly half of the correlations now exceeding 
.8o. Two different raters evaluating independ- 
ently a series of families during simultaneous 
visits agreed to the extent of correlations vary- 
ing from .26 to .88 for the 30 scales; 22 of the 
30 correlations exceeded .65. Means and stand- 
ard deviations calculated from raw scores are 
gratifyingly similar among different raters, fur- 
ther indicating that some common basis of judg- 
ment is apparently operating. 

Raw score norms are not available. The orig- 
inal research purpose for which these scales were 
devised embraced a longitudinal study of oyer 
100 families, each to be rated twice a year. The 
authors of the manual suggest that wher te 
scales are used to compare families, the ratings 
be accomplished by one person and the raw 
scores be reduced to standard measures in terms 
of the mean and standard deviati-.)s of raw 
scores for each scale separately. 1 :us, results 
from the several scales can be directly compared. 

It should be clear that these scales are useful 
only in the hands of a skilled interviewer. It 
must be further pointed out that no interviewer 
or home visitor, no matter how skilled, can un- 
dertake these ratings without careful study of 
the manual and the scales themselves to familiar- 
ize himself with the cue points and to consoli- 
date his “philosophy” of home evaluation. Fur- 
thermore, research evidence shows that a rater 
gradually shifts his interpretations and estimates 
of the several dimensions of family environment 
over a period of years—added evidence of the 
need for periodically reducing raw scores 0 
standard scores. 

АП arguments point toward the soundness of 
such a standardized approach to the evaluation 
of complex phenomena. Research evidence 11" 
dicates that, as the clinician systematizes his 
judgments, they improve in accuracy and value. 
Plenty of room remains for clinical art; the 
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rater is encouraged to note down all qualifica- 
tions, subjective impressions, and other evalua- 
tions which occur to him to clarify the basis for 
his judgment. Several research papers listed in 
the attached bibliography provide material on 
the interrelations of these dimensions of parent 
behavior and supply shrewd clinical appraisals 
of the impact of various "syndromes" of factors 
on child behavior. 


The 30 scales for the rating of important as 


pects of parent behavior toward children afford 
not only important research tools but also useful 
clinical devices wherever a worker contacts a 
population of parents in order to describe or to 
evaluate the psychological “atmospheres” of 
homes. The scales are only devices for sys- 
tematizing judgments—they are not tests or 
precise measures. Within the recognized limits 
of the rating method, they afford an exceedingly 
useful adjunct to the case report. While the scales 
record judgments essentially relative to the 
standards of a particular rater, research data 
suggest that they have fairly general meaning 
within a sample of lower and middle class occu- 
pational groups in the Midwest. This reviewer 
would like to see such devices used by child 
placement agencies and child guidance clinics 
M um as in researches on child development. 


^ [44] 

*Gardner Behavior Chart. Mental patients; 1939; 
ratings on 15 aspects of behavior; 1 form; $1 per 100, 
postpaid; specimen set not available; (5-10) minutes; 
Paul H. Wilcox; the Author, 526 West 1oth St., Tra- 
verse City, Mich. * 
i REFERENCES 

1. Wincox, Paur Н. “The Gardner Behavior Chart.” Am J 


Psychiat 3874-80 My '42. * 

E EE f n i AND Rosinson, Mary FRANCES. “An 
Evaluation of Prefrontal Lobotom Through Ward Behavior. 
J Abn & Soctal Psychol 40:61-9 Ja 545. * (PA 19:1956) 


[45] 
*General Goals of Life Inventory: General Edu- 
cation Series. College; 1942-50; 20 scores, each rep- 
resenting a “life goal”; 1 form, ’50; no data on relia- 
bility and validity in manual (for data presented 
elsewhere, see 6 below) ; manual (^50) ; norms ['42] ; 
separate answer sheets must be used ; $2.50 рег 25; 
per 25 answer sheets; cash orders postpaid; 256 per 
specimen set, postpaid; nontimed (50) minutes; devel- 
oped by the Cooperative Study in General Education 
(Ralph W. Tyler, Director); Cooperative Test Di- 
vision, Educational Testing Service. * 
REFERENCES Mus andi 
Ч Ө nventory of Students" ie: 

Genie of Lite?” Ea yt tpe 4157-95 su 44. * (PA 19: 
467 ИЗ 

^ Sister. “Ап Investigation Into the 
Life Goal of College Student” Cath Ed К 45:475-82 О "47. * 


‚ Can $ The Use of the Inventory in the Philoso- 
hy Ca pp. 55-62, In Harold Baker Dunkel's General 
Nd it in the Humanities. Washington, ,D.C& pai 
Council on Education, 1947. ae xix, 325, (РА 22434 3L 
4. Cooperative STUDY IN GENERAL EDUCATION, XECUTIVE 
COMMITTEE, RALPH . Tyrer Drrecror. "First (project: 
General Goals of Life,” pp. 83-98. In Cooperation in Gene: 
Vi 
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Education, Washington, D.C.: American Counci i 
1947. Pp. xvii, 240. * (РА 22:432) Pn Исанов, 

5. DUNKEL, Нахор BAKER, General Education in the Hu- 

manities, pp. 21-78, 108-120, 267-89. Including sections by 
С. W. Cannom (pp. 55°62) МЕНЕ Walters (рр. 63-70); 
Pauline R. Hoeltzel, Gladys К. rown and Dell Park Me er- 
mott (pp. 70-5); and Charles F. Sawhill Virtue (pp. 278-89). 
Foreword 5 ph W. Tyler. Washington, D. American 
Council on Education, 1947. Pp. xix, 323. * (РА 22:434) 
6. Dunxet, „HaroLD Baxer. Chap. 2, "Student's General 
Goals of Life,” pp. 21—54, 05-8, 267-77. In his General Edu- 
cation in the Humanities, Washington, D.C.: American Coun- 
cil on Education, 1947. Pp. xix, 323. * (PA 22:434) 

7. HOELTZEL, PAULINE R.; Brown, GLanys К.; лмо McDzn- 
MOTT, DELL PARK, “The Use of the Inventory by the Depart- 
ment of English,” Pp. 75-8. In Harold Baker Dunkel’s General 
Education in the. wmanities, Washington, D.C.: American 
Council on Education, 1947. Ер, xix, 323. * (PA 22:434) 

8. JONES, Jameson M. “The Use of Inventories of Life- 
Goals and of Religious Concepts," pp. 107-20. In Harold Baker 
Dunkel's General Education in the Humanities. Washington 
YR Council on Education, 1947. Pp. xix, 323. Li 

1434 
‚ 9. VIRTUE, CHarLES Е, SAWHILL. appends B, “The Valid- 
ity of the Inventory of General Life-Goals in Correlation With 
Other Inventories,” pp, 278-89. In Harold Baker Dunkel’s 
General Education in the Humanities. Washington, | D.C.: 
American Council on Education, 1947. Pp. xix, 323. * (РА 


22:434, 

то. WALTERS, ANNETTE. “The Use of the Goals of Life In- 
ventory," pp. 295 In Harold Baker Dunkel's General Edu- 
cation in the Humanities. Washington, D.C.: American Coun- 
cil on Education, 1947. Pp. xix, 323. * (РА 22:434) 


C. Ковект Pace, Chairman, Department of 
Psychology, Syracuse University, Syracuse, 
New York. 

Twenty statements of life goals arranged in 
190 paired comparisons constitute the substance 
of this inventory. The number of times each 
statement is preferred to all others is the score 
for that goal. The answer sheet contains a simple 
profile chart on which the student can sum- 
marize his own responses; the chart shows 
graphically how his rankings compare with the 
median and middle 50 per cent of 2,248 students 
in 16 colleges who took the inventory at the time 
when the Cooperative Study in General Edu- 
cation was in progress. Yt. 

No statistics on reliability of the inventory are 
given in the manual. The reader is referred to 

Appendix A of Dunkel's General Education in 

` the Humanities (6), where he will find test- 
retest coefficients on one group of 16 students 
and another group of 27 students. These relia- 
bilities are generally in the .80’s and .go’s. 
Kuder-Richardson reliabilities of similar mag- 
nitudes are also reported. The reviewer's ex- 
perience with paired comparison tests would 
lead him to conclude that students’ responses to - 
the inventory are probably quite reliable. 

It is probable that the most appropriate use 
of the inventory is not so much as a testing 
instrument to compare one group with another 
but rather as a teaching device for promoting 
self-understanding among students. This use is 
clearly recognized by those who constructed the 
profile chart whereby each student can sum- 
marize and interpret his own rank order of pref- 
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erence, gaining self-insight in the process. The 
extent to which the inventory may be used in 
various colleges around the country will prob- 
ably be small if it is thought of as a test; its use 
could be widespread and beneficial if professors 
would take the point of view that teaching and 
learning and evaluation are intimately related 
to one another and that the process of evaluation 
is in itself a significant teaching method and 
an effective learning experience. The inventory 
makes a useful contribution to the need for and 
importance of evaluating some of the more in- 
tangible goals of education. 


Leona E. Туге, Associate Professor of Psy- 
chology, University of Oregon, Eugene, Oregon. 

This inventory was developed by the Co- 
operative Study in General Education. In con- 
nection with the evaluation of work in the hu- 
manities, it was thought desirable to develop an 
objective instrument which would accomplish 
with more precision the purpose which is served 
by having students write essays on their philoso- 
phies of life. The authors hoped that it would 
thus be possible to assess changes in students' 
philosophies growing out of educational influ- 
ences and to compare various groups of students 
with each other. The inventory consists of 190 
paired comparisons by means of which 20 life 
goals are ranked according to their importance 
for the individual. 

The first question that might be raised is with 
regard to the choice of goals. Dunkel states only 
that the makers of the test drew heavily upon 
papers already written by students and on the 
experience of teachers, and that they included 
both statements of traditional philosophical and 
religious positions and expressions of “cracker- 
barrel" philosophy (6). It seems doubtful that 
an' adequate representation of actual goals of 
life for college students can be obtained by these 
methods. А 

The best evidence presented as to the relia- 
bility of the inventory is based on two small 
groups of students, one class of 16 that was re- 
tested after two days and a class of 27 retested 
after 20 days. In general, the correlations would 
indicate that the reliability is high with the two- 
day interval, moderate with the longer lapse of 
time. Some students show much more consist- 
ency than others. 

The case for the validity of the inventory is 
made on the basis of the correspondence between 
what it reveals and what shows up in other tests 
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developed for the study, the Inventory of Be. 
liefs About Postwar Reconstruction, and the 
Inventory of Religious Concepts. The material 
as presented makes it difficult to judge just how 
much relationship there is, and the fact that the 
two other measuring instruments are at least as 
uncertain as the one being validated leaves the 
whole issue doubtful. 

The most serious criticism of the inventory, 
however, is its unanalyzed assumption that we 
can accept these stated life goals at their face 
value without investigating the personality dy- 
namics underlying them. Any one of these philo- 
sophical positions or any combination of them 
could be determined largely by rationalization, 
projection, identification, reaction formation, or 
any one of a number of well known psychologi- 
cal processes. Is any useful purpose to be served 
by obtaining these verbal statements by them- 
selves? It would seem that the objectivity and 
comparability at which the authors aimed is a 
dubious gain, when we consider the greater flex- 
ibility of the essay type of expression the inven- 
tory was designed to replace. Perhaps the pro- 
vision at the end of the inventory for a brief 
statement by the student is for the purpose of 
retaining some of these advantages. ' 

The illustrations Dunkel gives of the use of 
the inventory in educational practice do not 
convince this reviewer that this test has a use- 
ful function in general education. Its wide use 
in colleges having clearly stated philosophies 
of their own could tend to produce in stu- 
dents hypocrisy rather than sincerity, con- 
flict and anxiety rather than stability and in- 
tegration. 


[46] 
[Generalized Attitude Scales.] Grades 7-16; 1934- 
38; each scale has space for indicating attitude toward 
five attitude variables; 13 scales; Forms A, В; mimeo- 
graphed; no data on reliability and validity in manuals 
(for data presented elsewhere by authors, see refer- 
ences) ; 2€ per scale; 156 per specimen set of any one 
scale; postpaid; 3(5) minutes per attitude variable; 
directed and edited by H. H. Remmers; Division ot 
Educational Reference, Purdue University. * 
а) A SCALE FOR MEASURING ATTITUDE ANY AD- 
VERTISEMENT. 1938; Form A, only; Ruth E. Henion. 
b) A SCALE FOR MEASURING ATTITUDE TOWARD ANY DIS- 
CIPLINARY PROCEDURE. 1936; V. К. Clouse. / 
C) A SCALE FOR MEASURING ATTITUDE TOWARD AN! 
HOME-MAKING ACTIVITY. 1934; Beatrix Kellar. 
d) A SCALE FOR MEASURING INDIVIDUAL AND GROUP 
"MORALE". 1936; Laurence Whisler. 
€) A SCALE FOR MEASURING ATTITUDES TOWARD ANY 
PLAY. 1935; Mildred Dimmit. 
f) A SCALE FOR MEASURING ATTITUDES TOWARD ANY 
PRACTICE. 1034; Н. W. Bues. %; 
9) A SCALE FOR MEASURING ATTITUDES TOWARD ANY 
PROPOSED SOCIAL ACTION. 1935; Dorothy M. Thomas. 
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h) A SCALE FOR MEASURING ATTITUDE TOWARD RACES 
AND NATIONALITIES. 1934; Н. Н. Grice, 

i) A SCALE FOR MEASURING ATTITUDE TOWARD ANY 
SCHOOL SUBJECT. 1934; Ella B. Silance. 

]) A SCALE FOR MEASURING ATTITUDE TOWARD ANY IN- 
STITUTION. 1934; Ida B. Kelly. 

k) A SCALE FOR MEASURING ATTITUDES TOWARD ANY S0- 
CIAL SITUATION. 1938; Elna Huffman. 

1) A SCALE FOR MEASURING ATTITUDE TOWARD ANY 
TEACHER. 1935; L. D. Hoshaw. 

т) A SCALE FOR MEASURING ATTITUDES TOWARD ANY 
VOCATION. 1934; Harold E. Miller. 
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. TERRY, AND JENSEN, Јонм A. "Interrela- 
and Ethnic Attitudes in Selected Southern 


Donatp T. Campsett, Assistant Professor of 
Psychology, The University of Chicago, Chi- 
cago, Illinois. 

These tests were reviewed in The тодо Mental 
Measurements Yearbook. They are being re- 
viewed again primarily because they seem ќо be 
the only social attitude tests being published and 
sold since the Thurstone scales were allowed to 
go out of print. This should not be taken to 
mean, however, that they represent the most 
used tests. On the contrary, the present reviewer 
is aware of only one recent published use (46). 
In the flourishing area of research on attitudes 
toward minority groups, the general practice 
seems to be for the researcher to reduplicate all 
or parts of uncopyrighted tests, or to develop his 
own. Much used are the Bogardus Social Dis- 
tance Scale, the Likert test of attitudes toward 
Negroes,* and, recently, the Anti-Semitism and 
Ethnocentrism scales developed by Adorno, 
Frenkel-Brunswik, Sanford, and Levinson.* 

Any person selecting attitude tests for re- 
search purposes should be aware of McNemar’s * 
scathing review of the literature, even though 

he may not in the end agree with all of Mc- 
Nemar's evaluations. With regard to the Rem- 
mers scales, McNemar has this to вау: 

It is the writer's opinion that the comparability of 
scales constructed by the method of equal appearing 
intervals has not been sufficiently well demonstrated to 
permit the direct comparison of means based on differ- 
ent scales. Remmers and followers have been the most 
persistent in this practice. * 

The Thurstone scaling method as used by him and 
his students has always involved scaling for some par- 
ticular attitude. Remmers and Silance proposed in 
1934 that the method could be adapted for the purpose 


-of constructing "generalized attitude scales." * This as- 
sumes that a series of statements can be scaled as ap- 


1 Bogardus, Emory S. Immigration and Race Attitudes. 
Foreword by Jerome Davis. Boston, Mass.: D. C. Heath & Co., 
1928. Pp. xi, 268. * (PA 2:3223) 

2 Likert, Rensis. A Technique for the Measurement of Atti- 
tudes, Archives of Psychology, No. 140. New York: Archives 
of БАТ Ооу, Columbia University, 1932. Pp. ss. Paper. (РА 
7:1885 

з Adorno, T. W.; Frenkel-Brunswik, Else; Levinson, Daniel 


].; and Sanford, R. Nevitt; in collaboration with Betty Aron, + 


Maria Hertz Levinson, and William Morrow. The Authorita- 
rian Personality. New York: Harper & Brothers, 1950. Pp. 
xxxiii, одо. * (PA 2435796) H 4 

+ McNemar, Quinn. “Opinion-Attitude Methodology." Psy- 

‚ chol В 43:289-374 J1'46. * (PA 20:4703) 


plicable to a class of objects or phenomena, such a; 
races, or school subjects, or proposed social changes * 
Thus one sorting and subsequent scaling provides а 
set of statements which serve for measuring attitude 
toward, for example, any institution such as war, mar- 
riage, communism, Sunday observance, and divorce, 
The advantage of the generalized scale is supposed to 
reside in the fact that rap attitudes can be measured 
without great expenditure of time otherwise required 
for sorting and scale construction. 

To make the scale applicable to any one of the phe- 
nomena in a class, the respondent is simply told that he 
is to keep in mind the particular attitude object as he 
checks the statements. That this procedure can become 
quite ridiculous is illustrated by the following items: 
“[war or marriage] does not consider individual dif- 
ferences”; “[marriage] gives too little service”; 
"[marriage] satisfies only the most stupid with its 
services"; "[marriage] offers opportunity for indi- 
vidual initiative"; “[war] is too conservative”; "[war] 
is too changeable in its policies"; [marriage] appeals 
to man's lowest nature." 

In order to establish the validity of the generalized 
attitude scaling technique, one of Remmers' students 
constructed a scale specifically for attitude toward 
teaching and then correlated the scores therefrom with 
scores based upon the generalized attitude toward any 
occupation checked for teaching. The correlation cor- 
rected for attenuation was only .58, but this is said to 
"show a satisfactory validity" for the generalized scal- 
ing scheme even though the validity of the specific scale 
is unknown. Remmers and his students have practically 
placed attitude scaling on a mass production basis, and 
have proceeded to use the scales as tools for research 
on a variety of topics. Individual citation of these stud- 
ies would add more than 40 titles to a bibliography on 
attitude research. The fact that the dozens of reliability 
coefficients reported tend to have a median value of 
about .70 with values as frequently below .50 as above 
8o, and ranging down to .07, would indicate that a 
great deal of uncritical energy has been expended, 
These reliabilities are lower than those usually found 
for the Thurstone scales even though from 50% to 
100% more items are included in the generalized scales. 
Evidently the latter scales contain dead timber. 


In all fairness it must be said that McNemar's 
discussion, while accurate, accentuates the nega- 
tive side of the picture. In the five instances re- 
ported in which Thurstone scales were studied 
in conjunction with Remmers scales, the relia- 
bilities of the Remmers scales were higher in 
four (14, 18). For attitudes toward communism, 
the Remmers reliability was .89 and the Thur- 
stone was .78. For other attitude topics the cor- 
responding reliability coefficients were as fol- 
lows: War, .77 and —.05; Sunday Observance, 
98 and .90; the Negro, .84 and .87 ; the Chinese, 
.77 and .55 (14, 18). (Note the —05 interform 
reliability coefficient obtained for the Thurstone- 
Peterson attitude toward war scale.) The low 


‘reliabilities which contribute the bulk of Mc- 


Nemar's tallies come mainly from grade and, 
high school students, and their parents, where 
literacy and motivation might be low. They аге 


also associated with topics about which attitudes | 
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may not exist, and for which any method might 
give low reliabilities. Such topics include atti- 
tudes toward vocations, attitudes toward various 
aspects of government regulation of farmers, 
etc. The two lowest values, .07 and .08, are for 
the attitudes of mothers and fathers toward the 
public school (40). 

Most of the scales are now over 15 years old. 
In the interim, emphasis in test construction has 
shifted from the precise specification of item 
intensity to the more important problem of in- 
ternal consistency (unidimensionality or single 
factoredness). Intensity scoring has tended to 
drop out as it became realized that the gross 
number of items endorsed in a favorable direc- 
tion was functionally equivalent to the extremity 
of the endorsed items. These trends are epito- 
mized in the Likert ? and Guttman ° scale con- 
struction methods. In Thurstone’s full method 
of equal appearing intervals, internal consist- 
ency is taken care of by the coefficient of ir- 
relevance.® However, this tedious procedure was 
rarely used even by Thurstone’s own students, 
and there is no evidence that it was ever applied 
in the construction of any of the Remmers scales. 
Another important consideration which recent 
research has provided is the problem of response 
sets.” In these terms Remmers’ procedure of 
reusing the same statements in measuring dif- 
ferent attitudes might be expected to enhance 
reliability and correlations by adding consistent 
sources of variance irrelevant to the tester’s 
purpose. In the most recent addition to the gen- 
eralized scales, on “attitude toward any adver- 
tisement,” another opportunity for irrelevant 
response sets is added through the use of seven 
levels of endorsement (from A! to D!) to each 
statement. 

A more general question remains: Are any 
generalized attitude scales likely to be of use? On 
the one hand, general statements of liking and 
disliking are appropriate to almost any social 
object and can be found in almost every attitude 
test. These make appropriate ingredients for a 
generalized rating scale, or test, although con- 


5 Stouffer, Samuel A.; Guttman, Louis; Suchman, Edward 
A.; Lazarsfeld, Paul F.; Star, Shirley A.; and Clausen, John 
A. Studies in Social Psychology in World War II, Vol. 4. 
Princeton, N.J.: Princeton University Press, 1950. Pp. x, 756. 
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does Lee J. “Response Sets and Test Validity." Ed & 
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stant repetition of variations on this theme makes 
for dullness by the time one gets up to 20 or 40 
items. Social distance preference statements are 
probably in this same category, and the Bo- 
gardus Social Distance Scale seems to be a mean- 
ingful generalized scale. The remainder of the 
items found in social attitude tests deal with 
stereotypes of a specific, quasi-factual nature. 
Such stereotypes have a way of aggregating 
around social objects in an opportunistic fash- 
ion, and attitude scales which use the public 
range of comments about the social object in ` 
question reflect this specificity. Thus an item 
about morality or intelligence is a much more 
useful item in a test of attitudes toward Negroes 
than in a test of attitudes toward Jews, and vice 
versa for an item dealing with sharp business. 
practices. There is, however, one type of prob- 
lem for which generalized scales are essential ; 
this is the sociological problem of the relative 
reputational standing of social objects. In the 
Remmers series, this use is illustrated in the 
studies of attitudes toward school subjects, vo- 
cations, teachers, household activities, and eth- 
nic groups. However, for this problem, one or 
a few questions, rather than 40, are probably 
sufficient, as shown in the work of Bogardus * 
and Thurstone For this purpose also, topical 
purity or specificity is extremely important— 
much more important than in the problem of 
measuring individual differences among the 
holders of the attitudes. For example, while the 
Negro has the most unfavorable position among 
U.S. minority groups when social distance items _ 
are used, Jews are most apt to be blamed for 
national difficulties and accused of misusing their 
power.? Such considerations limit the value of 
whole scale comparisons of ethnic group stand- 
ings, as reported by Grice (24), and more re- 
cently by Prothro (46). Item by item сотрагі-* 
sons would undoubtedly show exceptions to the 
overall picture. ‹ 


Kennetu E. Crank, Associate Professor of 
Psychology, University of Minnesota, Minne- 
apolis, Minnesota. І { { 

The generalized attitude scale is a special type 
of Thurstone scale, designed to reduce the 
amount of labor ordinarily required to develop 
the latter type scale, and thus to make more 


* 


. readily available for general use instruments for 


> 8 Thurstone, L. “Ап Experimental, Study of National 
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o Campbell, Donald T. The Generality of a Social Attitude. 
Doctor's! thesis, University of California (Berkeley, Calif.), 
1947. 
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the measurement of attitudes. The assumption 
is made, and demonstrated to be reasonable, 
that a single set of items may be scaled by 
Thurstone methods and then used in the meas- 
urement of attitudes towards an entire class of 
objects. Thus, rather than develop a separate 
scale for each of hundreds of occupations, Rem- 
mers and his associates have developed a gen- 
eralized attitude scale that may be used with any 
specified occupation. 

The reduction in labor is the major advantage 
of the Remmers approach over the original 
Thurstone approach. In addition, however, Rem- 
mers includes a larger number of items, thus in- 
creasing reliability. He also ranges items in 
order of their scale values, which simplifies scor- 
ing at an unknown cost in reduced validity. 
Scores obtained through his methods do not 
always correlate highly with scores obtained on 
similar Thurstone scales (about .50 in one in- 
stance, for example). One disadvantage of his 
method—that all items must be interpreted in 
relation to a title added at the top of the sheet — 
does seem to reduce the value of the method. 
Response sheets, in addition, provide spaces for 
not one but five separate titles or subjects to be 
rated. The writer's experience is that only the 
most sophisticated students can make more than 
one or two sets of ratings on the same sheet 
without experiencing difficulty and confusion. 
Another difficulty arises when attitudes are 
measured with a generalized scale on two ob- 
jects which differ in characteristics as much as, 
for example Communism and Marriage. Some 
items in Kelly's Scale for Measuring Attitudes 
Toward Any Institution (“Gives too little serv- 
ice,” “Does not consider individual differences,” 
"Is cordially hated by the majority for its smug- 
ness and snobbishness") obviously have entirely 
different meanings and implications in these 
two contexts, and hence either modify the scale 
value assigned the items or introduce irrelevant 
variance in the total score. 

The need for generalized scales is not nearly 
as apparent today as it was in the middle 1930's. 
More reliable measurements may be obtained 
with less labor using Likert methods. A smaller 
number of more nearly homogeneous items may 
yield results more useful for many purposes 
through application of some of the contributions 
of Guttman. Certain of the generalized attitude 
scales (as those for vocations and school sub- 
jects) still retain their unique usefulness and 
will probably continue to be used. In general, 


however, more recent developments which stress 
homogeneity of.content and ease of Scoring, 
show greater promise for most users of attitude 
measures. 


For reviews by W. D. Commins and Theo- 
dore Newcomb, see 40:1202; for a review by 
Stephen M. Corey, see 38:897; for related re- 
views, see 40:B1050, 36:B215, and 36:Be16. 
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*The QGuilford-Martin Inventory of Factors 
GAMIN, Abridged Edition. Grades 9-16 and adults; 
1943-48 ; supersedes original edition ; 5 scores: general 
activity, ascendance-submission, masculinity-femininity, 
inferiority feelings, nervousness; IBM; 1 form, 43; 
manual ['48]; $2.50 per 25, postage extra; 60€ per 
specimen set, postpaid; separate answer sheets may be 
used; 26 per IBM answer sheet ; $3.50 per set of sten- 
cils for machine scoring of answer sheets ; $2.50 per set 
of stencils for hand scoring of answer sheets; 2@ per 
profile chart (also for tests 48 and 59) ; postage extra; 
nontimed (30) minutes; J. P. Guilford and H. G. 
Martin; Sheridan Supply Co. * 
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Husert E. Brocpen, Personnel Research Sec- 
tion, The Adjutant General's Office, Depart- 
ment of the Army, Washington, D.C. 

In commenting on Guilford’s An Inventory 
of Factors STDCR (see 59), this reviewer sup- 
ported, in general, the development and use of 
a factor score inventory, but expressed doubts 
as to its usefulness for selection until more va- 
lidity information became available. The need 
for control of over and under self-evaluation was 
also mentioned. These comments apply equally 
to the Guilford-Martin Inventory of Factors 
GAMIN, although somewhat more validity in- 
formation is made available for this inventory. 

The validity information provided relates to 
supervisory proficiency. High validities are re- 
ported for M and N respectively for.a small 
group (N = 26) of branch managers and execu- 
tives. However, these two scores gave negligible 
validity in predicting success of supervisors 
(N’s of 289 and 120). A usefully high validity 
(eta) is reported for M when curvilinearity is 
taken into account. While these and other find- 
ings are interesting, they cover a rather narrow 
area and cannot yet be regarded as an adequate 
basis for the general usefulness of the inventory. 
As the authors themselves point out, users of 
the tests should undertake validity studies in 
order to establish which, if any, of the factor 
scores are useful for their particular selection 
purpose. ? 

Тһе intercorrelations of the several factor 
Scores are, in some cases, higher than would 
seem desirable. However, it appears evident 
enough from examination of the reliabilities that 
each of the scores is sufficiently unique to allow 
independent contribution in a prediction prob- 
lem. Since many of the factors basic to the 
Scores are orthogonal to each other, the correla- 
tions between the scores is very likely due to 
the summated impurities of the individual scored 
items. Reduction of such intercorrelation is a 
difficult problem. 

In spite of the several difficulties mentioned, 
the GAMIN inventory is likely to prove a use- 
ful tool. It is more likely to be useful to research 
workers willing to undertake validation for their 
own purposes than to individuals seeking an 
inventory for immediate application. 


For reviews by R. A. Brotemarkle and H. jJ. 
Eysenck, see 3:43 and 3:45. 
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The  Guilford-Martin Personnel Inventory. 
Grades 9-16 and adults; 1943; 3 scores: objectivity, 
agreeableness, cooperativeness; IBM; 1 form; $2.50 
per 25, postage extra; 50¢ per specimen set, postpaid; 
separaté answer sheets may be used; 2¢ per IBM an- 
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scoring of answer sheets; 2¢ per profile chart (also for 
tests 47 and 59); postage extra; nontimed (30) min- 
cis & Ji Р. Guilford апі Н. С. Martin; Sheridan Sup- 
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eral’s Office, Department of the Army, Wash- 
ington, D.C. 

The stated purpose of the inventory is to 
assist industrial, business, and similar establish- 
ments in obtaining evaluations of present and 
potential employees along a “paranoid” con- 
tinuum. If fully effective, the test could be used 
to prevent the hiring of troublemakers and to 
spot those already employed in advance of overt 
difficulties, 

The inventory consists of 150 statements in 
question form. To these the person investigated 
is instructed to answer by encircling one of three 
possible answers printed at the end of the ques- 
tion (an IBM sheet is enclosed and can be used 
instead). A sample is: “Do you get upset rather 
easily? Yes? No.” The answers are indicative 
of certain attitudes toward and evaluation of 
jobs, society, human nature, and discipline to- 
gether with some self-appraisal. The questions 
are not of such a nature as to cause a normal 
individual to be embarrassed by giving a truth- 
ful answer. The first six items are not scored 
but serve as orientation. 

The concept of a “paranoid” space is derived 
from factor analysis and clinical studies. The 
space is defined by four factors, but in the pres- 
ent study, two of these were not sufficiently dif- 
ferentiated to be considered separately. The 
three factors used are named after the socially 
desirable poles of the continua as Objectivity, 
Agreeableness, and Cooperation. Each of the 
three factors is identified by a separate score. 

Scoring is done with the aid of an overlay key 
of transparent paper on which the selected an- 
swers are indicated together with their scoring 
weights. A table on the “Direction and Norms” 
sheet enables the investigator to convert these 
to a “С” scale based on percentage of the ex- 
perimentally investigated population achieving 
various raw scores. It enables the investigator 
to compare scores on the three factors with one 
another and with those of similar inventories. 

Two hundred items were constructed together 
with a preliminary key based on “the best sta- 
tistical and clinical evidence at hand." The in- 
ventory was then administered to 400 subjects 
chosen on a priori basis to represent a diversely 
employed population. The test of internal con- 
sistency was applied to each item, and 150 of 
them were found to be sufficiently differentiat- 
ing between the highest and lowest fourths of 
scores to be retained. 

The item construction, sampling of subject 


case, and development of scoring keys are all 
vaguely outlined, and in a negative way prevent 
any criticism; the users of the tests are forced 
to take the word of the inventory constructors, 

On the other hand, the development of re. 
liability is well detailed and regardless of 
whether any makeshift methods were involved 
in the construction, shows that the reliabilities 
of the finished products are very acceptable: 83 
for O (Objectivity), 80 for Ag (Agreeable- 
ness), and .91 for Co (Cooperativeness), found 
by scoring 100 cases similar to those used for 
item selection, correlating scores on one random 
half against the other, and estimating reliabili- 
ties of the total score by the Spearman-Brown 
technique. The traits are not completely inde- 
pendent; the correlation between O and Ag is 
ба, between О and Co is .55, and between Ag 
and C is .63. 

In assaying the validity of the inventory the 


‘story is not so favorable. The manual reports 


an attempt to differentiate between members of 
an industrial workers group of 51 cases, a num- 
ber of whom had previously been rated as 
troublemakers and discontents. The criterion 
chosen for prediction of undesirables was arbi- 
trarily chosen as below the median score on 
two of the traits. The results were as follows: 


Predicted as Predicted as Total 
Unsatisfactory Satisfactory 


Rated as Unsatisfactory 16 6 22 
Rated as Satisfactory ELS EO E 
Total 26 25 st 


(Equivalent to a tetrachoric r of .55 for 51 cases) 


How good this discrimination is cannot be 
known without knowing more about the com- 
position of the group. If the group is composed 
of the highest and lowest quarters of a popula- 
tion classed along a troublemaker continuum it 
means one thing, if it is an unselected group 1t 
would mean something else. The authors do not 
state how the experimental group was selected. 


In evaluating the inventory, the general im- - 


pression is that the items were carefully worked 
out and put together, and that the authors were 
satisfied when they found that the scores were 
very reliable. When 400 cases were used if 
selection of items and adjustment of the key; 
and тоо cases used for the establishment 0 


coefficients of reliability, it would appear that à _ 


greater effort could have been exerted at vali- 
dating the inventory. ү 
The trouble would be опе of criterion. The 


inventory will undoubtedly pick, better than - 


chance, those who are grossly discontented with 
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their work; among the potential employees, it 
will pick those who are likely to become discon- 
tented. It will not cull out those who, though 
they have a strong paranoid streak, are intelli- 
gent enough to know that their natural impulse 
to answer a question may not coincide with the 
socially accepted one. On the other hand, they 
may also be intelligent enough to stay out of 
serious trouble; thus from a practical point of 
view they should not be eliminated. (If they 
were, who would later develop into executives, 
superintendents, labor leaders and the like?) 

Assuming that a large proportion of all mal- 
contents could be spotted and eliminated, would 
it be desirable from a management point of 
view? The only way to find out is by correlating 
the scores on the inventory with an overall еї- 
fectiveness on the job. ` 

The inventory is probably as good for its 
stated purpose as one could be made with the 
present limited knowledge of personnel charac- 
teristics. It does not claim too much for itself, 
and when used in conjunction with other meas- 
ures, it will tend to round out the estimation of 
the personality Gestalt of an employee, present 
or potential. As the inventory becomes more 
generally used, it can be hoped that a more 
extensive validation evaluation will be published. 


For reviews by Benjamin Shimberg and R. A. 
Brotemarkle, see 3:44—5. 


[49] 

*The Guilford-Zimmerman Temperament Sur- 
vey. Grades 9-16 and adults; 1949; a revision and 
condensation of Inventory of Factors STDCR (’34- 
40), Guilford-Martin Inventory of Factors GAMIN 
(43), and Guilford-Martin | Personnel Inventory 
(43); 10 scores: general activity, restraint, ascend- 
ance, sociability, emotional stability, objectivity, friend- 
liness, thoughtfulness, personal relations, masculinity ; 

M; т form; norms ('49); separate answer sheets 
must be used; $3.75 per 25; 75€ per 25 IBM answer 
sheets; $2 per set of machine scoring stencils; $2 per 
set of hand scoring stencils; 256 per manual; 50¢ per 
25 profile charts for ne pot per 25 profile charts for 
women; postage extra ; per specimen set (does not 
include stencils), postpaid; nontimed (50) minutes; 
J. P. Guilford and Wayne S. Zimmerman; Sheridan 
Supply Co. * 
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DAD Shans e Ree УЫ КЫ АМЕ oh 
WiLLiAM STEPHENSON, Visiting Professor of 
Psychology, The University of Chicago, Chi- 
cago, Illinois. 

This Survey puts into one schedule the 10 
major “traits,” GRASEOFTPM, that Guilford 
and others have variously identified by factor 
analysis, and which have hitherto been included 
in separate inventories: Nebraska Personality 
Inventory (SEM), Guilford-Martin Inventory 
of Factors GAMIN, Guilford-Martin Personnel 
Inventory I (OAgCo), and Inventory of Fac- 
tors STDCR. It consists of 300 items, 30 for 
each of the “traits,” each responded to with a 
yes, ?, or no. The responses are weighted only 
о or т, and the layout lends itself to the standard 
IBM answer sheet. Only two stencils are needed 
for hand scoring. 

The "traits" themselves are (G) general ac- 
tivity, (R) restraint, (A) ascendence, (S) socia- 
bility, (E) emotional stability, (O) objectivity, 
(F) friendliness, (T) thoughtfulness, (P) per- 
sonal relations, and (M) masculinity respec- 
tively. The reliability with which each of the 
traits is assessed is shown to be of the order .80; 
and their intercorrelations are, as the authors say, 
"gratifyingly low," the implication being that 
all are approximately orthogonal in factor terms, 
that is, that “unique traits" are involved. Some 
consideration is given to possible biases which 
detract from the dependability of the schedule 
—everyone, for example, is apt to give himself 
favorable scores, although not on all the “traits.” 
Finally, profile charts are provided for men and 
women for each of the то “traits.” 

Аз one would expect from Professor Guil- 
ford’s laboratory, the Survey, its data and sup- 
porting norms are all adequate, thorough, and 
factually orientated. For the purposes for which 
these inventories are used it is probably bet- 
ter than most. We could raise, however, two 
methodological issues about it. Given 10 un- 
correlated “traits,” if each is used to indicate 
only two grades (above average for the trait, 
and below average), 21° = 1,024 possible classes 
of temperament can be indicated. This should 
either provide scope enough for almost anyone 
who believes in the relative uniqueness of tem- 
peramental qualities, or be the despair of those 
who find it difficult to believe that temperament 
can be as complex as all this! Moreover, given 
the premise of 10 uncorrelated traits, there must 
on the whole be an equal number of persons for 
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each of the 1024 possibilities: is that not so? 
Thus, amongst 2048 college students there would 
have to be two of each class. These are interesting 
geometrical speculations, deductions from the 
Survey’s nature. No doubt something is wrong, 
perhaps, to think of temperament as so nicely 
and geometrically allotted amongst college stu- 
dents, but so it must be if this Survey is as 
sound scientifically as it appears to be. 

One wonders, however, whether someone 
could have a theory about temperament, some- 
how, and put this to test, if only to free us from 
a feeling that mankind is not ordered like squares 
on a chessboard with respect to its tempera- 
mental qualities, Temperament traits, after all, 
are in the happy position of being such as per- 
haps correspond to our behavior as observed by 
others—when we are sad, we feel it, and also 
others can see it in our demeanors. But when 
we save a life and are called brave by observers, 
we might feel in fact quite frightened and hor- 
rified and certainly not brave-feeling. There are 
those who wish to look at personality (and every- 
thing else) from the “internal frame of refer- 
ence,” and others who look at it only from the 
“external frame”: temperament traits probably 
look the same both ways, or imply the same. 
Might not these, then, in relation to others of 
conduct as externally observed, provide interest- 
ing food for experimental speculation? 


Меп, VAN STEENBERG, Research Psychologist, 
Personnel Research Section, The Adjutant 
General's. Office, Department of the Army, 
Washington, D.C. 

The stated purpose of this survey is to com- 
bine the findings of the Guilford series of per- 
sonality inyentories into a single battery and 
thus obtain scores on то personality traits from 
the administration of a single booklet. 

This 8-page booklet contains instructions for 
marking and 300 items. The items are in the 
form of statements which the examinee marks 
as true, false, or indifferent; e.g., “You are a 
carefree individual. Yes ? No.” (The fact that 
the sequence of numbers on the IBM sheet runs 
from left to right, instead of up and down as 
has become usual, is immediately noted or com- 
mented upon by test-technicians—it will prob- 
ably never bother the average examinee because 
he has not acquired that particular reaction pat- 
tern.) 

Overlay keys are provided for handscoring. 
Only one answer for each statement is keyed, 


and all keyed items are given equal weight, By 
spacing the statements so that every fifth one 
on the answer sheet gives a score on one particy. 
lar trait, all the elements of the total score of each 
trait are in one column of the IBM sheet. This 
clever arrangement greatly facilitates hand scor- 
ing. There are то scores, one for each trait 
assayed, to be obtained with a maximum of 3o 
on each. A high score is in the direction of what 
is in general the socially desirable pole. 

Previous inventories developed by the authors 
provided the raw materials from which the items 
were selected or reworked. By scoring each item 
on only one key, greater facility in scoring is 
achieved, though by limiting the possible score 
to 30 on any one trait, this facility has been 
achieved at some sacrifice of reliability of in- 
dividual scores. 

Norms for the survey were obtained for 9 of 
IO traits on a college population of 523 men 
and 389 women. Norms for the remaining trait 
“T” were derived from a population of high 
school seniors and their parents. The authors 
state that various estimates of reliabilities were 
made, and a table is given showing the con- 
sensus. The coefficients vary between .75 and 
.85. The mean scores and the SD are given sepa- 
rately for men and women, but except on the trait 
of masculinity there are no substantial (certainly 
no significant) differences between the sexes. 

The intercorrelations between the traits are 
generally small, one is as high as .61, some 
others are of the magnitude about .40, but most 
of them are small enough that there does not 
seem to be any question as to the existence of 
ten separate dimensions. 

The validity of the scores is principally based 
on the factor analytic studies in which the traits 
were isolated. It is further indicated that a prac 
tical validation study has been carried out, but 
the details are not given. 

The raw scores on the то traits are to be con- 
verted into “С” scores, which in turn are base 
on the percentage of the scores attained by the 
experimentally investigated population. 

Interpretation of the 10 scores per individual 
is of course the real difficulty in a broad inven- 
tory of this kind. To assist the administrator 0 
the survey, a very detailed and lucid descriptio 
of the traits and clinical syndromes is givet 
in the manual. But since evaluation is not only 1 
terms of the то factors but also (rightly) i 
their interactions, the task demands a consider 
able amount of clinical sophistication on the part 
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of the investigator if it is to be done ade- 
quately. The. optimal scores for individuals 
who show supervisory promise are given in the 
manual. 

There are some instructions given in the 
manual to which exceptions must be taken. It 
is stated that more than three question marks 
on an answer sheet is a probable indication either 
that the examinee does not know himself very 
well or that he has little confidence in whatever 
ideas he has about himself. The remedy recom- 
mended is to call the examinee into a conference 
and after orientation suggest that he change a 
substantial number of question marks to either 
“Yes” or “No.” A number of mature psycholo- 
gists in the office of this reviewer "took" the in- 
ventory and in most cases came up with more 
than three question marks on most traits. If a 
forced choice was desired by the authors, the 
question marks should not have been included 
among the choices. 

The authors also indicate that there is in gen- 
eral a total bias toward the favorable pole. This 
is not surprising, and the personnel technician 
heeding the general warning given is able to 
adjust his evaluation by relating it to configura- 
tion of the profile rather than to individual 
scores. In this connection, the authors state that 
on the basis of probability theory the expecta- 
tion of the score of a particular individual should 
not deviate widely from the mean in the same 
direction on many trait scores, since the traits 
are independent or almost so. In other words, 
if eight or nine of the ten traits are above (or 
below) the median, the whole set of ratings 
should be suspect. This line of reasoning would 
be valid if the traits were sampled independ- 
ently but, of course, they are not. It is well 
known that favorable traits are positively cor- 
related in individuals (a positive manifold), and 
it would be a source of surprise if an individual 
who was rated above normal in eight traits was 
not also found to be superior in the remaining 
two. f 

The survey gives a very favorable impression 
of a well rounded, carefully worked out method 
of evaluating an important portion of the total 
personality. It is easy to administer and to score, 
and if interpretation of the obtained measures 1s 
difficult, it is a function of the complexity of 
personalities rather than a function of the sur- 
vey. It will be particularly useful in counseling 
at the upper high school and college levels as 
well as in organizations employing a large num- 
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ber of young people and having in their employ 
psychologists with enough experience and com- 
petence to interpret the results. 


J Consult Psychol 14:162 Ap "50. Laurance 
F. Shaffer. * As the outstanding omnibus in- 
strument based primarily on factor analyses, the 
Survey will have usefulness for screening, rapid 
evaluation and research. 


[50] 
*Heston Personal Adjustment Inventory. Grades 
9-16 and adults; 1949; 6 scores: analytical thinking, 
sociability, emotional stability, confidence, personal re- 
lations, home satisfaction; IBM; 1 form; $2.40 per 
25; 30€ per manual; postage extra; 35¢ per specimen 
set, postpaid; separate answer sheets may be used; 
$1.10 per 25 IBM answer sheets; 40€ per set of sten- 
cils for scoring answer sheets; postage extra; nontimed 
i rto. minutes; Joseph C. Heston; World Book 
о. 
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т. Heston, Јоѕерн C. “A Comparison of Four Masculinity- 
Femininity Scales,” Ed & Psychol Meas 8:375-87 au '48. * 

2. M CAEDIS Joun U., AND Tvrrm, Fren T. “Diagnostic 
and Predictive Value of the Heston Inventory Used in Student 
Teaching.” J Teacher Ed 1:40-3 Mr 'so. * 

ALBERT Erus, Consulting Psychologist, 56 
Park Ave., New York 16, New York. 

Compared to other personality inventories, 
this inventory is a better than average instru- 
ment which possesses most of the advantages 
which its author and publisher claim for it. It 
independently measures—or at least provides a 
score for—traits of personal adjustment; “it af- 
fords adequate statistical reliability of measure- 
ment” ; it has been validated by several methods ; 
it is usable at both the college and high school 
level ; it provides fairly stable and representative 
norms; and “it is simple to administer and con- 
venient to score." Having duly noted these ad- 
vantages of the test, it is now in order to examine 
its main, and somewhat less well publicized, 
shortcomings : 

a) It is claimed that "clinical experience with 
many individual cases during the last three years 
has demonstrated the value of the inventory ap- 
proach," but no confirmatory evidence is given 
in the extensive test manual. Apparently, more- 
over, no attempt is being made to check, experi- 
mentally, this “clinical experience.” The writer 
suspects that it is the same kind of “clinical ex- 
perience" which has been offered, for many 
years, in support of orthodox Freudianism, di- 
anetics, the Szondi Test, and many other as yet 
experimentally unvalidated hypotheses. This 
type of "clinical experience" normally consists 
of making a tremendous fuss about one case 
which supports the hypothesis in question—and 
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conveniently forgetting about the other nine 
cases which do not. " 

b) The six "personality traits" scored on the 
Heston inventory differ widely in clinical usa- 
bility. One of them, analytical thinking, is hardly 
a personality trait at all, but belongs more with 
descriptions of mental processes. Three of the 
other traits—sociability, personal relations, and 
* home satisfaction—are largely the effects of per- 
sonality traits, rather than the traits themselves. 
Moreover, they are fairly easily determined by 
almost any type of inventory or, much better, 
by a briet psychological interview. The only 
truly important personality traits that the in- 
ventory appears to measure are confidence and 
emotional stability which turn out to have an 
intercorrelation coefficient of .73. 

c) When outside criteria were employed to 
test the validity of the scales, low coefficients of 
correlation were generally found between the 
inventory scores and the ratings of acquaint- 
ances. Thus, 7's were found of only .24 for emo- 
tional stability and .36 for confidence. 

d) The author unfortunately uses many of 
the vaguely worded, confusing questions from 
some of the oldest personality inventories, in- 
stead of constructing unequivocally worded new 
ones. Questions like, “Do you daydream often ?,” 
which have time and again been found to be of 
the meaningless armchair variety, have been 
retained. And many questions to which all truly 
honest and conscientious respondents would 
necessarily have to answer yes have often been 
included. For example: “Do you ever wish you 
were more attractive?" “Do you ever feel too 
self-conscious ?" “Do you sometimes become an- 
gry?” “Are your nerves ever raw or on edge?" 
"Are there some people you dislike?" A »o 
answer to most of these questions is, according 
to the scoring sheet, distinctly rewarded, which, 
apparently, means that only those who lie to 
themselves or to others are considered to be 
well adjusted in our society. 

е) The author writes that “it is the writer's 
firm persuasion that such tests [as the Heston] 
do have valuable practical utility and that most 
forward-looking counselors would rather use a 
tool with some admitted difficulties than rely 
solely on subjective judgment." If this is a cor- 
rect belief, then counseling is certainly in a sad 
state today. While it may be true that the ma- 
jority of contemporary counselors still favor 
old-fashioned inventory techniques instead of 
getting down to modern clinical methods of per- 
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sonality evaluation, it is a little hard to believe 
that most forward-looking counselors may also 
be included in this diehard group. Personality 
inventories may of course have real value a; 
screening instruments, to be used prior to coun. 
seling or psychotherapeutic processes; but it js 
discouraging to believe that well trained, for. 
ward-looking clinicians would for a moment 
think of spending valuable time giving and scor- 
ing inventories for use with disturbed individuals 
who have come for some form of counseling or 
treatment. 

In sum: the Heston Personal Adjustment In- 
ventory, while as good as most of the similar 
tests of this type, seems to have only a screening | 
usefulness; and it is depressing to see counsel- 
ors being encouraged to use any instrument of 
this sort in dealing with individual counselees 
who have gone beyond a screening process. All 
contemporary personality inventories, includ- 
ing this one, have dubious individual diagnostic 
value. Any counselor who, in a 15- or 20-minute 
interview with a counselee, cannot obtain more 
concrete and valid information than can be ob- 
tained with these instruments had better re- 
turn to tests and measurements or other more 
academic applications of psychology. 


Hans J. Eysenck, Director, Psychology De- 
partment, Institute of Psychiatry, Maudsley 
Hospital, London, England. 

This inventory claims to test six basic com- 
ponents of an individual’s adjustment: ana- 
lytical thinking, sociability, emotional stability, 
confidence, personal relations, and home satis- 
faction. Norms are provided for high school and 
college groups but not for adults. The inventory 
comprises 270 questions in all. Scores are con- 
verted into percentiles and may be represented 
on a profile chart. Split half reliabilities of the 
individual scales are reasonably satisfactory; 
ranging from .80 to .91. Validity is claimed on 
three grounds: (a) Internal Consistency. This 
is based on item analysis, not as might have 
been preferable, on factor analysis. (5) Psycho- 
logical Meaningfulness. This is difficult to aP- 
praise in view of the differences between 
individual judges as to what precisely is ps)” 
chologically meaningful. (c) Independent С 
teria. Correlations are presented here showing 
moderately sized correlations between inventory | 
traits and ratings by counselors and other psy 
chologically trained persons. Differences wet 
found between college men and reformatory 1" 
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mates, but not between high school boys and 
boys in a juvenile correction institution. 

Intercorrelations are given between the scales 
which are high enough—.73, .61, .58, .44, .39, 
‚38, .38, 32) :3155120,503; 01) 7:08, len tis vane 
—.18—to make it quite clear that the six traits 
measured have no separate entity. A factorial 
analysis is necessary before these six traits can 
be regarded as independently useful. There is 
little in this scale to set it apart from the many 
others on the market, and from the point of view 
of intercorrelation of scores, it is clearly in- 
ferior to the other scales. 

It is difficult to know the extent to which this 
type of questionnaire can be used because there 
is no evidence presented that the alleged areas 
measured have any meaningful existence or, if 
they do, that they can be measured by question- 
naires of this type. The path of research in per- 
sonality measurement is much more thorny and 
difficult than is suggested by the authors of in- 
ventories and questionnaires; and, until they 
face more realistically the problems of taxonomy 
and validity, their products will be of doubtful 
scientific value. 


E. Lowett Кешу, Professor of Psychology, 
and Director, Bureau of Psychological Services, 
University of Michigan, Ann Arbor, Michigan. 

This is a self-report inventory of 270 yes-no 
items designed to evaluate six aspects of an in- 
dividual’s personal adjustment. Like other self- 
report inventories, it is administered without a 
time limit: most persons complete the form in 
40 to 50 minutes. Scoring is facilitated by the 
use of a separate answer sheet. Percentile norms 
are provided for college men (884), college 
women (597), high school senior men (216) 
and high school senior women (251). The re- 
verse side of the answer sheet contains a profile 
chart for use in preparing a profile of the indi- 
vidual scores. As a means of emphasizing the 
greater significance of percentile differences as 
the score deviates from the mean at the extremes 
of the scales, the percentile scale used on the 
profile chart has been made to correspond to 
z-score units. 

Test items were selected from a pool of 450 
items on the basis of internal consistency : each 
item used has a minimum discrimination index 
of 30; the mean index for the items on each scale 
is 45 or higher. In addition, the author used the 
criterion of face validity: regardless of its dis- 
crimination index, an item was not included in 


a scale unless the term was “psychologically 
meaningful" for the variable. Unit scoring 
weights are used and no item contributes to 
more than one of the six scores. Corrected split 
half reliability coefficients for the six scores are: 
-86, .91, .86, .84, .80, and .87. The intercorrela- 
tions between the six scores are low with the 
exception of those between emotional stability, 
confidence, and personal relations (.73, .61, and 
.58). All show correlations of .50 to .75 with the 
MMPI scales for depression, psychasthenia, 
and schizophrenia. The author regards the dif- 
ferences between the reliabilities and intercor- 
relations of these scales great enough to justify 
separate scoring keys and separate reporting of 
scores. However, these three scores are plotted 
sequentially on the profile chart. Scores on ana- 
lytical thinking (thinking introversion) corre- 
late .31 with ACE scores and .36 with first- 
semester college grades. 

Evidence of the validity of the several scores 
includes: reasonably good agreement between 
the scores and the judgments of counselors based 
on more extended contact; correlations of .26 
to .59 between scores and trait ratings of ac- 
quaintances ; correlations of .31 to .63 with self- 
ratings ; and the fact that for five of the six traits, 
the correlation of the scores with both of these 
criteria is higher than the intercriterion corre- 
lation. 

The inventory impresses this reviewer as 
being a good instrument for the purposes for 
which it was designed: to assist the busy coun- 
selor in identifying students in need of atten- 
tion and to provide a preliminary assessment of 
the student. The author is to be commended for 
the generally superior and unusually complete 
manual. It reflects not only a considerable 


.amount of research but thoughtful consideration 


of the problems underlying the design and use 
of any personality inventory. In this connection, 
it is noted that the author states that the relia- 
bility coefficients “аге all large enough for very 
satisfactory use in group situations," yet the 
inventory is designed and recommended for 
use in the evaluation of individuals. The manual 
warns the user of the test to be cautious in the 
interpretation of the scores of individuals, but 
this word of caution may not be sufficient. 

The author is quite insistent that whatever 
correlations exist between the several scales 
represent true relationships between the vari- 
ables, since each is "measured entirely inde- 
pendently, by a separate set of items.” Whether 
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or not one accepts this contention is of course 
a matter of the meaning attached to the phrase, 
"measured entirely independently." To be sure, 
the scores are based on different items, but some 
of the items appearing in the emotional stability 
scale would certainly have high enough discrimi- 
nation indices to have been included in one of 
the other scales with which emotional stability 
is highly correlated. Just because intelligence 
may be measured by two tests with no common 
items would not lead to the conclusion that the 
two tests are measuring different variables. The 
author apparently chose to build a test with six 
scales. Three of these—analytical thinking, so- 
ciability, and home satisfaction—are relatively 
independent of each other and of whatever is 
being measured by the other three scales which 
have high intercorrelations. 

In addition to detailed instructions for the use 
of the inventory, the manual contains all essen- 
tial technical data, makes but modest claims for 
the instrument, and offers wise counsel with 
respect to its appropriate use in student per- 
sonnel programs. 


J Consult Psychol 13:224 Je '49. Laurance 
F. Shaffer. * Although there are many objec- 
tions to the questionnaire approach to the study 
of personality, Heston's contribution is not “just 
another" inventory, but a workmanlike job with 
a number of areas of potential usefulness, in- 
cluding the selection of students for more indi- 
vidual guidance. 


[51] 
Hunt-Minnesota Test for Organic Brain Dam- 
age. Chronological ages 16-70 and mental ages 8 and 
over; 1943; 16 tests grouped in 3 divisions : the vocabu- 
lary test of Revised Stanford-Binet Scale, interpolated 
tests, learning and recall; individual; 1 form; no data 
on reliability and validity in manual (for data presented 
elsewhere by the author, see 7) ; tentative norms; $5 
per set of testing materials and 25 record blanks ; $1.25 
per 25 record blanks; postage extra; (30) minutes; 
abbreviated test (15) minutes; Howard F. Hunt; Uni- 
versity of Minnesota Press. * 
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Seymour С. Kresanorr, Chief Psychologist, 
Franklin Delano Roosevelt Veterans Adminis- 
tration Hospital, Montrose, New York. 

This test was designed to detect and measure 
intellectual impairment as seen in patients with 
organic brain damage. Hunt utilized the general 
clinical observation that recently acquired as- 
sociations are impaired in the presence of or- 
ganic brain dysfunction while old habits show 
relatively little deficit. The test thus consists 
of the 1937 Stanford-Binet vocabulary, which 
should reveal little impairment ; a group of timed 
verbal and nonverbal tests involving learning 
and recall, which should be sensitive to organic 
brain damage ; and a group of interpolated tasks 
which serve the purposes of providing uniform 
intervals between tests of recall and also of 
detecting significant personality disturbances. 

The test is statistically standardized so that 
T scores may be obtained. These constitute pre- 
dictions of the maximal level of learning as based 
upon age and vocabulary scores. Hunt studied 
the significance of difference in scores between 
a standardizing group of brain damaged cases 
and a matched control group of patients without 
brain damage. This permitted the determination 
of a T score cutoff beyond which the diagnosis 
of organic brain damage may be made. 

The literature reveals definite weaknesses as- 
sociated with the test which indicate the need 
for further investigation. A primary difficulty 
involves the relatively limited number of pa- 
tients employed in the initial standardizing 
group. Further standardization utilizing a large 
number of patients with organic brain damage 
seems indicated. In addition, there i$ a need 
for comparison between normal and psychiatric 
groups. This would serve to clarify the problem 
of “false-positive” selection which is a frequent 
criticism of this technique as pointed out by 
Malamud (8), Meehl and Jeffery (9), and 
Juckem and Wold (16). The last two have 
demonstrated that in a normal population of 
superior vocabulary level, 60 per cent of the 
subjects obtain T scores indicative of organic 
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brain damage. Extreme caution must therefore 
be exercised in the use of this test with superior 
individuals. Further, the basic difficulty of dif- 
ferentiating organic deficit from types of im- 
pairment due to chronic psychogenic illness per- 
sists in the strictly quantitative use of this 
technique. 

Strict adherence to the quantitative aspects 
of the test findings can produce spurious inter- 
pretation. However, the test remains a valuable 
tool in the evaluation of intellectual efficiency 
despite weaknesses in the determination of the 
etiology of the intellectual impairment. The test 
would appear to be of greatest value when uti- 
lized by an experienced clinician with emphasis 
upon the qualitative aspects of the test results. 


For reviews by Margaret Ives ата О. L. 
Zangwill, see 3:49. 


[52] 
*[Illinois Opinion Inventories.] Grades 6-8, 9-12, 
adults; 1948; a survey form for use by school adminis- 
trators in sampling parent and student opinion on 
schools; 4 inventories (only 2 published as separates) ; 
1 form; no manual ; no data on reliability and validity ; 
directions and the complete inventories are presented 
in What People Think about Their Schools (see 1 be- 
low) ; Harold C. Hand, Gilbert C. Finlay, and Ardwin 
J. Dolio; World Book Co. * 
а) ILLINOIS INVENTORY OF PARENT OPINION. Adults; 
$1.80 per 25, postage extra; 20€ per specimen set, post- 
paid; (50) minutes. 
b) ILLINOIS INVENTORY OF PUPIL OPINION; SECONDARY 
SCHOOL FORM. Grades 9-12; $1.40 per 25, postage extra ; 
20€ per specimen set, postpaid; (30-50) minutes. 
C) ILLINOIS INVENTORY OF PUPIL OPINION: UPPER ELE- 
MENTARY SCHOOL FORM. Grades 6-8; may be produced 
Ру Kadiv school systems with permission from pub- 
asner, 
d) ILLINOIS INVENTORY OF TEACHER OPINION. Teach- 
ers; may be reproduced by individual school systems 
with permission from publisher. 
REFERENCES 

1. Hanp, Hamorp Curtis. What People Think About Their 
Schools: Values and Methods of Public-Opinion Polling as 
Applied to School Systems. Yonkers, N.Y.: World Book Co., 
1948. Pp. iv, 219. * 


KennetH E. Crank, Associate Professor of 
Psychology, University of Minnesota, Minne- 
apolis, Minnesota. 

These inventories are designed to provide: 
(a) school superintendents and boards of edu- 
cation with information about parents' attitudes 
towards their children's schools; (b)*teachers 
and administrators with information about stu- 
dents’ attitudes and opinions toward their 
schools; and (c) administrators with informa- 
tion about teachers’ attitudes towards their 
schools and jobs. These are types of opinions 
"which school-board members, superintendents, 
principals, and teachers must secure if they are 


to know instead of guess (1) how well they are 
succeeding in their public-relations or personnel 
practices, (2) at what specific points they are 
succeeding or failing, and (3) what their con- 
sequent top priorities actually are." 

Each of the four separate inventories is re- 
ported to have been developed and carefully pre- 
tested, used in city wide polls, and modified on 
the basis of experience in its use. The final forms 
presented for use give evidence of this work. 
The generally well phrased questions cover very 
well the important issues and look as though 
they would give useful results. Some questions 
do show a need for additional work, however, 
being either too long and involved, or failing to 
distinguish between the differences in opinions 
of informed and uninformed respondents. Sam- 
ple findings for one school system are presented. 
Neither normative data nor evidence of the 
sensitivity of the measures for reflecting changes 
in school practices over a period of time are pre- 
sented, although these would seem to be rather 
important points. The lack of evidences of relia- 
bility and validity is less serious since the mean- 
ing of these concepts for an inventory of this 
sort is obscure. 

Two of the four forms of the inventories (the 
parent form and the secondary school form) are 
printed and available from the publisher. The 
other forms must be reproduced locally by the 
institution wishing to use them. When used for 
nonprofit purposes and with the publisher’s per- 
mission, the two printed inventories may also 
be reproduced locally without charge, Thus the 
vast amount of work ordinarily required in 
preparing surveys of opinions about schools is 
greatly reduced by the easy availability of these 
inventories. The school administrator or teacher 
who is interested in surveying the attitudes of 
his associates or students will do well to con- 
sult Hand’s monograph, What People Think 
About Their Schools. 


For a related review, see 53. 


[53] 
[Re Illinois Opinion Inventories.] HAND, Har- 
or» C. What People Think About Their Schools: 
Values and Methods of Public-Opinion Polling 
as Applied to School Systems. Yonkers, NY.: 
World Book Co., 1948. Pp. iv, 219. $2.52. * 


El Sch J 49:477-8 Ap ’49. John Withall. Are 
you a school administrator? Are you interested 
in getting fairly accurate information about 
parents’, teachers’, and pupils’ attitudes toward 
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the school and its administrations? Since a 
“уез” to the first question makes an affirmative 
reply to the second almost inevitable, Harold 
C. Hand's book, W/hat People Think about 
Their Schools, may be of value to you. The 
publication presents techniques for procuring 
more reliable and valid data than are customarily 
available to school administrators with respect 
to the satisfactions and dissatisfactions of the 
three groups mentioned above. Primarily a man- 
ual, the book describes the rationale behind 
the Opinion Inventories presented by the author 
and enlarges in detail on the methodology of 
administering the inventories and of processing 
and interpreting the resulting data. One of the 
major themes of this slim volume (the book 
comprises 149 pages of text proper) is the de- 
lineation of the dangers confronting the adminis- 
trator who depends on guesswork and hearsay 
for assessing the likes and dislikes of any of the 
groups of persons with whom he works. In 
Hand's opinion, guesswork and the pitfalls it 
brings to administrative practice can be elimi- 
nated by careful polling of the school's patrons 
and teachers. This polling can be done through 
the use of the Illinois Opinion Inventories * In 
the main, the inventories are composed of 
strength-of-feeling check items and a few 
write-in answers. * The questions in the inven- 
tories are unusually clear and straightforward. 
There seems to be only one item which evidences 
some ambiguity. This item revolves around the 
parents' and pupils' interpretations of the phrase 
*social problems." The question is aimed at as- 
sessing their opinions of the effectiveness of the 
school's efforts in helping pupils with problems 
of personal adjustment. Will the phrase “Social 
problems" connote adjustive problems to them ? 
Furthermore, judging from the amount of atten- 
tion and space devoted to the two problems, 
assessing the parents' and pupils' opinions about 
the value of the school's guidance and counseling 
program appears to be of less moment to the 
authors than obtaining reactions to the problem 
raised by the exclusion of some pupils from 
extra-curriculum activities because of the pro- 
hibitive cost. The authors of the inventories 
seem to be on secure ground in all instances in 
which they ask for an individual's attitudes or 
pe on the existing school situation. In Part 
[T of the parents’ inventory, however, they leave 
their more secure pastures by attempting to 
plumb parents' attitudes about matters such as 
federal aid for education; extending the formal 
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period of education upward ; society's responsi- 
bility for insuring basic needs—food, shelter 
clothing; health services—which will render edi. 
cation effective; the operational reality of ейи. 
cational opportunity for all; and so on. The 
language throughout the book is refreshingly 
clear and simple. In some respects it may seem 
to the reader that the author sets out in too 
much detail suggestions regarding sampling 
procedures, types of data sheets, steps and pro- 
cedures in processing the data, and even the 
types of tables to be used. If one realizes, how- 
ever, that the book is primarily a manual to 
accompany the Illinois Inventories of Opinion, 
the meticulous descriptions of ways of collect- 
ing and interpreting data seem justifiable. 


[54] 
*Industrial Subjective Fatigue and Euphoria 
Scales: Present Feelings. Ages 14 and over; 1944; 
Forms A, B; manual ('44—see r below); $2 per 25; 
$1 per Soren set; postpaid; (1) minute; Willard A. 
Kerr; sychometric Affiliates, Box 1625, Chicago 00, 


Ш. 
REFERENCES 


1, Mippieton, W. C.; Fav, P. J.; Kerr, W. A.; AND Ами, 
F. “The Effect of Music on Feelings of Restfulness-Tiredness 
and Pleasantness-Unpleasantness." J Psychol 17:299-318. '44. * 


(PA 18:3451) 
[55] 


xIndustrial Subjective Fatigue Scale: Retro- 
spective Work Curve Feelings: National Re- 
search open on Employee Feelings at Work. 
Ages 14 and over; 1950-51; Form A [рп mimeo- 
graphed; manual ('50—5ее г below); $2 per 25; $i 
per specimen set; postpaid; (2) minutes ; Willard Kerr 
and Ivan a Psychometric Affiliates, Box 1625, 
Chicago оо, Ill. * 
REFERENCES 

1. GarrrirH, Jous W.; Kerr, WirLARD А.; Mayo, THOMAS 
B., Jr.; AND Торлы Joun К. “Changes in Subjective D 
and Readiness for Work During the Fight-Hour Shift.” J pp 
Psychol 34:163-6 Je 'so. * (PA 25:4014). 


[56] 
xInteraction Process Analysis. Groups of from 2 
to 20 people (ages 4 and over) ; 1948-50; a method of 
analyzing group character and processes; $3 per 250 
interaction scoring forms ['50]; $6 per manual, Inter- 
action Process Analysis: A Method for the Study of 
Small Groups ('50—see 2 below) ; postage extra; * y 
Interaction Recorder, an optional instrument W 1С 
facilitates the recording of observations in sequence is 
described in т below; Robert F. Bales; Addison 
Wesley Press Inc. * 
REFERENCES 

т. Bares, Ковевт F., AND Gersranps, Henry, “The ‘Inte 
action Recorder’: An Apparatus and Check List for Sequen! ^ 
TE Asalyels of Social еН Human Relations 1' 
45 no 4 '48. È 

2. Вааай Rosen F. Interaction Process Analysis: A Method 
%, the Study of Small Groups. Cambridge, Мазз.: Addiso 

eny Press, Inc., 1950. Рр. xi, 203. * (РА 24: 53) Turo 

3. BALES, Rosert F; Ѕтвортвеск, FreD L.; MILLS, ctf 
port M.; Амр RosgsoroucH, Mary E. "Channels of Comm" 
cation in Small Groups." Am Sociol R 16:461-8 Ag '51. 


Launor F. CARTER, Director of Research, Army 
Field Forces Human Research Unit No. 2, Fort 
Ord, California. 7 
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Bales’ form is printed on an 8 by 11% inch 
sheet of paper and is used for recording the be- 
havior of individuals in a group situation. On 
the left-hand side are 12 categories for classify- 
ing individuals’ behavior and the remaining area 
is ruled into boxes for tallying the frequency of 
occurrence of each behavior. There are 12 rows 
to correspond to the categories and 20 columns 
which can be used for successive time intervals. 
Extensive instructions for the use of the form, 
detailed definitions of the categories, and sug- 
gestions for training observers are contained in 
the book, Interaction Process Analysis. 

One of the serious problems of interaction re- 
cording is the nature of the categories to be 
used. Bales’ categories have been evolved both 
from theoretical considerations and as a result 
of considerable empirical refinement. The 12 
categories are, in order, (1) shows solidarity, 
(2) shows tension release, (3) agrees, (4) gives 
suggestion, (5) gives opinion, (6) gives orienta- 
tion, (7) asks for orientation, (8) asks for 
opinion, (9) asks for suggestion, (10) disa- 
grees, (11) shows tension, and (12) shows an- 
tagonism. In Bales’ theoretical formulation cate- 
gories 6 and 7 have to do with problems of 
communication, 5 and 8 with problems of evalu- 
ation, 4 and 9 with problems of control, 3 and 
10 with problems of decision, 2 and 11 with ten- 
sion reduction, and т and 12 with reintegration. 
Thus different groups can be compared with 
respect to the relative proportion of time devoted 
to these various activities. An extensive the- 
oretical discussion of the meaning and implica- 
tion of the various categories is contained in 
Bales’ book. 

While it is contended that this system of 
categorizing is widely applicable, it is this re- 
viewer’s experience that it is best adapted to 
discussion groups and is not entirely appropriate 
for groups engaged in work on tasks requiring 
manipulation of material. 

The question of the reliability and validity of 
interaction scoring procedures usually refers to 
the total process of the recording technique and 
the performance of the observers. One chapter 
of Bales’ book is devoted to the problem of ap- 
praising observer reliability. In general, an at- 
tempt to demonstrate reliability is made by com- 
paring the frequency of observed behavior for 
a given time interval recorded as falling into 
each category when observations of the same 
group are made by independent observers. In 
this reviewer’s opinion Bales’ demonstrations 


of reliability are unsatisfactory, since they are 
largely based on an inappropriate use of chi- 
square. However, using different statistical tech- 
niques, others have shown that such observa- 
tions can be adequately reliable. In a sense the 
validity of the technique is self-evident since it 
allows the recording of immediately perceived 
behavior; whether the behavior categories re- 
corded are appropriate for a particular study de- 
pends on the problem being investigated. 

The Interaction Scoring Form and Interac- 
tion Process Analysis represent pioneer efforts 
in a complicated but rapidly growing field. The 
scoring form will prove valuable as a convenient 
device for scoring interaction in discussion 
groups and for the introductory training of ob- 
servers. For some group behavior studies more 
complex techniques will be needed such as are 
involved in the use of the Interaction Recorder, 
the Interaction. Chronograph, or the modified 
stenograph. 


For related reviews, see 57. 


[57] 
*[Re Interaction Process Analysis.] Bares, Ros- 
ert F. Interaction Process Analysis: A Method 
for the Study of Small Groups. Cambridge, Mass. : 
Addison-Wesley Press, Inc., 1950. Pp. xi, 203. $6.00. * 
(PA 24: 4553) 

Am Sociol R 15:693-4 О ' 5o. Ernest Green- 
wood. * [The book’s] chief contribution lies in 
demonstrating the utility of a system of twelve 
basic, general-purpose categories whereby small 
group interaction, regardless of setting, might 
be observed, recorded and analyzed * represents 
a faltering but encouraging step toward the 
development of an experimental social psychol- 
ogy worthy of that name. Studies of....[this 
type] promise eventually to crack open some of 
the heretofore imposing obstacles blocking the 
development of experimentation in our disci- 
pline. Two such problems, to which the author. 
himself all too briefly alludes, are relevant here, 
viz. how to translate crudely derived a priori 
generalizations into operational hypotheses, and 
how to standardize observations over time and 
space. We already know so much about why we 
behave as human beings and still we know so 
little about how to control ourselves. We shall 
persist in our inability to shape the macrocosm 
until we have learned to manipulate the micro- 
cosm. Therefore principles of group behavior 
that we already have discovered must be made 
concrete, anchored to observable and manipu- 
latable referents. The twelve general-purpose 
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categories developed by the author represent an 
attempt to impart handles to group interaction 
so that we might get hold of and see it in settings 
sufficiently circumscribed to be manageable. The 
other of our difficulties has been our inability to 
render our observations additive, to impart to 
them the cumulative property which is the sine 
qua non of scientific progress. Thus we have had 
no way of guaranteeing comparability among a 
series of observations of the same observer of 
the same group or among the observations of 
many scattered observers of similar groups. The 
author's suggested minimal set of interaction 
categories, with his scheme for their observation 
and recording, is a step toward a long sought for 
standardization. The author is apparently well 
aware of the crudity of his attempts and invites 
critical appraisal of them. Whatever may be the 
shortcomings of his efforts, they deserve encour- 
agement. It is from such beginnings that we shall 
transform social science from a catalogue of 
mere truisms into a kit of workable tools. 

J Abn & Social Psychol 45:786-8 О ' 5o. Al- 
vin Zander. * a stimulating progress report * 
The format is good and the writing style is in- 
teresting and clear. There is no doubt that this 
volume is creating a great deal of interest. It is 
a stimulating and provocative aid for some types 
of research on groups which many are sure to 
find helpful. One wonders if its appearance will 
encourage descriptive research similar to that 
which blossomed not many years ago with the 
first appearance of psychological tests. If so, to 
what end? This reviewer knows of several group 
practitioners who have tried to use the observa- 
tion schedule with their groups and then won- 
dered what to do with their data. Similarly, 
what will be done with an accumulation of group- 
descriptions made in terms of standard-catego- 
ries? Apparently the author is ambivalent 
concerning the range of applicability of this ob- 

'servation scheme. At one point he states that 
his aspiration is to develop a standard set of 
general-purpose categories which should be able 
to encompass every act by anyone in a group. 
In another place he asserts that the categories 
might best be used as a supplement to other 
research methods or observation categories. 
Only experience will show whether or not this 
short set of observation categories, built in terms 
of a very definite theoretical system, will be a 
useful supplement for studies employing quite 
a different theoretical framework. This reviewer 
is skeptical about its ready integration into many 
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studies, especially experimental research, Skepti- 
cal, because the standard nature of the cate- 
gories will almost certainly be changed under 
the influence of different experimental situa- 
tions. The significance and interpretations of 
“asking for opinions” or “showing tension re- 
lease,” for example, might be quite different 
when varying leadership styles as compared to, 
say varying group homogeneity of membership, 
The author states that he hopes this approach 
will make it possible for him to develop more 
adequate theories about full-scale social systems, 
His theoretical framework which proposes that 
each act of a group member is part of an inter- 
action system gives promise that small group 
behavior may be found to be isomorphic to the 
behavior of larger social systems. We hope that 
later revisions, which are promised from time to 
time, will report progress on this problem. At 
present the book is mainly concerned with the 
behavior of members. There is relatively little 
on group phenomena, and almost none on the 
influence of the institutions of which the group 
is a part, or the society into which the institu- 
tion fits. Certainly much of the behavior of the 
group or the members can best be understood 
by some measurement of these influences ex- 
ternal to the group. We will wait with interest 
for reports on improvements in this method- 
ology—and with even greater impatience for the 
findings of research studies using these pro- 
cedures. In the meantime, students of person- 
person interaction whether they be clinical psy- 
chologists, social psychologists, sociologists, or 
anthropologists, will find this a most worthwhile 
and informative discussion of ап intriguing 
methodology. 

J Appl Psychol 35:220-1 Je "51. Wesley Os- 
terberg. * The task which the author took upon 
himself in writing this book was probably по 
less complex and difficult than the problem— 
objectifying and quantifying behavior—itself. | 
He is to be commended for his attempt, but not, 
however, for his presentation. Excursions dow! 


quent. As one example, the relationship the at- 
thor attempts to establish between his concepts 
and those of psychoanalysis (p. 46) is extremely 
tenuous and apparently irrelevant. In addition, 
much of the material is difficult to read, and un- 
necessarily so. Sentences with over sixty words 8 
are commonplace; some have more than 016 - 
hundred. The reliability design, which involvé | 
a sequence of six phases and a number of tests 
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is illustrated by a chart which itself presents an 
interesting challenge to the reader in the way 
of interpretation. Even when the design and 
results are understood, conclusions regarding 
reliability cannot be drawn because of the way 
in which Chi Square is used. For example, 
when the expected frequency contributes one- 
half of the variance of the observed frequency, 
as in the example cited of two observers, a limit 
is imposed on the Chi Square value obtained. 
Furthermore, there is no indication that the 
Scores recorded are of the same behavior. Al- 
though Observer A and Observer B both re- 
corded five scores under category 10 during a 
twelve minute recording (Table 2), there is 
no assurance that they were observing the same 
behavior when they did so. Interaction process 
analysis is applicable to a wide variety of groups 
and inter-personal situations where an objective 
method of behavior analysis is badly needed. 
Because of the presentation, however, the use 
of the method here will probably be limited— 
even among other researchers in the field, to 
whom the book is addressed. This is regrettable 
because the author has provided at least a 
foundation for further development, and possibly 
considerable superstructure. 

J Consult Psychol 14:235 Je ’50. Laurance 
F. Shaffer. ^ detailed description of the methods 
used by the Laboratory of Social Relations at 
Harvard University for the recording and analy- 
sis of the interactions in small groups. Itself a 
product of interdisciplinary contacts between 
sociology and psychology, the method of analy- 
Sis is one of the most promising tools for the 
advancement of the study of many interpersonal 
areas: psychotherapy, education, social action, 
and politics. No recent development has shown 
more clearly the rewards that follow the quan- 
tification of observations that previously have 
been made only qualitatively and crudely. 

Pub Opinion О 14:588-90 fall ’50. Daniel 
Lerner. There is a style of talk which makes ex- 
tensive use of dichotomies and dilemmas. Its 
staple ingredient is the forced choice. Some- 
times enlightening, as when paired comparisons 
are used to elicit hypothetical preferences, the 
technique is more often obscurantist in actual 
situations. There may be no earthly reason, in 
"real life,” to choose one of a given pair—e.g., 
if both can be had or neither is wanted. Few 
dilemmas are inescapable and many dichotomies 
are false. Against this sterile mode of discussion 
in social science, Dr. Bales is a powerful ally. 


His book shows up the vacuity of the Theorist- 
Empiricist dichotomy which too often allows 
scientific controversy to degenerate into mere 
controversialism. His contribution is the more 
valuable because it quickly passes beyond pious 
exegesis of why it is virtuous to integrate theory 
and research, and gives us a usable demonstra- 
tion of how to do it. It will be suggestive to the 
student of opinion and content analyst, as well 
as to the empirical sociologist. The core of Bales' 
method for studying interaction as a process in 
small groups is “а way of classifying direct, face- 
to-face interaction as it takes place, act by act, 
and a series of ways of summarizing and analyz- 
ing the resulting data so that they yield useful 
information” (pp. 5-6). Methodology here is 
emphasized in the sense of conceptualized tech- 
nique, the empirical focus within a theoretical 
context. Technique thus becomes an instrument 
of knowledge, a better way of getting answers 
to meaningful questions. Although his method 
makes use of elaborate new gadgetry—one-way 
mirrors, concealed microphones, Interaction Re- 
corders—Bales is not ridden by his machines. 
He says: “By memorizing the categories and 
the numbers he assigns to subjects, [a trained] 
observer might even dispense with prepared 
forms and require only a pencil and paper” 
(p. 5). The most striking feature of the book 
is its carefully articulated conception of a method 
for stabilizing observation and objectifying in- 
terpretation in complex observational situations. 
It deals first with the inevitable problem of elimi- 
nating “observer bias” from the observational 
situation. Three standard ways of doing this are: 
(1) to observe without letting the observed 
know ; (2) to handle the observed so that being 
observed makes no difference in their behavior ; 
(3) to find out the difference and account for 
it in evaluating the data. The author apparently 
uses all three, although we are left in the dark 
between the second and third method when he 
tells us only that “it appears that the knowledge 
of observation is not a particularly disrupting 
factor” (p. 1). He eliminates observer bias from 
the gathering and processing data by content 
analysis—i.e., by specifying the verbal and non- 
verbal items of behavior which are to be coded 
and the categories under which they are to be 
arranged. This represents a jpromising appli- 
cation of content analytic technique to obser- 
vational problems of small-group interaction. 
(Curiously, he does not mention the wartime 
work of Lasswell, the pioneer in content analysis, 
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and his associates on the Library of Congress 
project. The development of the equations in 
Chapters V and VI surely would be aided by the 
preliminary work of the swell group on a 
“coefficient of imbalance,” reported by Janis 
and Fadner in Language of Politics, and on “re- 
liability” reported by Kaplan, Goldsen and oth- 
ers in the same volume.) To preside over the 
wedding, where a technique heretofore mainly 
applied to verbal behavior in mass media is 
adapted to the observation of problem-solving 
behavior in small groups, Bales introduces the 
concept of sequential analysis (the temporal 
succession of events, not the statistical method 
recently developed by Wald). This conceptual 
innovation justifies Dr. Bales’ claim that he is 
interested in analyzing the process of interac- 
tion, rather than merely in arranging selected 
characteristics by frequency of occurrence or 
other such criteria. The concept is worked into 
a research tool by means of a new and interest- 
ing apparatus—the Interaction Recorder. * The 
...two chapters, on training observers and ap- 
praising their reliability, will be useful to those 
interested in trying out the Bales method or 
in applying what he has learned to other content 
analytic methods involving variant readings of 
the same data. * The final chapter, on problems 
of analysis and interpretation, is а sustained 
effort to present a synthesis of the method : *how 
to give operational content to the referents of 
our theoretical variables, and how to give the- 
oretical meaning to the empirical uniformities 
in our data" (p. 116). Asa final statement, this 
does not quite come off. There are several excel- 
lent points: e.g., his exposition of the “flip-flop” 
problem (pp. 117ff)—the temptation, when re- 
sults appear to demonstrate the reverse of the 
original hypothesis, simply to reverse the hy- 
pothesis to conform with the data—and his con- 
cise statement of the logical role of inference (p. 
120). These should be read by all who work with 
statistically controlled data. The technical pres- 
entation is straightforward and competent (al- 
though the equation for Generalized Status In- 
dex on page 168 is likely to prove so misleading, 
if taken at all seriously, that it should be aban- 
doned and no statistical treatment of this index 
attempted until some reliable weighting can be 
worked out of accumulated empirical data). 
What weakens the presentation is the final sec- 
tion (pp. 172-6). As axiomata media, the hy- 
potheses here formulated are not inspiring. In 
part, they suffer from the malady that afflicts all 
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such general-purpose categories as Bales here | 
proposes: they appear too obvious—not in the 
vulgar sense that everybody "knows" these hy- 
potheses to be true, but in the policy sense that 
they do not seem in urgent need of investigation, 
Also, at.this final point, the author gives us 
technical propositions where we want (and have 
a right to expect) behavioral hypotheses, The 
listed assertions, all beginning with the phrase 
""There is.a fluctuation about an unstable balance 
between tendencies toward..." (pp. 173-5), are 
not genuine hypotheses and they are not about 
behavior. The outcome of such propositions, 
even when indexed and tested, will be to show 
that it is technically feasible to define "unstable 
balance" so that "fluctuations," when measured, 
will turn out to be within a given area of the 
graph. At the very end, Bales does propose sev- 
eral interesting hypotheses by specifying certain 
regularities that occur in the "fluctuations about 
an unstable balance": (1) as the group grows 
older, (2) as its size increases, (3) as its mem- 
bership changes (mixing old and new members), 
(4) as its membership becomes more heteroge- 
neous, (5) as it confronts increasingly difficult — 
problems of adapting to the external situation. 
But these are the last five sentences, and one is 
left to wait for clarification and data in the next 
book. In fairness to the author, it must be said 
that he has asked us to read this book as “a 
working manual, not a finished product.” The 
above comments are intended to suggest possible 
directions, or redirections, of effort that may be 
useful in developing the finished product. Mean- 
while, the book serves as a progress report on 
efforts to date. Viewed this way, it gives us à 
fine demonstration of how to wed theory and 
research in concrete situations. We look forward 
to the next report, when sufficient data on 
the interactive sequences of “Adaptive-Instru- 
mental" and "Integrative-Expressive" behavior _ 
in small groups will have been accumulated. j 

Psychiatric Q 24:620 ЛІ ^50. * The book. - 
is well organized and is written in a way to cap- 
ture and excite the esoteric reader. The subject 
matter is worthy of consideration by social sc 
ence and psychiatry. À 

Social Forces 29:320-30 Mr '51. Howard Ё 
Jensen. Ever since Le Play a continuing tradi- 
tion in sociology has stressed the intensive study 
of small groups as an indispensable inductive 
basis for generalizations about the structure and 
process of society at large. But such studies have 
been based upon no consensus as to what tlt | 
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unit phenomena of. social interaction аге, and 
they have lacked a practicably workable system 
of simple and clearly defined categories їп terms 
of which the units could be observed and effec- 
tive techniques by which observations ‘could be 
quickly and accurately recorded. The results 
have consequently been lacking in precision and 


comparability, and have contributed: much less. 


than their earlier promise to a cumulative body 
of dependable social science. Dr. Bales’ volume 
represents a notable attempt to supply this lack. 
He and his colleagues....have devised. a method 
for breaking up social interaction into unit acts 
which can be recorded separately, have formu- 


lated a small number of categories into which . 


they can be classified, and have constructed an 
instrument by means of which each unit act can 
be recorded as it occurs, and properly attributed 
to both its originator and the.person or persons 
to whom it is directed. The aspects of social 
interaction with which the method can deal are 
admittedly limited. Only its formal structural 
and functional features can be taken into account. 
Any reference to the nature, significance and 
concrete content of the behavior, or any inter- 
pretation based on the temporal sequence of its 
constituent units, is ruled out. The concentration 
of the observer upon recognizing, categorizing, 
attributing and recording the unit acts as they 
occur is so intense that he often misses the 
larger implications of the activity as а whole 
and is at the end "not able to give a very co- 
herent account of what went on." The impor- 
tance of these aspects of activity, which the au- 
thor calls “the ideosyncratic (sic) or ‘topical’ 
content," is, however, not ignored, but is re- 
corded by sound or typescript for supplementary 
analysis. In the last chapter the author writes 
convincingly of the wide'range of possible appli- 
cations of the method. “Many of the most 
important generalizations of social science," he 
thinks, "have already been discovered." The 
most important future advances will probably 
come, not from "brand new, high level generali- 
zations,” but from discovering the obscure im- 
plications of the generalizations already known, 
from identifying more specifically the concrete 
referents of their terms, from delimiting the 
range of their application and of the psycho- 
logical, cultural and structural conditions under 
which they are valid. As the foregoing quotation 
implies, the theoretical orientation of the re- 
search is not new. It builds solidly upon the 
work of such pioneers as Dewey, Mead, Cooley, 


"Thomas, and others, supplemented by the better 
established results of more recent work in social 
psychology and psychiatry. It is clear gain that 
sociology, after wandering in the wilderness of 
barren factfinding, should return to the germinal 
insights of its founders, which alone can give 
significance to facts and direction to their ac- 
cumulation. But it is regrettable that the sections 
dealing with these general notions have not been 
done with the lucidity which characterizes the 
descriptive and interpretative sections. These 
sections read as if they were the work of an- 
other hand. That both the culture and structure 
of the group arise out of the interaction of its 
members; that once in being they provide the 
patterns which condition further interaction; 
that individuals in social interaction do not re- 
spond to one another's acts as momentary and 
isolated units; that they do not consider them- 
selves as homogeneous group members but are 
conscious of themselves as differentiated per- 
sonalities and react to one another as such ; that 
their acts are related to both the past and the 
future and either contribute to a behavioral se- 
quence leading to satisfactions or disrupt such 
а sequence; that the collective behavior of a 
group is impaired if its members do not share 
a sufficient body of common thoughts, feelings 
and intentions—all this and much more has often 


` been briefly, simply and clearly stated. It gains 


nothing in scientific precision and rational in- 
telligibility by being restated in an esoteric lan- 
guage that goes far to justify the time-worn jibe 
that "sociology is what everybody knows told 
in language that nobody understands." Mead's 
treatment of role-taking as the method whereby 
the self acquires the group patterns, for example, 
is not helped by restatement as a process in 
which the "elements are regarded as present 
‘inner’ surrogates of the structured influence of 
factors which also are, or were, ‘outer.’” In 
these days of little books at big prices, the reader 
would be better served were these sections re- 
written in a third their present compass, were 
numerous repetitions eliminated (page 9 reap- 
pearing as page 59 down to the last comma), 
were a list of charts and tables added to the 
Table of Contents, and were the analyses and 
interpretations in Chapter 5 cross-referenced to 
the pages in Chapter 1 which contain the raw 
data discussed. 

Sociol & Social Res 34:392 My-Je’50. Bruce 
M. Pringle. * Instructions for training observ- 
ers and for determining and improving relia- 
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bility are....not so clear or so detailed as might 
be desired by those who wish to use the method 
* A well-developed rationale for the twelve cate- 
gories is given, together with a number of hy- 
potheses which can be tested by the method. * 
The book is noteworthy as an example of the 
rapprochement between theory and research 
that has been so much talked about recently. 
Southw Social Sci Q 31:57-8 Je ' 50. Paul B. 
Foreman. * a major contribution to the small 
fund of concise research reports—the studies of 
Dorothy Swaine Thomas, Kurt Lewin, Ronald 
Lippitt and others—which have sought close- 
checked empirical knowledge about immediate 
group relations * What marks this progress re- 
port as essential for any student of inter-personal 
relations is that it invites further study—study 
which with little cost can be conducted any- 
where by anyone with some ingenuity and mas- 
tery of the basic statistical techniques necessary 
for this sort of research design. Following the 
leads expressed in this monograph or focused 
by it would seem to be much more acute use of 
time than devoting a similar amount of energy 
to more attitude tests and more sociograms. If 
ps suggestion is the essential why behind pub- 
ication of this progress report, it is probably true 
that interested researchers will—as they should 
anyway—have to orient work largely in terms 
of their own theory. It is more than pleasing to 
note Bales' acknowledgment to Jameson of Ore- 
gon for arousing interest in measuring social 
interaction. Jameson for at least a score of years 
has been maintaining that sociologists should 
measure when they have something to measure 
and that meanwhile the quest for the what is 
equal in importance to knowing how. 
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"Interest Index: General Education Series. 
Grades 7-13; 1938-50; same as Interest Index: Test 
$.2a (30) which was a revision of Test 8.2 (38) ; 
original tests published by Evaluation in the Eight- 
Year Study, Progressive Education Association; 30 
scores: "like" and "dislike" scores for English, foreign 
languages, mathematics, social studies, biology, physi- 
cal sciences, music, fine arts, industrial arts, business 
courses, home economics, sports, manipulative, reading, 
total; 1 form, ’50; interpretation leaflet Сз) ; no data 
on validity in manual (for data presented e sewhere, see 
5 below); manual ('50); separate answer sheets must 
be used; $225 per 25; $1 per 25 interpretative leaflets 
containing answer sheet and profile chart; cash orders 
postpaid; 506 per specimen set, postpaid; nontimed 
(40) minutes; Evaluation Staff (Ralph W. Tyler, 
Director) of the Eight-Year Study of the Progres- 
sive Education Association; published in 1950 by Co- 
operative Test Division, Educational Testing Serv- 
1ce. 
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An Inventory of Factors STDCR. Grades 9-16 
and adults; 1934-45; 5 scores: social introversion- 
extraversion, thinking introversion-extraversion, de- 
pression, cycloid tendencies, rhathymia; IBM; 1 form, 
'40; revised manual ['45] ; $2.50 per 25; 506 per speci- 
men set, postpaid; separate answer sheets may be used; 
2€ per IBM answer sheet ; $3.50 per set of stencils for 
machine scoring of answer sheets; $2.50 per set of sten- 
cils for hand scoring of answer sheets; 2 per profile 
chart (also for tests 47 and 48) ; postage extra; non- 
ens (30) minutes ; » P. Guilford; Sheridan Supply 
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Новевт E. Вкосрех, Personnel Research Sec- 
tion, The Adjutant General's Office, Depart- 
ment of the Army, Washington, D.C. 

The several inventories of Guilford and Mar- 
tin are among the first based on factorial tech- 
niques. They should, because of their basic ra- 
tionale, have certain advantages. The traits 
measured should be more basic than those con- 
ventionally evaluated through personality ques- 
tionnaires ; the scores should be relatively inde- 
pendent of each other; and by avoiding the 
overlap resulting from minor variations in or 
redefinitions of traits, they should provide the 
beginnings of a conceptually simpler system for 
a systematic understanding and treatment of 
validity data in the personality area. 

The question as to whether the advantages 
expected in theory will be borne out in practice 
can only be decided through extended empirical 
tests of the factorial as opposed to other ap- 
proaches. This reviewer is of the opinion that 
the factor score approach is well worth a trial. 

Questionnaire measurement, as a technique, 
has long been regarded as questionable. Until 
other practicable measures of personality traits 
yielding higher validity are developed, much of 
such criticism does not seem profitable and will 
not be attempted here. It is pertinent to note, 
however, that techniques have been developed 
which appear successful in minimizing the ef- 
fect of tendencies of the respondent to over- 
evaluate or underevaluate himself. While it 
seems inappropriate here to review the evidence 
on such techniques, both forced choice tech- 
niques and use of suppressors (as in the Min- 
nesota Multiphasic Personality Inventory) ap- 
pear promising. 

Little validity information against external 
criteria is available in the manual. For this 
reason, An Inventory of Factors STDCR can 
hardly be regarded as competing with the ques- 
tionnaires such as the MMPI if considered for 
clinical diagnosis. Both the validity information 
provided on the MMPI and the control of ex- 
aggeration by suppressor keys seem to give a 
clear advantage to the Minnesota Multiphasic 
Personality Inventory for use in clinical diag- 
nosis. 

One technical point on factorial basis of the 
scores is worthy of comment. The intercorrela- 
tions of the scores show high correspondence be- 
tween D and C scores. The use of both of these 
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scores is, on this account, questionable. Review 
of the original factor studies suggests to this 
reviewer that the two factors in question were 
not, in any event, established as independent 
primary traits. 

The general advantage of the Inventory of 
Factors STDCR seems to be based, then, on 
the future promise of the general factorial ap- 
proach. Regardless of the general promise of 
this approach, data are not yet available showing 
the practical utility of this inventory. 


For reviews by R. A. Brotemarkle and H. J. 
Eysenck, see 3:45 and. 3:55. 
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*Inventory of Personal-Social Relationships: 
General Education Series. High school and college; 
1941-50; a revision of the instrument developed in 1941 
and described in Student Personnel Services in Gen- 
eral Education by Paul J. Brouwer; 24 scores in each 
of 2 sections: activities and interests, concerns and 
difficulties; 1 form, ’50; no data on reliability and 
validity in manual (for data presented elsewhere, see 
2 below); no norms—publishers recommend the use 
of local norms; manual ('50); $2.50 per 25; 75¢ per 
25 individual interpretative sheets; cash orders post- 
paid; 25¢ per specimen set, postpaid; nontimed (50) 
minutes; developed by the Cooperative Study in Gen- 
eral Education Ralph W. Tyler, Director) ; Coopera- 
tive Test Division, Educational Testing Service. * 


REFERENCES 

т. Cooperative STUDY IN GENERAL EDUCATION, EXECUTIVE 
СоммтттЕк, ЁАрРн W. Tyrer, DimECTOR. Chap. 8, “Major 
Projects in Student Personnel," PP. 158-79. In Uo eration in 
General Education. Washington, .C.: American 
Education, 1947. Pp. xvii, 240. * (РА 22:432) 

2. Brouwer, PAUL J.; with the collaboration of John L, Berg- ` 
stresser. “Appraising the Personal-Social Needs of Students: 
First Project, The Self-Inventory of Personal-Social Relations,” 

р. 173-216. In Paul J. Brouwer's Student Personnel Services 
in General Education, Washington, D.C.: American Council on 
Education, 1949. Pp. xix, 317. * (PA 23:3440) 

N. L. Gac, Associate Professor of Education, 

University of Illinois, Urbana, Illinois. 

This inventory consists of two lists, the first 
of тоо activities, and {һе second of тоо diffi- 
culties which a student may experience. The 
student is asked to mark each activity in the 
first list according to whether he participates 
in it (A), does not participate in it but would 
like to (U), or does not participate and is in- 
different to it (D). The second list is to be 
marked in terms of whether the difficulty is of 
important concern to him, of mild concern to . 
him, or of no concern to him. The inventory, like 
the Mooney Problem Check List and the SRA 
Youth Inventory which it resembles, is essen- 
tially intended to facilitate the counseling process 
through setting up a standardized channel of 
communication from the client to the counselor. 
Its underlying theory seems to be sociological 
rather than psychological; the items thus fall 
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into such categories as “opposite sex,” “family,” 
“school or college faculty,” etc. 

The inventory is intended as an aid to coun- 
selors, who are to interpret the results to their 
students, and is not to be used by students with- 
out guidance. The scoring, which consists of 
counting the number of each of the three kinds 
of responses to each of the eight categories of 
items in both parts, has a logical, a priori, basis. 
Whether the scores thus defined would hold up 
under the application of techniques for estimat- 
ing internal consistency is a question that critical 
‘users of the inventory might want answered be- 
fore they paid much attention to these scores. 

The form of item used is likely to elicit re- 
sponse sets, so that the replies a student makes 
may reflect overall tendencies to check A, U, 
or D more than they reflect the psychological 
states with which the items are ostensibly con- 
cerned. Such response sets would result in a 
spurious kind of internal consistency ; they might 
also indeed be related to the psychological states 
at which the inventory is aimed, but empirical 
studies are needed to establish any such hy- 
pothesized relationships between number of A, 
U, or D responses and actual interests and con- 
cerns, 

The inventory is also recommended for use 
in describing groups of students or the social 
environment of activities, interests, concerns, 
and difficulties in which an individual student 
finds himself when associated on a given campus 
with a group of fellow students. Here the in- 
ventory would yield percentages of students in 
a given group responding in various ways to 
the individual items and categories of items. 
Using the inventory in this way involves the 
interpreter of group results in dangerous as- 
sumptions concerning the absolute significance 
of such percentages unless he has the results 
either from a sample of other comparable high 
schools or colleges or from a series of such sur- 
veys in a single school. In other words, the in- 
terpretation of group results needs to be en- 
hanced with norms in the same way that the 

interpretation of achievement or intelligence 
test responses needs norms. The manual, and 
Brouwer's supporting text (2), provide no cau- 
tioning influence on this matter ; users of group 
results from the inventory can easily find them- 
selves attaching psychological or social signif- 
icance to results which should more properly be 
considered reflections of the particular wording 
of the items. 

*. 
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The inventory was developed in full aware- 
ness of the nondirective or “permissive” ар. 
proach to counseling. How practicing counselors 
of the nondirective school view inventories of 
this kind is, however, a major question, It seems 
likely that such counselors, opposed to аз cing 
direct questions or formulating by themselves 
the significant questions that need to be raised, 
would view the inventory as a violation of the 
essence of their approach to counseling, Th 
mere administration of the inventory implie 
that the counselor, by inspection of the responses. 
and patterns of scores, will identify possibl 
needs of the student. The student may expect 
that, since a diagnosis has been made, a pre- 
scription will be forthcoming. This is exactly | 
the expectation that nondirective counselors 
want to combat from the very start. Conse- 
quently, it is difficult to see how the inventory | 
can be used in nondirective counseling. Brouwer | 
sees a place for both permissive and “prescrip- | 
tive" methods and recognizes that most faculty 
members typically take the prescriptive ap- | 
proach to counseling. If we are willing to grant 
the desirability in some cases of the prescriptive | 
approach, this inventory may be useful in estab- 
lishing rapport, revealing student's problems, | 
and providing a basis for counseling interviews. 

A. word should be said concerning the items 
themselves. Any list of activities, interests, con- 
cerns and difficulties must inevitably be a sample 
which reflects the biases of the authors. This list | 
was developed by the central staff of the Co- | 
operative Study in General Education and per- | 
haps reflects the biases of "student personnel | 
workers" in small liberal arts colleges. The 
flavor of the items, the general impression they 
create, is one in which the well adjusted student 
is “well rounded,” a “good Joe" of a wholesome, 
“decent,” congenial sort—in short, a middle 
class adolescent bent on self-improvement and 
unafflicted with divine discontent, passionate | 
intellectual thirsting or any similar imbalance 
There are colleges and high schools, and even 
more certainly individual high school or college 
students who might be disturbed by finding 
themselves so poorly embraced by the items 0 
this inventory. And yet it is not impossible 
the leaders of the significant intellectual an 
social movements of our time might be the Ve! - 
same students. In short, students who do not 
the middle class, well rounded, wholesome sten 
otype set up by the items of this inventory mig! 
be needlessly disturbed by it. 
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THEODORE A. SamnrN, Associate Professor of 
Psychology, University of California, Berkeley, 
California, 

This inventory, in common with others that 
have been constructed on a rationalistic basis, 
suffers from the absence of empirical validity. 
The inventory purportedly samples students’ 
“activities and interests” and “concerns and dif- 
ficulties.” The items were selected on the basis 
of two preliminary inventories administered to 
a group of students, The results were “exhaus- 
tively studied by an intercollegiate committee 
whose members went through the inventories 
item by item, eliminating some, adding others, 
revising many * The basic assumptions regard- 
ing human behavior that were implicit in the 
selection of items were questioned, analyzed, 
and clarified.” (2) Such a procedure seems 
inexcusable in the light of experience with hun- 
dreds of inventories constructed on a priori lines 
beginning with Woodworth’s Personal Data 
Sheet. What does any score on this inventory 
mean? Only that a student’s answers to ques- 
tions were combined according to the “basic as- 
sumptions about human behavior” of an inter- 
collegiate committee. 

Procedurally, the student answers the one 
hundred items in Part I (activities and interests) 
in one of three ways: participation in activity 
(coded “А”) ; nonparticipation in an activity 
“but would like to,” an unfulfilled interest 
(coded “U”); does not participate and is in- 
different to the activity in question (coded *D"). 
Thus a total score is derived in which the num- 
ber of A, U, and D responses are summed. The 
same procedure is followed for Part II (con- 
cerns and difficulties) except that each question 
is answered as an important concern or diffi- 
culty, a mild concern or difficulty, or no concern 
or difficulty. 

The 100 items in each part of the inventory 
are divided among eight categories of logically 
related items. For Part I these are opposite sex, 
family, faculty, belonging, social skills, intellec- 
tual and esthetic, social service, and leadership 
and initiative. For Part II the categories are 
opposite sex, family (home), faculty, belong- 
ing, social skills, physical well-being, likeness- 
to-others, and social experience. The mere list- 
ing of these categories raises questions for the 
reviewer as to the independence of the categories, 
their meaning in terms of actual behavior, and 
their relative weighting in a total adjustment 
index. 


The author of the test recommends the use of 
local norms and making cautious interpretations 
on the basis of significant deviation from the 
median score for any subcategory. This pro- 
cedure only means that a student has been 
singled out for attention because he has checked 
certain items more: or less frequently than the 
median student in his group. It does not neces- 
sarily mean that his deviation from the mean is 
a sign of maladjustment. Although the author 
warns against regarding deviation and malad- 
justment as synonyms, there is an implicit as- 
sumption here: the deviant on this rationally 
derived scale is somehow bad or maladjusted. 
The recommended approach to the interpreta- 
tion of the inventory is reminiscent of unstand- 
ardized projective tests. The counselor is advised 
to analyze responses to specific items and to 
pay attention to the individual patterns of 
responses. Then why bother with scores at 
all? 

The inventory will probably be used by guid- 
ance officers in high schools and smaller colleges 
where psychologically trained personnel are not 
available and where there is a basic philosophy 
of adjustment by conformity. 


[61] 
*The IPAT Humor Test of Personality. Grades 
0-16 and adults; 1949-52; title on test booklet is The 
IPAT Humor Test (A Humor Rating Quiz); for- 
merly called The C-L Humor Test (А Humor Rating 
Quiz); 10 scores: debonair sexual and general unin- 
hibitedness vs. anxious considerateness, good-natured 
play vs. dry wit, tough self-composure уз. reassurance 
in embarrassment, gruesomeness vs. flirtatious playful- 
ness, hostile derogation vs. urbane pleasantness, resig- 
nation vs. impudent defiance of decency, cold realism 
уз. theatricalism, ponderous humor vs. neat and light- 
hearted wit, whimsical retort vs. damaging retort, mis- 
treatment humor vs. cheerful ipe cte IBM; 
Forms A ('52), B (’52); [revised] manual ['s2]; 
$3.50 per 25 of Form А; $3 per 25 of Form В; $6 per 
25 sets of both forms; 50€ per manual; 856 per sample 
set of every form; separate answer sheets may be used; 
$1.50 per 50 IBM answer sheets for any one form; de- 
sign for machine scoring stencils included in manual ; 
so¢ per set of stencils for hand scoring of answer 
sheets; cash orders postpaid; (30-40) minutes; R. В. 
Cattell and L. B. Luborsky; Institute for Personality 
and Ability Testing. * 
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W. Grant Юаніѕтком, Visiting Assistant 
Professor of Psychology, State University of 
Iowa, Iowa City, Iowa. 

The authors of this test feel that repressed 
material is as important diagnostically as that 
produced by any other ego-defensive mecha- 
nism, such as projection or rationalization. The 
“action” in a joke which a subject finds humor- 
ous presumably mirrors the response trends of 
that individual. The authors do not specify how 
complete a picture of a particular personality 
is obtained in this sample ; conceivably, some im- 
portant characteristics which have not under- 
gone repression would not appear within re- 
sponses to the test. This instrument does not 
provide a measure of any trait relating to sense 
of humor except indirectly in Form B in terms 
of number of items rated as funny. 

The authors use two ways of sampling humor 
appreciation. The subject rates the humor in each 
joke on a 2-point scale in Form В. In Form A, 
the subject expresses preferences for items in a 
paired comparison design. Although the authors 
are aware that incidental factors may distort 
these measures, they do not seem to have given 
them proper consideration. Most importantly, 
they underevaluate the effect of novelty of the 
item. Within their own research they find im- 
portant changes in a subject’s ratings upon re- 
exposure to the same jokes; it would seem de- 
sirable to provide some check on the variation 
attributable to this source of distortion. 

Since the authors’ aim is to measure stable 
traits, this question of reliability is crucial. The 
information for evaluating the reliability of the 
instrument is somewhat contradictory. The man- 
ual mentions a test-retest study with a lapse 
of a month, but no precise summary of the 
findings is offered. The coefficients ranged from 
ТО to .60 for the various clusters ; presumably, 
the small number of items in each contributes 
to this low stability. An estimate of the equiva- 
lence cannot legitimately be made from a study 
of the two forms since the subject makes quite 
different types of responses to the two tests. The 
lack of agreement between the two forms has 
led the authors to recommend the use of both 
forms in evaluating a particular individual. The 
-authors seém to have avoided the error of weight- 
ing both directions of the response to one item— 
a procedure which could act to force artificially 
low scores in some components when a few 
high component scores are obtained. 


Evaluation of the validity of this instrument 
rests upon the adequacy of the closely related 
program of research in personality measurement 
carried out by the Institute for Personality and 
Ability Testing. The most important result of 
these analyses has been the Sixteen Personality 
Factor Questionnaire. The Humor Test has been 
validated against this scale, the Guilford-Martin 
Inventory of Factors GAMIN and the Inven- 
tory of Factors STDCR, and some ratings and 
psychiatric diagnoses. However, neither the arti- 
cles by Cattell and Luborsky (2, 3), to which the 
test purchaser is referred nor the manual itself 
provides sufficient information on which to judge 
the practical value of the present humor scales, 
For instance, Cluster ro (Silly Good Humor vs. 
Dry Comment) is said to have separated suc- 
cessfully the mean of a group of psychotics 
from the mean of a group of normals, but no 
particulars of this study are provided. In regard 
to validity in general, the authors consider pub- 
lication of the test as only one stage of a long 
research program to accumulate data relating 
to both its theoretical implications and its prac- 
tical applications. 

Similarly, only sketchy information is given 
about the norm groups on which the modified 
standard score tables were constructed. Pre- 
sumably, the authors advance this material as 
representative of a cross section of the general 
population who have adequate intelligence and 
reading ability to respond to these items. For 
instance, they report that a little less than 10 
per cent of their 1,044 cases could be regarded as 
neurotics, which proportion they consider close 
to the existing ratio. Actually, all of their sub- 
jects were university undergraduates. 

The basic mechanics of the test in its present 
form are deficient. The test is on green paper 
and lacks attractiveness, legibility, and conven- 
ience of response. The scoring (until an IBM 
form becomes available) involves destruction of 
the test booklet, which greatly hinders filing for 
research purposes. To apply the scoring stencil, 
the pages must be restapled, and even time- | 
consuming care will not insure a proper fit. The _ 
stencil itself is mimeographed and so arrang 
that it is confusing and likely to generate errors | 
їп accumulating the component scores. 

This test must be considered to be of research _ 
interest only. If the user is in a position to ad- 
minister it and learn about its possible contribu- | 
tions, it should be of definite interest. If the 
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purchaser needs an instrument to gather in- 
formation, it is not sufficiently well developed to 
be dependable. 


ArDIE Lusin, Statistical Psychologist; and 
Frank M. Loos, Research Psychologist; Insti- 
tute of Psychiatry, Maudsley Hospital, London, 
England. 

“Psychologists, from smoking-room amateurs 
to Freud, have long been aware that some of 
the more profound aspects of a man's personality 


may be revealed by observing the things at which ' 


he laughs" (2). Cattell and Luborsky, stimu- 
lated by the work of T. G. Andrews and the 
Freudian theory of humor,’ have attempted to 
devise a test of humor which will “reveal deeper 
aspects of personality.” (They believe that only 
“relatively superficial personality data” can be 
obtained from other paper and pencil personality 
tests.) The authors recommend their test for 
clinical diagnosis and personnel work because: 
(a) it does not depend “upon the insight and 
honesty of the subject,” (b) the humorous con- 
tent makes it attractive to the subject, and (c) 
it is easy to administer and to score. 

Form A includes 91 pairs of jokes of which 
the subject is asked to choose the funnier. Each 
of the 112 jokes making up Form B is to be 
classified as funny or dull. The 11 scores obtain- 
able from each form are based on 11 clusters 
of jokes gathered from three successive re- 
searches. “The jokes put together in one cluster 
have had to meet the criterion of correlating 
above .25 with every joke therein, on at least 
two sample populations.” The manual states that 
both forms measure “the same eleven personality 
dimensions.” This equating of clusters (rather 
than factors) to dimensions is in contradiction 
to Cattell’s previous work and to the usual defi- 
nitions given in factor analysis. 

The actual reliabilities for each cluster are not 
quoted, but it is stated that the correlations be- 
tween Forms A and B “over a one month inter- 
val range from .10 to .60.” This unsatisfactory 
result leads the authors to conclude that “the 
present test must therefore be used tentatively 
with that combination of measurement and clini- 
cal insight necessary with all clinical or misper- 
ceptive (‘projective’) tests today.” 

Figures for correlations between cluster scores 


1А ў . . “A Factorial Analysis of Responses 
to the Comte'as a Лр Personality." J General Psychol 28: 
209—224 Ap '43. * Й Ё 

2 Freud, Si d. Wit and Its Relation to the Unconscious. 
New vod SERM Yard & Co., 1016. Pp. ix, 388. * 
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and various “source traits" scores are given for 
three clusters; actual figures are not presented 
in the handbook for any other clusters. Not men- 
tioned in the handbook is the authors’ discovery 
of near zero correlations between 13 cluster 
scores from a 69 item joke test and 10 factor 
scores from the Guilford-Martin Inventory of 
Factors GAMIN and the Inventory of Factors 
STDCR (3). The only suggestion that the test 
has diagnostic validity comes from the statement 
in the handbook that one cluster “both on the A 
and B forms has been shown to distinguish psy- 
chotics from normals at the one percent level 
of confidence.” It is stated that only one other 
cluster tends to distinguish psychotics from nor- 
mals. Although it is indicated that the ЈРАТ 
Humor Test was designed in accordance with 
the Freudian postulates which experimental evi- 
dence obtained by use of the test tends to con- 
firm, we cannot find any factual evidence that 
this is true. 

In summary we may note the following 
points. It is not possible to rely on the published 
articles (2, 3) for information about the Humor 
Test because at most only 69 jokes of those men- 
tioned in these articles are included in the 294 
jokes making up the test. The handbook gives 
little useful information on construction, stand- 
ardization, and validity. However, the vague 
statement about reliabilities (quoted earlier) 
shows, if the usual standards of reliability are. 
accepted, that the two forms are not measuring 
the same variables. There is no evidence that the 
test can be used for the prediction of human be- 
havior. 

The test might serve as a novel basis for in- 
teresting research in personality, but at the 
present stage of its development, it is not defi- 
nitely known to be of any practical use whatso- 
ever. 


J. К. WITTENBORN, Research Associate in Psy- 
chology, Yale University, New Haven, Con- 
necticut. 

This test is related to a “Freudian hypothesis 
that the ‘tendency’ in wit is an expression. of 
needs normally repressed in everyday life.” The 
jokes which comprise the two forms of the Hu- 
mor Test are each relevant to 1 of 11 clusters of 
jokes. Evidence for the clusters is based on three 
successive researches but the samples of jokes 
and subjects employed in these researches are 
not specified. The validity claims for some of 
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the cluster scores are based on correlations with 
certain of the basic personality factors which 
provide the scores for Cattell’s Sixteen Per- 
sonality Factor Test scores. The manual does 
not provide such validity evidences for all of the 
humor cluster scores, however, and for the pro- 
spective examiner the possible validity of most 
of the humor clusters remains a matter of specu- 
lation. 

The evidence for cluster score reliability is 
not altogether reassuring. Retest correlations 
vary from cluster score to cluster score and 
range from .10 to .60. Whether this is a result 
of an intrinsic unreliability or evidence that 
humor expressed by appreciation of jokes is 
an unstable phenomenon is not known. If this 
aspect of humor is not stable, however, the 
question arises as to how scores for clusters of 
jokes may best be used in the evaluation of per- 
sonality, if they may be so used at all. It is re- 
ported in the manual that there is some variation 
in ability to recognize and that in a given situa- 
tion this ability may vary inversely with the 
period of time spent reading jokes (adaptation). 
This possibility has certain implications for the 
manner in which the test may best be used. The 
score for Form A of the test is based on a com- 
parison of jokes whereas the score for Form B 
of the test is based on a judgment as to the 
funniness of the jokes. Accordingly, Form B 
permits a determination of a score for general 
readiness to rate jokes as funny. Simce there is 
an adaptation factor, it would seem desirable to 
specify whether Form A or the Form B should 
be given first. Otherwise, the Form B score on 
general readiness to rate as funny may not have 
a uniform significance. 

The norms provided are based on undergrad- 
uates and the test is considered to be appropriate 
for the general population. The scoring pro- 
cedure involves counting items from different 
pages and combining them on the basis of a 
path marked on a stencil. Such use of a stencil 
is, of course, convenient and familiar; unfor- 
tunately, however, the paths for all the humor 
cluster scores are indicated on the same stencil, 
and this saving of paper may prove expensive in 
terms of the energies of the scorer. An additional 
handicap was experienced by the reviewer who 
found that the stencil was prepared on a smaller 
scale than the answer sheet. 

In general, the: possibility of assessing per- 
sonality by an employment of jokes is pleasant 
to contemplate. Perhaps, as the authors claim, 


the subject does not know the personal signif- 
icance of his sense of humor but the possibility 
exists that the subject’s judgments will be in- 
fluenced by his familiarity with the jokes. Quite 
often there is something decidedly unhumorous 
about a hackneyed joke. This is no idle critj- 
cism because if jokes are related to personality, 
the subject’s familiarity with jokes may suffer 
from a selective bias as a result of his personality 
and, accordingly, the use of familiar jokes on a 
test may yield scores which are not truly rep- 
resentative of the subject’s preference for jokes, 

Despite the rather discouraging lack of relia- 
bility, the fact that the manual does not indicate 
the degree to which the cluster scores are cor- 
related, and the fact that the evidence for va- 
lidity is incomplete and consists of occasional 
correlations with other factors, the idea of a 
humor test for personality is a good one and the 
clusters appear to be plausible (the only im- 
plausible aspect of the clusters is that they are 
all of a bipolar nature suggesting that every 
form of humor sampled by the jokes has a singu- 
lar and specifiable antithesis). In conclusion, it 
seems distinctly possible that this approach can 
elicit clinically relevant factors which may not 
be forthcoming by an employment of question- 
naire or rating procedures, 
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Johnson Temperament Analysis. Grades 12-16 and 
adults; 1941-45; 9 scores: nervous-composed, de- 
pressed-cheery, active-quiet, cordial-cold, sympathetic- 
hard, subjective-objective, aggressive-submissive, criti- 
cal-appreciative, self-mastery-impulsive; IBM; Form 
A (41); Form B (’44) discontinued; manual (^44); 
separate answer sheets must be used; $2 per 25; 2¢ per 
response record sheet; 1¢ per revised analysis profile 
(745) ; 60€ per set of unweighted scoring stencils ; $1.75 
per set of weighted scoring stencils; 35¢ per specimen 
set, postpaid; 2¢ per IBM answer sheet ; postage extra; 
nontimed (40-50) minutes; Roswell Н. Johnson ; Cali- 
fornia Test Bureau. * 
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ALBERT Ertis, Consulting Psychologist, 56 
Park Ave., New York I6, New York. 


Aside from the fact that the Johnson Temper- 
ament Analysis is no worse than most conven- 


` 
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tional personality inventories, there is little that 
can be said in its favor. Its shortcomings, some 
of the most important of which are listed as 
follows, come much easier to mind than its ad- 
vantages : 

а) The language and indirect phrasing of the 
questions make difficult reading and answering. 
A typical question reads: "Is S very eager to 
have his own business, or be an independent 
professional man, or if in an organization to 
be in a position to give orders rather than to 
take them?” 

b) The temperament traits which the test 
purportedly measures are hardly of uniform use- 
fulness or importance, and include vague traits 
like active, cordial, sympathetic, and subjective. 
Moreover, the traits are all stated in bipolar 
terms ; according to recent researches of Thomas 
N. Jenkins and his associates, this type of di- 
chotomization of traits has dubious validity. 

c) The reliabilities of the nine scores obtained 
on the test range from .57 to .78, rather low for 
this type of inventory. 

d) The latest (1944) manual for the test 
contains no adequate validity reports. The one 
validity study using an independent criterion 
and reported in detail was done with groups who 
had voluntarily come for marriage counseling 
and who, unlike unselected groups of subjects, 
could be expected to give relatively honest test 
protocols, Even with this type of subject, the test 
reliably distinguished unhappily married from 
about-to-be-married individuals only in extreme 
(and presumably few) instances. 

€) According to the author's published data, 
respondents tend to overrate themselves, in an 
appreciably biased manner, on seven of the nine 
traits tested. 

f) The test manual shows no comprehension 
of modern clinical psychological methods or of 
the psychodynamics of normal or exceptional 
human behavior. The manual belittles clinical 
interviewing, cavalierly attempts to give a 3- 
page course in personal and marital counseling, 
glosses over the serious shortcomings of all per- 
sonality inventories, implies wide-ranging diag- 
nostic accuracy which the test most certainly 
does not seem to possess, and makes special 
claims for industrial uses of the test which are 
highly unrealistic. 

In summary, the Johnson Temperament 
Analysis and its accompanying manual attempt 
to show, in-all seriousness, (a) that personality 
inventories now constitute one of the best tech- 
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niques for the deep exploration of human per- 
sonality and for the uncovering and treatment 
of serious emotional problems, and (5) that the 
Johnson Temperament Analysis, in particular, 
is one of the most reliable and valid instruments 
of its type. In neither of these claims or impli- 
cations does the author appear to approximate 
even remotely factual reality. 


For a review by Н. Meltzer, see 3:57. 
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*Jurgensen Classification Inventory. Adults; 
1947-50; 1 form, '47; norms and scoring keys are not 
provided as each user is expected to develop keys and 
norms to fit his own situation; manual (’50) ; separate 
answer sheets must be used; $5 per 25; $1.90 per 100 
answer sheets; postage extra; 50¢ per specimen set, 
postpaid; test materials will be sold only to members 
of the American Psychological Association; (20-40) 
minutes; Clifford E. Jurgensen; the Author, 4741 El- 
liot Ave. South, Minneapolis 7, Minn. * 


REFERENCES 

т. JurcENsEN, Сілғғовр E. "Report on the ‘Classification In- 
ventory,’ a Personality Test for Industrial Use." J Appl Psychol 
28:445-60 D '44. * (РА 19:1320) | i 

21 Арлмѕ, Ёрулвр B. An Investigation of the Validity of 
the Jurgensen Classification Inventory as a Predictor о Aca- 
demic Achievement of Purdue University Students. Master's 
thesis, Purdue University (Lafayette, Ind.), 1948. Pp. 24. 

3, PRED, Соврох D. A Comparison of the Test Performance. 
of “Good” and “Poor” Industrial Supervisors. Master’s thesis, 
Purdue University (Lafayette, Ind.), 1948. 

4. BOSWELL, CHARLES Azan. The Consistency of Responses to 
Forced-Choice Items in a Preference Inventory. Master’s thesis, 
State University of Iowa (Iowa City, Towa), 1949. Pp. iv, 42. 

5. KIRKPATRICK, JAMES J. Validity of the Jurgensen Classi- 
fication Inventory as a Predictor of College Achievement, Mas- 
ter's thesis, University of Tennessee (Knoxville, Tenn.), m e 

6. Кхлоғт, EnwiN В, “A Selection Battery for Bake Sho 
Managers.” J Appl Psychol 33:304-15 Ag '49. * (PA 24:2850) 

7. Laws, C. Н. “How Can We Pick Better Supervisors?” 
Personnel Psychol 2:69-73 sp ’49. * (PA 23:5071) k 

8. Orson, MamjonrE. Validation o the Forced-Choice Tech- 
nique in Personality Measurement, octor's thesis, University 
of Minnesota (Minneapolis, Minn.), 1949. Ер X, 143. ў 

9. Mats, Rosert Date. A Study of the Effectiveness With 
Which the Jurgensen Classification Inventory Counteracts Falsi- 
fying. Master’s thesis, University of Colorado (Boulder, Colo.), 


. 1 
[ue Krrxpatrick, James J. "Cross-Validation of a Forced- 
Choice Personality Inventory." J Appl Psychol 35:413-7 D 
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Бовккт G. БЕМАВЕЕ, Assistant Professor о] 
Psychology, and Louis L. McQurrry, Profes- 
sor of Psychology, University of Illinois, Ur- 
bana, Illinois. 

DESCRIPTION AND PURPOSE OF THE TEST. This 
inventory, first reported in 1944, represents one 
of the first, if not the first, personality inven- 
tories which utilizes the forced-choice technique 
as it is known today. The inventory was devised 
explicitly for situations such as personnel selec- 
tion where an incentive exists for an individual 
to depict himself favorably. 

The test consists of 72 triads, each composed 
of three descriptive phrases from which the sub- 
ject selects the phrase most and least descriptive 
of him, and 36 paired items from which the sub- 
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ject makes a preference and indicates whether 
it is strong or weak. 

Forty-five triads refer to irritating types of 
persons; 17 triads refer to reputations, and the 
remaining 10 triads refer to personal likes and 
dislikes. Twenty pairs require choices in activi- 
ties and the remaining 16 pairs refer to types of 
disliked persons. 

The inventory is self-administering and no 
time limits are specified. 

The test booklet is stapled together, and has 
the disadvantage that it will not lie flat when 
opened. However, when the answer sheet is 
inserted into the rear of the test booklet, the 
right-hand margin of the page of items is con- 
veniently matched by the column on the answer 
sheet into which responses are to be placed. 

The instructions for the first part do not 
specify whether the subject is to use himself as 
a referrent in making choices, or whether he 
should respond in the way he conceives that 
some group, stich as his acquaintances, would 
respond. Choices must be made, in each triad, 
of the type of person which the subject thinks 
is “most irritating” and “least irritating.” Irri- 
tating to whom? The available interpretations 
of the latter by the respondent is a feature of 
this part of the inventory which deserves ex- 
plicit consideration. 

SCORING. The test presumes that those who 
use it will develop weighted scoring keys ap- 
propriate to the particular situations. 

The manual offers practical procedures for 
item analysis and validation which are quite 
commendable, except for two points. Contrary 
to a statement in the manual, “Level of signif- 
icance" is not equal to the “Chances of a true 
difference.” Instead, it is the times in a hundred 
that chance alone would produce a difference as 
large as the one obtained when the true dif- 
ference is zero. Another error is the listing of 
levels of significance appropriate to unidirec- 
tional differences without warning that this is 
appropriate only when the direction of the dif- 
ference can be specified in advance. The re- 
sponses to the item by the criterion groups must 
be specified on an a priori basis. This does not 
appear possible in the case of most items. 

RESULTS IN THE USE OF THE TEST. (a) Re- 
liability. Several studies have evidenced relia- 
bilities adequate for the uses proposed for this 
test. Test-retest reliability (N = 23) was .78 
in one study. (b) Validity. Excluding studies 
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without cross-validation, the test showed some 
validity for selection of bakeshop managers (cor- 
relation of .39 for 33 cases with a composite 
criterion, significant at the 5 per cent level of 
confidence), but failed to distinguish between 
176 "good" and "poor" industrial supervisors 
in 45 plants. 

Tailor-made scoring keys revealed significant 
differences between engineering students and 
journalism students. Studies using items from 
this test for predicting academic achievement 
have given negative results. 

OVERALL EVALUATION. This inventory offers 
an approach with more promise in personnel 
selection than the conventional personality in- 
ventories. In many situations, however, it would 
seem to be preferable to build items especially 
adapted to the situation, rather than restrict 
oneself to the items of this inventory. 


WirriAM J. E. Crissy, Assistant Professor of 
Psychology, Queens College; and Lecturer on 
Applied Psychology, Fordham University; New 
York, New York. 

This personality questionnaire samples the 
following aspects of the examinee’s adjustment: 
the kinds of persons he finds most and least ir- 
ritating, the ways in which he would most prefer 
and least prefer to be thought of by others, per- 
sonal likes and dislikes, preferred activities or 
modes of behavior, and, finally, the kinds of 
persons he dislikes. The instructions for ad- 
ministration are contained in the test booklet 
proper and are clearly and explicitly written ; the 
test may be self-administered. 

The test does not purport to measure specific 
personality traits, nor does it claim general va- 
lidity. The author intends that each prospective 
user try it out on his own group or groups, evolve 
his own scoring key based upon an item analy- 
sis against available criteria, and cross validate 
the test on a second appropriate group. In sum- 
mary, it is Jurgensen’s thesis that in the em- 
ployment situation specific validity is the prime 
consideration. Therefore, each user must deter- 
mine whether this test is appropriate for the 
solution of his particular testing problems. 

The author furnishes a miscellany of highly 
useful aids to facilitate the experimental use o 
the test. Specifically, he furnishes easy-to-under- 
stand directions and work sheet layouts for do- 
ing various kinds of item analyses and statistical 
tables for various levels of confidence. These 


117 ] 


aids are well within the comprehension of any- 
one with an elementary knowledge of test theory 
and statistical methods. 

Although no validity or reliability data as 
such are reported in the manual, Jurgensen and 
others have elsewhere reported such informa- 
tion. The validity data show promise for the 
test, especially for use with sales groups. In the 
case of one study (т) using various scoring keys, 
the obtained validity coefficients range from .77 
to .81 for a sales group and from .67 to .72 for 
a graduate student group. However, these co- 
efficients are computed on the same sample as 
was used for constructing the scoring keys used 
and are therefore open to question as to stability. 
Jurgensen has reported to the reviewer that in 
a sequel study involving additional small groups, 
the validity coefficients remained relatively stable 
(.71 for the sales personnel, .64 for the students). 
The reliability data are reported in terms of 
modified, stepped-up, split half coefficients, They 
range from .90 to .97 on the sales group and 
from .8o to .98 on the student group. While the 
writer does not report standard errors of meas- 
urement, the reviewer has estimated these from 
the dispersions reported in the paper and they 
are relatively small and generally satisfactory. 

The test appears to have potential usefulness 
in the screening and selection of employees for 
all white collar positions. Each prospective user 
must determine experimentally whether this is 
a good test for his purposes. The individual items 
in the test appear to have “face validity," that 
is, they seem to be sampling the aspects of ad- 
justment the prospective employer would ton- 
sider relevant in connection with almost any 
job. The more usual personality inventories 
which purport to measure specific traits have 
not proved very valid in industrial usage. It re- 
mains to be seen whether a test that makes no 
such pretensions will prove to be a more efficient 
kind of measure. At this writing, the validity 
data are too sparse to warrant drawing general 
conclusions with regard to the applicability and 
usefulness of the test. 


[64] 
*KD Proneness Scale and Check List. Grades 7- 
12; 1950; also called Delinquency Proneness Scale; 
2 parts; 1 form; distribution is restricted and purchase 
1s made by special arrangement with the publisher; 
William C. Kvaraceus; World Book Co. * " 
а) KD PRONENESS SCALE. Nontimed (15-25) minutes. 
b) кр PRONENESS CHECKLIST. Ratings made by teach- 
ers; nontimed (6-25) minutes. 
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DoucLas Courtney, Program Diréctor, In- 
stitute for Research in Human Relations, Phila- 
delphia, Pennsylvania. 

This test consists of two parts: a scale of 75 
multiple choice questions designed to detect “de- 
linquency proneness” in children in grades 6— 
12 to be answered by the child himself, and a 
58-item checklist to be filled in by the social 
worker or other professional person evaluating 
the behaviour of a group of children. The items 
concern those aspects of the person, his home, 
and his school which appear to be related to 
delinquency. Each item on the checklist is an- 
swered by checking yes, no, or ?. 

The scale is well printed; the items appear to 
be well and thoughtfully constructed. The man- 
ual is prepared with the usual thoroughness of 
the World Book Company. In a letter to the 
reviewer, the World Book Company indicated 
that the present materials are being used on a 
limited basis only. The company indicated that 
they were “awaiting the outcome of several 
studies now in process to obtain additional 
normative data.” Since the authors and pub- 
lishers are apparently taking every precaution 
in the preparation of this scale and checklist it 
would seem neither appropriate, helpful, nor 
prudent to be overly critical at this time. 

One or two considerations might be borne in 
mind by both publishers and potential users as 
they work toward further validation of the meas- 
ure, especially the scale. 

a) It would seem appropriate to balance more 
closely the various criterion groups. There is 
no indication whether the “Public School Boys” 
represent a sample from the same public schools 
that produced the “Delinquent Boys.” If they 
are not from the same population the interpreta-__ 
tion derived from these comparative data (Table 
т in the manual) would appear to be heavily 
“loaded” in favor of the desired predictions. It 
may well be that only three or four of the 98 
“Delinquent Boys" (Table 1) could have been 
removed from the 156 “Public School Boys" on 
the same table. Where those delinquent boys 
actually fell in relation to the I 56 other boys 
would have a lot to do with our interpretation of 
the validity of the KD Scale as a discriminating 
predictor in a real situation. 

b) The section entitled *Correlations with 
other measures" with its accompanying Table 
3, has the same elusive statistical quality. Тһе 
correlations all appear to be done with one or 
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the other end of the KD distribution. If this 
proves to be true the negative intercorrelations 
between segments of the KD distribution and 
the Otis Self-Administering Tests of Mental 
Ability, for example, are very high and would 
be much higher if the entire distribution were 
to be used. The test of this assumption would 
be to obtain an intercorrelation between the 
Otis and the KD scale for all of the persons in- 
volved pooled into a single group. It is an old 
trick of test makers to expand the range when 
they wish to push up intercorrelations. This is 
a new one directed at pushing down the inter- 
correlation. In saying this the reviewer is im- 
puting neither goodness nor badness to the au- 
thor or publisher. He is merely saying that 
interrelationships should be computed at the 
level at which predictions are to be made and 
used. If we wish to discriminate between boys 
who are in public school and boys who are in 
“reform” school, we need only look at the roster 
of the two schools without the use of a predictor. 
If we wish to discriminate within a neighbor- 
hood between those children who are going to 
reform school and those who are not, we need 
validity coefficients derived from neighborhood 
data. The author and publisher are aware of this 
and state in the manual that studies are in prog- 
ress to determine the instrument’s predictive 
value for unselected groups. It has been neces- 
sary to use a much rougher validating technique 
at this stage in the development than might be 
wished for. 

The area of delinquency predictions is a com- 
plex, socially valuable area of research. The out- 
comes of the studies on this scale and checklist 
are awaited with keen interest for they appear 

to be a promising development in a sparse field. 


Date B. Harris, Professor, Institute of Child 
Welfare, University of Minnesota, Minneapolis, 
Minnesota. 

The first item, the KD Proneness Scale, taps 
children’s attitudes and interests and presumably 
indicates trends toward delinquent behavior, The 
nature of the instrument appears to be suffi- 
ciently concealed. Although a sophisticated per- 
son could affect his score in the direction of 
socially acceptable answers, this is not likely to 
happen in the junior high age range. While the 
concepts included in the Scale are well within 
the understanding of children in the seventh 
grade and above, the vocabulary of the Scale 
may present some difficulties for poor readers, 


One distinct advantage of this test is that it 
is applicable to girls as well as to boys. How- 
ever, data contained in the manual indicate that 
the test does not discriminate as successfully 
between known delinquent girls and unselected 
girls as it does between delinquent and nonde- 
linquent boys. 

The one criticism which must be made is that 
the manual fails to report cross validation ma- 
terial. Tests from the same delinquent group 
were used both for obtaining the score distribu- | 
tions reported in the manual and for selecting 
the original items. However, data supplied to 
this reviewer by the author indicate that in 
further studies, a biserial correlation of the 
order of .85 was obtained between test scores 
and a dichotomous grouping composed of de- 
linquent boys and boys with high moral repu- 
tations. The comparable figure for girls is .46. 
In still other groups, correlations between 5- 
point ratings of behavior (undefined) and scale 
scores run between .то and .49, values being 
higher for boys. It is clear from these data, too, 
that mean scores are significantly higher (i.e. 
indicative of delinquency) for younger than for 
older children. 

The criterion groups of delinquents and non- 
delinquents were apparently not matched with 
respect to such factors as age, socio-economic 
status, and intelligence. Published research on 
delinquency shows that differentials between 
delinquent and nondelinquent groups on atti- 
tude type material frequently are minimized 
when such factors as socio-economic status and 
intelligence are controlled. The manual reports 
that for a variety of groups the correlation of 
scale scores with results of а group intelligence 
test run between —.26 and —.42. Thus, intelli- 
gence is not unassociated with results of the 
measure. One correlation of —.24 is reported 
with the. Personal Index, a scale for the detec- 
tion of potential behavior problem cases (this 
scale is otherwise unidentified). я 

The author states in the таппа1 that, ^It is 
to be noted that no ‘norms,’ in the customary 
sense are furnished for interpreting scores on 
the Scale, nor are any needed for the use of 
results here proposed." He provides a threshold 
or "cutting score" presumptive of delinquency. 
However, he has published several distributions 
of scores which the discriminating test user is 
likely to consult to give meaning to his work. 
Even though the legal code considers delinquency 
to exist or not to exist in terms of presence of 
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absence of specific acts, most school people are 
trained to think of maladjustment as a variable, 
existing in greater and lesser degrees. Undoubt- 
edly this thinking will apply to obtained scores 
on this test, even though a "cutting score" is 
offered as the most meaningful way of inter- 
preting results. 

A second instrument, the checklist, provides 
a list of descriptive characteristics which have 
apparently been drawn from research literature 

„as being significantly associated with delin- 
quency. It has obvious "face validity" to any- 
one familiar with the sociological and psycho- 
logical literature of delinquency, but the manual 
affords no real knowledge of how the items were 
selected. Certain items are especially designated 
as having “е greatest bearing on potential de- 
linquent behavior"; again no basis for the au- 
thor's selection is afforded. The checklist is to 
be marked by a teacher or professional worker 
on the basis of his knowledge of a particular 
child. Ranges of scores presumed to indicate 
conditions of "vulnerable" and "extreme ex- 
posure" to delinquency are tentatively offered. 
While the author recommends that the two in- 
struments be used together, he gives no informa- 
tion concerning the correlation of the measures 
except to say, "there will not always be com- 
plete agreement between the two instruments in 
identifying a given youngster as probably de- 
linquent, but even children for whom the scale 
and check list do not agree should receive further 
attention from the appropriate professional 
worker.” 

From research data available, it would ap- 
pear that this test is comparable or perhaps 
somewhat superior to several other paper and 
pencil instruments purporting to predict de- 
linquency. All such instruments suffer from the 
poor definition provided by the socio-legal basis 
of the criterion group. The KD Scale, used with 
considerable caution, will indicate some of the 
children needing the special attention of guid- 
ance workers. The checklist will probably do as 
much as the scale, by systematizing facts already 
known about the child, and will not require the 
child’s time to obtain a record of his attitudes. 
The author and publishers are to be highly com- 
mended for restricting the distribution of the 
test. Such limitation should be used more widely 
to prevent poorly considered judgments being 
passed on children, especially in cases where 
an instrument is still in process of develop- 
ment. 


*Kuder Preference Record—Personal. Grades 9- 
16 and adults; 1948-49; 6 scores: sociable, practical, 
theoretical, agreeable, dominant, verification; IBM; 
2 editions ; 1 form, '48; preliminary manual, second re- 
vision ('49) ; separate answer pads or answer sheets 
must be used; 556 per .25 profile sheets for men and 
women or boys and girls; 756 per specimen set of 
either edition; cash orders postpaid ; nontimed (40-45) 
minutes; С. Frederic Kuder; Science Research Asso- 
ciates, Inc. * 
4) FORM AH (HAND SCORING EDITION). 40€ per test and 
answer pad; $2.15 per 25 answer pads. 
b) FORM AM (MACHINE SCORING EDITION). 30€ per test ; 
$2.90 per 100 IBM answer sheets ; $4 per set of scoring 
keys. 

REFERENCES 

1, Mosier, Mary F., AND Kuper, С. Freperrc. “Personal 
Preference Differences Amon; Occupational Groups." J Appl 
Psychol 33:231-9 Je 4 * (PA 24:1926) 

2. Вівсе, WiLLIAM К. “Preferences and Behavior Ratings of 
Py d Ed & Psychol Meas 10:392-4 au '5o. * (РА 25: 
2987. 

з. бозновх, Ммохан MARJORIE. A Study of the Relation- 
ships Between the Kuder Preference Record—Personal and Cer- 
tain Sociometric Ratings. Doctor's thesis, Indiana University 
(Bloomington, Тойду 1950. (Thesis Abstract Series....1950, 
1951, рр. 37-43) (PA 25:7107, title only) 

4. Kuper, б. Freperic. "Identifying the Faker.” Personnel 
Psychol 3:155-67 su 'so. * (PA 25:1791) 


J Consult Psychol 13:67 F '49. Laurance F. 
Shaffer. * A wide use may be predicted for this 
convenient and soundly-developed question- 
naire. * 

QN 
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Lewerenz-Steinmetz Orientation Test: Concern- 
ing Fundamental Aims of Education. High school 
and college; 1931-35; 10 scores: physical and mental 
health; education-general, education-specific, worthy 
home membership, vocation, civic education (govern- 
ment), civic education (industry and world), worthy 
use of leisure, ethical character, total; 1 form, '35; 
manual (35); $2.50 per 25, postage extra; 25€ per 
specimen set, postpaid; nontimed (60) minutes; Alfred 
S. Lewerenz and Harry C. Steinmetz; California Test 
Bureau. * 


REFERENCES 
т. Rorrg, J. F. “The Measurement of Teaching Ability: 
Study Number Two." J Exp Ed 14:52-74 S '4s. * (PA 20: 


1268 
Dx Коѕткев, L. E. "The Measurement of Teaching Ability: 
Study Number One." J Exp Ed 14:6-51 S '45. * (PA 20: 


6: 
Herd Resnick, }озЕРН. “A Study of Some Relationships Be- 


tween High School Grades and Certain apace of Adjustment,” 
J Ed Res 44:321-40 Ja '51. * (PA 25:8286) 


ЕВЕРрЕВ1С L. Ayer, Head, Division of Evalua- 
tion and Research, Citizenship Education Proj- 
ect, Teachers College, Columbia University, 
New York, New York. 

The purpose of this test is to measure “beliefs 
likely to condition attitudes." The general pat- 
tern is a list of statements ranging from super- 
stition to fact; the respondent is asked to mark 
those statements which are absolutely true. High 
reliability is reported in the manual. The validity 
report may be summed up in these words of 
the authors: “The Orientation Test appears to 
be valid in that with individuals the test results 
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seem to check with subjective expectations." 
This statement is based on the fact that indi- 
viduals of different educational attainment made 
sharp differences in scores. Standardization was 
done on separate samples of college instructors, 
graduate students, public school teachers, col- 
lege upperclassmen, miscellaneous tradesmen, 
city policemen, city jail prisoners, and 372 white 
adults who were "probably above the average 
of a typical population." These groups, with 
the exception of the last, are listed in the rank 
order of their median scores. It is the opinion of 
this reviewer that this test is a reasonably good 
measure of educational attainment which does 
not necessarily measure superstition and dogma. 
This opinion is supported by such items as the 
following, which constitute more of a vocabulary 
test than an attitude questionnaire: "Revolt 
against equitable authority, if an habitual re- 
sponse, is a manifestation of a pathological 
mental condition"; “Inevitably involving re- 
nunciation, marriage is essentially tragic, though 
not necessarily unhappy"; and “Most cine- 
matographic performances have a deleterious еї- 
fect on adolescent minds." Contrast those with 
items like these: “A square jaw is a sign of will 
power" ; "In divorce the man is always most to 
blame"; and “Motion picture plays vary in 
merit." 

True to the claims of the authors, high school 
students (in the suburban area surrounding 
New York City) did not attain as high scores 
as Columbia University graduate students who, 
being older, had had time to develop more preju- 
dices than the younger students. Although 
the test is intended for use in high schools 
and colleges, this reviewer doubts whether 
the test would serve any purpose in a high 
school and probably only limited purpose in 
college. 

Even if reasonably well informed college stu- 
dents do not know the difference between super- 
stition and fact, they would be able to figure out 
the scoring pattern after the first two pages if 
they are at all testwise, since the "absolutely 
true" responses are in an identical pattern on 
each page. This makes for ease in scoring, it is 
true, but also may leave the truthfulness of the 
responses open to question. 

In the opinion of this reviewer, the test lacks 
validity as an attitude test, and is not sufficiently 
comprehensive to make an efficient test of edu- 
cational achievement. 


Rocer T. Lennon, Director, Division of Test 
Research and Service, World Book Company, 
Yonkers, New York. 

This test is intended “аз a measure of beliefs 
likely to condition attitudes.” The beliefs are 
those having to do with “fundamental aims of 
education.” The test consists of 475 items, or- 
ganized into seven subtests, two of which are 
divided into two parts. Thus, nine scores are 
derived as indicated in the description above, 
together with a total score. The number of items 
per subtest varies from 25 to 75. 

The items include statements of fact (“A tit- 
mouse is a small bird”) ; common beliefs either 
false or unsupported by adequate evidence 
(“The genius is likely to be unbalanced” ; “It 
is better to tell your troubles to your friends 
than to keep them to yourself”) ; and value judg- 
ments (“Hard work deserves a high school- 
mark"). The examinee is directed to mark those 
statements which he thinks are “absolutely 
true.” Two scores are derived for each subtest: 
a “True” score, which is the number of state- 
ments keyed as true and marked true by the 
subject; and a “Doubtful” score, which is the 
number of statements keyed as doubtful but 
marked as true by the subject. On the whole, the 
statements are clear, although the phrasing of 
a few of them might be criticized—e.g., Section 
IV, Item 21; V, 50; VI, 25; VII, 7. The re- 
viewer did not, in every instance, find himself in 
agreement with the keying. The “absolute” 
truth of statements such as I, 21 and 46; II, 51; 
III, 28; IV, 21 may be questioned. 

Information on validity is confined to the 
statements that the test “appears to be valid in 
that with individuals the test results seem to 
check with subjective expectations,” and that 
there are “sharp differences in score existing 
between certain groups of individuals of varying 
educational attainment.” These latter data (in- 
cluded as part of the normative material) are 
scarcely unambiguous evidences of validity. The 
nine areas into which the test is organized are 
represented as a classification of “commonly ac- 
cepted objectives of education,” a not unreason- 
able claim; but there is no indication of the 
manner in which the individual items in the sub- 
tests were constructed, selected, or validated. 

Estimates of reliability of the total test re- 
ported in the manual include a coefficient of 89 
(N=152 adults, otherwise undefined), de- 
rived by the “split-half method on average per- 


121] 


TESTS & REVIEWS: CHARACTER—NONPROJECTIVE 


67 


centile score," and corrected by the Spear- 
man-Brown formula; and corrected odd-even 
coefficients of .951 for True scores and .958 for 
Doubtful scores (N = “тоо cases of college stu- 
dents and adults"). These latter coefficients are 
described, rather puzzlingly, as indicating the 
“agreement between odds and evens of items 
answered correctly." No reliability data are pro- 
vided for subtest scores, a serious omission in 
view of the fact that use of subtest scores of 
individual examinees is encouraged both in the 
manual and by the provision on the test booklet 
of a profile chart on which these scores are to 
be charted. No data are furnished concerning 
subtest intercorrelations. 

Materials for interpretation of scores include 
tables of percentile ranks and adjective ratings, 
for subtest and total scores, for a mixed adult 
population and for a population of high school 
seniors and college freshmen. For adults, the 
percentile rank and the rating for each subtest 
is derived from a combination of the Doubtful 
score and the True score. Occasionally, the com- 
binations lead to peculiar results: e.g. in Sec- 
tion I, for a given Doubtful score, an examinee 
with a True score of 14 has a percentile rank 
of6 and is rated as Inferior, whereas an examinee 
with the presumably better True score of 15 
receives a percentile rank of 3 and is rated Very 
Inferior. For high school and college students, 
percentiles are calculated separately for the 
Doubtful and the True score on each subtest ; 
these two percentiles are then summed and this 
sum of percentiles is itself converted to a per- 
centile rank. For both students and adults per- 
centile ranks for all nine subtests are totaled 
and divided by 9 to yield an average percentile 
rank for the entire test, which is converted to 
still another percentile rank and rating. The well 
known lack of equivalence of percentile units 
appears not to have discouraged the authors 
from adding and averaging percentiles ; but the 
reviewer regards this system of interpretation 
as needlessly crude and inexact, and not to be 
defended on grounds of apparent simplicity. 
Further, presumably because of the small stand- 
ard deviation of subtest scores, very small dif- 
ferences in raw score are frequently associated 
with extremely large differences in percentile 
rank, 

The adult “standards” are based on a popula- 
tion described as “almost entirely made up of 
superior adults” and “largely....university gradu- 


ates" but otherwise unspecified. Percentile ranks 
for average percentile scores are also furnished 
for seven different groups of adults (N from 19 
to 119) ; it is not clear from the manual whether 
the adult standard population included any or 
all of the cases in these latter groups. The popu- 
Jation from which the high school senior and 
college freshman norms were derived is not 
described. No distributions of scores, means, or 
standard deviations of either subtest or total 
scores are presented. 

From the standpoint of format, the test is 
adequate. Administration is simple, though 
likely to require somewhat more than the 60 
minutes suggested. Scoring is by means of a 
strip key. Suggestions for “diagnosis” and use 
of the results are limited to brief generalities. 

The authors, in developing this test some 20 
years ago, apparently had in mind the provision 
of an instrument intended to get at some of the 
important attitudinal outcomes of education, 
certainly a praiseworthy aim. The reviewer is 
disposed to feel, however, (a) that the nature of 
the items and of the task set for the examinee 
are such as to make the test very largely a meas- 
ure of achievement or knowledge in the various 
areas rather than an attitude measure; and (b) 
that in the absence of data beyond that provided 
in the manual, particularly with respect to valid- 
ity, reliability, intertest correlations, and norms, 
the test is of but limited usefulness for the pur- 
poses it was designed to serve. 


[67] 
*The Life Adjustment Inventory. High school ; 
1951; 14 scores: adjustment to curriculum, reading and 
study skills, communication and listening. skills, social 
skills and etiquette, boy-girl relationships, religion- 
moral-ethics, functional citizenship, vocational orienta- 
tion and preparation, physical and ‘mental health, fam- 
ily living, orientation to science, consumer education, 
art appreciation and creativity, use of leisure time; 
1 form; $2.25 per 25; 50@ per manual ; 75% рег ѕресі- 
men set ; postage extra ; nontimed (25) minutes ; edited 
by J. Wayne Wrightstone ; Ronald C. Doll; Acorn 


Publishing Co. * 


Jonn W. M. ROTHNEY, Professor of Education, 
University of Wisconsin, Madison, Wisconsm. 

This is just another one of those toss-a- 
circle-around-the-symbol-for-your-problem in- 
ventories which holds no promise of contribut- 
ing any more to our understanding of pupils’ 
problems than the scores of others which repre- 
sent the lowest form of the testmaker’s art. In 
25 minutes the student encircles numbers at the 
end of 180 problems to indicate whether they 
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are big, medium or small ones to him. The stu- 
dent is asked to give his “honest and sincere 
opinions”; this request should be warning 
enough that not much can be expected from the 
instrument. Many studies have shown that an- 
swers to this type of question can be faked con- 
sciously or unconsciously, and no check is pro- 
vided in this inventory as to the sincerity and 
honesty of the students’ opinions. The very fact 
that the student is asked to be honest and sincere 
may even suggest to those who had not thought 
about being insincere or dishonest that they 
could answer in that way. 

The authors state, “The validity of an in- 
ventory of this sort is difficult to establish con- 
cretely.” Then they say, “The present Inventory 
has been validated by administering it to thirty- 
four random-selected freshmen, sophomores, 
juniors, and seniors in a public high school and 
then making case studies of the thirty-four pu- 
pils.” Note the number. A total of 34 cases spread 
over four grades! It is well established statisti- 
cally and scientifically that validation on so few 
cases is entirely inadequate. After showing that 
the inventory and case study results are some- 
what similar, the authors conclude that the “early 
attempts at validation appear, therefore, to have 
brought encouraging results,” and they give the 
usual promise to determine and substantiate the 
validity of the inventory some time in the future. 
This reviewer would not recommend purchase 
of the inventory until we have seen the results 
of the determination and substantiation proc- 
esses. , 

The usual long, detailed, impressive but al- 
most meaningless tables of norms commonly 
provided for adjustment inventory scores are 
given. When the 2,005 cases are broken down 
by sex and grade, the groups range in number 
from 212 to 307. Fortunately the authors do 
not urge the user of the inventory to employ 
the norms extensively. It is suggested that 
norms may be used as general guides but that 
attention be focused on the local situation be- 
cause “each local school community has.its own 
peculiar conditions which may affect the way 
pupils respond.” Then, why not use a locally 
constructed inventory which can be adapted to 
those peculiar conditions? Drawing up lists of 

‚ statements similar to those used in this inventory 
could be done by a faculty committee as com- 
petently as it is done here. ; 

3 Since this reviewer sees no merit whatever 
in the procedure used in this inventory and such 
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similar devices as the Mooney Problem Check 
Listand the SRA Youth Inventory, comparisons 
between them seem valueless. It is hoped that 
we will soon pass that naive stage in educational 
and psychological thought in which we expect 
to get at important problems by a hurried mass 
approach. Perhaps these inventories will find 
their best use in bonfires celebrating our emer- 
gence from the ruts that the personality and ad- 
justment testers, ably abetted by high pressure 
salesmen, have carved out for us. , 
Caveat emptor. 


HELEN SHAcTER, Consulting Psychologist; and 
Lecturer in Psychology, Northwestern Uni- 
versity; Chicago, Illinois. 

As defined by the author, the purpose of the 
Life Adjustment Inventory is broad and the 
uses to which it may be put are very varied. А 
fourfold aim is the stated objective: (a) to sur- 
vey the extent to which a high school curriculum 
meets the needs of its pupils; (b) to assist 
teachers toward better understanding of pupils’ 
problems, needs, and interests ; (c) to aid guid- 
ance workers in individual counseling ; and (d) 
to carry out research in both guidance and cur- 
riculum problems. 

It would be naive to believe that so diversified 
a goal could indeed be achieved through one 
paper and pencil approach. While the inventory 
can probably contribute in some degree to parts 
of its broad objective if it is used judiciously 
and if its findings are considered in an investiga- 
tion which utilizes the instrument as one of à 
battery, the reviewer believes that the most 
promising utilization of the inventory would be 
in a guidance program of individual counseling. 

The inventory is presented in the form of 180 
statements to which the respondent is asked to 
react by encircling 1, 2, or 3 after each state- 
ment, indicating that he considers the item either 
a BIG problem, a problem of MEDIUM im- 
portance, or one of SMALL or NO importance. 
There is no time limit ; 20 to 25 minutes are said 
to be sufficient to complete the inventory. 

The items are classified into 14 areas of learn- 
ing considered significant because of emphasis 
“to greater or lesser extent by youth commits- 
sions, curriculum committees, and individual e i 
ucators, as well as by young people themselves. 
(None of the commissions or committees 15 
identified.) Fourteen scores are obtained from 
reactions to 24 items relative to curriculum ad- 
justment, and to 12 items each for 13 other 
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problem areas as listed in the bibliographic en- 
try above. 

The wording of the statements is simple and 
brief; a few examples can illustrate the general 
tone: 

Td like to quit school now. 

Га like to know how to cook. 

I often wonder just how well I’m doing in my 

school work. 

I seem to have trouble in working with other 
people. 

T'd like to know more about dating. 

I wonder how to select a college. 

I know too little about local and state government 

to become a good voting citizen. 

Not nearly so simple, however, are the implica- 
tions of the reactions to these items. The manual 
accompanying the inventory differentiates in- 
terpretation for survey purposes and for indi- 
vidual diagnosis. A scoring sheet is provided 
and also standardization data giving (a) the 
average number of BIG problem items by areas ; 
(b) the percentage of pupils, identified by sex 
and grade, marking each item in each area; and 
(c) the rank order of learning areas in which 
the standardization group felt the curricula of 
their schools most inadequate. 

The standardization group numbered 2,005 
pupils in 8 high schools in 5 states of the East 
and Middle West. It is not noted whether urban 
or suburban or rural populations (or all three) 
were included. Nor is it stated whether public or 
private schools (or both) were studied. 

Reliability coefficients of .91 for boys and .92 
for girls are quoted for the inventory as a whole. 
These findings may be misleading, for they are 
based on responses of only 85 boys and 95 girls 
to whom the inventory was readministered six 
weeks after the original administration. These 
were unselected pupils from the three upper 
classes of a four-year high school. 

The manual states frankly that the validity of 
an inventory such as this is “difficult to establish 
concretely,” but adds that “encouraging results” 
have come from early attempts at validation 
through making nondirective studies of 34 high 
school pupils to whom the inventory had been 
administered. However, results are detailed rela- 
tive to three only of the total of 14 problem areas 
considered. It seems to the reviewer that more 
comprehensive and extensive data would be de- 
sirable, 

The Life Adjustment Inventory does not, ap- 
pear to be well named nor well established sta- 
tistically. Neither does it seem well adapted to 
the fourfold goal sought through its use. It is 
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highly questionable that high school students dif- 
ferentiate between their personal needs and 
dearth in curriculum provisions. It is debatable 
that high school teachers would have the oppor- 
tunity, even if there were the inclination, to 
utilize data in 14 areas of pupil response con- 
cerning problems, needs and interests. 
Nevertheless, the inventory does offer an in- 
teresting extension to studies of student view- 
point relative to curricula and guidance in the 
high school, although findings should probably 
be considered suggestive rather than specific. It 
provides, too, an approach which can be utilized 
in individual guidance as an additional instru- 
ment of investigation which is relatively brief 
to administer, interesting to record, and pro- 
ductive of a variety of reactions in several areas 
of importance in the adjustment of high school 
boys and girls. For such use it can serve the 
counselor both in direct pupil contacts and also 
in teacher contacts concerning individual pupils. 
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*The Manson Evaluation. Adults; 1948; a test for 
the identification of alcoholics, potential alcoholics and 
severely maladjusted adults; 8 scores: anxiety, depres- 
sive fluctuations, emotional sensitivity, resentfulness, 
incompleteness, aloneness, interpersonal relations, to- 
tal; 1 form; $2.75 per 25; 356 per specimen set; post- 
paid; nontimed (5-15) minutes; Morse P. Manson; 
Western Psychological Services, Box 775, Beverly 
Hills, Calif. * v 
REFERENCES 
т. Manson, Morse P. A Psychometric Differentiation of Al- 

i К is, University of 
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. “А Psychometric 9Diiterentiation of 
EUR From Non: Alcoholics?" Q J Studies Alcohol 9:1757 
4 1384 
s: ADM MoRsE В, “A Psychometric Analysis of Psycho- 
neurotic and Psychosomatic Characteristics of Alcoholics.” 
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4. Manson, Morse Р. fist Sou chometite ponni of Psycho- 
pathic Characteristics of Alcoholics.” J Consult Psychol 13: 
111-8 Ap '49. * (РА 23:4893) 

СнАВІЕЅ Н. Honzix, Personal Counselor and 
Vocational Adviser, Veterans Administration, 
San Francisco, California. 

The test author states that, although it is not 
difficult for the practitioner or therapist to make 
the clinical diagnosis of alcoholic addiction, a 
brief, objective, paper and pencil test for the 
determination of alcoholism has the advantage 
of saving valuable time of clinically trained peo- ` 
ple through the avoidance of time-consuming 
consultations, histories, and examinations and 
has the further usefulness of obj ectively reinforc- 
ing the clinical diagnosis of alcoholic addiction. 
The Manson Evaluation consists of 72 ques- 
tions of the type used in personality tests, such 
as the Minnesota Multiphasic Personality In- 
ventory, all remotely associated with drinking. 
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The test is well standardized and validated and 
probably differentiates the alcoholic from the 
nonalcoholic as effectively as any test of this 
sort could be expected to do. 

The author states that four methods of vali- 
dation were used. Of these, only the method 
that determined the effectiveness of the test in 
separating alcoholics from nonalcoholics solely 
through test scores is of practical importance to 
the prospective test user. This was done by 
presenting the percentages of correct predictions 
as follows: male alcoholics, 79 per cent; female 
alcoholics, 80 per cent; male nonalcoholics, 79 
per cent; and female nonalcoholics, 85 per cent. 
This means that incorrect predictions run for 
four groups from 15 to 21 per cent, the poorest 
prediction being for male alcoholics and male 
nonalcoholics. In other words, the test differen- 
tiates alcoholics from nonalcoholics 29 to 35 
per cent better than chance. The practical ques- 
tion confronting the prospective test user is 
whether for his purposes it is worth while to 
give the test for such an amount of additional 
assurance. 

The test is easily given and scored. An inter- 
esting and potentially valuable feature of the 
test for clinical purposes is the provision of 
scores which indicate the extent of an alcoholic’s 
psychoneurotic and psychopathic traits ; that is, 
how closely he resembles the neurotic or the 
psychopathic individual: The traits of anxiety, 
depressive fluctuations, and emotional sensitiv- 
ity, for which differentiating scores may be ob- 
tained, are designated psychoneurotic ; feelings 
of resentment, failure to complete social objec- 
tives, feelings of aloneness, and poor interper- 
sonal relationships are called psychopathic. 

This carefully validated test can be a useful 
diagnostic tool in clinical work with alcoholics. 
It must, of course, be used with discretion and 
with awareness of its limitations. Since, as noted 
above, incorrect predictions run from 1 5 to 21 
per cent, some alcoholics will be placed in the 
class of nonalcoholics, and vice versa. Since one 
never knows from the test alone which particu- 
lar individuals have been so misplaced, further 
checking through clinical methods is essential, 


ALBERT L. HUNSICKER, Supervising Psycholo- 
gist, Galesburg State Research H ospital, Gales- 
“burg, Illinois. : 

This questionnaire of 72 yes-no questions is 
designed to identify (a) individuals addicted 
to alcohol, (5). individuals possessing the per- 
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sonality structure characteristically found in 
alcoholics, (c) areas of personality disorder fre- 
quently associated with alcoholism. The ques- 
tions, only remotely associated with drinking, 
were selected from a series of 470 questions ad- 
ministered to 126 alcoholics and 157 nonalco- 
holics. An attempt was made to match groups 
for age, education, and social status; one set of 
scoring keys with separate norms for males and 
females is offered. The shorter approximation 
of the Kuder-Richardson formula yielded a re- 
liability of .94 for both the male and the female 
groups. Correct prediction of known alcoholics 
in treatment is claimed for 79 per cent of the 
male group and 89 per cerit of the female group 
studied. This reviewer administered the ques- 
tionnaire to 11 known alcoholics in therapy for 
alcoholism at an outpatient clinic and to 18 hos- 
pital employees, who omitted all identifying in- 
formation; he, like the author, found five over- 
lapping scores. 

The author reports that a subjective analysis 
of the 72 questions resulted in establishing seven 
neurotic or psychopathic traits. These are (a) 
anxiety, (b) depressive fluctuations, (c) emo- 
tional sensitivity, (d) feelings of resentment, 
(e) failure to complete social objectives, (f) feel- 
ings of aloneness, and (g) poor interpersonal re- 
lationships. 

It has been this reviewer's experience that 
the results from questionnaires of this type will 
vary with the circumstances under which they 
are taken. When administered to a known clini- 
cal group in treatment for failure to adjust be- 
cause of the clinical symptoms, one can almost 
always expect more valid results. These subjects, 
in the treatment situation, are willing to be self 
critical and to record this self criticism. How- 
ever, these same subjects when taking the same 
test in a situation where their symptoms and 
failures are unknown (e.g. when applying for a 


_ job) can be expected to perform more like the 


control (normal) group than like the experi- 
mental (alcoholic) group. The author's sugges- 
tion that this test may prove useful in identifying 
alcoholics or potential alcoholics prior to em- 
ployment or training is one that will have to 
be carefully evaluated, for it is not yet demon- 
strated. Furthermore, the subjective breakdown 
of the 72 questions into seven categories will 
probably not stand up under a careful cluster 
or factor analysis or under the test of differen- 
tiating psychiatric groups. However, unlike the 
Alcadd Test by the same author, which consists 
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of бо frank drinking questions, this test may 
prove an excellent research tool for screening 
alcoholics or possible alcoholics. More work 
needs to be done with known alcoholics in test- 
ing situations where they believe themselves to 
be considered members of the "normal" popula- 


tion. 
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xMemory-for-Designs Test. Ages 8.5 and over; 
1946-48; a test for the presence of brain damage; indi- 
vidual; 1 form, '46; no data on reliability and validity 
in manual (for data presented elsewhere by the au- 
thors, see references below); manual ('46); supple- 
mentary manual (’48) ; $2 per set of testing materials, 
postpaid; (3314-15) minutes; Frances Graham and 
Barbara Kendall; Department of Neuropsychiatry, 
Westen University School of Medicine, St. Louis, 

о. 
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The Minnesota Inventory of Social Attitudes. . 


College; 1937; 2 tests; $1.50 per 25 of any one test; 
356 per specimen set; postpaid; (15-20) minutes per 
test; E. С. Williamson and J. С. Darley; distributed by 
Psychological Corporation. * 

а) MINNESOTA INVENTORY OF SOCIAL BEHAVIOR. Form 


b) MINNESOTA INVENTORY OF SOCIAL PREFERENCES. 
Form P. 


REFERENCES 

1. Dartey, Jonn С. An Analysis of Attitude and Adjustment 

Tests: With peciat Reference to Conditions of Change in At- 

titudes and Adjustments. Doctor’s thesis, University of Minne- 
sota (Minneapolis, Minn.), 1937. 

2. DanLEv, Joun С. “Scholastic 

Maladjustment.” J Appl Psychol 21:485-93 


2099) 
Joun С. “Tested Maladjustment Related to 


з. DARLEY, 
Clinically Diagnosed Maladjustment.” J Appl Psychol 21:632- 
G. “The Measure- 


42 А37. * (PA 12:3031) 

4. WILLIAMSON, E. G., AND DARLEY, "E a 
ment of Social Attitudes of College Students: I, Standardiza- 
tion of Tests and Results of a Survey.” J Social Psychol 8:219- 
29 My 37. * (PA 11:5277) 

s. WittraMson, E. G., AND Dartey, J. С. “The Measure- 
ment of Social Attitudes of College Students: II, Validation of 
Two Attitude Tests." J Social Psychol 8:231-42 My '37. я 


'A 11:5278) р 
6. DAnLEY, Joun С. “Changes in Measured Attitudes and 
9:189-99 My '38. * (PA 12: 


Adjustments,” J Social Psyc 
3 k 

7. McNamara, WALTER J., AND DARLEY, Joun С. “A Factor 
Analysis of Test-Retest Performance on Attitude and Adjust- 
ment Tests.” J Ed Psychol 29:652-64 D '38. * 

8. DARLEY, JOHN à. AND McNamara, Warrer J. “Factor 
Analysis in the Establishment of New Personality Tests.” J Ed 
Psychol 31:321-34 My '40. * (PA 15:922) af 

9. GILKINSON, HOWARD, AND KNower, FnANKLIN Н. А 
Study of Standardized Personality, Tests and Skill in Speech. 

Ed Psychol 32:1612175 Mr "41. * (PA 15 73888 é К 

I0. JOHNSON, P. The Prediction of Scholastic Achieve- 
ment for Freshman Engineering Students at Purdue University. 
Purdue University, Division of Educational Research, Studies 
in Engineering Education II. Lafayette, Ind.: the Division, 
May 1942. Pp. 22. Paper. * (PA 16:5020) 


Achievement and Measured 
O '37. * (PA 12: 


TESTS & REVIEWS: CHARACTER—NONPROJECTIVE 


70 


Stet aN A. P. The Relationship of Test Scores to 
ыу ЕЛ Fey und 
University (Lafayette, Ind.), АР, чокан екы цесце 

12. BENDER, I. E., AND Hastorr, A. Н. “The Perception of 
Persons: Forecasting Another Person’s Responses on Three 
Personality Scales.” J Abn & Social Psychol 45:556-61 Л ’50. 
* (PA 25:988) " 
Verner M. Sims, Professor of Psychology, 
University of Alabama, University, Alabama. 

Judged in terms of the conventional stand- 
ards, the Minnesota Inventories would appear 
to rank high. Considerable effort at validating 
the inventories was made, their reliability is 
high, they are simple to administer and score, 
the manual is adequate, and norms based upon 
a relatively large college population (although 
all from the University of Minnesota) are re- 
ported. : 

There are, however, a few questions concern- 
ing the inventories which the reviewer would 
like to raise. Although the directions given the 
testee state there are no right or wrong answers 
to the statements, actually for each possible an- 
swer, score values (ranging from one to five) 
are printed on the inventory along with the 
answer. This is done for convenience in scoring, 
but it seems to be an open invitation to the 
bright, “high-score” conscious subject to make 
a “good” score. Thus it would demand un- 
necessarily high rapport with the testee if the 
answers are to be valid. Some other scoring ar- 
rangement should increase the usefulness of the 
inventories. 

Again, the effort to fit the response to all 
questions into a standard 5-choice pattern (al- 
most always, frequently, occasionally, rarely, al- 
most never) often results in clumsy sentence 
structure which seems unnecessary and may well 
reduce the validity of the inventories particularly 
if they are used with subjects who are not too 
verbally facile. In the case of nine items, for ex- 
ample, the choice "almost never" results in a 
double negative, such as, “Does not like to take 
the initiative in making friends” and “almost 

"never." Other items are almost as difficult to 
interpret: “Has about the average fear of mak- 
ing blunders before people" and the choice 
"rarely" ; “Neither seeks nor avoids social con- 

“occasionally.” More nat- 


tacts” and the choice y 
ural question forms would seemingly lead to 


better understanding on the part of many sub- ` 
jects. # 

The authors do not seem to have studied or . 
even considered the possible interdependence of 
the items within either inventory. Inspection 
shows cases where there would appear to be 
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high interdependence. In Form B, for instance, 
Item 4 reads, "Is reluctant to meet important 
people," Item 28 reads, “Наз no hesitancy about 
meeting important people" ; in Form P, Item 16 
reads, "Prefers being with other people rather 
than being alone," Item 18, “Prefers to be alone 
as much as possible." Such cases (and the reader 
who will examine the inventories can find several 
others) appear to be no more than repetitions 
of the same questions. To the extent that this 
is true, the effectiveness of the measürement is 
reduced. 

A. final, and perhaps more serious, question 
has to do with the two "personality traits" which 
the authors claim the inventories measure: 
“preferences. in the extent and type of desired 
social relationships," and “ап individual's esti- 
mates of his own behavior and feelings in social 
situations." In the reviewer's opinion, the au- 
thors have not even established the existence of 
two such unitary traits, let alone proven that 
their inventories measure them. They report un- 
corrected coefficients between the two forms of 
-47 for men and .44 for women, and inspection 
of the items in the two forms show many cases 
of great similarity. For example, Item 30 in Form 
B reads, “Наз a fairly good time at most parties,” 
while Item 3, Form P, reads, “Наз the time of 
his life at social affairs" and Item 40 in the same 
form, “Is annoyed by social activities." Tt is 
difficult to accept as a fact that the first of these 
items is indicative of one personality trait while 
the other two indicate another trait. The reader 
who cares to investigate will find other striking 
illustrations. In fact, five items in Form B (10, 
16, 25, 30, and 39) ask directly for preferences 
in social relationships, and at least three items 
in Form P (3, 8, and 32) inquire directly con- 
cerning. overt social behavior. It is possible, of 
course, that more careful investigation would 
establish the fact that "social preferences" and 
“self-estimate of social behavior" are different 
traits, but there is considerable doubt in the 
reviewer's mind that there is any essential dif- 
ference in what these two inventories measure. 

In spite of | ie questions we have raised, scores 
on the inventories should be helpful in counsel- 
ing students where some information on social 

‘adjustment is needed. These criticisms, if valid, 
indicate only need for further work with the 
inventories, not that they are without worth as 
they stand. 


For reviews by J. P. Guilford and George W. 
Hartmann, see 38:900. 
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*Minnesota Multiphasic Personality Inventory, 
Revised Edition. Ages 16 and over; 1042—51; 13 
scores: hypochondriasis, depression, hysteria, psycho- 
pathic deviate, masculinity and femininity, paranoia, 
psychasthenia, schizophrenia, hypomania, question, lie, 
validity, K; individual and group forms; manual 
C51) ; postpaid; Starke R, Hathaway and J. Charnley 
McKinley; Psychological Corporation. * 

а) INDIVIDUAL FORM ("THE CARD SET"). $23 per set of 
testing materials including 50 recording sheets; $2.90 
per 50 recording sheets; nontimed (30-90) minutes. 
b) GROUP FORM (“THE BOOKLET FORM"). IBM; for En- 


ineers Northwest scoring service and Hankes Answer 

heets, see 466; separate answer sheets must be used ; 
$5.50 per 25; $1.50 per manual; $3.50 per 50 IBM an- 
swer sheets and case summary cards; $4.50 per set of 
machine scoring stencils with manual; $4.50 per set of 
hand scoring stencils with manual; nontimed (40-90) 
minutes. " 
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The major development in recent years with 
respect to this instrument has been the intro- 
duction of the K score (130). This score is not 
intended to be a "clinical" index although it 
appears that it may have interesting clinical 
correlates, Rather it is a validating device de- 
signed to assess "test-taking attitude" along 
a “defensiveness-frankness” continuum, the 
extremes of the continuum being frank simu- 
lation at the one end and deliberate dissimu- 
lation at the other.” The clinical applica- 
tion of the K score consists in its use as a cor- 
rection to some of the Inventory scale scores 
(Hs, Pd, Pt, Sc, Ma) with the aim of augment- 
ing the discriminative value of these scales and 
of the Inventory in general. Attempts to verify 
this purported usefulness of the K score have 
yielded generally negative results (124, r25, 
126). However, there is evidence suggesting 
that an index involving both the F and K scores 
possesses some merit in detecting simulation of 
mental abnormality ( 124, 212). This lead should 
be followed up in view of both the practical need 
for instruments designed to disclose malinger- 
ing in clinical and military settings and the the- 
oretical importance of an understanding of ma- 
lingering as a personality reaction. 

The Inventory continues to enjoy extensive 
and interesting investigation of its psychological 
implications by the Minnesota group, with the 
new Hathaway-Meehl Atlas (263) being the 
most significant single product. There is an ob- 
vious tendency in this recent investigative work 
to get away from the simple empirical validation 
against clinical diagnoses based upon official psy- 
chiatric nosology which was demanded by the 
names of the scales and the initial claims made 
for their efficacy in classifying patients. In the 
new Atlas, for example, the scales are referred 
to by number rather than by name (a practice 
which its users are advised to follow) and the 
comment that “it would probably have been bet- 
ter to have used numbers...from the first" is 
made. 

As reviews of the Inventory in The Third 
Mental Measurements Y earbook indicated, these 
scale scores (or combinations of them in “рго- 
file" form) demonstrated little utility in dif- 
ferential diagnosis. A more recent validation 
study (r26), employing more "sophisticated" 
criteria, has yielded similarly negative results. 
In this respect, the statement in the new manual 
that “a high score on a scale has been found to 
predict positively the corresponding final clini- 
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cal diagnosis ог estimate in more than 60 рег 
cent of new psychiatric admissions," while no 
doubt factually true in the sense that a majority 
of patients in the various nosologic categories 
will show high scores in the corresponding scales 
(and in many а noncorresponding scale), is, in 
the reviewer's opinion, definitely misleading in 
its implication that a true differential prediction 
can be attained in this manner. 

Much exploratory work on the application of 
the Inventory to a variety of psychosomatic, 
neurological and psychosocial problems has been 
done in recent years. Whether the results of 
this work will survive cross-validation is still 
largely an open question but the initial findings 
have been in some instances most provocative. 


For reviews by Arthur L. Benton, H. J. 
Eysenck, L. S. Penrose, and Julian B. Rotter 
and an excerpt from a review, see 3:60; for re- 
lated reviews, see 72. 


[72] 
*[Re Minnesota Multiphasic Personality Inven- 
tory, Revised Edition.] HATHAWAY, STARKE R., 
AND Megat, PauL E. An Atlas for the Clinical Use 
of the MMPI. Minneapolis, Minn.: University of 
Minnesota Press, 1951. Pp. xliv, 799. $9.75. * (London: 
Oxford University Press. 78s.) (PA 25: 7468) 

J Clin Psychol 7:387 О ’51..* Students and 
clinical psychologists will find this а valuable 
source book for use with the MMPI. 

J Consult Psychol 15:355 Ag '51. Laurance 
F. Shaffer. The Atlas is a most unusual volume, 
consisting of 968 short histories of a wide variety 
of psychiatric cases, arranged by their coded 
profiles on the Minnesota Multiphasic Personal- 
ity Inventory. Each case summary is mainly 
factual, describing the patient as he entered the 
hospital, his pertinent history, and how he ap- 
peared to the therapist or hospital staff. There 
isa minimum of dynamic interpretation. The 
cases were taken without reference to MMPI 
profiles, except for a few selected in the last 
stages of the preparation of the volume to pro- 
vide examples of rare patterns. Although the 
authors, quite wisely, make no claim that the 
case studies validate the MMPI, reading the 
cases does suggest that the profiles group them 
according to their main behavioral trends. The 
aim of the Atlas is to provide supplementary 
clinical training for users of the Inventory. It 
assumes that the reader already has some knowl- 
edge of the MMPI and of the research under- 
lying its construction. The Atlas makes readily 
available a wider variety of case pictures than 


most workers would meet in their clinical ex- 
perience, The volume has shortcomings that the 
authors recognize, including the brevity of the 
case citations and their limited source, mostly 
from one hospital. As a new experiment in 


clinical training, however, the Atlas will be of 
wide interest. a 
[73] 
*Mooney Problem Check List: 1950 Revision. 
Grades 7-9, 9-12, 13-16, adults; 1941-50: earlier edi- 
tion published by Ohio State University Press; 4 lev- 
els; no norms—authors recommend the use of local 
norms; manual ('50); $1.65 per 25 of any one level; 
356 per specimen set of any one level; postpaid; Ross 
L. Mooney and Leonard V. Gordon (College and Adult 
Forms) ; Psychological Corporation. * 
а) JUNIOR HIGH SCHOOL FORM. Grades 7-9; 1942-50; 
7 scores : health and physical development, school, home 
апа family, money-work-the future, boy and girl rela- 
tions, relations to people in general, self-centered con- 
cerns; Form J (750); (35-50) minutes. 
b) HIGH SCHOOL FORM. Grades 9-12; 1941-50; II 
scores: health and physical development, finances- 
living conditions-employment, social and recreational 
activities, social-psychological relations, personal- 
psychological relations, courtship-sex-marriage, home 
and family, morals and religion, adjustment to school 
work, the future-vocational and educational, curricu- 
lum and teaching procedures; Form H ('50); non- 
timed (35-50) minutes. 
C) COLLEGE FORM. Grades 13-16; 1941-50; II scores: 
same as for High School Form; Form C (^50) ; non- 
imed (35-50) minutes. 
d) Bo Sek Adults; 1950; 9 scores: health, eco- 
nomic security, self-improvement, personality, home 
and family, courtship, sex, religion, occupation ; Form 
А ('s0) ; nontimed (20-30) minutes. 
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зо, Ѕетревѕ, Сивевт R. W. А Study of the Academic, Social 
and Personal Needs of Students of the Pennsylvania State Col- 
lege for the Years 1948-1950 Through an Analysis оў Instru- 
ments Administered in Education 105. Doctor's thesis, Pennsyl- 
vania State College (State College, Pa.), 1950. (Abstracts of 
Decem Moe d 1951, pp. 289-97.) (PA 26:2405, 
title only: 


Навор E. Jones, Professor of Psychology, and 
Director, Institute of Child Welfare, University 
of California, Berkeley, California. 

In its present form, the Mooney Problem 
Check List is based on the analysis of brief 
statements written by 4,000 high school stu- 
dents describing their personal problems, and 
on a number of other sources including case 
records and counseling interviews. Repeated re- 
visions have utilized data from various parts of 
the country. The authors state that in the choice 
of items they have attempted to select problems 
which are either commonly checked, or which 
are "serious enough to be important in an indi- 
vidual case." This latter criterion would seem to 
have very little restrictive value. Illustrative 
items (High School Form) follow : being over- 
weight, being talked about, being treated like a 
child at home, not getting along with a teacher, 
wanting to be more popular, and giving in to 
temptations. 

Inthe Junior High School Form, 30 items are 
presented in each of 7 “problem areas” : Health 
and Physical Development; School; Home and 
Family; Money, Work and the Future; Boy 
and Girl Relations ; Relations to People in Gen- 
eral; and Self-Centered Concerns. The High 
School and College Forms are expanded to 11 
areas and 330 items. A form for adults is also 
provided, with a separate manual. 

The Check List is designed primarily as an 
aid.to counseling: to acquaint the counselee with 
topics that may be discussed, to help him in re- 
viewing his own problems, and also to give 
counselors some preliminary notion as to matters 
of concern. In using the list with college stu- 
dents Marsh (3) reports: “This check list was 
felt to be valuable at Stephens because it helped 
to locate areas of student problems, gave a quick 
overview of students' felt difficulties, and offered 
a good basis for an opening conference." Similar 
advantages are mentioned by Congdon (7) who 
points out also that the absence of scores, clas- 
sificatory labels, and threatening psychological 
terms is an aid in establishing a good relation- 
ship with students. 

The Check List, in various forms, has had 
such extensive use that it should be possible to 
give more information than is now available as 


to its value in counseling programs. Under what 
conditions, and for what level of counseling ex- 
pertness, is it most serviceable? Does this abrupt 
and peripheral approach to problems sometimes 
set up a barrier to effective counseling ? 

Since the list is not designed to produce 
*scores" and no normative or correlational data 
are supplied, it cannot be assessed with regard 
to the usual concepts of reliability and validity. 
Chief attention is directed to the individual items 
as the significant data. Persons who mark an 
exceptionally large number of items may be 
likely candidates for counseling, but the selection 
of such cases rests upon local conditions and not 
on adjustment norms, which the authors regard 
as inapplicable. Similarly, problem area data 
(the term "category score" is avoided) are de- 
scribed as useful in the referral of cases to spe- 
cial programs, but not as having diagnostic sig- 
nificance. 

The authors are to be commended for the 
limited claims made for this instrument. They 
caution that it should not be expected to dis- 
close problems that the individual is unwilling 
to admit, either to himself or others. Some in- 
consistencies, however, occur in statements about 
the Check List, as in the instructions on the Adult 
Form: “Mark the list honestly and sincerely 
and you will obtain a representative inventory 
of your problems." It should be added that such 
an inventory can only be representative of selt- 
perceived and self-reported foci of difficulty, and 
that the manifest report is often likely to provide 
a distorted as well as an incomplete representa- 
tion of underlying sources of conflict. 

In an application of the College Form, Cong- 
don (7) found that the most frequent reported 
problems were in the area of adjustments to 
college. Low frequencies were obtained in areas 
involving courtship, sex, marriage, home and 
family. It may be questioned if these data closely 
reflect the actual distribution of problems which 
would be elicited by other approaches. 

The use of the Check List for surveys (un- 
signed) is advocated as a means of discovering 
topics for discussion, to throw light on educa- 
tional needs, and to study problem clusters, 
trends, and differences among groups. If the list 
is to be used for research purposes, it would be 
desirable to supply item frequencies for various 
samples ; the interpretation of group differences 
is by no means simple, but the availability of data 
would at least assist in forming hypotheses. The 
authors may be fully justified in withholding 
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normative data from counselors, in view of the 
possibly misleading nature of "adjustment" 
scores and of scores in pseudo-diagnostic cate- 
gories. For other purposes, however, it is ap- 
propriate to look for some degree of statistical 
organization of results from schedules which 
have been so extensively used. 

If the Mooney Problem Check List is of genu- 
ine value in counseling, it is legitimate to hope 
that the data which it furnishes can in some way 
be utilized for serious research purposes. If this 
is not possible without destroying the service 
value of the instrument, it may be questioned 
whether the instrument will stand the test of 
continued practical use. 


Morris Krucman, Assistant Superintendent 
in Charge of Guidance, Board of Education, 
New York, New York. : 

The Mooney Problem Check List is not a 
test, and therein lies its strength. Psychologists, 
guidance workers, industrial personnel workers 
and educators have been searching for years for 
a simple way to evaluate personality, but simple 
ways in a complex field have a disagreeable habit 
of yielding little. Most бї the pencil and paper 
tests of personality have been extremely disap- 
pointing. The Mooney Problem Check List has 
not been in that category because it promised 
little and produced much. 

It is refreshing to find authors of a psycho- 
logical instrument whose claims are modest to 
the point of understatement. They say, 

At all times the counselor must keep in mind that the 
Problem Check List is not a test. Tt does not yield 
scores on traits or permit any direct statements about 
the adjustment status of the person who made the re- 
sponses. Rather, the Problem Check List is а form of 
simple communication between the counselee and coun- 
selor designed to accelerate the process of understand- 
ing the student and his real problems. 

The check list is not considered a substitute 
{ог the interview or for the clinical approach, 
but an aid to them. It functions well under some 
conditions, and not under others. Its strengths 
and limitations are well déscribed in the manuals, 
and a brief section on counseling with the check 
list in the manual is clinically sound. 

Although validity, reliability, and norms are 
discussed in the manual, the authors are careful 
to point out that the usual criteria for these do 
not obtain for this type of instrument. This 
makes good sense, since the authors do not em- 
phasize objectivity in this instrument. Instead, 
the uses to which it can be safely put are de- 


scribed: facilitate counseling interviews; con- 
duct group surveys leading to plans for indi- 
vidualized action; serve as a basis for group 
guidance; increase teacher understanding ; and 
conduct: research on problems of youth and 
adults. 

The present forms are revisions of the check 
lists that have been in use for ten years. The 
sound suggestions for the use of this instrument 
are based on these years of experience. 

The Mooney Problem Check List is a valuable 
aid in guidance for adjustment if used in con- 
junction with the interview. As an independent 
instrument it has value mainly for research 
purposes. 


For reviews by Ralph C. Bedell and Theodore 
F. Lentz, see 3:67. 


[74] 
*Northampton Activity Rating Scale. Mental pa- 
tients; 1951; Form D; no normative data; manual 
[’51]; $5.50 per 25; 30€ per specimen set; postpaid ; 
Isidor W. Scherer; Meed Scientific Apparatus Co., 
P.O. Box 658, Springfield, Mass. * 


[75] YR, 
Personal Audit. Grades 9-16 and adults; 1941-45; 
2 editions, 1 form, '45; manual (745); 50€ per specimen 
set of any one edition; cash orders postpaid; Clifford 
R. Adams and William M. Lepley; Science Research 
Associates, Inc. * 

а) FORM LL (LONG FORM). 1041; emotional adjustment 
in 9 areas: seriousness, firmness, frankness, tranquil- 
lity, stability, tolerance, steadiness, persistence, con- 
tentment; $3.65 per 25; nontimed (45) minutes. 

b) ковм ss (SHORT FORM). 1941; emotional adjust- 
ment in 6 areas: seriousness, firmness, frankness, tran- 
quillity, stability, tolerance; $2.65 per 25; nontime 
(35) minutes. 
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WirLIAM Seeman, University of Minnesota, 
Minneapolis, Minnesota. R3 

This structured personality inventory has the 
advantage that it is self-administering in char- 
acter, is easily and quickly scored, and (in the 
use ofa profile pattern rather than a single score) 
takes some cognizance of the “Gestalt” proper- 
ties of personality. The odd-even reliability is 
stated as .00 or above and the test-retest relia- 


bility .90 to .97. There are nine parts to the long 
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form: seriousness-impulsiveness, firmness-inde- 
cision, tranquility-irritability, frankness-evasion, 
stability-instability, tolerance-intolerance, stead- 
iness-emotionality, persistence-fluctuation, con- 
tentment-worry. The short form consists of the 
first six parts. 

It seems probable, on the basis of the item 
analysis, that high scores and low scores on the 
inventory differentiate individuals on the basis 
of something, but one may have serious misgiv- 
ings about what useful information is revealed 
by such differences and by profile differences. It 
is by no means clear that the 18 labels for the 
nine parts actually constitute dimensions along 
which meaningful measurements may be made. 
Definition of the criterion in each part is either 
not attempted or is left in the vague terms of 
popular language. This reviewer is frankly puz- 
zled, also, about the meaning of the statement 
that the consensus of judgements of 30 psycholo- 
gists "coincides with the descriptions which have 
been prepared for the various traits." No infor- 
mation is given to indicate how this consensus 
was determined nor how it was decided that the 
consensus "fit" the descriptions. Incidentally, 
since such a method of consensus is a form of 
"face" validity, one wonders how "deception 
has been nearly eliminated." (No evidence is 
submitted to support this statement.) The point 
is that such a method virtually eliminates “sub- 
tle" items which alone defy deception. 

Another feature which puzzled this reviewer 
lies in the correlations presented in Tables 1 and 
4 of the manual, in the section on validation. The 
correlations in Table 1 are regarded as “evidence 
10r validation by negation, since the nine parts 
do not overlap." In other words, this set of cor- 
relations is interpreted as showing that the nine 
parts are different. Table 4 presents correlations 

-with the Bernreuter which are supposed to show 

how parts of the inventory are like another per- 
sonality scale. Yet, by the chi-square test, these 
two sets of correlations may be said to be ho- 
mogeneous. It does not seem to be misrepre- 
senting the state of affairs to say that two rela- 
tively homogeneous sets of correlations are 
interpreted in opposite directions. 

In summary, one may say that this is an easily 
administered, easily scored, inexpensive struc- 
tured inventory which yields a profile rather than 
a single score. However, there is some doubt 
about the nature of the dimensions and hence 
about the value of the information which the test 
yields. It would be desirable to have more ade- 


quate empirical or experimental definitions of 
the criteria for the various scales. 


For a review by Percival M. Symonds, see 
3:64. 


[76] 

*The Personal Preference Inventory: Student 
Form. College; 1947-49; 3 scores: economic back- 
ground, social attitude, masculinity-femininity; 1 form, 
'47; manual ['49]; tentative norms—author recom- 
mends use of local norms; $2 per 25, postage extra; 
15€ per specimen set, postpaid; nontimed (10-20) min- 
utes; Hugh M. Bell; Pacific Books, P.O. Box 558, 
Palo Alto, Calif. * 


E. LoweLL Кк, Professor of Psychology, 
and Director, Bureau of Psychological Services, 
University of Michigan, Ann Arbor, Michigan. 

This inventory is composed of 9o yes-no-? 
items printed on the inner pages of a four-page 
folder. Еасһ:о the three scales is composed of 
30 items, scored with unit weights with the aid 
of a semitransparent stencil. As with the Bell 
Adjustment Inventory, each of the items is 
preceded by a letter designating the scale to 
which the item belongs. The items are arranged 
in a cyclical order, that is, every third item 
belongs to the same scale. 

This inventory is said to provide three sepa- 
rate measures of students' attitude and adjust- 
ment: (a) Economic Background: “The indi- 
vidual who scores high tends to feel that others 
have been more fortunate than he in their eco- 
nomic backgrounds." (b) Social Attitude: 
“High scores indicate that a student possesses 
a strong critical attitude toward people....is cau- 
tious and self-defensive.” (c) Masculinity-Femi- 
ninity: “For men, high scores indicate mascu- 
linity of interests * for women, low- scores 
indicate marked femininity of interests.” 

Corrected split half reliabilities of the three 
scales are reported as .80, .85, and .84, based on 
99 freshmen and sophomores, presumably of 
both sexes. Selection of the items for each of the 
scales was based on “the degree to which they 
differentiated between extreme groups”; the 
specific procedure used is not reported. The 
three scales show relatively low intercorrelations 
(—.23 to +.36) ; similarly low correlations are 
reported with the four scales of the author’s Ad- 
justment Inventory. 

The only evidence of the validity of the scales 
is a statement that in using the scale in counsel- 
ing 650 students, “marked agreement” was 
found between the scores and the judgment of 
both students and counselors. Separate norms 
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are provided for college men (186) and college 
women (144). 

This instrument is obviously an attempt on 
the part of the author to construct a supplement 
to his Adjustment Inventory for use in the coun- 
seling of college students. For some reason, he 
has chosen to publish the new scales as a sepa- 
rate inventory rather than to add the items to 
the other inventory. Furthermore, he has chosen 
to publish the instrument with what appear to 
be decidedly inadequate norms, based as they 
are on relatively few students at one college in 
California. The criticism seems especially rele- 
vant with respect to the variable, economic back- 
ground, although there would seem to be little 
excuse for not obtaining more representative 
normative data for all scales. The manual admits 
that the inventory has not been used with high 
school students, and that "nothing is known 
concerning its effectiveness" with them. 

This inventory was designed for use in col- 
lege counseling, and its validity rests solely on 
the counselors! report of its usefulness. Has any- 


one demonstrated that the counselors do a better . 


job with the instrument than without it? The 
counselors’ feeling about this matter may be 
significant, and it may even be correct, but this 
reviewer would be happier if there were some 
relevant evidence. In the same vein, if it is true 
that the scores are in "marked agreement" with 
the student's evaluation of himself on these vari- 
ables, why should we not simply ask the student 
to rate himself directly on the variables con- 
cerned ? 

This reviewer is also concerned about the ex- 
tremely provincial nature of this instrument. The 
brief manual has no bibliography and reflects no 
awareness on the part of the author of related 
developments in the measurement of adjustment 
variables. The only other test mentioned and the 
only intercorrelations reported are with the au- 
thor's other inventory. Is masculinity-femininity 
as measured by this scale the same as measured 
by Strong's Vocational Interest Blank, the 
Guilford-Martin Inventory of Factors GAMIN, 
or the Minnesota Multiphasic Personality In- 
ventory? One is left with the impression that the 
author not only does not know the answers to 
stich questions, but worse still, does not care. 


C. M, LourrIT, Assistant to the Provost, Uni- 
versity of Illinois, Urbana, Illinois. 

This inventory is intended to provide informa- 
tion for the clinical counselors on the college 


student's attitudes toward his economic back- 
ground, his attitude toward people, and his posi- 
tion on the masculinity-femininity continuum. It 
follows the usual format of inventories in pre- 
senting 9o yes-no-? questions. Inasmuch as the 
inventory may be taken individually or in a 
group and seldom requires more than 15 min- 
utes, it is a convenient way of exploring these 
particular areas of the students' attitudes. While 
tentative norms based on 186 men and 144 
women are presented, the author advises that 
norms be developed in each local situation. Co- 
efficients of reliability based on 99 cases are .80 
for economic background, .85 for social attitude, 
and .84 for masculinity-femininity. Intercorrela- 
tions among the three sections of the Scale are 
low as are the correlations between the three 
sections and the four parts of Bell’s Adjustment 
Inventory. It is questionable whether numerical 
scores on this instrument can be taken as of too 
great meaning. The reliabilities would seem to 
be rather low for use in individual prediction; 
however, the author is quite explicit in consider- 
ing the instrument as an aid in exploring certain 
aspects of the college student client. Used as a 
means of securing information with economy of 
time, it promises to have a clinical usefulness. 


[77] 

The Personality Inventory. Grades 9-16 and adults ; 
1931-38; 6 scores: neurotic tendency, self-sufficiency, 
introversion-extroversion, dominance-submission, con- 
fidence, sociability; IBM; 1 form, '35; manual (35) 5 
tentative norms (’38) ; $1.75 per 25; 26 per norm sheet ; 
406 per specimen set; separate answer sheets may be 
used; $1 per 50 IBM answer sheets; $1 per тоо Hankes 
answer sheets (see 466); $2.50 per set of stencils for 
machine scoring of IBM answer sheets; 256 per 25 
individual report blanks ; postage extra; nontimed (25) 
minutes; Robert G. Bernreuter (scales measuring so- 
ciability and confidence were prepared by John C. 
Flanagan); Stanford University Press. * 
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used by educational or vocational counseling 
agencies. Similarly it could be used in an out- 
patient clinic, but not in a mental hospital situa- 
tion where patients are attempting to appear 
well in order to be released. The Bernreuter 
worked fairly well as a screening technique in 
certain of the armed forces programs; this was, 
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however, largely because men in the armed serv- 
ices are more willing to admit their maladjust- 
ments in order to avoid unpleasant duties. 

In the second place, the test identifies general 
personality inadequacies better than it evaluates 
an individual's suitability for particular jobs or 
life situations. Evidence accumulates that, al- 
though it is possible to score the blank for six 
traits, the nature of the inter-correlations is such 
that actually only two characteristics are being 
measured, those which Flanagan (28) named 
self-confidence and solitariness. Since the first 
of these correlates highly with the Bernreuter 
scale for neuroticism, it would perhaps be more 
meaningful to think of it as having to do with 
general emotional stability. The second deals 
with the sociability or the self-sufficiency of the 
individual. While there has been some work 
showing significant group differences on Bern- 
reuter scales between salesmen and nonsales- 
men, campus leaders and nonleaders, successful 
and unsuccessful practice teachers, etc., none of 
these differences are large enough to support 
decisions about individual cases. Correlations 
with various indicators of educational and vo- 
cational success have been too low to furnish a 
basis for any individual prediction. It is to be 
noted also that the “unfavorable” scores furnish 
more evidence for maladjustment than the ap- 
parently “good” scores furnish for superior ad- 
justment. 

In the third place, the test as now scored is 
designed to distinguish between the degrees of 
adjustment within the normal range. There is 
empirical evidence that psychiatric patients an- 
swer many of the items in ways that are in an 
opposite direction to that indicated by the scor- 
ing weights. (This phenomenon has also been 
noted in connection with the development of the 
Minnesota Multiphasic Personality Inventory, 
where empirical differences formed the basis for 
item weights.) It is true that in spite of this, 
groups of patients do obtain average scores devi- 
ating significantly from the norms, but there are 
numerous individual exceptions. Thus, educa- 
tional and guidance situations seem to be the 
places in which a test of this type is of most 
value. When used in such settings, only the 
anxious withdrawing types of maladjustment 
are assessed by this test. There is considerable 
evidence that behavior problems and psycho- 
pathic tendencies are not identified by the Bern- 
reuter. The large amount of overlapping which 
validity studies have consistently shown be- 
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tween adjusted and maladjusted groups should, 
of course, always be kept in mind when studying 
the individual case. 

With these limitations and qualifications, The 
Personality Inventory can still be a useful in- 
strument. It would be still more useful if it could 
be restandardized with only two keys and a 
simplified scoring system. There is ample sta- 
tistical justification for these changes. 


For reviews by Charles I. Mosier and Theo- 
dore Newcomb, see 40:1239; for related re- 
views, see 38:B358 and see 36:Br08. 


[78] 

*Personality Record. A rating scale for pupils in 
grades 7-14; 1941; 7 ratings: seriousness of purpose, 
industry, initiative, influence, concern for others, re- 
sponsibility, emotional stability; 1 form; no data on 
reliability and validity; no manual; 1-99, 56 each; 100- 
499, $2 per 100, 500-999, $1.50 per 100; also available 
in combination with the Secondary-School Record; 
cash orders postpaid; National Association of Second- 
ary-School Principals. * 


REFERENCES 
1. Erıcker, PauL E. "Record Forms for Secondary Schools,” 


B Nat Assn Sec Sch Prin 31:39-48 N '47. * 
VERNER M. 51м, Professor of Psychology, Uni- 
versity of Alabama, University, Alabama. 

The Personality Record is one of a series 
of forms "designed by Committees of School 
Administrators for Secondary Schools" and 
"adopted by the National Association of Sec- 
ondary-School Principals" for use in maintain- 
ing permanent records of high school pupils. It 
is a standard form on which teachers record for 
each pupil confidential information concerning 
the pupil's personal characteristics, with space 
for recording teacher judgements of the pupil's 
school activities, interests and abilities, and phys- 
ical, social, and mental limitations. 

Before considering the Record, let us raise a 
question concerning whether the Association 
should sponsor such an undertaking. Consciously 
or unconsciously, the committee responsible for 
the Personality Record has tried to define for 
thousands of teachers the ideal personality for 
high school pupils. In other words, through 
identifying seven aspects of personality (no 
more, no less) and setting up for each of these 
a hierarchy of five levels of behavior, the com- 
mittee has decided for American education that 
the ideal American youth is “purposeful,” is in- 
dustrious to the point where he “seeks additional 
work," is "actively creative"; in influence, is 
"strongly controlling" but "deeply and gen- 
erally concerned for others"; “assumes much 
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responsibility"; and, is "exceptionally stable" 
emotionally. These all seem good, middle class 
American virtues; but the reviewer has con- 
siderable doubt that a committee of high school 
principals, however able they may be, should be 
vested with power for making such decisions. 

The Record itself is clearly a rating scale of 
the absolute type, where, for each of the traits 
included, the pupil is to be matched with the 
most appropriate of several described levels of 
performance. The authors, however, deny this 
fact and claim that these "are not ratings." The 
only satisfactory explanation which the reviewer 
can find for the failure of the authors to recog- 
nize the Record as a rating scale is their lack of 
familiarity with rating scales and their construc- 
tion. This ignorance may also account for 
the fact that they have managed to violate 
most of the rules for the construction of such 
scales. 

The descriptive material accompanying the 
Record reports that it was adopted “after ex- 
tensive research and tryout in school systems," 
but if any effort was made to validate the scale, 
or to test its objectivity and reliability, it is not 
reported. Inspection of the scale suggests that 
had such investigation been carried out, the au- 
thors would probably have been sorely disap- 
pointed. 

Three basic rules for constructing such scales 
are: (a) carefully define the aspects of per- 
sonality which are to be rated; (b) describe 
the levels of performance in terms of objectively 
observable behaviors; and (c) develop direc- 
tions for using the scale which will help the 
user to standardize the procedure. None of 
these rules are observed in the Record. Intangi- 
ble aspects of personality such as industry, in- 
fluence, responsibility and emotional stability 
go undefined. Levels of performance are de- 
scribed in such vague terms as purposeless, con- 
scientious,’ somewhat socially concerned, us- 
ually dependable, hyperemotional, and well- 
balanced. And, finally, there are no instructions 
for users of the scale. 

Recording such information as this instru- 
ment yields on the permanent records of pupils 
may well do many of them permanent harm. The 
reviewer would recommend that the Association 
of Secondary-School Principals withdraw its 
support of the Personality Record until such 
time as, first, the Association has consciously 
faced the issue of whether it should sponsor 
such an enterprise, and secondly, should the 
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answer above be yes, has reconsidered whether 
it cares to sponsor a scale of such doubtful worth. 


[79] 
*The Personality Survey. Grades 7-9; 1048; 9 
scores: age, intelligence, reading comprehension, pupil 
questionnaire, teacher rating, sociometric rating, ab- 
sence, school marks, total adjustment; scores in intel- 
ligence and reading comprehension must be obtained 
from Traxler Silent Reading Test for Grades 7 to 10; 
1 form ['48] ; no data on reliability ; manual [248]; au- 
thor recommends use of local norms; 2€ per pupil's 
report; 2€ per teacher's report; 2¢ per table for trans- 
muting scores; 26 per class record sheet; 12¢ per man- 
ual; postage extra; 20€ per specimen set, postpaid; 
Percival M. Symonds; Public School Publishing Co. * 
REFERENCES 


т. SvwoNps, PencIVAL M., AND SHERMAN, Murray. “А Per- 
sonality Survey of a Junior High School." Sch R 55:449-61 O 
?47. * (PA 23:1460) 

2. SvMoNpS, PrncCIVAL M., AND SHERMAN, Murray, “Ретѕоп- 
ality Survey of a Junior High School," PP. 23-50. In The 
Measurement of Student Adjustment and Achievement. Edited 
by Wilma T. Donahue, Clyde H, Coombs, and Robert М. W. 
Travers. Ann Arbor, Mich.: University of Michigan Press, 
1949. Pp. xiv, 256. * (PA 23:6416) 


DoucLas CourTNEY, Program Director, Insti- 
tute for Research in Human Relations, Phila- 
delphia, Pennsylvania. 

The author states that the Personality Survey 
is 
designed to furnish evidence regarding basic personal- 
ity traits and personal attitudes from several sources— 
tests of proficiency taken by the pupil himself, and testi- 
mony from teachers, fellow pupils, and from each pupil 
himself * is designed to provide a profile and also a 
single measure of the pupil's adjustment * is designed 
to indicate which pupils in a school are in greatest need 
for specialized help, not only with regard to school 
progress, but also with regard to their social relations 
and their personal adjustment * is also designed to re- 
veal those who are making the best adjustments in 
school; those with the best academic record, with the 
greatest capacity for leadership, with the most accept- 
able social attitudes and relationships and with emo- 
tional stability and freedom from feelings of inferiority 
and inadequacy. Such a survey should reveal those with 
the greatest promise on whom the school can well con- 
centrate its efforts to produce leaders. 


Eight "factors" are used in the Survey: (1) 
chronological age, (2) intelligence, (3) read- 
ing ability, (4) pupil self-rating, (5) teacher- 
rating, (6) rating by associates, (7) attendance, 
and (8) school marks. These "factors" are 
really variables in accepted research usage. 

Variable 1 is chronological age. A table is 
provided to simplify the computation of chrono- 
logical age. Each child is given a decile ranking 
from т to то. The author suggests that these 
decile rankings be based on half grades. On page 
IO of the manual, however, he uses the words 


"your class" as though he were talking to an: 


individual teacher. In a reported application of 
the Survey (r), Symonds uses distributions 
based on half-grade ranges across the junior 


high school. Four different normative bases are 
suggested to or inferred by the reader; the use 
of a shifting normative base is quite indefensible 
if the average teacher is to come out with any- 
thing but a hodgepodge. 

Variable 2, named intelligence by the author, 
would be more appropriately named reading 
vocabulary since it is the vocabulary section of 
the Traxler Silent Reading Test. A table is pro- 
vided to convert these vocabulary scores into 
mental ages based on correlations obtained with 
61 children who had been given both the vo- 
cabulary tests and the Wechsler-Bellevue In- 
telligence Scale, a scale for the measurement of 
adult intelligence. Such a conversion is quite 
inappropriate but since it is not necessary for 
the use of the Survey, the table can be ignored 
safely. 

Variable 3, named reading ability, is the Test 
of Paragraph Comprehension, Part 3 of the 
Traxler Silent Reading Test. 

Variable 4 is a 30-item questionnaire in which 
the children answer questions about themselves. 
Certain answers, based apparently on the a 
priori judgment of the author, are considered 
favorable. The form is not suitable for girls. 
Decile ratings are based on decile norms for the 
entire school rather than either the classroom or 
the grade level. 

Variable 5 is a rating by teachers of those 
children in the school, not just in their own 
homerooms, who indicate certain stated physical 
and personality assets and handicaps. The posi- 
tive mentions are balanced against the negative 
mentions, and the algebraic sum for each child 
is converted into a decile ranking based on school 
wide norms. 

Variable 6 is a sociometric rating in which 
each child matches certain descriptions with the 
names of other children chosen from the class- 
room. The children also make certain social 
choices. А score is derived in the same manner 
as for the teacher rating. There is no form for 
girls. 

Variable 7 is a record of absences based on 
the entire school over a given period of time. 

Variable 8 is that of school marks. They are 
averaged for each pupil and a decile ranking 
obtained. For some reason not specified, the ex- 
tremely important areas of physical education, 
music, and the arts are excluded. 

The author suggests the computation of three 
composite scores for each child: (a) median of 
the eight scores, (b) median of the three scores 


79 THE FOURTH MENTAL MEASUREMENTS YEARBOOK 
TITULARES a i0 


on variables 1-3, and (c) the median of the 
five scores on variables 4-8. Symonds suggests 
that the last two composite scores indicate prom- 
ise or child potential and fulfillment or achieve- 
ment, respectively ; the author refers to the dif- 
ference between these scores as the gap between 
potential and fulfillment. He suggests that the 
children with much greater potential than ful- 
fillment (a high minus difference score) “are 
those on whom the school might well turn its 
searchlight of individual study, for these are the 
pupils who have the greatest promise but who 
have failed to meet expectations." 

The manual refers to an article (т) by the 
author for data on the validity of this technique. 
The validity data presented in the article con- 
sists of five cases chosen at random and clinically 
described. Here we bump into certain semantic 
difficulties. These eight variables have been 
called criteria by the author. The further “cri- 
teria" which are chosen to validate the first eight 
are a series of unspecified clinical judgments of 
five cases chosen at random. Only the most un- 
sophisticated reader would accept such evidence 
although it is provocative and could be used as 
the basis for more rigorous research. 

The Personality Survey is a very good tech- 
nique, in one way, in that it offers a method 
whereby a teacher or administrator may look 
at the differences that exist among his pupils. 
This is good because it provides a base for in- 
dividual instruction. Provision for individual 
differences in the school situation is about the 
only sensible resultant that can come from an 
extensive consideration of individual differences. 
All eight variables appear to be important vari- 
ables in the study of children. The methods sug- 
gested for measuring these important variables 
appear to be sensible and appropriate. The only 
notable exception is the absence of forms appli- 
cable to girls for variables 4 and 6. 

The recommended statistical manipulations of 
these variables appear to involve statistical tech- 
niques which are not appropriate to the underly- 
ing assumptions. There are four different norma- 
tive bases from which decile rankings are derived, 
One is the individual classroom; one is the 
half-grade range within the classroom; one is 
the half-grade range within the school; and one 
is the entire school (3-grade range). The fact 
that different teachers know and are rating dif- 
ferent and overlapping groups of children pro- 
vides an unknown but suspected large number 
of additional normative bases. The reviewer 
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suggests that this is unnecessary. If the indi- 
vidual classroom is the actual stage on which 
children compete in school, it is on this stage 
that all comparisons should be made. If the 
teacher would take the variables suggested by 
Symonds, measure her 30 or 40 children on 
each of them and then merely rank the children 
in her own classroom she would be operating on 
a single base. Median ranks could then be ob- 
tained for a total composite score as well as the 
difference scores which Symonds suggests. This 
reviewer feels that the composite scoring will 
prove less useful in actual practice than a study 
of the separate variables in relation to each other 
in a given child. 

The reviewer disagrees with the philosophy 
implied in the author's interpretation of the 
Personality Survey. The author's suggestion 
that children with high minus differences should 
be the focal point for individual study implies a 
philosophy that is repugnant to the reviewer. 
A survey designed to outline individual differ- 
ences should result in each child's receiving equal 
attention from the school to make the most of 
his capacities. Neither the fool, nor the genius, 
nor the genius acting like a fool should receive 
undue attention at the expense of the others. 


Joun W. M. Roruney, Professor of Education, 
University of Wisconsin, Madison, Wisconsin. 

This instrument proposes to furnish evidence 
about basic (!) personality traits and personal 
attitudes from tests, testimony from teachers, 
fellow pupils, and from the pupil himself. The 
manual states that it is intended for use in grades 
7 through 9 and, although no data are given for 
other grade levels, it is suggested that it may 
be used in the sixth grade and the senior high 
school. When you are using the shot gun ap- 
proach, you may just as well spread the shot to 
get any game within reach. 

The test is designed more specifically to reveal 
those pupils who are "making the best adjust- 
ments in school: those with the best academic 
records, with the greatest capacity for leader- 
ship, with the most acceptable social attitudes 
and relationships and with emotional stability 
and freedom from feelings of inferiority and in- 
adequacy." АП this is to be done by assembling 
data on eight factors at the rate of 17 minutes 
per pupil. The millennium has arrived. Now, at 
last we will have time to work with pupils! The 
assembly line is out of the factory and into à 
social institution. But wait. 
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Factor 1, chronological age, is used on the 
assumption that, other things being equal, the 
older the pupil in a junior high school the less 
well adjusted he is. 

Factors 2 and 3, intelligence and reading 
ability, are obtained from scores on the Traxler 
Silent Reading Test. Both factors are obtained 
from a reading test. Just why you should use the 
Personality Survey up to this point is not clear. 
Give the Traxler and look up the pupil's age 
and you have factors 1, 2, and 3. 

Factor 4, pupil self-rating, is another version 
of the common, “Are your feelings easily hurt?” 
type of item. The author believes (but presents 
no evidence) that a battery of 30 such questions 
“will discriminate effectively between boys and 
girls who are happy, adequate and well-adjusted 
socially and those who are not happy." Many 
persons of course will not believe it. A distribu- 
tion of scores on this factor for 770 pupils in a 
New York junior high school is given. In the 
manual it is stated that, “It is not known how 
representative this distribution is of junior high 
school pupils in general" Until such data are 
presented the norms are of no value. 

Factor 5, teacher rating, is obtained by having 
teachers write names of pupils under such head- 
ings as, “insolent, smart alecky" and “sensitive, 
touchy, hurt by criticism.” The invitation to 
call pupils names is there for those who like to 
do so. The author points out difficulties in use 
of ratings but adds, “such discrepancies in rat- 
ing will occur to no greater degree by this method 
than by some other method." No evidence is of- 
fered. 

Factor 6, sociontetric ratings, is a combina- 
tion of the common “guess who" and sociometric 
procedures. The combination is not a happy one. 

Factor 7, absence, is taken from the school 
register on the theory that regularity in attend- 
ance is one index of good adjustment. 

Factor 8, school marks, is taken from school 
records and is the average of the pupil's marks 
in 4 or 5 major subjects because, “It is generally 
recognized that school marks represent an im- 
portant index of a pupil's adjustment to his work 
in school,” 

Decile positions on the eight factors are to be 
computed locally. 

Note that this Personality Survey, designed 
to get evidence about basic Personality traits, is 
obtained from your own records by noting age, 
absences, and marks, by giving one reading test, 
by using questionable ratings, by asking the 
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pupils to answer 30 questions, and by speeded 
up guess who and sociometric procedures. 

In the reviewer’s opinion this is the worst of 
all the very bad ways of studying pupils so far 
devised. 


[ 80] 
*Pre-Counseling Inventory. Ages 8-14; 1949; in- 
dividual; 2 scores: adjustment, tension; Form R; $1 
per 20; 35¢ per specimen set; postpaid; (15-30) min- 
E Alfred Schmieding; Concordia Publishing 
ouse. 


REFERENCES 
т. ScHMIEDING, ALFRED. Parent-Child Relationships in the 
Christian Home. Foreword by Herbert H. Gross. River Forest, 
Ill.: Lutheran Education Association, 1949. Pp. vii, 22. Pa- 
per. 


Cuartes H. Honzrx, Personal Counselor and 
Vocational Adviser, Veterans Administration, 
San Francisco, California. 

This inventory is designed "to bring to the 
surface the feelings, emotions, anxieties, and 
tensions of children often difficult to isolate or 
to define by ordinary observation or by the 
commonly used methods of interview." It is in- 
tended as a precounseling device to uncover the 
possible causes of maladjustment and behavior 
difficulties of children. The basic idea of the in- 
ventory is a comparison of responses given to 
statements about an imaginary boy or girl (the 
Story Projection) and responses given to the 
same statements directed in question form to 
the boy or girl being interviewed (the Question- 
naire). There are 30 such statements, chosen 
from a larger battery by trial and error for 
their diagnostic value. Examples from the Story 
Projection are: “John Sumner likes to play with 
other boys and girls" and “John likes his father 
and mother about the same." These are repeated 
later in the Questionnaire as questions : “Do you 
like to play with other boys and girls?" and “Do 
you like your father and mother about the same ?” 

The child's affirmative responses are given а 
value of one ; negative, zero. This allows a com- 
parison of responses in the Story Projection as 
against responses to the same items in the Ques- 
tionnaire. The number of disagreements gives 
what is termed the Tension Score. The number 
of favorable responses minus the number of dis- 
agreements is called the Adjustment Score. 

The validation of the inventory, as described 
in the manual, and its theoretical foundations 
do not appear adequate. It is stated that, "In a 
tabulation of 77 cases the Adjustment Score was 
correlated with judgment. The judgment of the 
child's adjustment was in each case secured in- 
dependently by a competent judge." But how 
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and under what circumstances the independent 
judgments were obtained is not described. 
"Though the Tension Score is the distinctive fea- 
ture of the inventory and is considered valuable, 
the correlation between it and "independent 
judgment" was only .55. The author cautions 
against its unqualified use but states that “the 
Tension Score is of such value that it has been 
included if for no other reason than experimental 
purposes.” On the theoretical side, the signif- 
icance of disagreements between responses in 
the Story Projection and responses in the Ques- 
tionnaire is not made clear, nor is it clear that 
the term “tension” adequately describes the pre- 
sumably complex psychological factors that 
cause the differences in response. 

An interesting feature of the inventory is the 
emphasis on religious attitudes. Five of the 30 
items refer to religious matters, such as, “Do 
you often feel that you have many unforgiven 
sins?” The author states, “The Christian coun- 
selor will from time to time calmly and objec- 
tively call the child’s attention to the great source 
of help the Bible offers him.” It is evident that 
this feature will limit the general use of the in- 
ventory since the religious atmosphere and in- 
doctrination implied by these items are not com- 
mon to many schools and communities. The 
author remarks that frequently the religious 
items “serve the purpose of opening up some 
maladjusted religious emotions, which may then 
be explored more fully, to the benefit of -the 
child." And, “Occasionally, a child would indi- 
cate that Jesus may have saved others from sin, 
but he was not sure that this applied to him.” 
The impression one gets is that maladjustment 
has been manufactured by unfortunate religious 
attitudes and indoctrination, and more religion 
is to be used to undo the maladjustment. There 
may be some truth in this, but in this reviewer’s 
opinion a much more fundamental and radical 
revision of religious education is required than 
is implied by the author’s comments. 

Aside from the Adjustment and the Tension 
Scores, the author believes that each item has its 
own value in revealing emotions and attitudes 
that may be the sources of conflict and malad- 
justed behavior. In this respect, the Inventory 
may bea distinctly helpful tool. Though the Ten- 
sion Score is not statistically reliable the dis- 
agreements in the child’s responses may be sig- 
nificant and exploration may reveal the source 
of the conflict, this being the first necessary step 
in the job of resolving the conflict. 
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*Problem Check List: Form for Rural Youth, 
Ages 16-30; 1946-48; an adaptation of the Mooney 
Problem Check List; 10 scores: health and physical, 
relationship with people, citizenship, education, voca- 
tion and economic, morals and religion, personal tem- 
perament, courtship-sex-marriage, social and recrea- 
tional, home and family; 1 form, '46; manual ('48); 
$1.50 per 25; 20€ per specimen set; postage extra; non- 
timed (30-50) minutes; Ralph E. Bender, Mary Alice 
Price (manual), and Ross L. Mooney (manual) ; Ohio 
State Unversity Press. * 


REFERENCES 


т. Benver, RaLPH E. “The Development of a Problem Check 
List and a Demonstration of Its Use in Planning Rural Youth 
Programs." Agric Ed Mag 20:116-7 D '47. * 

2. Benver, Клірн E. The Development of a Problem Check 
List and a Demonstration of Its Use in Planning Rural Youth 
Programs Unpublished doctor's thesis, Ohio State University 
(Columbus, Ohio), 1947. 


[82] 

*Problem Check List: Form for Schools of 
Nursing. Student nurses; 1945-48; an adaptation of 
the Mooney Problem Check List; 13 scores: health 
and physical development, finances and living condi- 
tions, social and recreational activities, social-psy- 
chological relations, personal-psychological relations, 
courtship-sex-marriage, home and family, morals and 
religion, adjustment to school of nursing, the future- 
professional and educational, curriculum and school 
program, adjustment to human relationships in nursing, 
adjustment to administration of nursing care; 1 form, 
'45; manual ('48) ; $1.50 per 25; 20€ per specimen set; 
postage extra; nontimed (35-50) minutes; Luella J. 
Morison, Mary Alice Price (manual), and Ross L. 
Mooney (manual); Ohio State University Press. * 


REFERENCES 


т. Morison, LuzLLA J. “A Problem Check List: Its Use in 
Student Guidance.” Am J Nursing 47:248-51 Ap '47. * 


[83] 
X*The Purdue Rating Scale for Administrators 
and Executives. Administrators and executives; 
1950-51; 3 levels; 1 form, '50; manual (’51) ; 3¢ per 
scale; 56 per profile chart for any one level (^51) ; 25¢ 
per manual; 356 per specimen set; postpaid; Н. H. 
Remmers and R. L. Hobson; Personnel Evaluation Re- 
search Service, Division of Educational Reference, 
Purdue University. * 
a) REPORT FORM A. College administrators. 3 scores: 
fairness to subordinates, administrative achievement, 
democratic orientation. ү 
b) REPORT FORM В. Business executives; 2 scores: social 
responsibility for subordinates and society, executive 
achievement; no norms for part scores. 
с) REPORT FORM С. School administrators. 


REFERENCES { 

т. Hosson, Ronert L. "Some Psychological Dimensions 0 
Academic Administrators." Abstract. Proc Okla Acad Sci 28: 
131 '48, * (PA 25:1288, title only) x A d f 
2. Hosson, Rozert L, Some Psychological Dimensions 0) 
Academic Administrators. Doctor's thesis, Purdue University 
(Lafayette; Ind.), 1948. » f 
з. Hosson, Ковевт L. “Some Psychological Dimensions 0 
Academic Administrators," pp. 7-64. In Further Studies i" 
Attitudes, Series XVIII. Edited by H. H. Remmers. Purdue 
озер Division of Educational Reference, Studies, in 
Higher Education, [No.] 73. Lafayette, Ind.: the Division, 
[1950]. Pp. os Paper. * (РА 25:6490) 5 d 
4. Kirk, Bruce. A Study of Subordinates’ Attitudes Towar 
Public School Administrators. Master's thesis, Purdue Univer- 


sity (Lafayette, Ind.), 1950. ў i. 
5. КОРЕ, Jesse C. Some Psychological Dimensions of Виз” 
ness and Industrial Executives. Doctor’s thesis, Purdue Univer 


sity (Lafayette, Ind.), 1950. (PA 24:4933, title only) 5 

. Rupe, Jesse C. “Some Psychological Dimensions of Busi- 
ness and Industrial Executives,” pp. 65-95. In Further Studies 
in Attitudes, Series XVIII. Edited by E їн: Remmers. Purdue 
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Wn Division of Educational Reference, lies. 
igher lucation, {No.] 73. Lafayette, Ind.: the Division, 
M rig Е È aihen Workers Bre the Boss.” Personnel 
Psychol 4:271-89 ащ 's1. * (PA 26:3637) 

Kennetu L. Heaton, Richardson, Bellows, 
Henry and Co., Inc., Philadelphia, Pennsyl- 
vania. 

The scale is designed for use in the rating of 
administrative or executive personnel in busi- 
ness, industry, school or college. It presents 36 
traits or characteristics on which the adminis- 
trator is to be rated by his subordinates. A va- 
riety of items are included : “Possesses specific 
knowledge in his own field,” “Is emotionally 
poised and calm,” “Makes plans carefully and 
adequately,” and so forth. 

The authors claim that “the extensive statis- 
tical researches” done on this scale support the 
conclusions: “(1) There are basic aspects of 
administration common to all administrative 
work, and this scale measures at least some of 
the important aspects. (2) The scale is satisfac- 
torily reliable and valid. (3) The scale does not 
have a serious general halo effect.” 

Reliability coefficients for each item were ob- 
tained by a modified split half method for busi- 
ness and industry and for colleges, and by the 
Horst technique for schools. By the split half 
method the range is from .54 to .98 with few 
items below .75. By the Horst method there was 
a range from .40 to .80 for public school ad- 
ministrators. 

The authors assumed that there was no better 
method of validation than to compare the ratings 
of one subordinate with those of another subor- 
dinate, as indicated on the scale itself. Thus, they 
assumed that the reliability and validity co- 
efficients were identical. These assumptions are 
surely open to serious question. 

Normative data are based on 823 ratings of 
54 administrators in nine colleges and universi- 
ties, 702 ratings of 113 business and industrial 
exectitives, and 1,153 ratings of 88 superintend- 
ents and principals in 25 public schools. 

Factor analysis has been applied and norms 
developed for the general factors “Fairness to 
Subordinates,” “Administrative Achievement,” 
and “Democratic Orientation” for university 
and college administrators. 

A carefully prepared manual discusses mat- 
ters of administration, scoring, and interpreta- 
tion. Separate tally sheets and profile charts are 
provided for each of the three groups of execu- 
tives. 

If used by the administrator to encourage 


Studies in 
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criticism on the part of his subordinates and to 
prepare the way for frank discussion of ad- 
ministrative weaknesses and failures, this can 
be a helpful instrument. It should often help the 
administrator to locate the general areas in 
which there is greatest dissatisfaction with his 
functioning. However, to use the scale as a diag- 
nostic instrument would be unfortunate. Because 
of its very nature, it can not be expected to pro- 
vide the specificity of information which the ad- 
ministrator must secure before he knows what 
is wrong and what improvements can be made. 


[84] 
The School Inventory. High school; 1936; 1 form; 
tentative norms—author recommends the use of local 
norms; $1 per 25; 106 per specimen set; 60€ per plas- 
tic scoring stencil; cash orders postpaid; nontimed 
(10-15) minutes; Hugh M. Bell; Stanford University 
Press. * 
REFERENCES 

AS See 40:1252. 

s. Perers, Epwin F. The Construction of an Adjustment 
Inventory on the Basis of a Critical Analysis of the Factors as 
Measured by the Bell Inventories. Doctor’s thesis, New York 
University (New York, N.Y.), 1941. Pp. 156. (Abstracts of 
Theses....[School of Education] 1941, pp. 173-9. 

6. Ryans, Davi G., AND Peters, Epwin F. “School Satis 
faction Among First-Year Students in a Women’s College.’ 
Sch & Soc 53:157-9 F 1 '41. * (PA 15:2383) 

. Ryans, Davin G., AND Peters, Epwin Е, “Factors Affect. 
ing the School-Satisfaction of Students in a Women's College." 
Sch & Soc $5:26-8 Ja 3 '42. * (PA 16:1691) 

Ross W. Matteson, Counselor, Michigan State 
College, East Lansing, Michigan. 

This inventory, used in high schools for the 
past 16 years, consists of 76 questions aimed at 
discovering what things about their school stu- 
dents may or may not consider satisfactory. 
Space is provided on the last page for the listing 
of any specific suggestions a student may have 
for the improvement of his school. 

Approximately two thirds of the items seek 
to determine the student’s opinions as to teacher 
personality and efficiency and teacher-student 
relationships. The remaining questions have to 
do with school subjects, classes, grades, and such 
general factors as school organization, disci- 
pline, physical facilities, and relationships with 
fellow students. 

Students are instructed to answer each ques- 
tion by drawing a circle around yes, no, or A 
The instrument is practically self-administering ; 
however, as suggested in the manual, it prob- 
ably should not be administered until a “feeling 
of cooperation among the students” has been 
established. 

Scoring the inventory consists simply of su- 
perimposing a transparent stencil on the answer 
columns and counting the number of items indi- 


cated. The higher the score the more “unsatis- 
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factory” the attitude toward the school. Since 
the items are not classified, this score in itself 
gives no indication as to sources of dissatisfac- 
tion. 

Norms for the inventory, based on a limited 
number of high school students, provide descrip- 
tions ranging from "excellent" to “very unsatis- 
factory” for the various score ranges. Perhaps 
the most valid use of the questionnaire lies in 
the information about the individual student that 
a detailed analysis may provide for subsequent 
counseling interviews. 

Although the student is assured that his an- 
swers “will be treated with the strictest confi- 
dence,” caution is definitely indicated both in 
the instrument’s administration and in its inter- 
pretation. Scores of different students are more 
likely to reflect comparative outspokenness than 
comparative lack of adaptation to the school. 
The inventory’s chief value appears to lie in 
its use, where indicated, with individual students 
and in connection with the counseling process. 


For reviews by Robert G. Bernreuter and J. 
B. Maller, see 40:1252; for related reviews, see 
40:B842, 38:B309, and 36:B30. 


[85] 

Selective Vocabulary Test. Ages 13 and over; 
1944; a disguised test of masculinity-femininity; 1 
form; no data on reliability; 2s. 6d. per 25; 3d. per 
single copy; 3s. per manual; purchase tax (British 
purchasers only) and postage extra; nontimed (15-30) 
CAN Patrick Slater; George G. Harrap & Co. 
td. 


REFERENCES 

I. SLATER, PATRICK. “Interpreting Discrepancies.” Brit J 
Med Psychol 19:415—9 pts 3-4 "4s. * (PA 17:3980 
Assessment of отаю брат, Bie fed Зиду of the 
74 S 18 '47. * (PA 22:1238) 
James MaxwzLL, Principal Lecturer in Psy- 
chology, Moray House, Edinburgh, Scotland. 

This test is based on the principle that the 
different interests of men and women lead to 
their acquiring different types of vocabulary ; 
hence masculine and feminine tendencies can be 
differentiated by vocabulary. The test requires 
the subject to define 8o words, 40 of which 
represent masculine interests, and 40 feminine. 
No words, of course, are exclusively the prop- 
erty of one sex, and the selection of words for 
the test is made on the correlation between 
knowledge of the word and sex, these correla- 
tions ranging from .34 to .99. No further statisti- 
cal data are given. ў 

` The scoring is not entirely objective, but a 
number of specimen definitions are given for 
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each word. At times the scoring appears rather 
arbitrary. For the word “muff” for instance, 
only the meaning of a hand warmer is accepted ; 
and of the то definitions of “fillet” in the Oxford 
Dictionary, only one is accepted, the definitions 
as a headband or as a supporting strip of wood 
not being mentioned. By its nature the test re- 
flects the interests of the group on whom it was 
validated, and it is rather English in bias, Of 
the sports mentioned, cricket is represented by 
two items, Rugby football, golf, and polo by one 
each. The feminine words are of more general 
application. 

The test is primarily intended as a personality 
test and as a pointer to abnormalities in sexual 
development and attitudes. Though the author, 
in a rather discursive manual, appears to have 
the clinical applications of the test mainly in 
mind, a fairly comprehensive set of norms is 
provided for children aged 13 and 15. For each 
sex, for each score in the vocabulary appropriate 
to the sex, are given the 1, 2, 5, 10, 30, 50, 70, 
90, 95, 98, and 99 percentiles for the scores in 
the opposite-sexed words. As the average num- 
ber of cases on which each of the four sets of 
norms is based is about 8o, this high degree of 
discrimination is probably not justified. 

There is no doubt that further development of 
the test is required before it can be considered as 
soundly established. Of this the author is aware, 
and he states that he has published the test to 
enable further data to be obtained. To what ex- 
tent the publication of an admittedly incomplete 
test, with comprehensive norms, etc., is justified, 
is a matter of opinion. 


For reviews by Jack W. Dunlap and Starke 
R. Hathaway, see 3:93. 


[86] 

XSelf Analysis Inventory. Adults; 1045; title on 
test booklet is “How’m I Doin’? : an interviewing aid 
for locating maladjustment in 37 problem areas; 1 
form; no data on reliability and validity; $9.60 per 
100; 54¢ per specimen set, postpaid; 2¢ per leaflet of 
remedial suggestions for any one area; 48¢ per com- 
bined booklet of remedial suggestions for the 37 areas; 
postage extra; nontimed (30-60) minutes; Harry J. 
Baker; Public School Publishing Co. * 


[87] 
*The Sixteen Personality Factor Questionnaire. 
Adults; 1949-50; formerly called The 16 P.F. Test; 
16 scores: cyclothymia уз. schizothymia, general in- 
telligence vs. mental defect, emotional stability vs. gen- 
eral neuroticism, dominance уз. submission, surgenc 
vs. desurgency, positive character vs. immature’ depen 
ent character, adveriturous cyclothymia vs. inherent 
withdrawn  schizothymia, emotional sensitivity vs. 
tough maturity, paranoid schizothymia vs. trustful ac- 
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cessibility, bohemianism vs. practical concernedness, 
sophistication vs. rough simplicity, worrying suspi- 
ciousness vs. calm trustfulness, radicalism vs. conserva- 
tism, independent selí-sufficiency vs. lack of resolution, 
will control and character stability, nervous tension; 
IBM; Forms A ('49), В (49); authors recommend 
the use of both forms; manual [’50] ; $3 per 25; $5.50 
per 25 of both forms; 85¢ per specimen set of every 
form; separate answer sheets may be used; $2 per 50 
IBM answer sheets; design for machine scoring sten- 
cils included in manual; $2 per set of stencils for hand 
scoring of answer sheets; 00@ per scoring key; cash 
orders postpaid; (30-40) minutes for any one form; 
R. B. Cattell and G. Stice; Institute for Personality 
and Ability Testing. * 
REFERENCES 


т. Catrett, RaymMonp B. “The Description of Personality: 
Principles and Findings in a Factor Analysis." Am J Psychol 
58:69-90 Ja А, (РА ач тї) 

2. CATTELL, RAYMOND В. *Interpretation of the Twelve Pri- 
mary Personality Factors." Char & Pers 13:55-91 S '45. * (PA 


s. CarreLL, RAvMOND B. “Primary Personality Factors in 
the Realm of Objective Tests." J Personality 16:459-87 Je '48. 


4. 

. 6. CATTELL, Raymonp В, "The Primary Personality Factors 
in Women Compared With Those in Men." Brit J Psychol, Stat 
Sect 1:114-30 Jl '48. * (РА 2 Ио) 

7. CarTELL, RAYMOND B. ne ain Personality Factors in 
Questionnaire Self-Estimate Material.” J Social Psychol 31:3- 
38 Е 'so. * (PA 24:5722) 

8. CarTELL, R. P. AND SAuNDERS, D. В. “Inter-Relation and 
Matching of Personality Factors From Behavior Rating, Ques- 
tionnaire, and SMS Test Data." J Social Psychol 31:243- 
бо My 'so. * (PA 25:2902) 

Cuartes M. Harsu, Research Psychologist, 
U.S. Navy Electronics Laboratory, San Diego, 
California.* 

The authors claim that, among personality 
tests, this is as pure a product of factor analysis 
as can be found, in that each item has appreciable 
saturation by one of the 16 “source traits” of 
ability, temperament, and character integration. 
Twelve of the factors were isolated by factor 
analysis of observers’ ratings of traits, the other 
four were found in questionnaire responses. The 
printed test directions are brief, with freedom for 
amplification by the administrator. Items are of 


two types, each with three alternate answers: 


Do you tend to get angry with In 
people rather easily? ..... Yes Between No 
Would you prefer the life оѓ: 

(a) an artist? 

(b) a Y.M.C.A. secretary? (a) Uncertain (b) 
Many of the items look familiar to users of older 
personality and interest inventories, but the 
manual does not indicate the method of deriva- 
tion of items or the method whereby observer- 
rating factors were translated into questionnaire 
items, Many of the items have the objectionable 
ambiguity which was criticized in older ques- 
tionnaires, thus arousing a rather uncooperative 
attitude among college students, to whom this 


reviewer has administered the test. In general 
the items seem less subtle and penetrating than 
items of the Guilford-Martin Inventory of Fac- 
tors GAMIN or of the revised Kuder Prefer- 
ence Record. This is only a superficial judg- 
ment, of course, but it points to the need for 
objective evidence of the superiority of The 16 
P.F. Test. 

The parallel Forms A and B have identical 
rotation of 187 items, with 3 buffer items and 
IO or I3 items measuring each factor. On a 
general population sample of 200, the corrected 
split half reliabilities of the factor scores from 
Forms A and B combined range from .50 to 
88 (Ма = 71). No reliabilities are reported 
for the separate forms, but the manual implies 
that they are too unreliable for most purposes 
since it recommends that both forms be given 
as a single test. Hand scoring by transparencies 
involves repositioning and stapling the sheets 
of the test booklet. A separate answer sheet can 
be machine scored with six sets of keys which 
fit either Form A or B. Positions on the answer 
sheet do not line up with items on the test book- 
let. Raw scores are converted to standard scores 
on a IO-point scale, based on the responses of 
293 college students plus 50 other adults. 

'The manual claims that the items represent an 
even sampling from the personality sphere with 
a minimum of overlapping of factor scores, but 
no objective evidence is presented to support 
these claims. The manual gives no information 
concerning either the factor saturations of the 
items or the intercorrelation of factor scores.: 
Neither is there any indication of the correlation 
of factor scores with other personality measures. 
It is implied that no other measures are pure 
enough to use for validating the 16 personality 
factors, which are merely described verbally and 
defended on the grounds that they, or appar- 
ently closely related factors, have shown up in 
many different factorial studies of personality 
and even in clinical studies. To this reviewer it 
seems very desirable to know how factor B (gen- 
eral intelligence) is related to recognized meas- 
ures of intelligence, how factor E (ascendance- 
submission) is related to the Allports' 4-5 Re- 
action Study, how Q; (radicalism-conservatism ) 
is related to other radicalism tests, and how the 
other factors are related to somewhat similarly 
named factors in the tests of Kuder, Guilford, 
Thurstone, and others who have assembled fairly 
“pure factor” subtests. Such information will 
neither validate nor invalidate The 16 P.F. Test, 
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but it will help to orient psychometrists who wish 
to evaluate the alleged distinction between the 
16 source traits and the personality factors pre- 
viously revealed in other factor studies. 

The 16 P.F. Test is presented as a research 
instrument with possible applied psychology 
uses. In its present form it seems unlikely that 
it can give an assessment of personality much 
superior to that of other multifactor paper tests. 
Cattell has suggested the factorial determinants 
of various life situations, and the test may be 
useful in investigating hunches concerning the 
composition of psychometric classifications or 
clinical syndromes. But, in general, the utility of 
the 16 factor scores remains to be demonstrated, 
and the neutral observer will probably want 
better evidence of the purity of the factor scores 
before he starts using The 16 P.F. Test as an 
analytical research instrument. 


ARDIE LuBiN, Statistical Psychologist, Insti- 
tute of Psychiatry, Maudsley Hospital, London, 
England. 

This personality questionnaire contains items 
such as “Do you think that most of us have so 
many faults that unless people are charitable to 
one another life would be intolerable? Yes. In 
Between. No.” The subject picks one of the 
three alternatives. 

The split half reliabilities for scores on the 
two forms combined based on 200 cases range 
from .50 to .88 for the various factors. The 
standardisation given in the handbook is casually 
described as “based on 293 men and women col- 
lege students, averaging 22 years of age, plus 
some 50 men and women from other walks of 
life." Means and standard deviations are given. 
Within the handbook, various references are 
made to "peripheral standardisation" in terms 
of means and standard deviations for “оссира- 
tions, clinical syndromes, social, educational, sex 
and age classes, etc." This information, if it 
existed, would make the test unique; this re- 


viewer, however, has been unable to find any such : 


data either in the handbook or in published arti- 
cles. The test user is invited to substitute, for 
the actual validation data, a mental estimate of 
how important the given factors are in the given 
situation where prediction is needed which is to 
be based on an "understanding of the intrinsic 
meaning of each of” the 16 scores. The test user 
presumably obtains this understanding by read- 
ing descriptions presented in the handbook such 
as the following : 


FACTOR F. SURGENCY AND DESURGENCY. * It is well 
known and frequently confirmed that the factor is a 
powerful component in what used to be called extra- 
version-introversion, Eysenck's study in the abnormal 
range shows that it is associated with a basic tempera- 
mental difference which causes some individuals (the 
surgent individuals) to show conversion hysteria re- 
sponses in emotional difficulties while the desurgent 
individuals tend to show free floating anxiety and de 
pression as in typical anxiety hysteria. 


The reviewer found that he and other psycholo- 
gists reading this passage tended to interpret it 
as meaning that Eysenck had found that scores 
on Factor F differentiated conversion hysteria 
from anxiety hysteria. At present, neither 
Eysenck nor any other psychologist has pub- 
lished evidence showing that Factor F discrimi- 
nates between any two groups of mental patients. 
This free, though unwarranted, linking of factor 
scores to mental illnesses is found throughout 
the handbook. “As far as resemblance to clinical 
diagnostic categories are concerned it will be 
readily seen that A and H are two factors in 
schizophrenia, O corresponds to free anxiety or 
neurasthenia, F differentiates conversion hys- 
teria and manic conditions from anxiety neu- 
rosis and depression, Qs is the pattern of the ob- 
sessional character, L is the paranoid component 
and C is a measure of general neuroticism.” 
Again, no published evidence can be found that 
shows that these factor scores actually do dis- 
criminate in this manner between the specified 
groups of mental patients, Presumably, these 
clinical terms are used as hypotheses, possible 
explanations of the factors. But to make this 
clear to the test user, the handbook would have 
to be entirely rewritten. 

We conclude that The 16 P.F. Test could be 
used in a harmful manner. Either the statements 
about the diagnostic value of the factor scores 
should be confirmed, or the present handbook 
withdrawn and rewritten, At present the test has 
no known validated use. 


J. Ricuarp WrrrENBORN, Research Associate 
in Psychology, Yale University, New Haven, 
Connecticut. 

This questionnaire is based on a factorial ap- 
proach to the study of personality. Accordingly, 
the virtues and the limitations to be considered 
should include a review of the virtues and limi- 
tations of both a factorial approach and of a 
questionnaire method. When items are scored оп 
the basis of their factorial composition, it in ef- 
fect means that their intercorrelations have de- 
termined the manner in which they are grouped 
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for scoring, and on this basis it may be claimed 
that all the items which comprise a given factor 
score are to some degree or another measuring 
the same thing, i.e., that is they tend to distin- 
guish between people in a similar way. Scores 
based on this kind of behavioral homogeneity 
have certain advantages. (a) Scores of differ- 
ent magnitudes are likely to have the same kind 
of behavioral significance (the distinction be- 
tween individuals is on the basis of degree rather 
than kind). (b) The person who interprets the 
score has a better chance of knowing what the 
score means than a person who is forced to in- 
terpret scores which represent heterogeneous 
and unspecified behavior. (c) Because of their 
behavioral homogeneity, it is possible that such 
scores may have an exceptional validity for 
homogeneous criteria of similar content and re- 
duced validity for criteria which are heteroge- 
neous in content. Ordinarily, this should mean 
that factor scores can be combined more eco- 
nomically in the multiple prediction of criteria 
than can scores which have no particular claim 
to behavioral homogeneity. { 
According to the manual, the authors aim “to 
leave out no important aspect of the total per- 
sonality" (and apparently they assume that the 
basic aspects are limited in number and for the 
most part have been discovered by their studies). 
This is a notable.aim inasmuch as important as- 
pects of the personality may be defined ad in- 
finitum on the basis of an almost countless va- 
riety of desiderata. It should be emphasized, 
however, that factors are based on patterns of 
intercorrelation. Inasmuch as the pattern of in- 
tercorrelation among items is determined by the 
habits which the sample of items may represent 
and by the habits which are present in the sample 
of subjects employed, factors cannot be claimed 
to have a significance which transcends time, 
place, and circumstance. As a matter of fact, 
they may have no significance at all which is 
relevant to the purposes of testing and as a 
consequence they are best regarded as a kind 
of economy or shortcut in describing a sample 
of behavior; the practical value of the economy 
is relative and dependent upon a correspondence 
between the sampling considerations which de- 
termined the selection of items and subjects and 
the anticipated practical testing situation. In any 
event, it seems unwarranted for the test users to 
accept naively the label which the factor analyst 
applies to his factor scores. Before factor scores 
can be used intelligently, the reader should ex- 


amine the items which contribute to the factor 
and compare his own inference concerning the 
basis for their homogeneity with the inference 
provided by the factor analyst. He should next 
refine his inference by a scrutiny of the items 
which do not form a part of the factor score in 
question. This type of scrutiny is particularly 
desirable when no external validity is provided 
for the factor score. 

The examiner's use of the factor scores ob- 
viously cannot be dependable until he knows 
their reliability; just as important and per- 
haps less obvious is the desirability of the ex- 
aminer's knowing the manner in which the factor 
scores are intercorrelated. Even though the fac- 
tor analyst may claim that his factors are rela- 
tively independent, their actual dependence or 
independence of the factors must be empirically 
determined. 

The reliabilities of the factor scores for Forms 
A and B combined range from .50 to .88. The 
intercorrelations among the factor scores are not 
found in the manual. Although it is claimed that 
the traits inferred by the authors and measured 
by the questionnaire are based on a variety of 
studies employing different kinds of material, 
the manual offers the examiner no specific in- 
formation concerning the samples of subjects 
and data employed in making the inferences or 
concerning the manner in which the nature of a 
trait was inferred. The norms are based primarily 
on college students. 

From the information provided by the manual 
it may be remarked that the reliabilities of the 
scores are rather low for the study of the indi- 
vidual, that the samples (subjects and variables) 
on which the factors are based are not specified, 
that the interdependence of the scores is not 
given, and that the possible practical validity of 
the scores is for the most part a matter for 
speculation. Despite these limitations (which 
may be limitations in the preparation of the j 
manual), it is quite probable that the question- 
naire may be valuably employed in a variety of 
personnel research undertakings. The ques- 
tionnaire as it stands is not a finished tool. It 
represents a very worthwhile and ambitious be- 
ginning, however, and this reviewer takes pleas- 
ure in suggesting its use wherever trial ap- 
proaches to the evaluations of new aspects of 
personality are desired. 

[88] 


*Social Distance Scale, Seventh Experimental 
Edition. Ages 15 and over; 1925-51; 4 scales listed 
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below ; mimeographed ; no data on reliability and valid- 
ity; no manual; 20 or more sets of the 4 scales, 8€ per 
set; 2¢ per copy of any one scale; postage extra; тоё 
per specimen set, postpaid; 20(25) minutes; Emory S. 
Bogardus; the Author, 3518 University Ave, Los An- 
geles 7, Calif. * 

а) ETHNIC DISTANCE SCALE. 1925-51; formerly call 
Racial Distance; 1 form, '51. v 
b) OCCUPATION DISTANCE. 1925. 

C) RELIGIOUS DISTANCE. 1925. 

d) ECONOMIC DISTANCE. 1925. 
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"Some Differential Attitudes Among Adolescent Grow as 
Revealed by Bogardus' Social Distance Scale." Sociol & Social 
Res 35:38-45 S-O 'so. * (PA 25:6071) 

Dowarp T. CAMPBELL, Assistant Professor of 
Psychology, The University of Chicago, Chi- 
cago, Illinois. 

The Bogardus Social Distance Scale is prob- 
ably the most used single test of social attitudes, 
with a popularity that shows no signs of waning 
after 27 years. It is also one of the oldest tests. 
Only Harper's A Social Study? is older among 
attitude tests that have been used beyond the 
дси Coin Oniveriyy sche, Cage! Gumi 
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research іп which they were originally pre- 
sented. The Social Distance Scale provides the 
backbone of Bogardus' outstanding research on 
race attitudes in the United States.? It is also a 
major instrument in Hartley's recent research,’ 
that of Murphy and Likert,* Zeligs and Hendrik- 
son (4, 5, 6), and of many others we have made 
no attempt to cite here. Perhaps even.more out- 
standing is the fact that such subsequent re- 
search has reinforced Bogardus' main conclu- 
sions as to the generality within the United 
States of a hierarchy of preferences for nation- 
ality and ethnic groups, a hierarchy which even 
minority groups share in its essential outline.” 
The test has also been used to measure social 
distance toward professions, religious groups, 
conscientious objectors, etc. It is probably the 
most frequent illustration of attitude measure- 
ment cited in social psychology texts. 

The test itself is simple. While the various 
users have modified the instructions, and oc- 
casionally the items, the general format has usu- 
ally been kept. In its original version it went like 
this: "According to my first feeling reactions, 
I would willingly admit members of each race 
(as a class, and not the best I have known nor 
the worst members) to one or more of the classi- 
fications under which I have placed a cross: (1) 
To close kinship by marriage. (2) To my club 
as personal chums. (3) To my street as neigh- 
bors. (4) To employment in my occupation in 
my country. (5) To citizenship in my coun- 
try. (6) As visitors only to my country. (7) 
Would exclude from my country." Under the 
7 points as column headings were spaces to rate 
a large number of ethnic and nationality groups 
(1). 

In spite of this general acceptance, the test 
has had a cinderella-like history in its reputa- 
tion for scientific respectability within the test 
and measurement fraternity. Some 10 months 
after Bogardus first published the scale in 1925, 
Allport and Hartman published their article * 
which is often cited as the beginning of scientific 
attitude measurement. (Actually efforts at atti- 
tude testing date back to 1920, at least.) Taking 


2 Bogardus, Emory S. Immigration and Race Attitudes. Fore- 
word by Jerome Davis. Boston, Mass.: D. C. Heath & Co» 
1928. Pp. xi, 268. * (PA 2:3223) А 

3 Hartley, Eugene L, Problems in Prejudice. New York: 
Us s Crown Press, 1946. Pp. x, 124. (РА 20:3231), . 4 

4 Murphy, Gardner, and Likert, Rensis. Public Opinion a" 
the Individual: A Psychological Study of Student Attitudes 0n 
Public Questions, With a Retest Five Years Later. New York: 
Harper & Brothers, 1938. Pp. ix, 316. * (РА 12:5448) " 

5 Allport, Floyd H., and Бао D. A. “The Measuremen! 
and Motivation of Atypical Opinion in a Certain Group. Am 
Pol Sci R 19:735-60 N "25. * 
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off from this article, L. L. Thurstone? threw 
the elaborate machinery of the psychophysical 
methods into the problem of attitude test con- 
struction. The weight of this tradition and its 
greater claims for mathematical sophistication 
gave the Thurstone scales a superior prestige 
that today seems largely specious. (In general, 
the preponderant emphasis of the Thurstone 
methods was on the precise scaling of item in- 
tensity, a nonessential by current standards. 
Item homogeneity is today the main objec- 
tive,^*) So dominating was this aura of scien- 
tific respectability that even Bogardus himself 
felt its weight, and in 1933 (3) published a re- 
vision of his scale with "equal" steps, as estab- 
lished by processing some 60 social distance steps 
through the equal-appearing-intervals method.? 
The items of this scale are as follows: “(1) 
Would marry. (2) Would have as regular 
friends. (3) Would work beside in an office. 
(4) Would have several families in my neigh- 
borhood. (5) Would have merely as speaking 
acquaintances. (6) Would have live outside my 
neighborhood. (7) Would have live outside my 
country." So far as the present reviewer is aware, 
this revision has not been extensively used. Hart- 
ley, for instance, used the original scale, adding 
an additional step “to my school as classmates" 
between Bogardus' steps 3 and 4. So far did 
the reputation for scientific respectability of 
the Bogardus scale slip that McNemar +° was 
able to ignore it completely in his otherwise 
thorough review of the literature. 

Fads in attitude measurement have, however, 
changed, and today the apex of respectability is 
occupied by a specialized language of internal 
consistency known as Guttman Scale Analysis,° 
with its criteria of unidimensionality and re- 
producibility. The 20 or 30 Thurstone scales of 
social attitude are made obsolete by this criterion 
(and incidentally also by Thurstone’s own later 
criterion of interitem factor analysis). And al- 
though Guttman has not expressly stated so, 


6 Thurst L. L. “The Method of Paired Comparisons for 
Sociak Values.” J Abn & Social Psychol 21:384-400 '27. (РА 
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the old Bogardus scale is а perfect illustration 
of the hierarchical unidimensional set of items 
that scale analysis requires. (For scale analysis 
purposes, slight rewording would be required of 
Item 6 in the original scale, so that it would 
read "As visitors to my country," and thus 
avoid the double-endedness of its original word- 
ing. In the 1933 revised scale, Item 5 would 
have to drop the "merely.") Two very recent 
research reports have explicitly confirmed this, 
finding that in larger batteries of items dealing 
with attitudes toward Negroes, only items in 
the social distance domain “scaled.” 1112 

The Social Distance Scale shares, of course, 
the difficulties of all measuring instruments de- 
pending upon voluntary self-description by the 
population under study. This weakness is to 
be found in all published attitude tests, and in 
all interest tests except those utilizing differen- 
tial information profiles. In spite of this limita- 
tion, much valuable research has been done with 
such techniques, characteristically using ano- 
nymity as a substitute for disguise. Another 
note of caution seems indicated where, as in the 
studies of Hartley? and Murphy and Likert,* 
the social distance test involving a large number 
of out-groups is scored to get one general ethno- 
centrism or xenophobia score. As Cronbach * 
has so ably pointed out, any repetitive response 
required of the respondent may create “response 
sets,” which in an irrelevant way increase the 
internal consistency of the test. Both Murphy 
and Likert, and Hartley report split half relia- 
bilities, corrected, in the range .94 to .97 with 
the social distance test involving 21 to 32 social 
groups to be judged. This reliability for a test 
taking but 10 minutes is so high as to cause sus- 
picion rather than comfort. The present reviewer . 
would also tend to interpret as evidence of re- 
sponse set rather than validity Hartley’s finding 
that the social distance expression to three non- 
such ethnic groups (e.g., Pireneans) correlated 
.80 with the composite social distance score based 
on 32 actual ethnic and nationality groups. These 
qualifications are minor, however, and do not 
affect the more sociological use of the test for 
ranking social groups in popular favor. Among 
social attitude tests, the Social Distance Scale 

11 Schuessler, К. F, “Item Analysis in Scale Analysis.” Paper 
read at the 46th Annual Meeting of the American Sociological 
Society, Chicago, September 7, 1951. | de 

12 Suchman, Edward A. "Attitude Research in the Elmira 
Study." Paper read at the 46th Annual Meeting of the American 
Sociological Society, Chicago, September 6, 1951. 

13 Cronbach, Lee J. “Further Evidence on Response Sets and 
Test Design." Ed & Psychol Meas 10:3-31 sp '50. * (PA 


PA 25:681 
Cronbach, Lee J. “Response Sets and Test Validity.” Ed РД 
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is so good, and so naturally suited to its purpose, 
that if Bogardus had not invented it, some one 
else would have. Such a situation is rare indeed 
in the social sciences. 


[89] 
*Social Intelligence Test: George Washington 
University Series, Revised Form. Grades 9-16 and 
adults; 1930-49; 3 editions; 1 form; no data on relia- 
bility and validity in manual ; no norms for pth scores ; 
manual ('49); names and faces sheet (’48); soc per 
specimen set; postpaid; Е. A. Moss, Thelma Hunt, and 
K. T. Omwake; Center for Psychological Service, 
George Washington University. * 2 s 
а) SECOND EDITION. 1930-49; 6 scores: judgment in 
social situations, recognition of the mental state of the 
speaker, memory for names and faces, observation of 
human behavior, sense of humor, total; 1 form, '49; 
$3.75 per 25; 49(55) minutes; prepared with the assist- 
ance of L. G. Woodward. 
b) SHORT EDITION. 1944; 5 scores: same as for Second 
Edition except for memory for names and faces; $3.75 
per 25; 40(45) minutes. А 
с) SP (SPECIAL) EDITION. 1947; 3 scores: judgment in 
social situations, observation of human behavior, total ; 
$3 per 25; 30(35) minutes. 
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For reviews by Glen U. Cleeton and Howard 
R. Taylor, see 3:96; for a review by Robert L. 
Thorndike, see 40: 1253. 
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*SRA Junior Inventory. Grades 4-8; 1951; 5 
scores: my health, getting along with other people, 
about me and my school, about myself, about me and 
my home; IBM; 1 form; separate answer pads or an- 
swer sheets must be used; $1.15 per 25 ода 5Ф рег 
specimen set; cash orders postpaid; nontimed (30-40) 
minutes; H. H. Remmers and Robert H. Bauernfeind ; 
Science Research Associates, Inc. * 
а) FORM AH (HAND SCORING EDITION). 49€ per test 
and answer pad; $1.90 per 25 answer pads. 
b) FORM AM (MACHINE' SCORING EDITION). 394 per 
test; $2.90 per 100 IBM answer sheets ; $1.50 per set of 
scoring stencils. 
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Dwicut І. Arnotp, Professor of Education, 
and Director of Guidance Testing, Kent State 
University, Kent, Ohio. 

The development of problem check lists for 
use in the high school and elementary school is 
a promising step in developing guidance pro- 
grams and educational programs more nearly 
geared to the actual living problems of boys and 
girls. The publication of instruments in this area 
and the addition of further data are definitely 
desirable. 

Basic procedures indicated in the development 
of this inventory seem sound and thorough. 
The items in the inventory were developed from 
statements of pupils. The inventory has been 
given to a sufficiently large number of pu- 
pils representing various regions and types of 
schools so that the norms must be recognized as 
being not only adequate but well presented. Ta- 
ble 7, giving the percentage of children checking 
each item from the various subgroups by sex, 
grade, community, region, and social economic 
status, is a very significant table. In fact, it 
greatly enhances the value of such an inventory 
and answers a number of questions which may 
properly be raised about this instrument. These 
data show that such a device is usable in the 
fourth and fifth grades. They show the extent 
to which the grade level and type of community 
influence the number of items checked by these 
children. The data in this table are so significant 
that its value may easily be overlooked. This and 
similar studies can well be the source of sig- 
nificant information for curriculum development 
as well as for suggesting types of guidance pro- 
grams and activities needed. This table also sug- 
gests an important use that may be made of such 
instruments, 

The suggestions given in the manual for use 
are well stated and helpful. No caution is found 
in the manual about the possibility of negative 
reactions from parents and children in the use 
of the inventory. Although this does not occur 
frequently in using such problem check lists in 
the school, users should be cautioned to prepare 
the students adequately and to have a reasonably 
favorable attitude in the community before such 
an instrument is extensively used. It is probably 
a desirable suggestion that such an inventory 
should be used on a voluntary basis unless it is 
used without names for research purposes. There 
should be some caution in the use of these in- 
ventories regarding the danger that users may 
get into psychological problems of greater depth 
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than they are prepared to handle. It is true of all 
personality inventories and check lists that harm 
may be done by attempting to deal with prob- 
lems at a much deeper level than these instru- 
ments allow. The major emphasis should be on 
the recognition of problems as normal and on the 
common sense ways of dealing with these prob- 
lems or of talking them through with trained 
persons who are working in areas where they 
have had adequate background and training. 

A profile chart for a problem check list seems 
to be entirely out of place. For example, what 
does it mean to indicate that a child who marks 
one item stands at the r4th percentile in the 
number of problems checked about himself 
whereas a student who marks four items is at 
the soth percentile? This profile violates the gen- 
erally accepted principle that high points on 
the profile should represent desirable scores. 

Failure to give credit to the extensive and 
excellent work done by Ross Mooney at Ohio 
State University in the development of problem 
check lists points either to negligence or to in- 
tentional omission, neither of which is defensible. 

The form of the inventory using several dif- 
ferent sheets and a special carbon answer pad is 
a needlessly complicated and much more expen- 
sive instrument than is needed for scoring which 
involves simply counting items checked. Ac- 
cording to prices from the 1951 catalogue the 
SRA Junior Inventory will cost a minimum of 
16¢ a pupil whereas the Mooney Problem Check 
List available for grades 7-9 is available for 6 to 
7 cents apiece. 

While this inventory seems to function satis- 
factorily at the junior high school level, the 
wording of items seems decidedly more appro- 
priate for grades 4 to 6 than for junior high 
school. For grades 7-9, the Mooney Problem 
Check List, Junior High Form, is decidedly 
more appropriate. Aside from high cost and 
needless complexity, the SRA Junior Inventory 
seems a satisfactory instrument for the purpose 
intended, especially for grades 4—6. 


[91] 

*SRA Youth Inventory. Grades 7-12; 1949-50; 9 
scores: my school, looking ahead, about myself, getting 
along with others, my home and family, boy meets girl, 
health, things in general, basic difficulty; IBM; 2 edi- 
tions; 1 form, '49; manual for grades 9-12 (49) ; sup- 
plementary manual for grades 7-9 ('50) ; separate an- 
swer sheets must be used; $1.15 per 25 profiles; 756 per 
specimen set; cash orders postpaid; nontimed (35-45) 
minutes; H. H. Remmers, Benjamin Shimberg, and 
А, J. Drucker; Science Research Associates, Inc. * 

а) FORM AH (HAND SCORING EDITION). 49€ per test and 


answer pad; $1.90 per 25 answer pads; 50€ per scoring 
stencil. 

b) FORM AM (MACHINE SCORING EDITION). 30€ рег 
test; $2.90 per 100 IBM answer sheets; $2.50 per set 
of scoring stencils. 
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Kennetu E. CLARK, Associate Professor of 
Psychology, University of Minnesota, Minne- 
apolis, Minnesota. 

This checklist of 298 questions “has been 
designed as a tool to help teachers, counselors, 
and school administrators to identify quickly the 
problems that young people say worry them 
most. Its results should also prove useful to 
students and their parents." 

+ Items for the eight major areas of problems 
tapped by the inventory came from analysis of 
“hundreds of essays" obtained by asking stu- 
dents in more than 100 high schools throughout 
the country to state anonymously in their own 
words “what things bothered them most.” 
Questions prepared to cover as wide a range of 
the problems as possible were then administered 
to “thousands of high school students in every 
section of the country.” The resulting inventory 
covers in very satisfactory fashion the problems 
young people consider to be most important. 

Responses to items are either check marks (if 
the statement “expresses something that has 
been a problem to you”) or blanks. A self-scor- 
ing answer sheet is provided in addition to sep- 
arate machine scorable answer sheets. Scoring 
of the responses in the eight areas may be done 
by the student himself but scoring of the Basic 
Difficulty items is reserved for the counselor or 
teacher. The general wording of questions, 
methods of scoring, and methods of translating 
raw scores into percentile scores are carefully 
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planned and make the instrument exceedingly 
easy to administer. 

The interpretation of scores is not easy. The 
authors suggest that the high points on a profile 
of scores be used by students to stimulate them 
to solve their own problems and that teachers 
use the inventory to learn more about their stu- 
dents and their problems. Counselors would 
want to interview first those persons with high 
Basic Difficulty scores and should consider that 
any score above, say, the 75th percentile indi- 
cates a need for counseling. In counseling, item 
responses for an individual or for a class may be 
discussed. The authors caution that the use of 
scores by themselves to indicate maladjustment 
is not warranted—that these scores merely indi- 
cate the relative frequency, not the intensity or 
severity, of problems. The manual is devoted in 
large part to discussion of appropriate uses of 
the inventory. 

Items in the inventory are divided into eight 
areas, on a judgmental basis, checked by inter- 
nal consistency analysis. Evidence presented in- 
dicates that each of the eight areas is sufficiently 
homogeneous to warrant use of a single score for 
it. Intercorrelations of area scores are reported 
as ranging from .20 to .67 with a median inter- 
correlation of .46. No test-retest reliabilities of 
area scores are presented: hence, the user can- 
not estimate whether area scores are relatively 
stable indicators of problems over a long time 
interval. No evidences of longitudinal time 
changes in scores of any individual or group are 
presented. The reliabilities of the area scores, 
estimated by Kuder-Richardson methods, seem 
high enough for individual use with the possible 
exception of the health area key. 

The norms are more adequate than those for 
most instruments of this sort. The normative 
sample is large, spread across the entire country, 
and stratified to approximate known population 
characteristics. Separate norms аге presented 
for sexes, and condensed norms are provided for 
16 subgroups broken on sex, grade, and rural- 
urban distribution. In addition, the percentages 
of young people checking each item are pro- 
vided, broken down by sex, grade, geographic 
region, rural-urban distribution, religion, and 
family income. 

д It is apparent that the authors consider this 
instrument to be both a measure of adjustment 
and an aid in the counseling interview. Thus, 
‘the inventory is supposed to provide an indica- 
tion of what a student thinks are his problems.” 


Used for this purpose, no validation is possible. 
The authors do expect, however, that the in- 
ventory will have external validity. Preliminary 
evidence on this point, presented in the manual, 
indicates that 22 out of 35 students independ- 
ently rated as well adjusted scored above the 
median on the Basic Difficulty key, and that 36 
of 57 poorly adjusted students had scores below 
the median. This validity evidence is not strik- 
ing, perhaps because of the unreliability of the 
independent ratings of adjustment, and is prac- 
tically useless to the teacher or counselor, since 
interpretations of profiles will presumably be 
made using the 75th percentile as the cut-off 
point (at the authors’ suggestion) rather than 
the median. 

This inventory thus shows promise of being 
useful as a screening or diagnostic instrument in 
school counseling situations, but is not currently 
ready for such use, except in the hands of a 
carefully trained person. It may be used as an 
aid to interviewing, but again, only if the person 
doing the interviewing is well qualified. For ex- 
perimental or survey purposes it is probably as 
good as any available device of its kind. 


FRANK S. Freeman, Professor of Psychology 
and Education, Cornell University, Ithaca, New 
York. 

This checklist of 298 questions, subdivided as ' 
indicated above, is intended to assist school per- 
sonnel, parents, and the pupils themselves in 
identifying problems that junior and senior high 
school pupils indicate as worrying them most. 

Original materials for the inventory were ob- 
tained by having high school pupils write un- 
structured, anonymous essays on problems of 
greatest concern to them. Essays were obtained 
from “more than 500” respondents in “about 40 
schools.” The content of these was analyzed; 
earlier investigations and inquiries regarding 
teen-age problems were surveyed; and on the 
basis of the analyses and surveys, the preliminary 
list of items was devised. These items were then 
edited by members of the staff in Psychology 
and Education of Purdue University. Thus, the 
items arrived at had the benefit of direct con- 
tribution from the persons primarily concerned, 
professional evaluation, and comparative study. 
The preliminary inventory was administered to 
approximately 15,000 pupils in grades 0—12 and 
about 4,000 in grades 7 and 8, widely distributed 
geographically. Actual statistical analysis of the 
test results was made upon a stratified sampling 
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of 2,500 cases, stratification being based upon 
sex, school grade, religion, urban or rural resi- 
dence, and economic level. 

The manuals accompanying the inventory 
were carefully prepared; they give a complete 
descriptive background and a full statistical ac- 
count of the results, as well as a discussion by 
the authors of the possible uses which may be 
made of the inventory's results by educators, 
psychologists, and parents. The authors are 
properly cautious in discussing the interpreta- 
tion of an individual's scores and responses, 
pointing out that these must be used together 
with other information about that individual. As 
in the case of all other pencil and paper per- 
sonality inventories, there is always the danger 
that in this inventory, too, the nonprofessional 
persons in psychology or counseling will be too 
much impressed and guided by scores, norms, 
and percentile ranks in interpreting an indi- 
vidual's responses. The authors, however, have 
done well to caution repeatedly against mechani- 
cal use of the results and against failure to dis- 
tinguish between items that suggest concern 
with relatively superficial problems and those 
that suggest basic difficulties requiring interview 
or therapy with one who is professionally qual- 
ified to deal with problems of personality and 
adjustment. 

Reliability coefficients of this inventory were 
calculated separately for each of the eight areas, 
using the Kuder-Richardson Case II formula. 
These coefficients varied from .75 (Health) to 
94 (My Home and Family), the median being 
.88. It.is doubtful that the areas and individual 
items of this instrument satisfy the conditions 
and assumptions specified for the use of this 
method of estimating reliability. And even if 
the inventory did satisfy the specified conditions 
and assumptions, we may seriously question 
whether this is the most appropriate procedure 
for estimating reliability of this type of testing 
device. The procedure employed yields an in- 
dex of the test's internal consistency at the par- 
ticular moment the individuals were answering 
the questions, as the authors themselves recog- 
nize. In using an inventory like this one, how- 
ever, we should be more interested, as psycholo- 
gists and counselors, in how consistent and 
persistent each individual's concerns are from 
day to day, over a length of time, rather than 
how an individual feels at a particular moment. 
The practical difficulties of determining relia- 
bility by the test-retest method, here being sug- 


gested, are considerable. One such study has 
been reported by Jacobs (5) with a small num- 
ber of cases (N = 48) retested after an interval 
of one month, The reliability coefficients ranged 
from .72 (Health) to .88 (My Home and Fam- 
ily), with a median of .82. 

Estimation of validity of a personality inven- 
tory is always difficult ; and this inventory is no 
exception. In the first place, we may agree with 
the authors that since the worries, concerns, and 
problems included in the inventory were ini- 
tially specified by the teen-age groups them- 
selves, the items undoubtedly have a degree of 
face validity. In addition, a biserial coefficient 
of correlation was calculated for each item with 
the total score of its own area (N = 1,000). For - 
the eight areas the range of the medians is from 
.50 to .70; the median of these medians being 
approximately .55. The range of r’s between 
items and their respective area scores is great, 
in one instance having an approximate range 
from .95 to .25. Clearly, then, the degree of in- 
ternal homogeneity of items is, on the whole, 
moderate and varies considerably between items. 
Another validating procedure used was to in- 
tercorrelate total scores of the eight problems 
areas. The median coefficients for the eight areas 
varied from .39 to .51; and the median of the 
medians is approximately .46. These would in- 
dicate a moderate, but, under the circumstances, 
not excessive, amount of communality between 
areas. Other tentative validating procedures 
were used: such as judgment of psychologists 
and guidance counselors regarding diagnostic 
value of items, comparisons of counselors' rat- 
ings with inventory scores, and comparisons of 
scores obtained by "known groups." These re- 
sults offer some support for the inventory, but 
they are at present tentative. 

On the whole, this inventory appears to be 
one of the sounder instruments in its field. The 
authors describe in detail and with care their 
procedures in deriving the items and in stand- 
ardizing them. They present sufficient data to 
enable prospective users of the inventory to 
evaluate it for themselves. They have drawn 
their raw materials from a wide area and from 
large numbers of subjects. They recognize the 
present limitations of the inventory and are 
properly cautious in discussing its possible uses. 
In the hands of professional psychologists and 
counselors this instrument should be useful and 
more valid than some others of the same type in 
facilitating the discovery of areas of difficulty 
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and maladjustment as a basis for subsequent 
counseling or therapy. 
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*Study of Values: A Scale for Measuring the 
Dominant Interests in Personality, Revised Edi- 
tion. College and adults; 1931-51; 6 scores: theoreti- 
cal, economic, aesthetic, social, political, religious; 1 
form, °51; manual ('51); $2.50 per 25; 40€ per speci- 
men set, postpaid; nontimed (20) minutes; Gordon W. 
Allport, Philip E. Vernon, and Gardner Lindzey; 
Houghton Mifflin Co. * 
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Factors and Reli 


Harrison С. Goucu, Assistant Professor of 
Psychology, University of California, Berkeley, 
California. 

This is a 1951 revision of one of the better 
known and more widely used personality tests. 
The items have been somewhat streamlined and 
modernized, and an attempt has been made to 
sharpen the conceptions underlying the social 
scale. The result is a test similar to the earlier 
version in all important aspects, but of more 
pleasing format, more clearly differentiated di- 
mensions, and improved statistical reliability. 

Evidence is cited in the manual showing the 
degree to which the various scales reflect group 
differences. These citations, and an abundance 
of evidence from the research literature, indi- 
cate that the six value scores do relate in the 
manner specified by the type-theory to a wide 
variety of criteria. For example, clergymen at- 
tain high scores on the religious scale, business 
administration students on the economic scale, 
and medical students on the theoretical scale. 

The manual also presents an interesting and 
useful section on research applications of the 
test. This section is helpful in indicating the pos- 
sible range of application of the instrument, and 
in providing hints concerning the authors’ con- 
victions about the general utility and meaning- 
fulness of the test. ; 

The test does not seek to measure the absolute 
strength of each of the six values, but rather the 
relative intensity. Thus, a high score on one value 
must be compensated for by a lower score.else- 
where. This mode of scoring may not be а 
deficiency, as some users claim, but it may lead 
to serious interpretational difficulty if the user 
thinks in terms of absolute levels. It would ap- 
pear to the reviewer that a modification of scor- 
ing could be achieved which would permit an 
assessment of both relative and absolute strength. 

The six values are based on Spranger's theory 
of types. The definitions appear to be somewhat 
invidious and judgmental, to a greater degree 
than is required by the “ideal type" approach. 
TThe definitions, and perhaps to an equal extent 
the questions in the test, tend to make the the- 
oretical and aesthetic values “good” ones, and 
the political and economic values “bad.” The 
danger in this is that later research concerned 
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with adjustmental and personological correlates 
of value preferences, as measured by this test, 
will necessarily rediscover the relationships al- 
ready built into the Study of Values. Biases of 
this kind would almost seem to be an inherent 
defect of tests constructed by a priori methods. 
It is an interesting theoretical question to ask 
how such bias can be avoided in a test where the 
test maker decides, by himself, what questions 
will be asked and which answers will be con- 
sidered “correct.” 

The test is suitable for high school and college 
groups. The language is too academic and in- 
volved for use in groups very far removed from 
a scholastic environment. There may be some 
question, too, as to whether the notion of value 
orientations of the kind invoked here has much 
applicability outside the realm of highly educated 
groups. 

The test is also fairly transparent in meaning 
and import. There are no internal checks on 
response validity, dissimulation, or on less con- 
scious response sets. Useful results with the test 
are thus predicated upon the interest and co- 
operation of the subject. 

One feature which is sorely missing from the 
manual is a discussion of the use of the test with 
individual cases. This is a somewhat surprising 
omission in view of the senior author’s identifica- 
tion with the idiographic point of view in psy- 
chology. The test does have interesting and 
useful properties in individual guidance and 
assessment, and it is to be regretted that the 
manual does not include some reference to this 
kind of use, and a few examples of individual 
interpretation. 

In summary, the Study of Values would ap- 
pear to be a dependable and informative instru- 
ment, It has a definite, if questionable, theoreti- 
cal basis, its statistical properties are adequate, 
and it yields scores which possess utility for 
both group and individual interpretation. The 
range of application of the test is somewhat 
limited, and the variables themselves may not 
have wide relevance. However, within the ap- 
propriate context the test possesses considerable 
merit and utility. 


Wittiam STEPHENSON, Visiting Professor of 
Psychology, The University of Chicago, Chi- 
cago, Illinois. 

This revised edition offers improvements over 
the original 1931 edition without changing the 
test in any essential respects. New questions 


have been introduced, the wording simplified, 
the scoring procedures streamlined, and fresh 
norms have been prepared. A major change in 
the basic feature of the test is the restriction of 
Spranger's "social" value to altruism or philan- 
thropy, to the exclusion of more mundane forms 
of conjugal, familial, and religious affections. 
The norms are also far less extensive for the 
revised edition. 

It is scarcely necessary for the reviewer to 
affirm or to question the general usefulness of 
this well-known test. It has been used in many 


researches and has had some indirect validation 


in several directions, e.g., in studies of semi- 
narians, novices in religious communities, col- 
lege students, married couples, and the like. 
As is well known, the test was based upon 
Spranger's Types of Men which specifies six 
types of persons : theoretical, economic, aesthetic, 
social, political, and religious. Assuming that 
personality is best known through a study of 
such values, the test attempts to indicate in which 
of these classes a person most dominantly be- 
longs. 

But twenty years have elapsed since this test 
was first introduced, and it is perhaps the oc- 
casion for evaluating it as а research tool, be- 
fore it sets out in its revived condition. It has 
always seemed an anomaly to the reviewer that 
one of the great critics of nomothesis and advo- 
cates of the ideographic approach to the study of 
personality should take part in the production 
of a test of this kind without fostering its sup- 
posed theoretical formulations on their own ac- 
count. One need not deny a certain very crude 
testability, along these test lines, of some of 
Spranger’s notions, but it should not at this time 
be taken on faith, or because definitive norms are 
available for it, or because external validations 
can be reported for it. After all, the ability to 
stand on one leg might provide similar norms 
and similar external validities. Some researches 
which are intrinsic to Spranger's theory and 
which seem to us to be essential have yet to 
be attempted, although methods now exist for 
such studies. Spranger's types were far more 
than averages of persons of a kind; a group or 
class of persons did not define them. Rather, 
the types were theoretical formulations that 
preceded the possible demarcation of classes of 
persons, It is important, therefore, to represent 
these theoretical matters ; and although Spranger 
is not easy to understand, it is possible to con- 
struct balanced block designed samples of state- 


м; 


‚ manual gives an extensive report of the statisti- 
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ments for Spranger's theory as such, at least 
with respect to what could reasonably be the 
independencies for his theory. Propositions can 
then be asserted about, say, any experimental 
subject, and empirical tests made of these. A 
theory like Spranger's cannot be proved or dis- 
proved for its "general implications" (every- 
where involved in the test under review). In- 
stead, we can test propositions about or in 


relation to it. Thus, when the present reviewer - 


uses this test, its questions are cut to pieces and 
divided into briefer units; a structured design 
is made of these, following R. A. Fisher's well 
known devices ; experiments are then conducted 
about a single curate, seminarian, or college 
student to see what Spranger’s theory as such 
can bring to light. We would suggest, therefore, 
that the time has come for those who issue tests 
of this kind to provide evidence that the theory 
they purport to be employing has intrinsic points 
of interest and value. 

Spranger was undoubtedly a subtle psycholo- 
gist, the first of Dilthey’s breed: it would be 
pleasant to see even one study, along the meth- 
odological lines for which Dilthey stood pre- 
eminent in his day, pursued in terms of singular 


propositional tests of the theory itself. Otherwise — 


there will be more decades of studies, finding 
little bits of fact here and there through the test 
but never any advance upon its elementary nomo- 
thetical formulations, against which, it so hap- 
pens, Dilthey, Spranger, and Allport have all 


` objected, and, we believe, correctly so. 


J Consult Psychol 15:515 D ' 5r. Laurance F. 
Shaffer. After twenty years of wide and varied 
use, the Study of Values has been revised thor- 
oughly, There are changes both in content and in 
format. The old "social" value, which had proved 
unreliable because of adherence to Spranger's 
excessively broad definition, has been redefined 
in terms of altruistic or philanthropic interest. 
мре of the items have been revised, to simplify 
wordings, and to eliminate outdated and over- 
specific cultural references. The new items give 
the impression that they are likely to maintain 
their meanings with the passage of time. The 


cal characteristics of the revision. New and re- 
vised items were selected by three successive 
item analyses, and each scale is internally con- 
sistent. Scale reliabilities are substantially im- 
proved, ranging from .87 to :92 on a one-month 
retest. Scale intercorrelations are mainly low. 


Norms are supplied in terms of the means and 
SD’s of 1,816 college students, and are further 
analyzed in terms of sex and of occupational 
groups. The new format simplifies the process of 
scoring and tes the need for a separate 
score sheet. In its excellent new revision, the 
Study of Values seems destined to several more 
decades of service as a tool for guidance and for 
research. 


For a review by Paul E. Meehl of the first 

edition, see 3:99. 
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X*Thurstone Temperament Schedule. Grades 9-16 
and adults; 1949-50; 7 scores: active, vigorous, im- 
pulsive, dominant, stable, sociable, reflective; IBM; 
2 editions; 1 form, '49; manual ('50) ; separate answer 
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edition; cash orders postpaid; nontimed (10-20) min- 
utes; L. L. Phurstone; Science Research Associates, 
Inc. * ) 
а) FORM AH (HAND SCORING EDITION). 49€ per test and 
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b) FORM AM (MACHINE SCORING EDITION). IBM; 30d 
per test; $2.90 per roo IBM answer sheets; $2.50 per 
set of machine-scoring stencils. 
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Hans J. Eysenck, Director, Psychology De- 
partment, Institute of Psychiatry, Maudsley 
Hospital, London, England. 

This inventory contains 140 “Yes” “?” “No” 
questions, which measure seven areas of рег- 
sonality : active, vigorous, impulsive, dominant, 
stable, sociable, and reflective. These seven areas 
were derived from a factorial analysis not yet 
reported in print, so that it is impossible to 
express an opinion regarding the basis of this 
questionnaire. It is disturbing to know, how- 
ever, that although this factorial analysis in- 
cluded the various factors isolated by Guilford, 
the result is only very superficially like Guil- 
ford’s set of factors. While the reviewer believes 
that factor analysis is indispensable in arriving 
at useful and fundamental dimensions of per- 
sonality, he feels that divergent results of this 
kind must make one very careful of accepting 
any particular solution unless it is strongly sup- 
ported by the evidence. Intercorrelations among 


“some of the seven traits are surprisingly high, 


orrelations of .71 and .52 being reported, so 


that one of the main advantages of factorial de- 


sign, namely, the independence of factors, is 
clearly sacrificed. Reliabilities for the seven areas . 
are not very high for any of the standardised 
groups (men, women, boys, and girls). Coeffi- 
cients of .45, .48, .46, occurring quite frequently. 
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with the highest coefficient .86, are still well 
below the .9 level. An ingenious argument is 
presented in the manual to show that even with 
a reliability as low as .4, not more than one sixth 
of the population will be misclassified as being 
in the highest quarter when their true scores 
are below the mean. This, of course, is true, but 
such a gross misclassification of almost 20 per 
cent of the population is a serious matter if any 
action is taken on the basis of the results. Also, 
of course, it should be borne in mind that like 
the proverbial dying man grasping at a straw, 
the user of a questionnaire, whether he be coun- 
selor or clinical psychologist, frequently inter- 
prets even very slight differences in profile 
scores, and he is unlikely to be dissuaded from 
this practice by any amount of statistical argu- 
ment regarding lack of reliability of the scores. 

No follow-up data are reported, except the 
failure of the schedule to discriminate between 
good and poor office workers. 

As a research instrument, no doubt, this 
schedule has a place. Its clinical and industrial 
use, however, must await proper validation 
studies. It should be added that in a text depend- 
ent for its appraisal so much on statistical de- 
tails of factorial analysis, more information 
should be given in the manual to enable the 
reader to form an adequate opinion. —.. ^ 


СнАВІЕЅ M. Harsu, Research Psychologist, 
U.S. Navy Electronics Laboratory, San Diego, 
California. xe 
This r4o-item self-administering question- 
naire is designed to give a brief appraisal of 
seven relatively permanent aspects of tempera- 
ment in persons who are fairly well adjusted. 
The simple items calling for “yes,” “?,” “no” 
responses are suitable for high school students 
or average adults, Instructions are clear and 
items are well spaced. The separate answer sheet 
is easy to use, the appropriate column always 
being aligned with the items on the suctessively 
narrower pages of the test booklet, each page 
revealing one more column of the answer sheet. 
Raw scores may be plotted directly on the norma- 


tive profile forms printed on the back of the . 

separate answer sheet, or the scores may be. 

Mo e relative to office worker norms printed | 
t 


in the examiner's manual. Each set of norms is 
based on a sample of from 400 to 1,200 persons. 

Split half reliabilities for the seven subscores, 
determined for five different samples of high 
school students or adults, vary from .45 to .86. 
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Retest reliabilities on a male executive group 
range from .61 to .82. The manual has a good 
discussion of the reliability of a profile, which 
for this type of schedule is more important than 
the reliability of the subscores. Tables of inter- 
correlation of subscores (for men and women 
separately) show that most of the scores are 
statistically independent, although two correla- 
tions are above .50. { | 

The seven aspects of temperament were de- 
termined by refactoring of Lovell's scoring of 
Guilford's three personality inventories. Items 
derived from several personality and preference 
tests were tentatively scored for these seven as- 
pects of temperament, leaving out any implica- 
tion of maladjustment, and the twenty most 
discriminating items in each area were selected 
for the present schedule. For various sample 
populations, the manual presents tables of inter- 
correlations between the subscores and the sub- 
scores of the Guilford-Martin questionnaires, 
the Study of Values, the Kuder Preference Rec- 
ord —V ocational, the Kuder Preference Record 
— Personal, and the Thurstone Interest Sched- 
ule. This statistical information will give helpful 
orientation until there is more direct evidence 
of the validity of the Thurstone Temperament 
Schedule as a supplementary aid to vocational 
guidance or placement. The manual frankly ad- 


mits the need of validity studies and discusses . 


the failure of the schedule to discriminate be- 


tween good and poor office workers in one study. 


The excellence and clarity of the manual en- 
courages a favorable attitude on the part of 
prospective users who could contribute to vali- 
dation studies of this little test, which may be 
of use in prescreening for longer or more dis- 


criminating specialized tests. 


Davin G. Ryans, Director, Teacher Character- 
istics Study, American Council on Education; 
and Professor of Education, University of Cali- 
fornia, Los Angeles, California. E 
This questionnaire was designed to describe 
normal, well adjusted individuals from the stand- 
point of important and stable personality traits. 
It is not intended to reflect psychotic or neurotic 


_tendencies. Seven areas of temperament are ap- 


praised: Active (referring to the tendency to 
hurry and be “оп the go") ; Vigorous (relating 
to the liking of physical activities and outdoor 
occupations) ; Impulsive (referring to the tend- 
ency to make quick decisions and ability to 
change easily from one task to another) ; Domi- 
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nant (referring to the desire for leadership, and 
for taking initiative and responsibility) ; Stable 
(referring to the tendency to be relaxed and to 
remain calm) ; Sociable (referring to the liking 
for other people and the ability to get along with 
others) ; and Reflective (referring to liking for 
quiet work and reflective thinking). 

The 140 items covering these seven areas are 
presented in question form. They are printed in 
a 6-page step-down booklet which can be used 
with either a self-scoring carbon answer pad or 
a machine-scorable answer sheet. Profile charts 
are provided for adults, with separate normative 
data for men and women, and for high school 
youth, with separate norms for boys and girls. 

Preliminary to development of the T'hurstone 
Temperament Schedule {һе intercorrelations 
among 13 personality scores yielded by the Guil- 
ford and Guilford-Martin schedules were factor 
analyzed by the centroid method, with scale 
reliabilities in the diagonal cells of the matrix, to 
determine the number of factors that seemed to 
be represented by questionnaire materials per- 
taining to normal persons. Nine factors were 
extracted and then rotated to oblique simple 
structure. Two factors were regarded as residual 
factors. The remaining seven formed the frame- 
work for the present instrument. All available 
interest and personality questionnaires were sur- 
veyed and 320 adapted items chosen as repre- 
sentative of the seven personality factors. Subse- 
quent item analysis against the upper and lower 
thirds of each area resulted in the selection of 
140 items. 

The norms available in 1950 were contributed 
to by 694 freshman men and 161 freshman 
women at the University of Illinois (Navy Pier), 
540 male and 496 female office workers, and 
Chicago high school students consisting of 419 
boys and 504 girls of ages 14 to 18. 

Odd-even reliabilities reported in the manual 
range from .45 for the Reflective scale to .86 
for the Dominant scale, based upon the scores 
of 106 subjects. Odd-even reliabilities given for 
five samples cluster around an > of .65. Test- 
retest reliability coefficients, based upon the 
scores of 81 male executives, range from .61 to 
.82 (median — .78) for the several scales. 

Intercorrelations of the scores on the seven 
scales are generally positive and relatively low. 
For 338 college freshmen the intercorrelation 
coefficients ranged from —.16 between the So- 
ciable and Reflective scales to .60 between the 


Dominant and Sociable scales, .58 between the 
Impulsive and Dominant scales, and .53 between 
the Impulsive and Sociable scales. The tendency 
for the Impulsive, Dominant, and Sociable scales 
to be significantly correlated also was noted in 
studies based upon 694 freshman men students 
and 161 freshman women students at the Uni- 
versity of Illinois. The only sizeable negative cor- 
relation reported was one of —.41 between the 
Sociable and Reflective scales, based upon the 
scores of 161 college freshman women. 

Of particular interest are intercorrelations be- 
tween scores on the scales of the Thurstone 
Temperament Schedule and scores on other per- 
sonality and interest inventories. Correlations 
between the scales of the T'hurstone Tempera- 
ment Schedule and the Kuder Preference Rec- 
ord—Personal are generally low positive and 
range from —.37 between the Impulsive scale 
(Thurstone) and Practical (Kuder) to .57 be- 
tween Dominant (Thurstone) and Sociable 
(Kuder). Correlations between the scales of 
the Thurstone Temperament Schedule and those 
of the Kuder Preference Record—V ocational 
are generally low, although there is a suggestion 
that the Vigorous scale (Thurstone) and the 
Clerical scale (Kuder) and the Reflective scale 
(Thurstone) and the Mechanical scale (Kuder) 
are significantly negatively correlated, whereas 
the Dominant scale (Thurstone) and Persua- 
sive scale (Kuder), the Sociable scale (Thur- 
stone) and the Persuasive (Kuder), and the 
Vigorous scale (Thurstone) and the Mechani- 
cal scale (Kuder) are significantly positively 
correlated. Correlations between scores on scales 
of the Thurstone Temperament Schedule and 
scores on the SRA Primary Mental Abilities are 
generally of zero order, the only exceptions 
being 7’s of .25 and .28 respectively between 
Dominant and Sociable (Thurstone Tempera- 
ment Schedule) and Word Fluency (PMA). 

Although the reliabilities of these scales are 
none too high, probably due to their shortness, 
and although the empirical intercorrelations of 
the scales clearly indicate they are not independ- 
ent (no claim for independence has been made 
by the author or publisher; rather, the factor 
analysis indicated oblique structure) the con- 
struction procedure followed appears to have 
been sound ; and the author and publishers have 
been particularly attentive to the supply of in- 
formation about the instrument and interpretive 
data, х 
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J Consult Psychol 15:170 Ap ’51. Laurance 
F. Shaffer. A distinctive and perhaps valuable 
feature of this questionnaire is its emphasis on 
normality. There are no items describing neu- 
rotic symptoms, and the seven scores that are 
obtained from it are all designated by trait terms 
descriptive of ways in which normal people may 
vary—active, vigorous, impulsive; dominant, 
stable, sociable, and reflective. The seven clus- 
ters in which the 140 items are arranged were 
obtained from factor analyses by Thurstone and 
Guilford. The actual items were selected for 
internal consistency within each cluster. The re- 
liabilities are moderate, .48 to .77 for the adult 
male group, the intercorrelations between trait 
scores are generally low, and the norms are based 
on adequately large groups. Despite all these 
statistical virtues, the interpretation of the sched- 
ule is almost surely subject to abuses. Many 
users will be tempted to follow the intriguing 
suggestions for using it in vocational guidance, 
without paying enough attention to the cautious 
footnote that calls attention to the need for vali- 
dation. 


[94] 
Vineland Social Maturity Scale. Birth to maturity ; 
1935-47 ; individual; 1 form, 46, c1936; manual C47); 
$1.50 per 25; $1 per manual; postage extra; $1 per 
specimen set, postpaid; (20-30) minutes; Edgar. 
Doll i Educational Test Bureau, Educational Publish- 
ers, Inc. * 
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WILLIAM M. CRUICKSHANK, Director, Educa- 
tion for Exceptional Children, School of Educa- 
tion, Syracuse University, Syracuse, New York. 

Since it is essential that clinical testing keep 
in step with theoretical advances in personality 
description, it is fortunate that the clinician has 
available a scale such as the Vineland which can 
offer vital information on the results of the in- 
dividual's maturational interaction with his so- 
cial milieu. Although there are no direct meas- 
urements of the influence of interpersonal 
contacts, most of the items of the scale indi- 
rectly bear the impact of the developing organ- 
ism's response to the socialization process. 

The categories of adequacy which the author 
has set up to facilitate evaluation reflect very 
well the processes involved in the maturation of 
social competence. The areas of occupation, lo- 
comotion, communication, and socialization de- 
lineate the instruments available for moving 
toward people while those of self-help and self- 
direction (the latter concerning itself mainly 
with the measurement of the individual’s com- 
petent use of money) point up the necessity for 
the development of independence and freedom 
which enable one to move more confidently and 
thus more competently among others. —. 

The test does not follow the usual procedure 
of laboratory measurement since it employs the 
method of report, rather than examination or 
observation. A person intimately familiar with 
the subject, such as parent, nurse, or institu- 
tional guardian, responds to careful interviewing 
by the examiner on the subject’s demonstrated 
performance of the 117 items covered by the 
scale. The danger inherent in utilizing the po- 
tentially biased observation of untrained and 
perhaps emotionally involved reporters has been 
pointed out by some as a critical shortcoming of 
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the instrument. However, the author's claim that 
it demands administration by clinically trained 
examiners, and that the careful basis for scor- 
ing items and questioning the respondee allows 
for a minimum of inaccurate evaluation as to 
whether a behavior is or is not evidenced by the 
subject, seems to adequately handle these ob- 
jections. 

The final score is computed from the total 
number of items successfully performed, with 
consideration given in scoring for lack of op- 
portunity and for performances which are in 
a transitional stage. The score can be converted 
very simply into a social age and thence, if de- 
sired, into a social quotient. Studies by the au- 
thor have revealed that, on the average, the 
social quotient tends to be higher than the in- 
telligence quotient and, further, that when the 
IQ is low, the SQ tends to be higher and when 
the IQ is high, the SQ tends to fall below it. The 
latter may be due to the fact that there is a rather 
definite cutting off point above which one can- 
not become any more socially competent. 

It would appear that a person's social age 
provides a more effective basis of classification 
for purposes of care and training than would the 
mental age. Allied to this, the scale has demon- 
strated its ability to differentiate between true 
mental defectives who are socially inadequate 
and people who are merely of subnormal in- 
tellect but who are quite competent in managing 
their personal and social lives. It has also been 
found valuable as a measure of therapeutic im- 
provement; for intercultural comparison, i.e., 
Negro and white, Jewish and non-Jewish; in 
evaluation of the acoustically, visually, and other 
physically handicapped individuals; and as an 
index for the measurement of growth or change. 

The author has indicated that a new manual 
containing more detailed analysis of the behavior 
required for each item, additional standardiza- 
tion data, and clinical experimentation in which 
the instrument was utilized to produce empirical 
evidence is in the process of development. This 
should encourage even more widespread use of 
the scale by practicing clinicians. 


FronENCE M. TEAGARDEN, Professor of Psy- 
chology, University of Pittsburgh, Pittsburgh, 
Pennsylvania. 

This is а point and age scale combination of 
117 items of performances “іп respect to which 
children show a progressive capacity for looking 
after themselves and for participating in those 


activities which lead toward ultimate independ- 
ence as adults." It is scaled to test these capaci- 
ties from birth through age 25 plus and gives 
social ages through year 30 plus. Social quotients 
may also be derived. The functions tested are 
self-help (general, eating, dressing), self-direc- 
tion, locomotion, occupation, communication, 
and social relations. The manual of directions 
suggests that each of these functions be tested 
“serially.” Each item within the age range in- 
volved is scored either plus, plus F (temporary 
failure under certain conditions), plus N.O. (no 
opportunity), minus N.O., plus-minus, or minus. 
Determination of “по opportunity" scoring is 
sometimes quite difficult. 

Standardization data were obtained from “ten 
normal subjects of each sex at each year from 
birth to thirty years of age, or a total of 620 
subjects." Extensive research has been done 
with the scale, the manual itself reporting 59 
studies selected as being "representative." Any 
one familiar with the literature in this field real- 
izes that research on the Vineland scale has con- 
tinued since the publication of the manual in 
1947. 

Information with which to score items and 
to derive ratings of social maturity may be ob- 
tained from the subject himself, in the case of 
older subjects, or from parents and other adults 
in the case of children. Clinical use of the scale 
shows that it is perhaps even better as an inter- 
viewing device with parents than as a measuring 
scale for the child. Often information secured 
from the mother and from the father does not 
agree at all at certain points. The clinician thus 
gets a valuable “lead” even if he is not sure how 
the particular item should be scored. Frequently 
also parents will say, when asked for information 
on some item of the scale, “Oh, I believe the 
child could do that himself but I have always 
done it for him.” Excellent counseling data! Or 
again parents sometimes say, “Well, we never 
do that in our home (“Plays simple table 
games?") but I guess maybe we should play 
with the children." 

Constant use of the scale leads one naturally 
enough to see some weaknesses. For example, 
certain items frequently have to be juggled some- 
what in the case of young children living in city 
apartment houses and others living in residential 
districts or rural areas directly along through 
highways. Again “Gets drink unassisted?” (age 
level 2—3) is usually answered negatively by 
parents of children of this age as they say, “He 
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can't reach it himself.” “Brushes or combs hair 
acceptably without help?" (age level 7-8) ap- 
pears to be one thing for boys with short hair 
and quite another thing for girls who may have 
long hair that needs considerable dressing. Girls 
seem likewise to be penalized somewhat on the 
item "Does small remunerative work?" (age 
level 10-11) since many girls have no oppor- 
tunity for earning money. Parents seem a little 
doubtful sometimes as to how they should 
truthfully reply to the standardized questions 
*Washes hands unaided?" (age level 3-4) and 
“Washes face unassisted?’ (age level 4-5). 
Scoring directions for some items of the “loco- 
motion” category such as “Goes about home 
town freely?” (age level g-10) and “Goes to 
nearby places alone?" (age level 15-18) do not 
require that any social caution be displayed. This 
leaves the examiner in a considerable scoring 
quandary at times in the case, for example, of 
unmarried mothers who get around town all 
right so far as “locomotion” is concerned. “Self 
direction” is, however, required at other points 
in the scale. 

In spite of these experiential difficulties, in 
spite of the clinician’s reluctance to report social 
ages carried to the second decimal place, and in 
spite of his unwillingness to do much statistical 
handling of data of uncertain accuracy, the scale 
has undoubted value, particularly for interview 
and counseling purposes. Mentally deficient 
children in superior homes often display amaz- 
ingly high social quotients, thereby giving clues 
as to possible social development and as to super- 
vision. Contrariwise, the clinician often finds a 
child whose IQ on intelligence tests seems spuri- 
ously high. The social quotient of such a child 


may give the answer; he may be socially less . 


developed than we would have a right to expect 
from his intelligence. He may be the square peg 
in the round hole or the bull in the china shop 
or the bungler. One hundred and seventeen items 
stich as Doll has scaled for us are valuable ad- 
juncts to the clinician who is attempting to ap- 
praise the strengths and weaknesses of his client. 
Social ages and social quotients can often be 
reported with considerable conviction. Even 
when special circumstances make the use of such 
mathematical measures unwise, the incidental 
data elicited by the scale and the informant s re- 
actions to the questions often far outweigh these 
deficiencies, When used wisely, the Vineland 
Social Maturity Scale adds greatly to our clini- 
cal insights. 


For reviews by C. M. Louttit and John W. M. 
Rothney and an excerpt from a review, see 
3:107; for reviews by Paul Н. Furfey, Elaine 
F. Kinder, and Anna S. Starr of Experimental 
Form B, see 38:1143. 
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Washburne Social-Adjustment Inventory, Thas- 
pic Edition. Ages 12 and over; 1932-40; a revision of 
The Washburne Social Adjustment Inventory, Sapich 
Edition (see 40: 1262) ; 9 scores: truthfulness, happi- 
ness, alienation, sympathy, purpose, impulse-judgment, 
control, wishes, total; IBM; 1 form, '40; manual 
(до); $2 per 25; 30€ per manual ('40) ; 35€ per speci- 
men set (does not include manual), postpaid; separate 
answer sheets may be used; $1.10 per 25 IBM answer 
sheets; $2.40 per set of stencils for machine-scoring of 
answer sheets; postage extra; nontimed (30-50) min- 
utes; John N. Washburne; World Book Co. * 
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Minneapolis, Minnesota. 

This questionnaire type personality inventory 
consists of an 8-page booklet which includes an 
answer sheet for the 122 items, most of which 
are answered by “yes” or "no." The items appear 
to be of a rather obvious character (e.g., “Do 
you sometimes enjoy the sight of an animal or 
a person being hurt?” and “Did you ever act 


greedily by taking more than your share of any- 
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thing?") such as are found in the majority of 
personality inventories. The purpose of the test 
as stated in the directions for administering is 
“to determine the degree of social and emotional 
adjustment.” The keys are scored for Truthful- 
ness, Happiness, Alienation, Sympathy, Pur- 
pose, Impulse-Judgment, and Control. There is 
also a “Wishes” subtest the scoring of which is, 
however, stated to be “пої absolutely necessary." 
There is a 4-page manual of directions for ad- 
ministration; to interpret the test results, how- 
ever, it is necessary to order the manual of inter- 
pretation separately, which the reviewer regards 
as unjustifiable. It does not appear too much to 
ask of a testmaker that he make his instrument 
as compact and economical as possible for pur- 
poses of meaningful use. 

To the extent that this instrument has been 
used in the differentiation of criterion cases of 
maladjustment which have required help in a 
mental hygiene clinic and “normals” who have 
required no such assistance, it takes а salutary 
cognizance of the need for empirical meaning in 
adjustment inventories. However, the distinc- 
tion between “social” adjustment and “ето- 
tional" adjustment is by no means clear to this 
reviewer. It is also not clear whether Happiness, 
Alienation, Sympathy, Purpose, or the other 
concepts tested are to be regarded as trait con- 
cepts; nor is there any attempt at experimental 
or empirical definition of these concepts. Al- 
though mental hygiene cases may legitimately 
furnish a criterion group for the vague concept 
of “maladjustment,” it is not equally clear what 
criterion groups could be obtained to give ex- 
perimental or empirical meaning to these sub- 
tests. Purpose, Impulse-Judgment, and Control 
are abstract concepts the experimental definition 
of which is no simple task. Since the criterion 
groups for “adjusted” and “maladjusted” are so 
broadly defined it is difficult to see what informa- 
tion is gleaned from deviant scores other than 
the knowledge that they indicate “poor adjust- 
ment." It is by no means suggested that such 
knowledge is useless; however, the test would 
appear to be most useful as a crude screening 
device. It should also be noted that the obvious 
character of the items renders them suscepti- 
ble to distortion both intentional and uninten- 
tional. 

The biserial correlation coefficient of validity 
is given as .90; the test-retest reliability for col- 
lege students is .92 when the administrations are 
separated by a school semester. 


For a review by Daniel A. Prescott of the 
original edition, see 38:928; for excerpts from 
reviews, see 3:110. 


[96] 
Willoughby Emotional Maturity Scale. College 
and adults; 1931; title on test booklet is Willoughby 
EM Scale; 1 form; $1.50 per 25; 10€ per specimen set; 
cash orders postpaid; nontimed (20) minutes; Ray- 
mond R. Willoughby; Stanford University Press. * 
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and Adjustment in a Guid- 
eachers.”” J Ed Res 32:653-9 


Lyste W. Скокт, Director of Personnel, Uni- 
versity of Kentucky, Lexington, Kentucky. 

This scale was developed to indicate the in- 
dividual’s degree of freedom from childish emo- 
tional attitudes and immature motives and to 
reveal the extent to which he can make adjust- 
ments to his environment. Each of the 60 items 
describes, in terms of a hypothetical subject, a 
type of situation and a reaction to it. The rater 
attempts to rate the real subject by checking 
those items which equal a fair description of 
characteristic behavior, actual or probable. The 
manual states a subject may rate himself with 
only a small loss of precision—a rather rash 
claim when presented without qualification. 

The scoring of the scale is simple—a total 
score of weighted items which have been checked 
is averaged and multiplied by 10. The manual 
states that this represents the pooled judgment 
of 100 expert students of personality as to the 
amount or degree of emotional maturity. 

The claimed reliability of rating (which is the 
agreement between two raters) is .54 for col- 
lege student raters. Certainly the use of the scale 
is somewhat limited in view of the reported re 
liability. The author indicates there is "reason 
to believe that the services of raters experience 
in the estimation of personality will be consid- 
erably increased, possibly to .70 or .8о.” No re 
search is offered which has any bearing on the 
statement, and it should have no bearing on the 

` proven reliability of the scale. 

Tentative results on a small group of college 
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students indicate that correlations with instru- 
ments alleged to measure emotional stability will 
be in the neighborhood of .60 or .70. Evaluation 
of the validity of the scale depends upon the 
degree to which one accepts the author's state- 
ment that “There is probably no better criterion 
of the trait in question in existence at present 
than the judgment of experts." First, one could 
ask, just who are the experts and on what basis 
were they selected as experts. This statement 
alone causes one to look upon the scale with 
caution. 

The norms for the scale are based on 70 stu- 
dents, equally divided as to sex. If the scale is 
of value in counseling, why has not the author 
expanded these norms of 1931? 

The statement is made that the self-ratings 
do not show the usual tendency to overestima- 
tion, but this statement is not explained further. 

The author points out in his journal article, 
in response to the criticism that the language is 
too involved, that “while the scale is not intended 
to be restricted to use by psychiatrists, it can- 
not be used by naive or uneducated persons." 
This limitation is not incorporated into the 
manual, which is itself far too brief and does 
not do justice to the amount of statistical work 
involved in the development of the scale, as de- 
scribed in the journal article. 

Practically no research has been reported on 
the scale and certainly the original research with 
70 college students, each rating himself and two 
fellow students, is insufficient to warrant the 
use of the scale as a counseling tool. The elabo- 
rate claims made as to the uses of the scale, such 
as an aid in selecting industrial workers, teach- 
ers, lawyers, and administrators, rather than 
using references, certainly have not been proven 
valid; and until additional researches are com- 
pleted and published in a more complete manual, 
it is rather doubtful that this scale can be con- 
sidered a reliable instrument for counseling. 


[97] 

X*The Wishes and Fears Inventory. Ages 4-8, 8- 
16; 1949; 8 scores: wishes, positive identifications, 
negative identifications, desired activities, undesired 
activities, changes desired in oneself, fears, earliest 
recollection; individual; 1 form, 2 levels (labeled 
Forms А and В); 1 form ['49] ; mimeographed; man- 
ual [49]; (Forms A, B); no data on reliability and 
validity; 254 per specimen set, postage extra; record 
blanks must be reproduced by test user; (5-15) min- 
utes; Martin L. Reymert; The Mooseheart Laboratory 
for Child Research, Mooseheart, Ill. * 
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*The Wishing Well. Grades 4-7; 1945-51; an analy- 
sis of children's needs; 9 scores: belonging, achieve- 


ment, economic security, fears, love and affection, guilt, 
sharing, world outlook, total; r form, '49; mimeo- 
graphed manual ('51); withdrawn from publication; 
nontimed (30) minutes; Evaluation Division, Bureau 
of Educational Research, Ohio State University. * 
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for Identifying Some Needs of Children." Ed Res B 24:169- 
77+ О 17 'as. * 
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*Work Preference Inventory. Applicants for em- 
ployment; 1946-50; 14 scores: reliability, creativeness, 
conservatism, ambition, masculinity, introversion, anx- 
iety-depression, persuasive, social service, theoretical, 
artistic, mechanical, economic, scientific; IBM ; 1 form, 
’46; manual [’50]; separate answer sheets must be 
used; $30 per 100; $2 per тоо IBM answer sheets; $7 
er set of 14 machine scoring stencils (revised ’50) ; 
5 per 100 profile sheets ; 20€ per manual; 756 per spec- 
imen set (does not include scoring stencils) ; postpaid ; 
nontimed (15-75) minutes; Robert W. Henderson; the 
Author, 940 Eighth St., N.E., Massillon, Ohio. * 


For reviews by Edwin W. Davis, John С. 
Flanagan, and Gilbert J. Rich, see 3:113. 
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*Your Activities and Attitudes. Adults in later 
maturity; 1948-49; 2 scores: attitudes, activities; I 
form, ’48; preliminary manual (40); $2.75 per 25; 50¢ 
per specimen set ; cash orders postpaid; nontime 

90) minutes; Ernest W. Burgess, Ruth S. Cavan, and 
AM J. Havighurst; Science Research Associates, 
nc. 

REFERENCES 

т. Cavan, RurH Ѕномів; Burczss, Ernest W.; Havic- 
HURST, ROBERT J.; AND GOLDHAMER, Hersert. Personal Ad- 
justment in Old Age. Chicago, Ill.: Science Research Asso- 
ciates, Inc., 1949. Pp. xiii, 204. * (PA 25:2340 f 

2. Havicuurst, ROBERT J. AND SHANAS, ETHEL. "Adjust- 
ment to Retirement: The Fossils—A Case Study." Sociol & 
Social Res 34:169-76 Ja-F 'so. * (РА 25:981) 

3. SHANAS, ETHEL. "The Personal Adjustment of Recipients 
CE Ola Age Assistance." J Geront 5:249-53 Л '50. * (РА 25: 
1670. 

H Havicnuunsr, Rosert J. “Validity of the Chicago Attitude 
Inventory as a Measure of Personal Adjustment in Old Age.” 
J Abn & Social Psychol 46:24-9 Ја °51. * (PA 25:7469) 

J Consult Psychol 13:385 O °49. Laurance F. 
Shaffer. The first standardized inventory in- 
tended specifically for elderly adults * It consists 
of three parts, an adult activities inventory meas- 
uring reported social participations, an attitude 
inventory, and questions on background infor- 


mation. * 
, For related reviews, see ТОТ. 
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*[Re Your Activities and Attitudes.] Cavan, 
RurH SnowLE; Burcess, Ernest W.; HAVIGHURST, 
Rosert J.; AND GoLpHAMER, HERBERT, Personal Ad- 
justment in Old Age. Chicago, Ill.: Science Research 
‘Associates, Inc., 1949. Pp. xiii, 204. $3.25. * (PA 25: 
2340) 

J Appl Psychol 34:217 Je ’50. Albert К. 
Chandler. The distinctive contribution of this 
book to applied psychology lies in its detailed 
account of the development, testing and applica- 
tion of an Attitude Inventory and an Activity 
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Inventory for the study of persons past sixty 
years of age. The aim of these inventories is to 
secure data on activities and attitudes in various 
areas including health, family, friends, work-and 
economic security. In testing the validity of 
these inventories, interesting auxiliary sched- 
ules were developed: a checklist of personal 
characteristics which an interviewer might ob- 
serve, a set of word portraits, and a list of symp- 
toms supposed to indicate senility (Appendixes 
D, E, F). * Tables 7 to 17 (pp. 48 to 59) are 
based on a smaller “study group” of 499 men 
and 759 women (pp. 46-48 and Appendix C). 
The relation between the “study group” and the 
total group is not made clear in the text. Reply- 
ing to a query from this reviewer Dr. Havig- 
hurst states: “The ‘study group’ consisted of all 
the respondents except the two major occupa- 
tional groups, namely the retired teachers and 
the retired ministers and their wives. Thus, the 
‘study group’ consists of the people who are 
described on pages 170 and 171 under para- 
graphs (d) and (e), and the groups described 
in the first few paragraphs on page 171.” * The 
authors have made an exceptionally valuable 
contribution by their thorough, cautious, and 
critical development of the two inventories and 
by using them in a study of a large number of 
cases. 

Sociol & Social Res 34:218 Ja-F ' 50. Arthur 
Chen. * an important...[volume] by distin- 
guished figures in American sociology. * 


PROJECTIVE 
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*The Blacky Pictures: A Technique for the Ex- 
ploration of Personality Dynamics. Ages 5 and 
Over; I950; а measure of psychosexual development; 
individual; 1 form; $10.50 per set of testing materials 
and 25 record blanks ; $3.50 per 25 record blanks; post- 
paid; (35-55) minutes; Gerald S. Blum; Psychologi- 
cal Corporation. 


REFERENCES 

т. Brum, Geratp S. A Study of the Psychoanalytic Thee 
of Psychosexual Development. Doctor's thesis, Stanford ‘Unt 
versity (Stanford, Calif.), 1949. (Abstracts of Dissertations... 
1948-49, 1949, pp. 202-4.) (PA 25:4952, title only) 

2. Brum, GERALD S. “A Study of the Psychoanalytic Theory 
of Ps: hosexual Development.” Genetic Psychol Monogr 39: 
$99 ius, сбило B. A Reply to Seward's "Psych 

‚ BL DU, eply to Seward's 'Ps: is, 
Deductive Method, and the Black Test? ” J Aba ККЕ 
Psychol 45:536-7 л 'so. * (PA 25:644) 

4. KLEHR, Навор, An Investigation of Some Personality 
Factors in Women With ‘Rheumatoid Arthritis, Doctor's thesis, 
Northwestern University (Evanston, Ill), 1950. (Summaries 
of Doctoral Dissertations....June-September 1950, 1951, pp. 


05-6. « 
5. SEWARD, Јонм P. “Psychoanalysis, Deductive Method 

E AEN rab J Abn & Social Psychol 20035 л ^en: 
6. Aronson, Marvin Lucius, A Study of the Freudian 


Theory of Paranoia by Means of a Group of Psychological 


Тена. Doctors ена, усеш Abstracts атт toe mod d 
7. МіснАІ-Ѕмітн, .; HAMMER, EMANUEL; AND Spitz, 

Herman, “Use of the Blacky Pictures With a Child Whose 

Oedipal Desires Are Close to Consciousness." J Clin Psychol 

7:280-2 Jl '51. * (PA 26:927) 

ArszERT ErLis, Consulting Psychologist, 56 

Park Ave., New York 16, New York. 

The Blacky Pictures constitute an interesting 
projective technique of personality evaluation 
which was originally devised to test several spe- 
cific Freudian hypotheses. The object of the test 
is to discover whether the subject has acquired, 
in the course of his psychosexual development, 
a significantly high degree of oral eroticism, oral 
sadism, anal sadism, oedipal intensity, mastur- 
bation guilt, castration anxiety (males), penis 
envy (females), positive identification, sibling 
rivalry, guilt feelings, positive ego ideal, and 
love-object relationships. The test consists of 
eleven cartoons depicting a dog, Blacky, going 
through various sexuo-amative experiences with 
other dogs representing Blacky’s papa, mama, 
and sibling. Test responses consist of (a) spon- 
taneous TAT-like stories about the cartoons; 
and (b) answers to specific questions, many of 
which are stated in multiple choice form, about 
the cartoons. 

Unfortunately, the Blacky Pictures manual 
provides virtually no information on the relia- 
bility or validity of the test ; nor does the article, 
“A Study of the Psychoanalytic Theory of Psy- 
chosexual Development” (2), which contains 
the author’s main study of the test. Instead of 
being properly standardized, the Pictures have 
been used to test several specific hypotheses— 
namely, Freudian assumptions that boys and 
girls in our culture react differently in their 
psychosexual developments. Over and above 
this, a kind of internal consistency study of the 
test has been made, in that tetrachoric correla- 
tions have been computed to show the relation- 
ship between how subjects react to one aspect 
of the test and how they react to another aspect 
of it. The obtained coefficients of correlation аге 
rarely above .40. 

The only true validity study of the test that 
is mentioned in the manual is one in the course 
of which Blacky Picture ratings on a single sub- 
ject who was under psychoanalysis were checke 
with the analyst's ratings of the subject, and 
were found substantially to agree. To check this 
single validity study, the present reviewer ad- 
ministered the test to one of his patients who 
had had some 200 hours of psychoanalysis, an 
then presented her test protocol to more than 30 
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clinical psychologists and psychological interns. 
This study is still in progress; but so far the 
results show that while the writer’s ratings of 
the patient, based on his knowledge of her as an 
analysand, correspond closely to her ratings of 
herself, there are wide discrepancies between 
these ratings and those of the clinical psycholo- 
gists and interns who rated the subject’s Blacky 
protocol. 

The writer’s experience with the Blacky Pic- 
tures has thus far indicated the following short- 
comings of the test: 

a) Some of the cartoons are not in the least 
clear, and may easily confuse many subjects. 
This is particularly true in the case of the 
Masturbation Guilt and Castration Anxiety car- 
toons. 

b) The practice of giving the test Inquiry, 
consisting of structured questions, after each 
card has been spontaneously evaluated, rather 
than reserving the Inquiry until after all spon- 
taneous stories have been given (as is usually 
done in Rorschach inquiries), invites biased re- 
sponses to all but the first cartoon. 

c) The multiple choice questions in the In- 
quiry are often transparent; so that the intelli- 
gent and sophisticated subject can easily figure 
out the “good” responses. 

d) The test seems to be more adaptable for 
use with children and with unsophisticated 
adults than with the college students with whom 
it was originally employed and partly standard- 
ized. 

е) The manual encourages psychologists first 
to interpret the test with a minimum of back- 
ground information on the subject, and inde- 
pendently from other test data ; and, considering 
the present status of psychoanalytic theory in 
general and the test theory in particular, this may 
easily lead to dangerous diagnostic practices. 

f) As stated above, reliability and validity 
studies in connection with the Blacky Pictures 
are at present virtually nonexistent, and conse- 
quently all test interpretations must be made 
with the utmost caution and reserve. 

In sum: the Blacky Pictures make an intri- 
guing projective technique of personality evalu- 
ation which appears to have some good possi- 
bilities of aiding in the uncovering of some of 
the subjects’ specific sexuo-amative problems. 
At the present time, however, the test is sadly 
in need of supporting validational studies, and 
until these appear its diagnostic employment 
should be more experimental than routine. 


J Consult Psychol 14:332-3 Ag ' 50. Laurance 
F. Shaffer. * Although the manual and the re- 
search monograph...suggest the purposes and 
uses of the test, no attempt is made to provide 
full training for its interpretation. The author 
wisely suggests that it be used only by clinical 
psychologists with general competence in other 
projective tests and with a broad knowledge of 
psychoanalytic theory. 


J Proj Tech 15:109-11 Mr ^51. Ephraim 
Rosen. Although projective techniques stem in- 
directly from psychoanalytic theory, the major- 
ity of them are not closely tied to psychoanalytic 
or to any other definite personality formulation. 
Usually one does not find a clear statement of 
either the theoretical concepts which have guided 
the construction of the test or of the conceptual 
system to be used as an aid in test interpreta- 
tion. The Blacky Pictures are unique in attempt- 
ing to evaluate the subject’s status in terms of 
a number of specified psychoanalytic variables. 
* The evidence for the validity of the technique 
is avowedly incomplete. As of the present, evi- 
dence for validity is drawn from two sources— 
the feeling of clinicians who have used the test 
that it is extremely helpful, and an investigation 
by the author into sex differences in psycho- 
sexual development and interrelations between 
psychoanalytic dimensions as manifested by 
Blacky Picture protocols of a fairly large number 
of college students. This investigation showed 
that a large majority of the significant quantita- 
tive differences in test performance between male 
and female students and of the dimensional inter-, 
correlations found were derivable from psycho 
analytic theory. Parenthetically, it may be noted 
that the methodology of this study has been 
attacked by Seward, primarily on the basis of 
Blum’s procedure of looking for Freudian ex- 
planations for all empirically found relationships. 
A more stringent deductive method would in- 
volve prediction of the relationships from the 
theory rather than consideration of only those 
relationships empirically found. A second possi- 
ble criticism of the study is that the lack of formal 
clarity in much of psychoanalytic theory makes 
it difficult to assess the subjective probability 
that the relationships found by Blum do flow 
logically from psychoanalytic tenets. Neverthe- 
less, Seward has challenged only a few of Blum’s 
specific findings and interpretations. To this 
extent, at least, there is evidence for test validity. 
The evidence does not necessarily imply useful- 
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ness and validity in the clinical setting, but it 
contrasts with the lack of objective validity evi- 
dence of any type in the case of a number of 
other projective techniques. In summary, the 
Blacky Pictures constitute a simple and in- 
genious device which promises to contribute to 
future clarification of systematic problems of 
personality and to take its place as a welcome 
addition to practical clinical instruments. Not 
the least of the favorable prognostic indications 
for it is the author's sophisticated awareness of 
such problems as the identification of the levels 
of personality tapped by a projective technique, 
the nature of interpretation, and the effect of 
partial structuring in a projective technique. The 
manual communicates a clarity of thinking about 
these issues that is refreshing by contrast with 
the muddled state of much past and some present 
work in projective psychology. : 

О J Child Behavior 2:474-6 О ’50. M. M. 
Genn. * The author deals ably with a problem 
which has troubled many users of projective 
tests. All projective tests, though the stimuli 
are relatively unstructured, usually contain 
stimuli which are intended to elicit responses 
to particular areas. For example, the Rorschach 
Plate VI is purported to contain the male sexual 
symbol. Frequently, examiners “get the subject 
coming and going." If he ignores the stimulus, 
he is said to be repressing strong feelings about 
it. If he deals with the stimulus, he is said to be 
overconcerned about it. The author of the Blacky 
Test (which contains highly structured situa- 
tions) handles this problem as follows: “Inter- 
pretation of spontaneous stories is dynamically 
oriented, with emphasis upon latent, rather than 
manifest content. Particular attention is paid 
to emotional intensity, ego defense mechanisms, 
blocking, evasiveness, symbolism, artificiality, 
significant slips, and so forth. The fact that a 
patient responds to Cartoon VIII, for example, 
with a sibling rivalry story, is not in itself an 
indication of strong disturbance in the area, 
since the stimulus situation is loaded in that 
direction. Instead the degree of sibling involve- 
ment is judged primarily in terms of intensity 
and how the rivalry theme is handled, not simply, 
whether or not it is present." The author then 
gives examples of so-called “Strong” and “Not 
Strong” stories. These examples should prove 
extremely helpful to users of the test in deciding 
whether there is a problem in a particular area. 

` * The test has not been tried out on children. 
Although it seems likely that the test is appro- 


priate for children, it still remains to be seen 
whether children's productions will lend them- 
selves to the same levels of interpretation, as 
those set up for adults. * the validity of the 
Blacky Test has been established to some extent. 
However, the reliability, or the extent to which 
a protocol is actually representative of the sub- 
ject's psychosexual development, has not as yet 
been demonstrated. This problem, however, 
exists in relation to all projective tests. 
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*Children's Apperception Test, Revised Edi- 
tion. Ages 3-10; 1949-50; also called CAT; indi- 
vidual; 1 form, '49; [revised] picture 2 ['51] ; revised 
manual ('50); [revised] record sheet ('50) ; $6 per set 
of то pictures in folder and manual; $6 per set of 30 
record analysis booklets; $11 per set of pictures, man- 
ual, and 30 record analysis booklets; postage extra; 
(15-50) minutes; Leopold Bellak and Sonya Sorel 
Bellak; C.P.S. Co., P.O. Box 42, Gracie Station, New 
York 28, N.Y. * 
REFERENCES 

т. Bettak, Lrororp, AND BELLAK, Sonya S. “Ап Introduc- 
tory Note on the Children’s Зрвеггернаа Test (САТ). J Proj 
Tech 14:173-80 Je 'so. * (PA 25:4152) я 

2. Gurevitz, SAuL, AND KLAPPER, ZELDA S. “Techniques for 
and Evaluation of the Responses of Schizophrenic and erebral 
Palsied Children to the Children's Apperception Test (C.A.T.).” 
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Joun E. Bett, Associate Professor of Psychol- 
ogy, Clark University, Worcester, Massachu- 
setts. 

The materials of this test were designed to 
provide pictures for children about which stories 
might be told; the test complements, then, the 
Symonds Picture Story Test for adolescents and 
the Thematic Apperception Test for adults. The 
CAT differs from these two tests in having аш- 
mals as the characters. The assumption under- 
lying this choice of animals is that children will 
identify themselves more readily with animals 
than with human beings and will reveal in their 
stories the “dynamic meaningfulness of the in- 
dividual differences of perception of standard 
stimuli." The range of situations presented per- 
mits expression of fantasies relating to such 
problems as those of feeding, other oral activity; 
sibling rivalry, parent-parent and parent-child 
interrelationships, aggression, acceptance by the 
adult world, night fears, toileting, masturbation, 
and growth. ( 

The administration is similar to that used in 
the TAT, but the instructions are less formal. 
The authors suggest adapting the procedure to 
the particular age and spontaneity of the child. 
A story is sought about each picture under 25 
good conditions of rapport between the examiner 
and the child as possible. | 

Interpretation based on the symbolic meaning 
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of the content of the fantasies is emphasized in 
the brief manual, which describes typical themes 
that may be elicited by each of the то pictures. 
As may be surmised from the situations por- 
trayed, the themes relate primarily to problems 
suggested by psychoanalytic theory. Four case 
illustrations with selected stories amplify the 
authors’ technique of interpretation. Interpreta- 
tion is facilitated by an analysis sheet for check- 
ing ro possible variables in each story: main 
theme, main hero (heroine), attitudes to pa- 
rental figures, family roles and identifications, 
figures or objects or external circumstances in- 
troduced, objects or figures omitted, nature of 
anxieties, significant conflicts, punishment for 
crime, and outcome. 

Since the major contribution of the Bellaks 
in introducing this test has been to develop a new 
set of pictures, the evaluation of their work at 
this point must depend upon the potential use- 
fulness of the plates in calling forth fantasies. 
A limited number of trials of the test with chil- 
dren in the suggested age range indicates that 
fantasies are called forth by the pictures. Most 
children appear to respond enthusiastically to 
the pictures and to warm up readily to the task 
of story telling. The pictures are sufficiently sim- 
ple that most older children, at least, understand 
their content, yet ambiguous enough to stimu- 
latea variety of response from different subjects. 
Unfortunately the pictures are printed on a matt 
finish cardboard that soils easily, even though it 
duplicates (more adequately) the artistic qual- 
ities of the original drawings by Violet Lamont, 
an illustrator of children's books. While others, 
such as Balken and Masserman, and Le Shan 
and Le Shan, have used projective story telling 
to pictures as a research and diagnostic method, 
they have not published their pictures and have 
made it difficult for others to replicate their ex- 
periments or to apply their insights in diagnosis. 
'The Bellaks have thus done a real service in 
making available a standard set of pictures which 
now may be widely used and studied—and, from 
all appearances, the representation of children's 
problems through animal characters was a touch 
of genius. 


L. ]озЕрн Stone, Professor of Child Study, 
Vassar College, Poughkeepsie, New York. 
The basic procedure and method of inter- 
pretation of this test derives from the Thematic 
Apperception Test: its plan to use various ani- 
mals in humanly family-centered action stems 


from a suggestion by Dr. Ernst Kris that “we 
could expect children to identify much more 
readily with animals than with humans.” The 
manual states that “ideally, we should like to see 
the C.A.T. used for children from three to 10; 
Symonds Picture-Story Test, for adolescents ; 
and the T.A.T. for adolescents and adults.” 
There are то picture stimulus cards “loaded” to 
produce material in certain directions deter- 
mined by the authors’ theoretical and clinical 
orientation ; the reviewer feels that caution must 
be used in interpreting the fact that a particular 
protocol does supply material. of the expected 
sorts; it follows, too, that the user must bear in 
mind that other aspects of childhood activity— 
and possibly other aspects of child personality— 
are neglected. 

The pictures were designed to elicit responses 
to “feeding problems specifically, and oral prob- 
lems generally ; to [investigate] problems of sib- 
ling rivalry ; to illuminate the attitude toward pa- 
rental figures and the way in which these figures 
are apperceived ; to learn about the child's rela- 
tionship to the parents as a couple—technically 
spoken of as the oedipal complex and its culmi- 
nation in the primal scene: namely, the child’s 
fantasies about seeing the parents in bed to- 
gether. Related to this, we wish to elicit the 
child’s fantasies about aggression ; about accept- 
ance by the adult world, and its fear of being 
lonely at night with a possible relation to mastur- 
bation, toilet behavior and the parents’ response 
to it.” In our own use of the test we have ob- 
served only partial approximation of these stress 
areas. 

No information has yet been published on the 
standardization or validation of the CAT al- 
though reference is made to preliminary “ex- 
perience,” on the basis of which the 10 published 
cards were selected from an initial series of 18, 
and “typical” themes (with no indication, as 
yet, of their actual typicality) are offered. How- 
ever, it is the reviewer’s understanding from 
correspondence with the authors that they and 
others are carrying on active research on the 
attributes of the test which includes investiga- 
tion of the suitability of adding cards to extend 
stimulation especially in the direction of peer and 
school relations and attitudes. 

As with the TAT (and many other projective 
devices) the stories elicited by the CAT cards 
may be evaluated in various ways, depending 
upon the purpose of administration and upon 
the tastes and training of the administrator. The 
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CAT Blank for Recording and Analysis may 
properly be considered separately from the test 
materials themselves : it is the reviewer's opinion 
that the test will be strengthened (though stand- 
ardization may be retarded) if there is inde- 
pendent study of the stimulus cards and of the 
efficacy of various methods of interpretation and 
analysis, including that of the blank. Moreover, 
the richness of thematic material produced in 
response to stimuli like the CAT is never satis- 
factorily reducible to any single set of rubrics 
to be applied for all purposes. Hence, Bellak's 
blank is better regarded as one guide to analysis, 
just as he has supplied a very similar one for the 
ТАТ. The reviewer's personal experience in 
(unsystematic) use of the method has been that 
the blank's categories omit much that he has 
found most fruitful in understanding a particular 
child. 

One further remark is prompted by the man- 
ual's claim "that the CAT is relatively culture- 
free." Seven of the 10 cards include objects or 
utensils (flowered hats, pipes, cereal bowls, 
furniture, plumbing, etc.) which not only are 
not “culture free" but also bear a certain middle 
class stamp. Moreover, the style of drawing and 
the selection of animals portrayed are such that 
the pictures are congenial to young American 
children and to our stereotypes of certain “per- 
sonality traits" of specific animal species: by 
this very token the test cannot be "culture free." 

At its present stage of development the CAT 
is a very promising tool for the experimental 
student of children's attitudes and personality 
development and for the clinician who is ready 
to use the results of this test, like those of the 
TAT or interview material, to build up a clini- 
cally shrewd and (hopefully) consistent concept 
of a personality; it is not ready for those who 
wish quantitative or even qualitative guideposts 
to normative status or to differential diagno- 
sis. 


Am J Orthopsychiatry 20:844-5 О ' 5o. Adolf 
G. Woltmann. * an important milestone in our 
search for better and more suitable test materials 
for children. The test is based upon the fact that 
children identify more readily with animals than 
with human figures. The pictures were....chosen 
in such a way as to elicit stories which constitute 
projections revealing the basic problems of chil- 
dren from three to ten years. By placing animals 
in social situations, reactions can be elicited to 
parents, siblings, eating habits, fear of loneliness, 
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anxieties, oedipal situations, aggression, cleanli- 
ness, masturbation, sexual curiosity, etc. The 
test consists of only ten cards, which are applica- 
ble to both boys and girls. This eliminates con- 
stant sorting and shortens the testing time. The 
test administration is the same as for the T.A.T, 
A. special analysis sheet aids in the scoring and 
final interpretation of the stories obtained. Each 
story is analyzed in terms of the main theme, the 
main hero, attitudes to parental figures, family 
constellations, the introduction of figures, ob- 
jects or external circumstances (friend, enemy, 
punisher, etc.), figures or objects omitted, the 
nature of anxiety, significant conflicts, attitudes 
toward punishment and the final outcome. The 
C.A.T. isa very valuable and important addition 
to the clinical tools now available for the under- 
standing of the child's personality, and this re- 
viewer sincerely believes that in years to come it 
will occupy a selected place among projective 
methods. 

Am J Psychiatry 108:317-8 O "51. Herbert 
Herman. It is gratifying to know that now... 
we...have a projective technique that may be 
applied to the age-group of 3 to 10. * may be 
profitable in the hands of psychiatrists, psychol- 
ogists, social workers, and teachers as well 
as psychologically trained pediatricians. * The 
C.A.T. is well on its way toward finding for 
itself an important role in the field of projective 
techniques in child psychiatry. It may be well 
to point out, however, that, since it is a projective 
technique, its value in this field is directly pro- 
portional to the ability of the individual attempt- 
ing its interpretation. When such limitations are 
recognized, the C.A.T. may be used to good 
advantage. 

J Consult Psychol 14:161 Ap ’50. Laurance 
F. Shaffer. A most appealing thematic apper- 
ception test for young children, the C.A.T. pre- 
sents animal characters in situations significant 
to the problems of childhood. A little bear helps 
one of two bigger bears in a tug-of-war. A lion 
sits with pipe and cane while a mouse views him 
from a mousehole. An adult spaniel spanks à 
young one, the background a very obvious bath- 
room. Suggestive protocols in the brief manual 
seem to show that the animal figures are suffi- 
ciently human-like to evoke identification, but 
enough unlike humans to free the child from in- 
hibitions against expression. The analysis sheet 
provides for an interpretation mainly in terms 
of content: theme, hero, attitudes to parental 
figures, family roles, anxieties, conflicts, and the 
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like. The C.A.T. is a ready and needed clinical 
tool, and also a provocative research instrument 
for future studies of age, socioeconomic, intellec- 
tual and ethnic groups. 

О J Child Behavior 2:469-70 О ’50. M. M. 
Genn. * The pictures were drawn by Viola La- 
mont, a children's illustrator. While these pic- 
tures might do very well for illustrations in a 
children's book, they have some limitations as 
test material Picture #6 which contains ani- 
mals sleeping or hibernating in a cave, is very 
dimly drawn, and some young children to whom 
the reviewer exposed the picture in the clinical 
situation could not determine what it was and 
were unable to fabricate anything. Picture #3 
is a large lion seated in a chair, with a small 
mouse in the corner. This picture is intended to 
elicit stories about the relationship with the 
father-figure. However, the mouse is so tiny 
and indistinctly drawn that some children miss 
it. The vagueness of some of the stimuli might 
be justified if it were the intention to elicit re- 
sponses to unstructured forms, as in the Ror- 
schach. However, since the purported aim is to 
elicit dynamic story-telling based on concrete 
stimuli, some of these pictures are inadequate. 
Picture #10 appears to be “overstructured” for 
diagnostic differentiation. This is a picture of a 
small puppy apparently being spanked by an 
older puppy in a bathroom. Some children simply 
tell that the child is being spanked for not going to 
the toilet. One wonders if a picture of the two 
figures simply standing near the toilet would 
elicit a story more clearly related to the child’s 
own experience. The manual does not provide 
instructions for the introductory steps in the 
administration. Presumably it is assumed that 
the clinician will use the same instructions as 
recommended for Murray’s Thematic Apper- 
ception Test. The manual does give some useful 
hints for interpretation. The check-list contain- 
ing points to be noted is especially helpful. In 
spite of the above-mentioned shortcomings with 
respect to some of the art work and the content 
of some of the pictures, the C.A.T. meets a long- 
felt need for pictured stimuli containing figures 
with which children could easily identify. The 
author acknowledges the fact that the publica- 
tion of the test does not necessarily imply that 
the technique is fully standardized, and points 
out the need for the accumulation of further 
data, 

TAT Newsletter 4:5-7 su ’50. Robert R. 
Holt. The standard set of TAT plates issued by 
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Murray contains three pictures specifically for 
boys and girls, two of them suggested by San- 
ford on the basis of his experience with the chil- 
dren of the Shady Hill study. Using these to 
substitute for a couple of the adult series, the set 
is supposed to be adaptable for use with children 
of seven years or younger. Sarah Schafer and 
Mary Leitch have demonstrated that some of the 
pictures have a limited usefulness even with 
nursery school children. Why, then, a special set 
of cards for children? The primary practical 
reason is that the TAT is rarely a very produc- 
tive instrument with preadolescent or younger 
children. Though the material it provokes is 
often scanty, it is useful enough to be employed 
routinely by a number of persons working with 
children. But if a picture-story test could really 
tap the rich fantasy life of the child it might have 
far more usefulness for diagnostic and therapeu- 
tic work, and for research. It looks as if the Bel- 
laks may have provided us wifh such a test. The 
principal theoretical reason for designing a new 
set of pictures was suggested by Ernst Kris, the 
manual tells us. “Theoretically, we had reason to 
assume that animals might be preferred identifi- 
cation figures from three years up to possibly 
ten,” the authors say, “and thus we set out to 
create, pictorially, situations vital to this age 
range.” A further advantage cited of using ani- 
mals is that they make the test easily usable with 
Negro children and almost any others within 
the Western-European cultural framework. The 
pictures themselves were drawn by Violet La- 
mont, an illustrator of children’s books, and show 
a sure professional touch. The sprightly animal 
figures, some naturalistic, some with human ac- 
cessories like hats or a pipe, have style and 
charm, and certainly appealed to a complete 
sample of one population, the reviewer's own 
children, These little girls, aged four and a half 
and three years, fell to the game of making up 
stories with relish, and seemed to identify them- 
selves with the animal children with great ease 
and naturalness. Of course, such a miniature 
try-out cannot show anything more than the fact 
that it is possible for very young children to take 
to the test and to produce usable material freely. 
Eighteen pictures were used by the authors and 
a number of other clinicians in preliminary try- 
outs, and the ten most useful retained. They seem 
to have been well chosen to touch upon the most 
significant problems and conflicts that most chil- 
dren in our society are likely to encounter. * 
the authors state their feeling that when one is 
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dealing with projective techniques, all that is 
necessary by way of standardization is to try a 
new one out and see that it gives useful material. 
For the rest, one relies upon "the basic hypothe- 
sis of projective—or apperceptive—phenom- 
ena,” and “the individual case can stand by it- 
self.” They do not wholly side-step the problem 
of norms, however, saying that it would be very 
desirable to assemble them for different age 
levels, but too difficult a task for a single team 
of investigators. The fact that this seems to be 
S.O.P. for the production of new projective tests 
does not make it any more desirable. The authors 
do go a good deal beyond what is usual, however, 
in their generous offer to cooperate with all 
users of their test, "to organize all available 
findings and make it available to users of the 
C.A.T., upon request" and to act “аз an informal 
clearing house." * Persons who are unskilled in 
analysis of thematic materials will probably find 
the examples of interpreted stories more helpful. 
The orientation given is sound, but of course no 
one should expect to learn interpretation from 
reading a manual, short or long. Taken as a 
whole, the test (and particularly the manual, 
which is rife with minor errors and carelessness) 
show signs of being put out in a little too much 
of a hurry—compare the Four Picture Test, 
published after 18 years and the experience of 
thousands of administrations. Nevertheless, the 
C.A.T. is a very promising-looking addition to 
the available instruments for testing young chil- 
dren. 
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XExpressive Movement Chart. Ages 3-5; 1946-48; 
individual; revised recording form (748); examiner's 
guide (748); $3.50 per set of figure cards, examiner's 
guide, and 25 recording forms; $1 per set of figure 
cards and guide; 156 per recording form; $5 per copy 
of The Personality of the Preschool Child ('46—see 
3:1245) which serves as the manual; postage extra; 
(45) minutes; Werner Wolff; Grune & Stratton. * 
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X*Four-Picture Test (1930). Ages 12 and over; 
1948; also called ЕРТ; individual; 1 form; manual 
(748) ; $16.90 (65 guilders) per set of testing materials 
(bound), postage extra; nontimed (15-70) minutes; 
D. J. van Lennep; publisher and distributor in Neth- 
erlands and Belgium: National Institute of Industrial 
Psychology, Utrecht, Netherlands; distributor in all 


other countries: Martinus Nijhoff, The Hague, Neth- 
erlands. 
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Jonn E. BELL, Associate Professor of Psychol- 
ogy, Clark University, Worcester, Massachu- 
setts. 

This test, designed in 1930, is based on the 
principle of storytelling to pictures. Four subtly 
colored pictures are provided which represent 
four different situations: (a) being together with 
one other person (a room with two persons 
talking across a long table) ; (b) being personally 
alone (a bedroom); (c) being socially alone 
(a street scene with a single figure under a 
lamppost) ; (d) being together with many others 
in a group (a tennis match, with observers). 
The pictures are sufficiently ambiguous (or 
polyvalent) to permit highly personalized inter- 
pretations. 

A unique feature of the technique is the de- 
mand that the subject produce a continuous 
story that incorporates all four pictures. Neither 
order nor time limit is prescribed; most exam- 
inees take about 35 minutes. Stories may be oral 
or written. The examiner may present the pic- 
tures for one minute, remove them, and then 
ask for written stories. If the story is banal, he 
may request another story ; if only three pictures 
are combined, he may urge the subject to try 
to incorporate the fourth or he may remove the 
fourth picture and ask the subject to imagine the 
picture needed to complete the series ; if the sub- 
ject cannot combine the pictures, the examiner 
may ask for a separate description of each picture 
and then attempt to combine two or more. 

Interpretation follows the two basic patterns 
for projective analysis: the formal, and the con- 
tentual. Study of the content for its personality 
significance follows the patterns used in the 
analysis of the TAT or other storytelling pro- 
jective techniques. It is in the formal analysis 
that one of the advantages of this method is most 
clearly revealed. Study of such formal elements 
as the order and span of combination and num- 
ber of pictures used gives data not ordinarily 
obtained from most picture-story methods. From 
such factors the examiner may be helped in аг- 
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riving at conclusions about the level and man- 
ner of intellectual functioning. 

A new interpretative suggestion to be applied 
to the study of story materials is also contained 
in the test blank and elaborated in the manual— 
the use of the time quality of the fantasy for in- 
terpretation. The author points out the difficulty 
schizophrenics have in giving a coherent time 
quality to the figures in the story ; the successive 
(after each other) time sequences in the stories 
of depressives ; and the final, or future, emphasis 
in the productions of hysterics. He follows 
Hugenholtz in dividing time into four forms— 
physical, vital, animal, and human—and in sug- 
gesting the interpretative value of each by itself, 
when combined, and when interrupted. The ap- 
plication of concern about the time is a princi- 
ple that may ‘be applied equally well to other 
types of story materials, but it is especially ap- 
propriate here because of the demand for a con- 
tinuous story. 

In general, the technique has the following 
advantages: (a) a brief administration time; 
(b) suggestiveness of the stimulus materials ; 
(c) an aesthetically pleasing quality to the color 
and composition of the pictures; (d) the values 
implicit in the necessity of combining the pic- 
tures to produce a story; and (e) the wide age 
range of subjects with whom the method may 
be used. 

One of the advantages (the brevity) may ap- 
pear also as a disadvantage. The risk in inter- 
preting such limited productions is that over- 
emphasis upon minor details may lead to in- 
valid conclusions that cannot be checked within 
other parts of the response. Perhaps when one 
has used the technique as extensively as the au- 
thor, dependence upon information from other 
diagnostic methods for validation is less im- 
portant. Without such a background, an inter- 
preter should probably regard conclusions from 
the method as hypotheses of a most tentative 
nature, This would suggest that the two major 
uses of the test will be as a rough screening device 
and as a contribution with some unique elements 
to a larger battery of projective techniques. 


E. J. G. Braprorp, Senior Lecturer in Educa- 
tion, The University of Sheffield, Sheffield, Eng- 
land. 

The four pictures which form the basis of this 
test are coloured reproductions of scenes painted 
inan “impressionist” style. In one picture, a bed- 
room, no human form is depicted ; in another is 


a solitary figure in a dark, damp, and otherwise 
deserted street; in a third, two people in a room 
are presumably in conversation ; the remaining 
picture depicts a social occasion on a tennis 
court, a game in progress and four seated on- 
lookers, male and female. The subject is re- 
quired to construct one story incorporating all 
four scenes. The scenes are well chosen to dis- 
close the subject’s general attitudes to society, 
its values, and moral standards. The scenes may 
be incorporated in any order, and the order 
chosen may in itself have definite significance. 
In the resulting stories, dependent, aggrieved, 
depressed, aggressive, and hysterical personality 
types can usually be recognised with little diffi- 
culty. 

Unfortunately, and this applies to most pro- 
jection tests, only about one third of adult sub- 
jects will produce really adequate material, and 
it is rare to get satisfactory responses from male 
juvenile delinquents. In the experience of this 
reviewer, with those whose command. of lan- 
guage is below average, as is the case with many 
delinquents, the material produced is so meagre 
that the time spent is not worthwhile. Hence a 
preliminary reading and vocabulary test is ad- 
visable. However, the blindness to details and 
even to whole pictures is very revealing. 

The reviewer’s experience supports the au- 
thor’s claim that better results are obtained if 
the pictures are withdrawn after a minute or 
two for inspection, and the story is then dictated 
by the subject. Well educated, intelligent sub- 
jects are less self-conscious if allowed to write 
out the story in a room by themselves. Both in 
hospital work and in vocational selection, the 
reviewer has found this test to assist materially 
in the interpretation of the results of an intelli- 
gence battery (incorporating verbal and per- 
formance tests) applied individually, the Four 
Picture Test following the battery during the 
course of the interview. 


Ернклтм Rosen, Assistant Professor of Psy- 
chology, University of Minnesota, Minneapolis, 
Minnesota. 

This projective technique consists of four am- 
biguous water color pictures, varying in size 
from 4 by 5%4 to 5 by 6 inches, designed to · 
represent "four very general and existential 
situations." Picture I depicts two figures, of am- 
biguous sex and age, conversing in a room. It 
is intended to represent "being together with 
one other person." Picture II shows a bed on 
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which the outline of a head on a pillow сап be 
very dimly made out. It symbolizes "being per- 
sonally alone." The third picture, intended to 
symbolize “being socially alone," is of a figure 
standing under a lamppost in a dark and rainy 
atmosphere. Picture IV depicts a tennis match 
watched by several spectators and symbolizes 
“being together with many others in a group.” 

The four pictures are placed on a table close 
together so as to form a square. The subject is 
then instructed to make up a story combining 
them and relating them to each other. He may 
start where he wishes and follow any order. 
The examiner makes a verbatim record of the 
story. There are several alternative modes of 
administration: The subject may tell his story 
after looking at the pictures for one minute, after 
which they are removed ; the subject may write 
his story (thus making group administration pos- 
sible) ; or combinations of these variations may 
be used. 

A flexible procedure analogous to testing the 
limits on the Rorschach may then be employed. 
'The examiner may ask for a new story related 
to anywhere from one to all four of the pictures, 
depending on the subject's performance in the 
test proper. Thus, if one of the four pictures is 
only loosely connected to the other three in the 
original story, the subject may be asked to tell 
a new story incorporating all four or to describe 
а substitute picture that he could include in his 
story. 

The manual provides a guide for formal analy- 
sis in terms of such variables as order of pictures 
in the story; extent of differentiation within a 
picture; use made of time, space, and thoughts 
and dreams of a figure to connect the pictures ; 
conflicts; and themes. Such formal analysis is 
considered to be only the beginning of inter- 
pretation, for the examiner must then proceed 
` clinically and intuitively. Four protocols and 
interpretations are presented in the manual to 
illustrate formal and clinical interpretation. 

The technique is ingenious and original. A 
priori, it would seem to provide a method of in- 
quiry into many areas not easily approached by 
most projective techniques. Thus rigidity versus 
flexibility can be evaluated by the subject's suc- 
cess in shifting to a new story. Insight into the 
subject's social relations can be obtained by ob- 
serving his response to the many figures in Pic- 
ture IV. The analysis of the use made of time 
and causality in connecting the pictures—for 
example, connection by- making each event the 


result of the preceding event versus connection 
by making each event a function of a goal to be 
reached by the performer—is an original ap- 
proach to dimensions of personality hitherto 
little studied. 

On the other hand, the Four Picture Test suf- 
fers from certain deficiencies. The test, perhaps 
due to the difficulty of reproducing the water 
colors, is quite expensive. More important, 
norms have not been compiled as an aid to that 
part of interpretation based on formal analysis. 
Apparently Dr. van Lennep worked on the 
technique for 18 years before making it public, 
thus accumulating an immense number of proto- 
cols. Analysis of these protocols, however, has 
not been presented statistically. We do not know 
what sort of stories are typical of various groups, 
what the effects of various administrative pro- 
cedures are, nor what validation data exist. Dr. 
van Lennep is not concerned with standardiza- 
tion. In line with a tradition stronger in Euro- 
pean psychology than in the United States, he 
seems to imply that emphasis on interpreter skill 
is incompatible with emphasis on objective 
norms. Understanding, intuition, and internal 
consistency are sought, rather than objective 
evidences of validity. Thus, though a skilled 
examiner may find the test very useful, we have 
no scientific indications of where its limitations 
and greatest usefulness lie. 


TAT Newsletter 2:2-4 Mr '49. Robert R. 
Holt. In America, the production of new tests 
is so prolific that a special section of the Psycho- 
logical Abstracts is devoted to them and every 
month sees a goodly crop. Typically, these are 
brain-children of no long gestation; the bright 
idea is materialized in the form of test materials 
which are tried out long enough to show some 
promise, and forthwith published. Our Dutch 
colleague puts us to shame with his Four Picture 
Test (FPT), conceived in 1930 and brought 
forth into the light of commercial distribution 
only after 18 years of careful work. One thinks 
of the European tradition of patient hand-work 
by master craftsmen, as compared to American 
assembly-line mass production. There are of 
course advantages to our own way; most con- 
spicuously, we are more concerned than is the 
author of the ЕРТ to present normative data 
and summarize биг experience with a test sta- 
tistically so that others may be helped a little 
over the initial stages of building up their ow? 
fund of experience. But it does not take more 
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than a reading of the manual and a few trials of 
the test itself to convince one that Dr. van Len- 
nep has produced a small but firmly-wrought 
masterpiece. The pictures, manual, and a half a 
dozen test blanks indicating the variables used 
in analyzing stories, are all done up nicely be- 
tween boards in the manner of the Rorschach 
plates, making a slim light volume indeed for 
the price. The author assures me, however, that 
the unusual cost of producing exact duplicates 
of the original watercolor pictures makes it im- 
possible to market the test for less. The reproduc- 
tions do indeed seem to be excellent, being nearly 
indistinguishable from actual watercolors. They 
are done chiefly in brownish greens, except for 
the contrasting russet of the tennis court in IV. 
In size they range from four by five and a half 
inches (I) to slightly over five by six inches 
(IV)...I is am interior scene; two persons are 
іп a room with а table between them and book- 
shelves in the background ; one is standing, ges- 
turing towards the other. II, a bedroom, con- 
tains a bed with a window behind it and the very 
dim indication of a head on the pillow. III de- 
picts a rainy night in which someone is leaning 
against a lamp-post. IV is a scene at a tennis 
court, showing two players and in the foreground 
two couples looking on and conversing. In the 
first three pictures, the figures are all fairly am- 
biguous as to sex and age and all expressions 
are indeterminate due to the deliberately sketch- 
like, somewhat slap-dash style of the paintings. 
These four pictures were planned to represent 
“the four existential situations: being personally 
alone (i.e., in the kind of solitude one seeks, as 
in II), being socially alone (i.e., rejected, as in 
III), being together with one other person (I), 
and being together with many others in a group 
(IV)." The unique feature of the test is that 
"the subject has not only to give a meaning to 
each of the four pictures, but must do it in such 
a manner that the interpreted pictures can be 
combined in one story. This demands, in addi- 
tion to a higher intellectual function, the imagin- 
ing of connecting links in which projections oc- 
cur easily and almost necessarily." The subject 
is told that the pictures are unrelated in the way 
they are laid out, and that he is to make up a 
story which combines all four of them in any 
order he likes, starting or finishing with any 
one, Clinicians who have had experience with 
the Picture Arrangement subtest of the Wech- 
sler-Bellevue scale will rightly expect that many 
psychotics and persons of inferior mental ca- 
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pacities (particularly, those with defects in the 
function of anticipation) will be unable to per- 
form the task as thus presented. As a result of 
experiences with such patients and with chil- 
dren, van Lennep suggests a variety of altered 
techniques of administration. In general, their 
essence is to accept whatever the patient can 
produce on however low a level, and then to 
urge and lead him to produce on higher levels 
of story production (instead of description) and 
combination (instead of separate consideration 
of the pictures). In addition, various methods 
of “testing the limits" by requiring stories for 
combinations different from the one sponta- 
neously chosen are outlined. The author indicates 
that many valuable protocols have been obtained 
by having subjects write out their own stories. 
Finally, he recommends the variation of allow- 
ing one minute's inspection of the cards, then 
taking them away before the subject writes or 
tells his story. By means of this admirable flexi- 
bility and ingenuity of administrative methods, 
the FPT's creator has found ways to use it with 
all kinds of subjects, and allows for its free 
adaptive utilization. With the gain in freedom 
there is a corresponding loss in standardization ; 
there are no indications of systematic differences 
in the results according to the particular varia- 
tion employed. Presumably the tester builds up 
his own norms, and the assumption is made that 
the data may be used in roughly the same way 
no matter how they are obtained. And while this 
assumption is undoubtedly true within limits, it 
is uncertain just how broad the limits are. The 
little manual contains, besides a brief theoretical 
introduction, description of the pictures and in- 
structions for administration, 20 pages on in- 
terpretation and four protocols with detailed 
analyses. Van Lennep introduces the reader to 
the interpretation of his test in this manual in . 
much the same way Tomkins does for ће TAT 

in his book : by extensive cataloguing of obtained 

form-varieties of each significant aspect of the 

stories. Thus we are prepared for the variety of 

protean forms the material may take somewhat 

at the expense of direct help in getting at the 

meaning of the stories, Still, considering the 

brevity of the manual and the probably inevitable 

difficulties of translation, the author has done а. 
fine job of communicating some of his insights 

relative to the interpretation of thematic ma- 

terial. There is grist here not just for the clinician 

who wants to learn the use of this test, but also 

for those who want to extend their proficiency 
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with the TAT. One of the most interesting and 
original contributions to interpretation is van 
Lennep's treatment of time as it is used in stories. 
Almost nothing has been written about the sig- 
nificance of the kind of time in which the subject 
lives as projected in TAT stories; judging by 
what the Hollanders seem to have been doing, 
we have been missing a subtle but revealing di- 
mension, Building on the work of P. Th. Hugen- 
holtz, van Lennep distinguishes the following 
principal kinds of time: physical time (eg., 
“eight o'clock") which occurs seldom and is of 
little importance ; vital time, “when an event is 
aimed from the past at the future" ; animal time : 
“the future is brought towards the moment” ; 
and human time, in which man unfolds his own 
boundless future. Depressives, we are told, “live 
in the after-each-other of things"; they tend to 
picture events as due to preceding events in the 
form: “something happened because of some- 
thing else in the past.” Hysterics, on the other 
hand, have a final or future-oriented quality to 
their temporal relations: one does something 
in order to bring about a desired state of affairs. 
The similar analysis of the use of space is less 
transferable to the TAT, being closely related 
to the problem of organizing four separate pic- 
tures. What other features of the FPT differen- 
tiate it from the TAT? First, we learn more 
about the organizational abilities of the subject, 
from the way in which he combines the scenes 
into a connected narrative. Second, there is the 
additional variable, the particular order of pic- 
tures chosen. One looks in vain for indications 
of possible significances of the different orders, 
however, beyond the statement that particular 
ones are common or rare. Third, by asking for 
another story with a different order of pictures 
from that chosen by the subject, one can learn 
about the flexibility vs. rigidity of his thought, 
the ease with which he is able to shift. Fourth, 
it does seem that the author is right in claiming 
that projective material is more easily got out 
of certain subjects through the transitional ma- 
terial they must introduce to provide continuity 
between the four pictures. It looks as if these 
particular pictures make it possible to tell more 
than the TAT usually does about a person’s 
broader social relationships—those beyond his 
contacts with the few significant persons in his 
life. This last fact is probably attributable to the 
fact that picture IV (the tennis court) contains 
more people than does any TAT picture and in 
a different type of activity ; indeed, it is the only 


one for which a pretty good analogue cannot be 
found in one of the editions of the ТАТ. To 
sum up the difference in а (somewhat over- 
simplified) word, there are richer possibilities 
for formal analysis in the FPT than in the TAT, 
while the latter provides more by way of personal 
content. Of course, the last fact is due in large 
part to the greater length of the TAT. The 
Four Pictures are remarkably well chosen for 
their possibilities of bringing out significant ma- 
terial about sexual, passive, affiliative and ag- 
gressive needs, the latter particularly in terms 
of rivalry—a striving not much touched off by 
the TAT. For all these reasons, one can prob- 
ably learn more from a single FPT story than 
from one TAT story; minute for minute, van 
Lennep’s test may be more productive than Mur- 
ray’s. Certainly the author has trained himself 
to a great sensitivity and resourcefulness in 
working with a minimum of material, by stick- 
ing to an essentially one-story test. But it seems 
to be inherently limited. My feeling after a very 
insufficient trial of the FPT is that after the 
first spontaneous story, more could be learned 
by using other pictures than by using the same 
ones in recombination. Where time is not at a 
premium, and when one wants to go as deeply 
as possible into personality dynamics, the TAT 
would have the edge every time—in the opinion 
of a grossly prejudiced party ! 
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*Group Projection Sketches for the Study of 
Small Groups. Groups of from 3 to 40 people (ages 
16 and over); 1949; 20 scores: grouped under 3 cate- 
gories (sociodynamics, group structure, outcome 0! 
group process): communication clarity, content-pro- 
cedure ratio, information providing, goal concentration, 
problem source, value orientation, tension level, tension 
direction, pacing level, personal interdependence, per- 
sonal affect, participation spread, role differentiation, 
in-group feeling, individuality of members, quality of 
group product, organization of outcome, creativity of 
group product, group satisfaction with outcome, moti- 
vation to execute outcome; I form; no data on relia- 
bility and validity; $2.50 per set of 5 pictures an 
mimeographed manual (content same as J Social 
Psychol 33:77-102 F ’51), postpaid; (60) minutes ; 
William E. Henry and Harold Guetzkow; University 
of Michigan Press, * 
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Starting with the assumption that groups have 
unique personality-like properties and charac- 
teristic ways of functioning, which might be 
studied by methods similar to those for study- 
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ing individual personalities, Horwitz and Cart- 
wright created a “Group TAT” a few years ago. 
Henry and Guetzkow have, with permission, 
taken the idea up and developed it into the pres- 
ent test. The result is a promising-appearing 
instrument, which rather regrettably has been 
published before much work has been done to 
actualize its apparent potentialities. 

There are five pictures about each of which a 
group is asked to write a story, taking about 10 
minutes for each: a conference group of men, a 
single man, two men of different ages, an older 
woman and a man, and an informal group of 
men. It seems unfortunate that there is no equiv- 
alent form for women, or at least a few more 
pictures offering all-female and mixed groups. 
There are many feminine and mixed groups 
that might well be studied by a technique such 
as this, and some essential features of their 
"group personality” may not come through 
clearly because of the masculine nature of the 
present pictures. The pictures are, however, at- 
tractively drawn and are large enough (21 by 
18 inches) to be seen easily by all members of a 
moderate sized group. 

The technique of administration (which in 
many ways is similar to that for the individual 
ТАТ) is carefully presented in the manual, with 
enough detail to cover most problems the user is 
likely to encounter. Several protocols from dif- 
ferent kinds of groups are accompanied by de- 
tailed interpretations which make it clear how 
a variety of group characteristics may show 
themselves in the stories. The authors also offer 
20 rating scales for aspects of group dynamics, 
structure and process which may be assessed 
from the set of five stories as a whole. These 
scales seem to be well chosen to quantify sig- 
nificant facts about a group; scorings on the 
scales are offered for two sets of stories, and 
there are some brief comments with each to 
indicate the kinds of inferences from stories that 
may be made in making the ratings. The ratings 
and “freehand” analyses, however, can be done 
only by clinically trained persons who are al- 
ready competent in interpreting the individual 
TAT, and who in addition have a good under- 
standing of group dynamics and structure. 

No data are given on the validity of inferences 
drawn from the test nor on the reliability of the 
scoring categories, though some are promised 
for the future. The authors are careful to state 
that the illustrative cases in the manual, each 
with a brief statement from independent sources 


about the group concerned, do not constitute 
validation and are not presented as such, Never- 
theless, they do convey a strong impression of 
the validity of the blind analyses, and suggest 
ways in which validation research could easily 
be done. 

The authors have used the Group Projection 
Sketches with white-collar staff workers in in- 
dustry, adult academic groups, and the like. Ap- 
parently the test may be used with even quite 
psychologically sophisticated persons, but it is 
not at all clear how well adapted it would be to 
subjects with less than college education. In its 
present state of development, it is primarily a 
test for use in research on small groups, where 
it should provide very useful data. One can 
think of a number of situations where it might 
prove useful: diagnosing difficulties in confer- 
ence groups, comparing the functioning of semi- 
nars or of different clinical teams, perhaps fol- 
lowing the progress of therapeutic groups, or 
locating trouble spots in certain kinds of busi- 
ness or industrial setups where people must work 
together in moderate numbers. For a test of 
this character, it would be most useful to know 
how groups respond to retesting: whether the 
test is sensitive enough to reflect changes in 
group structure and functioning after attempts 
have been made to bring them about. If so, it 
might be a very useful aid to the diagnosis and 
treatment of group difficulties. 

That branch of social psychology and soci- 
ology known as group dynamics has been devel- 
oping rapidly in recent years and shows dis- 
tinct promise of great social usefulness. The 
present test is to my knowledge the only instru- 
ment of its kind available to this new discipline 
for aiding in the diagnostic study of groups, and 
though too little work has been done with it 
for a proper evaluation at this time, it seems 
worthy of wide experimental trial in the hands 
of persons with the requisite interpretative skills, 


N. W. Morton, Director, Operational Research 
Group, Defense Research Board, Ottawa, Can- 
ada. 

These five 21-by 18-inch pencil sketches were 
prepared as a basis for a group product which 
may reveal some of the characteristics of the 
group and its mode of cae The product 
is a story about the human figures in each pic- 
ture or at least an interpretation of the scene 
presented. This story or written interpretation 
produced by the group is formally comparable 
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with a written TAT response from an individual. 

While there appears to be no reason why ob- 
servations of the behaviour of the members of 
the group while engaged in the production of 
its stories may not be made if desired, the in- 
tention of the authors is to provide an instru- 
ment which will not require such a direct assess- 
ment. That is, the evaluation of the group is 
entirely in terms of its products. 

Two forms of analysis of the stories are sug- 
gested. It is emphasized by the authors that these 
are provisional only, being based on preliminary 
experience rather than on extensive experi- 
mental study or validation. 

The two methods of analysis consist of a 
clinical type appreciation, wholly verbal in form, 
and a series of ratings conceived more or less 
systematically to assess the way the group works, 
its formal characteristics, and the nature of its 
product. In both methods projection appears to 
this reviewer to be treated very literally; that 
is, that the most plausible hypothesis about the 
group—the proposition concerning it most likely 
to be true—is that its various characteristics will 
each be mirrored directly in the respective fea- 
tures of its interpretation of the pictures. This 
is as if one were calling a series of shots with 
each individual outcome (ie, the particular 
characteristic of the group) uncertain, but the 
long term odds in one's favor. Probably without 
further evidence about the test, this is the most 
reasonable thing to do. 

The three main divisions of ratings, on the 
group's sociodynamics, structure, and product, 
seem to the reviewer to be obviously of quite 
different face validity. The last is a more or less 
direct. assessment, certain features of the first 
(e.g., value orientation) are fairly so, while the 
judgments about group structure involve a con- 
siderable and thereby rather uncertain process 
of inference. 

In achieving the economy of recording and 
using only the final story produced by the group 
about each picture rather than the necessarily 
detailed account of how it arrived at its con- 
clusions, it is judged that the chief loss is in 
direct information about the relationships of 
individuals to the group and within it. A princi- 
pal effect is to transfer emphasis from descrip- 
tion of individuals interacting in the group to 
evaluation of the group as a whole. One gains 
the impression that a further result may be the 
assumption of a considerable degree of homo- 
geneity in the group. Thus the method may be 
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better adapted to the description of a well defined 
group having certain settled and homogeneous 
qualities than to groups more heterogeneously 
or temporarily composed. 

The sketches are well drawn. The selection of 
subject matter appears to the reviewer to repre- 
sent some degree of bias toward the younger 
male urban academic or white collar level type 
of interest or experience. In this respect, and in 
the clarity rather than vagueness of the draw- 
ings, the material is thought to be more definitely 
oriented and structured than has been true of 
other TAT pictures. 
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*H-T-P: A Projective Device and a Measure of 
Adult Intelligence. Adults; 1946-50; individual; 
drawing form ('46); scoring folder ('50) ; post-draw- 
ing interrogation form ('50) ; manual (47); $1.25 per 
25 drawing forms; $2 per 25 scoring folders; $2 per 25 
interrogation forms; $2.25 per manual; postpaid; (60- 
90) minutes; John N. Buck; Western Psychological 
Services, Box 775, Beverly Hills, Calif, * 
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ALBERT Ertis, Consulting Psychologist, 56 
Park Ave., New York 16, New York. , 
Buck's original manual for the H-T-P is сег- 


tainly one of the worst horrors ever perpetrated 
in the field of clinical psychology. For de- 
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riving IQ's and other quantitative scores from 
the H-T-P, the standardization data presented 
in the manual are inadequately based on only 
140 cases, evenly spread over 7 mental levels 
from imbecile to superior intelligence. The stand- 
ardization group for qualitative personality in- 
terpretation is also most inadequate, based on 
the study of only 150 disturbed individuals, 
ranging from adult maladjustment and psycho- 
neurosis to psychopathy, epilepsy, and psycho- 
sis. If any specific facts on the reliability or 
validity of the H-T-P were obtained from these 
two standardization studies, they are conspic- 
uously missing from the manual. Considering 
that, at the time this manual was published in 
1948, Buck had admittedly had “ten years of 
study and clinical usage” of the H-T-P, the man- 
ual displays incredible naiveté, fanaticism, and 
arrant disregard for any attempt at scientific 
validation of the material presented. 

Fortunately for Buck, however, the manual 
appears to be as unreliable as a projective test 
of its author’s personality as many of its dog- 
matic maxims prove to be when employed in 
the projective testing of patients and other in- 
dividuals. For Buck has recently followed up 
the manual with complete mimeographed pro- 
ceedings (14) of the H-T-P workshop held at 
the Veterans Administration Hospital, Rich- 
mond, Virginia in the spring of 1950; and these 
proceedings indicate that he is a reasonably open- 
minded psychologist who is still learning a good 
deal about his own test and who is willing to 
revise many of his originally rash pronounce- 
ments in the light of subsequently gained ex- 
perience with, and criticism of, the H-T-P tech- 
nique. Since the new mimeographed material 
seems to supersede much of the material in the 
original manual, the present review will be lim- 
ited to a discussion of the H-T-P as used in con- 
junction with the new material and with John 
Т. Payne’s supplementary mimeographed paper, 
Observations on the Use of Color with the 
H-T-P. (15) 

The H-T-P technique is unquestionably a 
valuable addition to the clinical psychologist's 
battery of projective tests of personality evalua- 
tion, Two vitally important questions quickly 
arise, however, in connection with its employ- 
ment: (а) How shall it be administered? and 
(b) How shall its results be interpreted? 

Buck’s method of administering the H-T-P 
includes several basic steps: (a) individual ad- 
ministration of the test; (b) postdrawing inter- 


rogation; (с) individual administration of the 
chromatic H-T-P (the drawing of the house, 
tree, and person with various colored pencils 
instead of with regular pencils) ; (d) postdraw- 
ing interrogation of the chromatic drawings ; (e) 
quantitative and qualitative scoring of the test, 
including the estimation of specific 1Q’s; and 
(f) general qualitative interpretation of the 
test, particularly in relation to the personality 
evaluation of the subject and his differential 
diagnosis. This procedure is a thoroughgoing 
one that is likely, under ideal testing circum- 
stances, to provide maximum material for test 
interpretation. The present reviewer, on the basis 
of much experience with the H-T-P, is skepti- 
cal, however, about the practical value of this 
involved procedure, for several reasons: 

a) When done in the orthodox manner as out- 
lined in the new mimeographed manual, the 
administration, scoring, and interpretation of 
the H-T-P takes several hours. Considering the 
unknown reliability and, especially, validity of 
the H-T-P intelligence estimations and per- 
sonality interpretations at the present time, and 
considering the time available to the psycholo- 
gist in a normal clinical situation, it is unlikely 
that the H-T-P is normally worth this many 
hours of a busy psychologist's time. 

b) The standard administration procedure 
does not provide for the drawing of persons of 
both sexes by the subject. The writer has found 
it most valuable to administer the test so that 
the subject first draws a house, then a tree, then 
a person, and then a second person of the sex 
opposite to the one drawn first. This combines 
the advantages of the H-T-P and the Machover 
figure drawing technique and often adds valuable 
material for interpretative purposes. 

c) The present writer sees no particular value 
in employing the standard H-T-P booklets for 
the actual drawings. Any ordinary size white 
paper serves the same purpose. 

d) The quantitative scoring of the H-T-P, 
and particularly the obtaining of specific IQ's, 
has been found to be of relatively little value, 
considering the time and effort involved. 

е) In practice it has been quite feasible to 
gather H-T-P drawings on a group rather than 
an individual basis; much of the interpretative 
value of the tests can be obtained from such 
group-gathered drawings. 

f) The postdrawing interrogation may also 
be done on a group basis; or it may be omitted 
entirely. The time and effort spent in making 
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this postdrawing interrogation on an individual 
basis is questionably expended, as against utiliz- 
ing this time for a general psychological inter- 
view of the subject, in which reference to the 
drawings may play some part. 

For the foregoing reasons, this reviewer would 
strongly advise clinical psychologists who use 
the H-T-P to take Buck's administration and 
scoring procedures with decided skepticism and 
to adapt testing methodology to their own realis- 
tic work schedules. Used in this freer manner, 
the test may yield, for an expenditure of rela- 
tively little time and effort, some extremely use- 
ful interpretative material—albeit not nearly as 
much as Buck would like us to believe that it 
normally yields. 

Regarding the manner of interpreting per- 
sonality evaluations and making differential di- 
agnoses from H-T-P protocols, Buck is to be 
especially commended for his recently found 
skepticism about attributing specific meanings 
to specific aspects of the figures drawn by the 
subjects. Thus, he continually warns us in the 
new mimeographed manual, *you don't want to 
put too much weight on any one or two isolated 
items"; "nothing (if one can make such a 
general statement about anything in the H-T-P) 
always and necessarily represents any one 
thing"; “colors do not have any absolute and 
universal meaning"; "nothing in the quantita- 
tive scoring system can be taken automatically 
at face value"; “по specific patterns need ever 
be expected on the H-T-P”; “no single sign is 
of and by itself conclusive evidence of anything ; 
it is the total constellation that matters.” These 
are salient warnings long overdue from over- 
enthusiastic exponents of drawing interpreta- 
tion and are to be taken most seriously by the 
sound clinical psychologist who uses this test. 

Unfortunately, Buck frequently contradicts 
his own warnings and makes rash general and 
specific interpretative statements about the 
H-T-P which, as yet, are not backed up by any 
factual evidence whatever. He presents, in fairly 
dogmatic form, hypothesis after hypothesis 
which may, logically, seem to be true but which 
have not yet been psychologically and scien- 
tifically established. For example: 

Each drawn whole, the House, the Tree, and the 
Person, is to be regarded as a self-portrait. * You can 
use the H-T-P in blind analysis without any informa- 
tion about the patient and arrive at a fairly accurate 
diagnosis in the majority of cases. * If the ground line 


is spontaneously drawn it appears to be because the pa- 
tient, himself, feels insecure * If it is drawn going up- 
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ward to the right of the page, the subject probably feels 
he has a pretty uphill row to hoe in the future. 

When the perspective score is depressed we assume 
that the subject is losing insight. * Inadequate detail- 
ing....definitely suggests withdrawal tendencies. * A 
subject who makes sparing and unemphasized use of 
shading..is sensitive to..his relationship with other 
people. * Placement to the left of the vertical midline 
tends to emphasize immediate satisfaction of impulses * 

The House drawn with the side wall only visible... 
connotes very serious withdrawal and oppositional 
tendencies. * The wall (or walls) of the House seems 
to depict the Ego of the subject and its strength or 
weakness to symbolize the corresponding strength or 
weakness of the Ego. * If there are no panes drawn 
in the window, we presume that the subject has defi- 
nite oppositional tendencies. * The Tree appears to be 
the best subconscious portrait of the Self of the three 
wholes. * The disproportionately small head is...the 
Obsessive's expression of a desire to deny the site of 
painful thoughts and guilt feelings. An overly large 
mouth implies oral eroticism. * 

'These are all interesting hypotheses; it is 
possible that some of them may eventually be 
proved factually true. At the present time, how- 
ever, Buck presents no objective evidence in 
their support. As Payne notes in his Observa- 
tions on the Use of Color with the H-T-P: 
“These observations will have to be proven, or 
disproven, as the case may be, by additional 
research. Certainly more questions have been 
raised than answered." It must be sadly noted, 
however, that at the date of their 1950 presen- 
tations, Payne had been experimenting with the 
chromatic H-T-P for 5 years, and Buck with the 
achromatic H-T-P for more than 15 years. Un- 
der the circumstances, it is surely reasonable to 
expect considerably less speculation and more 
factual supporting material from both these au- 
thors. But, as yet, we are given literally hun- 
dreds of speculative statements—and virtually 
not a single fact—on personality interpretation 
through H-T-P analysis. 

In sum: the H-T-P technique appears to be 
a valuable addition to the projective testing bat- 
tery of the clinical psychologist under the fol- 
lowing conditions: when the test is used in a 
nonrigid, practical manner; when the time de- 
voted to its administration, scoring, and inter- 
pretation is reduced considerably below that 
advocated by Mr. Buck; when interpretations 
are made on a cautious, distinctly realistic basis, 
and the interpreter endeavors to get a little valid 
material rather than a mass of speculative mean- 
derings; when the H-T-P interpretations are 
made in the light of considerable other informa- 
tion known about the subject; and when these 
interpretations are backed by wide experience 
with clinical psychological procedures in gen- 
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eral, and projective drawing techniques in par- 
ticular. The sooner Buck gives us some factual 
data in support of his several hundred specula- 
tions concerning H-T-P personality interpreta- 
tions, the sooner we shall be able to take these 
speculations more seriously. 


EpHraim Rosen, Assistant Professor of Psy- 
chology, University of Minnesota, Minneapolis, 
Minnesota. 

New developments in administration, scoring, 
and interpretation of the H-T-P have occurred 
since the test was reviewed in The Third Men- 
tal Measurements Yearbook: (a) A chromatic 
form of the test, in which the subject has a choice 
of eight colored crayons with which to draw, 
has been devised. Choice of colors and methods 
of using them are believed to point to a variety 
of personality factors. (b) To the original scor- 
ing modes have been added some new ap- 
proaches, particularly one with the striking 
name “Quality of the Quantity.” This approach 
qualitatively evaluates scores and discrepancies 
between intratest scores by clinical interpreta- 
tion of the pattern into which the scores fall. 
(c) On the basis of cumulative test records, the 
test author and other H-T-P users have added 
further suggestions as to the meaning of specific 
qualitative “signs.” 

Most of the criticisms of the test made by the 
reviewers in The Third Mental Measurements 
Yearbook seem to have retained their validity. 
Quantitative scoring is as complex and time- 
consuming as ever. The qualitative scoring sys- 
tem and interpretation are, on the whole, as 
subjective as before. Validity data, in general, 
are meager. However, it is equally true, as the 
earlier reviewers pointed out, that the H-T-P, 
like many other projective devices, may well be 
useful in the hands of skilled clinicians and that 
with further work it may develop into a proved 
instrument. 


Psychiatric Q Sup 23:376 pt 2°49. * The ad- 
ministration of the test is relatively simple ; how- 
ever, the scoring is so complex that it seems 
doubtful whether anyone except the author can 
achieve adequate results. This reviewer has used 
the test and found rather poor correlations with 
other standard intelligence tests. In addition, its 
power as a diagnostic tool is questionable. Cli- 
nicians versed in Freudian concepts will derive 
useful material concerning personality dynamics 
from the subjects’ drawings. There is no doubt 
that the test has possibilities as a projective tech- 
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nique; but as the author implies in his manual, 
it needs much more work before it can be con- 
sidered a proved clinical instrument and take 
its place alongside the T.A.T. and/or the Ror- 
schach. However, for clinicians well versed in 
personality 4ynamics, the H-T-P test should be 
useful even in its present state of development. 


For reviews by Morris Krugman and Kath- 
erine W. Wilcox, see 3:47. 
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*Horn-Hellersberg Test. Ages 3 and over; 1945- 
49; based upon drawings taken from Horn Art Apti- 
tude Inventory (see 3: 171); a measure of "capacity to 
function or to adapt to a given surrounding” ; 1 form, 
^45; no data on reliability; no data on validity in man- 
ual (for data presented elsewhere by the author, see 5 
below) ; no description of normative population ; mime- 
ographed manual, second edition (40); $2 per 25 tests; 
25¢ per manual; 35¢ per specimen set; postage extra; 
(30-90) minutes; Elizabeth F. Hellersberg ; the Au- 
thor, 641 Whitney Ave., New Haven, Conn. * 
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versity, Lewisburg, Pennsylvania. 

Hellersberg has developed a psychological 
technique based upon a section of the Horn Art 
Aptitude Test. Twelve small rectangles, each of 
which contains portions of lines taken from well 
known pictures, and a blank rectangle are pre- 
sented to the subject. A picture is drawn in each 
rectangle, a title is written beneath, and the 
subject is interviewed at length to determine 
the significance of each drawing. Since the pic- 
tures are drawn in pencil, erasures may be made 
during the test. Careful notes are taken by the 
examiner during the entire performance, and a 
complete record of the interview must be made. 
Supplemental material for graphological infer- 
ences may be obtained from a 5- to 10-line pen- 
and-ink description the subject writes on one 
of the productions. 

The pictures are studied for themes and items. 
In the population tested by Hellersberg and her 
associates, certain picture elements were found 
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to be of frequent occurrence; others occurred 
rarely. Pictures related to daily life, for instance, 
indicate a good social rapport. Those with mor- 
bid or horror elements are assumed to indicate 
dysphoria and loss of contact with reality. At- 
tempts to imitate Charles Addams, however, 
might occur in the drawings of pseudo sophisti- 
cates. The examiner must probe carefully to de- 
termine whether the productions indicate a sense 
of reality. Bizarre horror-drawings, for instance, 
might be plotted on Chart A as lying in the 
"Danger Zone" and in the column indicating 
“Loss of Reality Contact.” Percentages are cal- 
culated from this chart. These indicate the posi- 
tions of items and themes in the following zones : 
objective, expressive-emotional, repressive-ra- 
tionalization, and danger. Mere scribbling by 
an adult would lie completely outside these zones 
and indicate loss of the sense of reality. Nor- 
mal individuals have a flexibility in their 
productions. On the stereotypy-individuality di- 
mension, they vary between objective repre- 
sentational drawings and those expressive of 
wholesome affects. 

Table B is provided for including data about 
form elements. For instance, a drawing might 
be done in bold, heavy strokes; it might depict 
a scene of violence; it might be original or trite. 
The clinician’s notes taken during the test and 
the interview are used to arrange form elements 
in this table. Tempo of work, erasures, mood 
fluctuations, ego-involvements, signs of pleasure 
or frustration—these are examples of data in- 
cluded in the table. In particular, the use of sym- 
bols must be determined. Many of them are ob- 
vious to the experienced clinical psychologist. 
A. thorough inquiry is required in order to work 
up the material to be included in this table. 

At least an hour is needed for administering 
*he test and conducting the interview. Though 
the manual contains some examples of scoring 
problems, the labor of working out the chart 
and the table is arduous. Subjective criteria must 
be employed. Great skill is needed in eliciting 
from the subject his interpretation of each pro- 
duction. Both the verbalizations and the pictures 
must then be evaluated. Apparently, there are as 
yet no scoring tables or samples analogous to 
those available to users of the Rorschach. Ex- 
perience, common sense, and intuitive clinical 
judgment must guide the examiner. In our cul- 
ture, with its emphasis upon individuality, a wide 
variety of items and themes is to be expected. 
Pueblo Indians who were tested by the Horn- 


Hellersberg technique were relatively easy sub- 
jects compared to the urban groups Hellers- 
berg and her colleagues have examined. Never- 
theless, the test yields “an effective yardstick of 
the individual’s present capacity to function in 
the adult world of our American civilization.” 

The rationale of the Horn-Hellersberg Test 
is inadequately, though plausibly, set forth. No 
statistical evidence is offered to demonstrate the 
validity or the reliability of this test. Both the 
chart and the table do present an array of data 
that are impressive. How can the examiner be 
at all certain that these data are of value in ag- 
sessment of the sense of reality? Behavior data 
of importance, no doubt, are brought into ex- 
pression by means of this technique. Affirma- 
tions, specious generalizations, and brief case 
notes are included to justify the assumption that 
they reveal the individual's degree of rapport 
with reality. Nevertheless, Hellersberg does state 
that the test is still in its experimental stages. 
Perhaps objective validating evidence will be 
forthcoming. It may be, indeed, that this test 
will “complement other personality tests such 
as the Rorschach Ink Blots, Thematic Apercep- 
tion [sic] Test, Bender Gestalt, Drawing of 
Human Figure, Handwriting Analysis, etc." At 
present, no valid data are offered to support this 
hope. 

A Rorschach, if intelligently administered and 
cautiously interpreted, might yield data about 
an individual's sense of contact with reality. At 
the same time, it might bring into expression 
many other facets of the personality. On the 
grounds of economy of time alone, it is doubtful 
whether clinical psychologists will have much 
use for the Horn-Hellersberg Test. It does, how- 
ever, offer many possibilities for research in- 
vestigations. There are opportunities for estab- 
lishing norms for scoring, quantifying data, and 
comparing the results with those obtained from 
other projective techniques. Certainly, there 15 
a need for an objective assessment of the indi- 
vidual's status on the objectivity-subjectivity 
continuum. This technique may mark the begin- 
ning of research in this area. In short, further 
research is certainly to be encouraged. The clini- 
cal serviceability of the test at its present stage 
of development is rather doubtful. 

After a conscientious perusal of this mono- 
graph, together with a little experience in trying 
to score a few tests, the reviewer came to three 
conclusions. First, there is a need for a clearly 
written, explicit manual with adequate scoring 
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samples, norms, and instructions for interpreta- 
tion. Secondly, “the whole problem of reality de- 
serves far greater consideration than it has thus 
far received.” This test should be an influential 
stimulus to further attacks on the problem. 
Thirdly, the results of Horn-Hellersberg Test 
reported in this monograph derive their value 
from Hellersberg’s enthusiasm, clinical experi- 
ence, and proficiency in making shrewd infer- 
ences. Perhaps, had she appraised the sense of 
reality through the use of any other worth while 
technique, she might have obtained material 
that would be equally convincing. This fact, 
however, does not warrant the assumption that 
other clinical psychologists will be able to employ 
this technique in its present form with similar 
proficiency. 


Т. W. Ricuarps, Professor of Psychology, 
Northwestern. University, Evanston, Illinois. 

This test, like the Geosign Test of Reichen- 
berg-Hackett, presents the subject-with partially 
structured graphic figures which he is asked to 
complete in any way he chooses. These represent 
“reality,” apparently, and what the subject does 
is interpreted as bringing to light “the concrete 
manner in which the individual realizes his inner 
strength in a life situation.” Hellersberg feels 
that in this sense.the H-H Test is a complement 
of tests such as the Rorschach, Bender-Gestalt, 
Drawing of the Human Figure, and so forth. 

The manual presents several illustrative cases, 
but no normative sample of productions. There 
are no data on reliability or validity, though she 
does refer to analysis of 40 cases of adolescents 
followed up after a year’s work experience in 
which she was able to “predict which subject 
was prepared for adult life and which was lag- 
ging in maturity.” 

After the 13 drawings of this test are ob- 
tained, together with a handwriting sample, pro- 
ductions are discussed with the subject, and he 
indicates preferences. His total performance for 
each picture is evaluated according to an inter- 
pretive scheme which leans heavily on princi- 
ples of “depth” psychology or psychodynamics. 
This scoring yields a pattern by which the draw- 
ings are considered as varying between the “Ob- 
jective Zone" through intermediary zones to 
“Toss of Reality Contact.” This classification of 
responses requires the exercise of considerable 
judgment by the scorer. ] 

It should be evident that the Н-Н Test (аз 
proposed by its author) offers (a) a technique 


for eliciting productions and (b) a scheme for 
interpreting the significance of these produc- 
tions. In the opinion of this reviewer, the first 
of these contributions is worth further explora- 
tion; by completing a partial structure the sub- 
ject may well reveal his individuality in impor- 
tant areas. The second offering of the test—its 
interpretive scheme—requires theoretical as- 
sumptions which themselves need to be tested 
at a much more basic level. This reviewer does 
not feel that the assumptions are invalid, but 
that, since they are assumptions, they do not 
justify the formal structure of the scoring system 
here proposed by the author of the test. 


For related reviews, see 109. 
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*[Re Horn-Hellersberg Test.] HELLERSBERG, ELIZ- 
ABETH F. The Individual's Relation to Reality in 
Our Culture: An Experimental Approach by 
Means of the Horn-Hellersberg Test. Springfield, 
Ill.: Charles C Thomas, Publisher, 1950. Pp. x, 128. 
$3.25. * (PA 25:2131) 

B Menninger Clinic 15:155 Jl "sr. Walter 
Kass. The title of this monograph could more 
accurately be: An Introduction to the Horn- 
Hellersberg Test. This test is a semi-structured 
projective drawing completion technique in 
which various given lines within a series of 
squares are the basis of drawings freely made by 
the subject. After an interview with the subject 
about the meaning of the title and content of 
each drawing, personality analysis and diagnosis 
of psychopathology is made from the individual 
ways of dealing with reality revealed in the draw- 
ings. It is regrettable that Dr. Hellersberg did 
not have space to develop her culture-reality, ac- 
culturation concepts, and document them more 
liberally from the ample source materials she has. 
This test has rich psychodiagnostic possibilities. 
It is recommended as a diagnostic tool for clini- 
cal psychologists, a research tool for anthro- 
pologists, sociologists and other investigators of 
personality expression. Я 

J Clin Psychol 7:295 Jl ^51. * The author 
claims to identify several signs which may be 
taken as indicators of a breakdown of contact 
and adaptability to reality. These signs probably 
have suggestive significance but there is no ob- 
jective data in this monograph in support of the 
author's intuitions. 

J Consult Psychol 15:168-9 Ap ' 51. E. Lowell 
Kelly. * essentially a manual for the Horn- 
Hellersberg Test. Although the ten-page Ap- 
pendix is entitled "Instructions for administra- 


109 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK 


[ 184 


—————————————————————————————————————— 


tion and interpretation," it is in itself a very 
inadequate manual, so it is fortunate that 
the body of the text is devoted largely to 
further instructions and to discussions of a 
series of illustrative cases with whom the in- 
strument has been used. The Horn-Hellers- 
berg Test is a projective device in which the 
subject is instructed to finish incomplete line 
drawings. This task is followed by an interview 
inquiry structured by the examiner to provide 
the necessary information for scoring and inter- 
preting the subject's test behavior. Although re- 
sponses are scored on many determinants— 
sequence, movement originality, etc.—the author 
regards the test as primarily useful in evaluating 
a subject's relation to reality, and has derived a 
method of content scoring to provide an index 
of reality adjustment regardless of the subject's 
cultural background. It would appear to be a 
useful technique. The author states that she has 
used the test with 2500 cases and has "found 
that if 36 per cent of all drawn items are placed 
in the Objective Zone, the individual is still able 
to function normally as an adult in our civiliza- 
tion.” Unfortunately, however, no normative 
data are provided, and there is no indication as 
to what may be expected by way of test-retest 
reliability or interexaminer agreement in scoring 
and interpretation. 

Psychol Service Center J 2:250-2 S ' 5o. Al- 
fred Jacobs. * "Relation to reality," as measured 
by the test, appears to be based on a comparison 
of the content and the associations connected 
with the drawings of one individual to the draw- 
ings of other individuals in the culture. However, 
no norms are presented in the monograph. The 
author states that if 3676 of the responses fall in 
an "Objective Category," the individual is able 
to function normally as an adult in our culture, 
so that a criterion measure of adjustment to 
society is implied. It might be disputed, con- 
sidering the failure of the author to submit rele- 
vant evidence, that neither the test score nor a 
measure of adjustment to society are highly re- 
lated to what the psychiatrist considers “sense of 
reality." Also absent are tests of the significance 
of the assumed relationship between scores on 
the Horn-Hellersberg Test and whatever opera- 
tions were used to define adjustment to society. 
The author describes the validity of the test as 
depending on "inner and qualitative consistency, 
a concept which more and more displaces that 
of a proof by mere quantitative methods" (p. 
52). The reviewer is not convinced that current 


psychological literature provides any evidence 
for the increasing popularity or acceptance of à 
non-quantitative definition of validity. Despite 
the lack of communicable reliability and validity 
statements, the author states that "vocational 
counsellors, teachers, guidance agents, social 
workers, who must make certain decisions re- 
garding an individual's training, placement or 
vocational choice, can use such a test to help 
control and supervise the effect of any new ar- 
rangement made by or for the subject." (p. 
102). The test appears to be capable of eliciting 
varied responses which are scored in terms of 
dimensions developed by the author, and in 
terms of categories previously used in the 
scoring of such tests as the Rorschach and 
the Bender-Gestalt. Assuming that the scoring 
has reasonable reliability, which would appear 
questionable for some of the complex dimen- 
sions, the test should prove a promising one for 
personality investigation. Aside from sampling 
drawing behavior, the test does not appear to 
differ appreciably from other commonly used 
projective techniques. In view of the recent 
deluge of projective materials in clinical psy- 
chology, the question may be raised at this point 
as to whether it will be more fruitful to extend 
the validation of accepted tests rather than look- 
ing for new projective devices, and, as in the 
case of the Horn-Hellersberg Test, whether it 
is legitimate to insist that new tests support their 
claims of being improvements or of being dif- 
ferent by the type of evidence which is de- 
manded for all types of psychological research. 
As in the case of other projective techniques, 
the Horn-Hellersberg Test seems to elicit re- 
sponses which give information about both as- 
sociative and motivational factors. Because of 
the lack of previously established responses to 
the ambiguous test stimulus, the responses which 
are elicited are frequently widely generalized 
and characteristic of the individual in many other 
situations. Since responses which are strongly 
motivated also seem to generalize more widely 
than less strongly motivated responses, strongly 
motivated responses appear to be more easily 
elicited by the ambiguous test stimulus, and in- 
formation about motivation is obtained from the 
test. In view of the lack of reliability and validity 
data, the use of the test as a clinical instrument 
at this time might be premature. In view of the 
quantifiability of some of the scoring dimen- 
sions, the test would provide an interesting and 
amenable area of investigation. 
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Q J Child Behavior 3:114-5 Ја '51. Joseph 
Katz. * The author has developed a systematic 
scoring scheme which is intended to serve as a 
measure for evaluating the normality of an indi- 
vidual’s relation to reality. This consists of 
classifying the subject matter drawn according 
to a check list of 40 topics under 9 degrees of 
subjective relatedness of these topics to the in- 
dividual. The tabulation of the items drawn are 
then seen to distribute themselves to form a 
scatter which normally falls, for the most part, 
into the Objective Zone, which is the zone of 
closest contact with reality. * The author cate- 
gorically states that by repeated experimenta- 
tion she has found that if 36 per cent of all drawn 
items are placed in the Objective Zone, the in- 
dividual is still able to function normally as an 
adult in our civilization. However, no details or 
data as to the scope or nature of the experimenta- 
tion is presented; nor is mention made of the 
possibility of so called normals who might con- 
ceivably fall below this 36 per cent objectivity 
or of some schizophrenics who may very well 
attain higher than this rigid criterion score. 
Later in the book, the author herself correctly 
states that the qualitative verbalizations, fluctua- 
tions in performance, general approach, and 
most important, the pertinent background of the 
individual, are all indispensable factors which 
have to be considered before passing judgment 
on the individual’s relation to reality. Again, the 
author speaks glibly of the high reliability of 
diagnosis and prediction made by means of this 
test in the past, without presenting ample sub- 
stantiating proof. The book is very well illus- 
trated with drawings, interpretations, and brief 
case histories of a variety of normal and ab- 
normal subjects presented. The interpretation 
of some of the drawing features and symboliza- 
tions are at times somewhat speculative but in 
general are treated cautiously and within a sound 
framework of personality development. The pres- 
ent investigation, while leaving much to be de- 
sired, nevertheless helps to establish the Horn- 
Hellersberg Test as a valuable contribution to 
projective psychology and as a very useful tool 
for the experimental study of such a vital but 
neglected concept as the individual’s relation to 
reality in our culture. 


[110] 


*The Kahn Test of Symbol Arrangement. Grades 
3-16 and adults; 1949; а projective technique for dif- 
ferential personality diagnosis ; $18.50 per set of test- 
ing materials, postpaid; nontimed (25-30) minutes; 


Theodore C. Kahn; Guidance Tools, P.O. Box 802, 
Wilmington, Calif. * 
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2. Kaun, Turopore С. “An Original Test of Symbol Ar- 
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chol 15:439-44 О '$1. * (PA 26:7147) 

EDWARD JOSEPH SHOBEN, JR., Associate Pro- 
fessor of Education, Teachers College, Colum- 
bia University, New York, New York. 

While it has much in common with certain 
of the projective techniques, the Kahn Test of 
Symbol Arrangement (KTSA) differs in that 
it uses essentially well structured stimulus ob- 
jects, such as hearts, stars, dogs, an anchor, etc., 
rather than amorphous ones. The rationale for 
this is not completely clear but seems to lie in 
the assertion that in the selection and manipula- 
tion of symbols the individual “reacts holistically 
with all of his dynamics concentrated at a given 
point at a given time.” Thus, in the selection 
and manipulation of the symbols of the Kahn 
Test of Symbol Arrangement, the subject is 
presumed to reveal his typical modes of approach 
to the various symbols impinging upon him in 
his daily life. 

However that may be, this test presents an in- 
teresting and easily administered task, appar- 
ently suitable to a wide range of subjects. It con- 
sists simply of 15 plastic objects which the 
subject is required to line up on a strip of felt 
marked off into 15 consecutively numbered seg- 
ments of equal size. He is then asked to verbalize 
his arrangement. This is gone through twice. 
The next step involves having the subject inter- 
pret each of the objects symbolically in the sense 
that the American flag is symbolic of the United 
States as a nation. When this is done, he must 
recall the second of his arrangements. A final 
procedure has the subject line up the objects in 
order of his preference or “liking” for them. 
There are some interpolated tasks in which the 
subject must superimpose one object on another, 
but these do not (as yet, at least) enter into the 
scoring in any significant way. 

Scoring is in terms of categories descriptive 
of the subject’s approach to the “line-up,” “sym- 
bolization,” and “preference” tasks. Inter-scorer 
reliabilities of the order of .97 are reported. This 
impressive coefficient is buttressed by a reported 
retest reliability of about .95, indicating that not 
only are the categories amenable to independent 
evaluation among judges, but that the behavior 
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that the test elicits tends to be stable from ad- 
ministration to administration. 

The only evidence for validity lies in a study 
of the KTSA's power to discriminate a group 
of brain-damaged psychotics from a nonpsy- 
chotic control group. The differentiation was 
successful, but a number of cautionary notes 
must be sounded. First, the control group ex- 
ceeded the brain-damaged psychotics in IO, ed- 
ucational attainment, and occupational status; 
these, rather than "personality factors," may 
have determined the performance differences. 
Second, while the overlap in scores is not great, 
there is no information as to the number of false 
positives or false negatives picked up by the test. 
"Third, studies of this sort are by no means con- 
clusive until cross-validational investigations 
have eliminated the possibility that sampling 
fluctuations did not account for differences. Fi- 
nally, with such extreme groups as these, one 
may legitimately wonder if the test improves 
diagnostic prediction at all. Would not the same 
amount of time spent in an interview result in 
the same separation of the diagnostic categories? 

Another point must be made relative to the 
use of the instrument in its present stage of de- 
velopment. While the scoring categories seem to 
promise much in terms of their objectivity, there 
is as yet no indication of how they are related to 
specific nontest aspects of behavior. Until such 
evidence is gathered, it must remain a tenable 
hypothesis, especially in the light of the rela- 
tively uncontrolled educational and intelligence 
factors in the only available validation study, 
that cognitive rather than psychodynamic fac- 
tors are being tapped by the KTSA. 

On the other hand, the KTSA is a simpler, 
more widely applicable situation than most in- 
struments at hand for investigating develop- 
mental patterns and various attributes of psy- 
chopathological behavior. On a research basis, 
its use should be strongly encouraged. As a test, 
it is still essentially unproven. 


[111] 
*Machover Draw-A-Person Test. Ages 2 and over; 
1949; also called Machover Figure Drawing Test; 
individual ; $3.50 (17s. 64.) per manual, Personality 
Projection in the Drawing of the Human Figure: A 
Method о} Personality Investigation (see 5 below), 
which contains directions for administering and inter- 
preting and associations record sheet; associations 
record sheets must be reproduced by individual test 
users; cash orders postpaid; (5-60) minutes without 
associations, (20-90) minutes with associations ; Karen 
Machover; Charles C Thomas, Publisher, 301-327 
East Lawrence Ave., Springfield, Ill. * (English pub- 


lisher: Blackwell Scientific Publications, Ltd., Oxford, 
England.) 
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РнплрІ. Harriman, Professor of Psychology 
and Head of the Department, Bucknell Univer- 
sity, Lewisburg, Pennsylvania. 

Complex in its interpretation, the technique of 
personality investigation through drawings of 
the human figure is simple to administer. Only 
IO or 20 minutes are needed to have the individ- 
ual draw a person and then to draw another of 
the opposite sex to that first drawn. The exam- 
iner obtains useful clinical data from the associa- 
tions which are given at the end of the test. Ina 
future publication there is to be a full discussion 
of the significance of information elicited by free 
associations and directive questions. 

Interpretations of “salient characteristics” of 
the individual require but 10 or 15 minutes. 
More complete analysis of the personality, of 
course, takes a much longer time. Nevertheless, 
the method is recommended because of its ease 
of administration, its economy in time, and its 
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interest-value for both the individual who is 
tested and the clinical psychologist who knows 
how to interpret the material. The Machover 
monograph is in the nature of a report-of-prog- 
ress. Fifteen years of work have been devoted 
to the development of the technique, principally 
in improvements of the drawing analysis as “а 
valuable supplementary clinical method." That 
the method is not overly difficult to master is 
apparent from this quotation: “With proper 
training and experience....facility in the use of 
the method may be gained in a relatively brief 
time * graduate psychology students have been 
able to acquire a grasp of basic principles....after 
some orientation lectures.” 

The monograph is divided into three parts. 
Part I is the least rewarding section of the book. 
Although she does mention some of the litera- 
ture on creative drawings, Machover does not 
yet appear to have developed a satisfactory theo- 
retical basis for her procedures. It is only fair to 
state that she realizes this weakness and that fu- 
ture publications are in the making. In the 29 
references given at the end, there are many sug- 
gestions for a lucid and convincing rationale, 
but they are not utilized in her disappointing 
introductory section. Many discussions of ex- 
pressive art are readily available, and it should 
not be hard to locate them. It is rather arrestive 
to observe that Viktor Lowenfeld’s analyses * 
of expressive drawings were overlooked. Simi- 
larly, she takes no cognizance of recent investi- 
gations into the psychological aspects of creative 
art. In familiar territory of creative literary 
work, such scholars as Lowes would have much 
to contribute to a better understanding of ex- 
pressive activity in almost any form. The vast 
amount of folklore connected with various parts 
of the human figure is completely ignored. Even 
a hasty perusal of Frazer’s Golden Bough or of 
a good commentary on Hebrew folklore in the 
Old Testament would seem to be of great assist- 
ance in establishing a more convincing basis of 
theory. Certainly, an extensive reading of refer- 
ences listed by Goodenough? and by Anastasi 
and Foley * would be of value in working out a co- 

1Lowenfeld, Viktor. Creative and Mental Growth: A Text- 
Book ов Art Éducation, New York: Macmillan Со., 1947. Рр. 
Дело Florence L. Measurement of Intelligence by 
Drawings. Yonkers, N.Y.: World Book Co., 1926. Pp. ИА 
ККИ Apne and Роу, LC eos Я ТҮ Н fatorical 
Lipi HEP RT Jen. J General Psychol 25:111—42 

Anastasi, Anne, and Foley, John P. “A Survey of the Liter- 
ature on Artistic Behavior in the Abnormal: 11, Approaches 
and Interrelationships." Anm NY Acad Sci 42:1-112 Ag 11 
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herent, logical theory for the figure-drawing test. 

The basic assumption is that an individual's 
experience with his own body results in sym- 
bolic investments of various parts of the buman 
figures which he draws in the test period. He 
becomes profoundly ego-involved in the task, 
and deep conflictual needs and tensions come 
into expression. Thousands of drawings col- 
lected in psychological clinics are said to con- 
firm this assumption. It is asserted that the 
technique may prove to be as valuable in locating 
personality assets as it has already been proved 
to be in appraising deficits of personality. Not 
only do the separate parts of the drawing have 
significance, but also the structural and the 
formal aspects reveal a great deal. 

Whether or not similar findings might be 
found among normal individuals goes unmen- 
tioned. When this reviewer tried out the Mach- 
over technique on college students, he found 
that the better adjusted ones either took the test 
facetiously or became irked by inability to reach 
their level of aspiration in pencil drawings. 
Badly adjusted students did become ego-in- 
volved and exhibited symptoms of dysphoria, 
not only in their drawings and associations but 
also during the test itself. Normal women 
seemed to be preoccupied with depicting current 
fashions in clothing and hairdo for their own sex 
and in humorous caricatures of "Joe College" 
males. The normal men labored to produce 
Vargas-type females. When normal males drew 
their own sex, they became deeply involved, and 
a wide variety of significant drawings was pro- 
duced. This little excursion into a use of the 
Machover technique is what made the reviewer 
disappointed with Part I. 

Part II is written with convincing enthusi- 
asm. No doubt, many a stimulating discussion 
lies behind the innumerable hypotheses and 
inferences in this section. The symbolic invest- 
ments of each part of the human figure are de- 
scribed in a style that never departs from ebul- 
lience. Naturally, the writer's sincere conviction 
about the value of figure drawing as a method of 
personality investigation carries weight with the 
reader. Here and there a bold generalization is 
supported by case notes, and then the case notes 


ture on Artistic Behavior in the Abnormal: III, Spontaneous 
Productions. American Psychological Association, Psychological 
Monographs, Vol. 52, No. 6, Whole No. 237. Washington, D.C.: 
the Association, Inc., 1940. Pp. iii, 71. * (РА 15:3387) 

Anastasi, Anne, and Foley, John P., Jr. “А Survey of the 
Literature on Artistic Behavior in the Abnormal: IV, Experi- 
mental Investigations." J General Psychol 25:187—237 Jl '41. 
* (PA 15:5115 
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are cited as justification of the generalization. 
Just as an eloquent orator can obfuscate his au- 
ditors, so does Machover succeed in keeping the 
reader from asking such questions as: What is 
the evidence? Are there any objective proofs? 
Why, for instance, is the Adam's apple some- 
times indicative of sexual weakness (p. 58)? 
The style often becomes fervent and exhortatory, 
as if to induce a sort of religious conversion to 
the Machover technique. Only by re-reading the 
section with zealous devotion to the ideals of 
scientific expository writing can one escape be- 
ing "sold on" the test. Though evidence and 
proof are lacking, the discussion of interpreta- 
tion is decidedly worth study. Shrewd clinical 
observations abound. Hypotheses worthy of ex- 
perimental investigation illuminate almost every 
page. Like many other proponents of ingenious 
projective techniques, Machover is a facile, con- 
vincing writer and an experienced clinical 
psychologist. Were she to “sell” a technique for 
the investigation of personality by any other 
worth-while method, she would, no doubt, suc- 
ceed in establishing it in "many progressive 
clinical units throughout the country." Likewise, 
no doubt, the questions about rationale and 
about validation would still be pertinent. In its 
present stages of development, Part II may be 
of greatest use in stimulating objective research, 
not in personality investigations where human 
welfare is involved. 

Part III includes seven brief illustrative ex- 
amples of the method. Clinical histories, repro- 
ductions of the drawings, and füll interpreta- 
tions are given. These furnish useful data for one 
who would understand the method of interpre- 
tation and the underlying concepts of the tech- 
nique. They also bring out the fact that the 
procedure is challenging, worth while, and 
plausible as a supplementary clinical tool. In its 
present stage of development, it will attract a 
great deal of attention and stimulate further 
study. This little book is of great importance to 
the psychologist who is concerned to keep up-to- 
date in psychodiagnostics. Machover is to be 
commended for making a report of progress. 
Nevertheless, drastic improvements are needed 
before the draw-a-person technique should be 
employed in personality appraisals. This pre- 
liminary report offers substantial hope that they 
will be effected. 


Naomi Stewart, Educational Testing Service, 
Princeton, New Jersey. 


Drawings of the human figure, along with 
other projective devices, have been used in clini- 
cal evaluations for a number of years, and a cer- 
tain amount of “figure-drawing lore,” varying 
in content from installation to installation, has 
accumulated. However, no detailed exposition 
of figure-drawing analysis such as that pre- 
sented in Machover’s Personality Projection in 
the Drawing of the Human Figure has been 
hitherto available. 

The method of administration is similar to 
that often followed in figure-drawing analysis. 
The subject is given a letter-size sheet of paper, 
a mnedium-soft lead pencil with eraser, and asked 
to “draw a person.” Neither the sex of the fig- 
ure to be drawn, nor its age, attire, nor attitude is 
specified. It is made clear that the drawing of a 
whole person is desired. When one drawing is 
completed, the subject is given another sheet of 
paper and asked to draw another figure of the 
opposite sex from the first. The drawing process 
is inconspicuously observed and notes made 
concerning sex of figure drawn first, sequence of 
parts, omissions rectified only at the examiner’s 
suggestion or urging, approximate time re- 
quired, and so forth. At the option of the ex- 
aminer, the subject may also be asked to reply 
to various questions concerning each person 
drawn. Ostensibly “associations” to the figures, 
the questions are actually designed to elicit the 
subject’s attitude toward himself and others. 
The “associations” data, however, are regarded 
by Machover as providing useful adjunct infor- 
mation only. They are not an essential part of 
the technique. 

Using only the drawings, it is presumably 
possible “їп a significant proportion of cases” to 
make accurate judgments covering the subject’s 
“emotional and psychosexual maturity, his апх- 
iety, guilt, aggression, and a host of other traits, 
and to arrive at a comprehensive understanding 
of his personality and behavior dynamics. The 
proportion of cases in which accurate judgments 
of this sort may be made is not indicated. No 
specific information is given concerning the ex- 
tent of agreement among different analysts at- 
tempting to make such judgments on the basis 
of a given pair of drawings, nor concerning the 
extent of agreement among judgments based 
on different pairs of drawings by the same 10- 
dividual. Machover states that her interpreta- 
tive principles have been reasonably verified mn 
clinical application to thousands of drawings 
studied in coordination with individual case rec- 
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ords over a period of 15 years, and that numer- 
ous studies involving the "blind" interpreta- 
tion of drawings have repeatedly validated the 
method, but data and details are not furnished. 

Although the Machover principles of inter- 
pretation are frankly empirical in origin, an at- 
tempt is made to provide a basic rationale for 
the method. Machover holds that the drawing 
of a person, in involving a projection of the 
"body image," furnishes a natural vehicle for 
the expression of one's body needs and conflicts. 
Her technique of interpretation is essentially an 
attempt to reconstruct the major features of this 
“self-projection.” 

The Machover method involves a careful and 
detailed examination of both the content of the 
drawings—the manner in which each part of the 
body and the clothing is drawn—and the struc- 
tural and formal aspects of the drawings—ab- 
solute and relative size of the male and female 
figure, placement on the page, symmetry, per- 
spective, type of line, shading, erasures, etc. 
In the interpretation, considerable emphasis is 
placed on the overall mood or tone conveyed by 
the figure through its facial expression and pos- 
tural attitude. Considerable attention is also 
paid to “conflict indicators"— differential treat- 
ment given to any area of the figure by conspicu- 
ous omissions, subtle breaks, dimming out or 
reinforcement of line, erasures, or shading. In- 
terpretation of the particular graphic treatment 
is based upon the functional significance of the 
part stressed. 

The essential character of the Machover 
method is perhaps best conveyed by quoting the 
author : 

The process of drawing the human figure is for the 
subject, whether he realizes it or not, a problem not 
only in graphic skill, but one of projecting himself in 
all of the body meanings and attitudes that have come 
to be represented in his body image. Consequently, the 
drawing analyst should feel free to extract from the 
graphic product what the subject has put into it. He 
should feel free to interpret directly aspects which, 
with striking literalness, often reflect real life problems 
and behavior of the individual who is drawing. The fig- 
ure is, in a way, an introduction to the individual who 
is drawing. Thus, when a subject erases his arms and 
changes the position of them several times, it may be 
literally interpreted that the subject does not know 
what to do with his arms in his behavior. If the fist is 
clenched, he may literally be expressing his belliger- 
ence. If the eye of the figure has a pensive, furtive, or 
bewildered gaze, it may often be characteristic of the 
individual who is projecting. 

The theme of “striking literalness” does in- 
deed loom large throughout the interpretation, 
and where the system is not directly literal it is 


indirectly so, through the mediation of various 
psychoanalytic equivalences which are assumed 
to exist. It is stated, for example, that overem- 
phasis of the mouth is frequently tied up with 
food fadism and gastric symptoms, profane lan- 
guage, and temper tantrums; that the individual 
who is quick to take offense and is resistive to 
authority may show moderate ear accentuation 
on that basis; that hair emphasis, whether it oc- 
curs on the head, on the chest, as a beard, or a 
mustache, may be regarded as an indication of 
virility strivings. Button emphasis, it is pointed 
out, occurs mainly in the drawings of mother- 
attached individuals, suggesting that the psy- 
chological significance of buttons may be 
connected with the umbilical symbol of mother- 
dependence; ties are described as varying from 
“the uncertain, tiny, and debilitated-looking tie 
of the individual who is despairingly aware of 
his weak sexuality, to the long and conspicuous 
tie, sometimes decorated with phallic like details, 
of the sexually aggressive individual who is 
driven to excessive compensation by fear of 
impotence.” Similarly, the stance of the figure 
is regarded in the same light as the stance of a 
real person, so that “insecurity of footing” shown 
in a drawing is taken as indicative of possible 
“insecurity of footing” on the part of the subject. 
By the same token, it is held that drawings char- 
acterized by a certain type of stance quite directly 
indicate fear of sexual assault on the part of a 
female subject, or anticipation of resistance to 
fantasied assault on the part of a male subject. 

Although undeniably literal, the technique of 
interpretation is not, however, as atomistic as 
these scattered excerpts may suggest. Machover 
stresses that the meaning of specific drawing 
traits must be considered in the context of their 
interrelation and cautions that a grasp of the 
mechanical details of drawing analysis cannot 
substitute for the knowledge of personality dy- 
namics and clinical syndromes which is indis- 
pensable to the proper use of the method. Never- 
theless, despite this emphasis on the importance 
of context and interrelationships, she gives very 
little explicit attention to the possible influence 
on drawing behavior of cultural and socio-eco- 
nomic background and educational level. Sex 
differences, adult-adolescent-child differences, 
and gross intelligence differences (feebleminded 
versus normal) are considered, but the possible 
effects of age differences among adults or of 
differences in intelligence among adults are al- 
most completely ignored. (Differences in artis- 
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tic skill, it might also be noted here, are held to 
be unimportant so far as the interpretation of 
figure-drawings is concerned.) 

From a theoretical standpoint, an additional 
question arises. As Goldworth (9) has pointed 
out, Machover, in emphasizing the projection 
of body needs and conflicts, appears largely to 
disregard the role of visual, objectively deter- 
mined stimuli in the formation of the subject's 
body image. Yet from the work of Ross,* 
Schilder,? and Lowenfeld,? it would appear that 
visual as well as tactile and kinesthetic experi- 
ences are involved. Particularly from Lowen- 
feld's work, it would, in fact, appear that there 
are basic differences among individuals with 
respect to the predominance of the two kinds of 
components in their body image. If so, it would 
seem that to whatever extent the “body image" 
is projected in the subject's drawing of a human 
- figure, the resulting product cannot be inter- 

preted in the same manner for all individuals. 

Because it-has been available in published 
form only since 1949, not much research has to 
date been reported concerning the Machover 
technique as distinct from other methods of 
figure drawing analysis, and very little of this 
has been on a sufficiently large number of cases 
to assure even a modicum of reliability. In the 

` study already mentioned, Goldworth investi- 
gated the incidence of "conflict indicators," us- 
ing the Machover criteria, in the drawings of 
normal, neurotic, schizophrenic, and brain dam- 
aged subjects. His data fail to substantiate 
Machover's formulations with respect to “con- 
flict indicators." To a certain extent, however, 
other data he reports may be taken as affording 
corroborative evidence for her statements re- 
garding the significance of size differences be- 
tween the male and the female figure and con- 
cerning the meaning of breast emphasis in 
drawings of the female figure. They may per- 
haps also be considered to confirm certain of 

Machover's observations regarding the inter- 

pretation of facial expression and stance. 

On the other hand, a study by Fisher and 
Fisher (8), designed to test the validity of cer- 
tain "paranoid signs" mentioned by Machover, 
fails entirely to support her formulations in this 

1 Козз, Nathaniel. “The Postural Model of the Head and 
Face in Various Positions (Experiments on Normals)." J Gen- 
eral Psychol 7: 144-62 °32. * (РА 7: 451) 

2 Schilder, Paul. The Image and Appearance of the Human 
Body: Studies in the Constructive Pris of the Psyche. 
London: Kegan Paul, Trench, Trubner & Co., Ltd., 1935. Рр. 
353. (PA 9: 5693 

3 Lowenfeld, Wiktor. Creative and Mental Growth: A Text- 


book on Art Education. New York: Macmillan Co., 1947. Pp. 
xv, 304. * 
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area. Further, Fisher and Fisher find rather 
poor agreement among trained raters as well as 
among untrained raters attempting to describe 
facial expression and stance, both of which figure 
prominently in the Machover technique. 

Until a considerable amount of additional evi- 
dence becomes available, it would seem unwise 
to rely on judgments based solely on interpreta- 
tion of the figure drawings. This does not pre- 
clude the possibility of using such drawings to 
furnish additional cues or otherwise to help 
clarify particular problems relating to individual 
cases, provided this is done in light of a great 
deal of more objective information. 

The few studies of the Machover technique, 
or selected aspects of it, which have been made 
to date have differed considerably in their orien- 
tation and their methodology. The findings— 
some possibly confirmatory, others apparently 
failing to confirm—may perhaps indicate that 
certain of the Machover formulations are more 
promising than others, both with respect to oper- 
ational reliability and with respect to funda- 
mental validity. Perhaps it will be possible even- 
tually to rationalize the various specific findings 
and reformulate the basic hypotheses accord- 
ingly. At the present time it would appear that 
the greatest value of the Machover technique 
lies in the focus for research in the figure-draw- 
ing area which it affords. 


For related reviews, see 112. 
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*[Re Machover Draw-A-Person Test.] Macno- 
VER, KanEN. Personality Projection in the Draw- 
ing of the Human Figure: A Method of Person- 
ality Investigation. Springfield, Ill.: Charles 
Thomas, Publisher, 1949. Pp. ix, 183. $3.50. * (Oxford, 
England: Blackwell Scientific Publications, Ltd. 175. 
6d.) (PA 23: 3217) 


Am J Psychiatry 106:395 N 749. Т. W. 
Richards. This contribution to the literature on 
projective techniques will be received enthusi- 
astically by those to whom the theoretical as- 
sumptions of dynamic psychology are congenial, 
and examined with suspicion, probably, by those 
who look for statistical or other quantitative evi- 
dence of validity for a test of personality. * the 
author relies for verification for her hypotheses 
regarding personality almost entirely on clinical 
evidence. She seems to expect that her discus- 
sion will seem plausible with only this. * Аз 
a report of methodology....Dr. Machover's book 
is a stimulating contribution. We need just such 
qualitative discussions to generate scientific 
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study of personality. As a contribution to the 
facilities of the clinician, it is to be expected that 
one's experience with the method in work with 
individual cases will confirm or contradict Dr. 
Machover's evaluation of her own experience. 
It is hoped that someone—Dr. Machover, per- 
haps—will bring together the several contribu- 
tions to the analysis of personality through draw- 
ings, and develop an integration with the 
theoretical principles which seem implicit in 
each or all of them. This task has been ignored 
in the present volume. It would be unfortunate 
if the author’s failure to integrate her work with 
that of Goodenough, Buck, and others served in 
any way to foster the notion of a “Machover 
technique” such as the illusion of specificity 
which was created for the Rorschach. It is the 
reviewer’s opinion that essential contribution 
to our understanding of personality will come 
not from some new technique, but from the 
study of what communality is revealed in the 
use of various techniques of appraisal. 

Am J Psychol 62:312-3 Ap ’49. * The bibli- 
ography....[suggests] the psychiatrical and clin- 
ical bent of the book. * The book, which is in- 
telligently and well written, offers agreeable 
relief from the new grist of divinations upon the 
hand-writing of the great. 

Am J Psychother 4:174-5 Ja ’50. John С. 
Watkins. * interesting and readable little book 
* One who has not been previously acquainted 
with this method of picture analysis will be 
amazed at the wealth of clinical impressions 
that can be obtained by the technique. * The 
book concludes with a brief bibliography and a 
rather thorough index. Excellently bound, the 
printing is on a high grade of paper. The inter- 
pretations are psychoanalytically oriented, and 
one cannot help but be impressed with the ex- 
tensive experience which the author has ob- 
viously had in the use of the method and with 
the shrewd clinical insight she shows in her 
interpretations. However, the work suffers, as 
do so many other methods in the field of pro- 
jective testing, with inadequate objective vali- 
dation. The various interpretations given dif- 
ferent features are offered without statistical 
results to support them. There is vague mention 
that these are based on a large number of cases, 
but the author does not share with us the exact 
findings from these data. Because of this severe 
limitation, objective and experimental-minded 
clinicians will find much to criticize in the work 
and will tend to use the interpretations presented 


with great caution and reserve until they are 
more universally proven. In this respect the 
reader will find it interesting to compare the 
subjective method of drawing-analysis utilized 
by Machover with the more objective, but also 
more cumbersome, techniques used by John N. 
Buck in his development of the HTP (House- 
Tree-Person) test. The psychiatrist, the psy- 
choanalyst and the clinical psychologist will find 
the book highly stimulating and well-worth read- 
ing. They will be grateful to the author for giv- 
ing us the benefit of her experience and insight 
in the use of the method. They will use it them- 
selves, many times with excellent results, sup- 
plementing other psychodiagnostic techniques. 
But still, they will probably exhibit a cautious 
and healthy skepticism in its use and consider its 
findings as tentative and subjective to confirma- 
tion by other diagnostic methods which have 
been more widely and thoroughly validated. I + 
feel certain the author, herself, would want her 
methods to grow into wider use in this conserva- 
tive and scientific way. The method is no sub- 
stitute for clinical insight. Rather it provides the 
shrewd and experienced psychodiagnostician 
with many valuable clues and leads into the 
underlying assets and liabilities of a human per- 
sonality and into the dynamics of its functioning. 
As such the method of analyzing personality 
through its projection into drawings of the hu- 
man figure deserves a place among the diagnostic 
weapons in the arsenal of the modern clinician. 

Am J Sociol 57:205-6 S '51. Olive West- 
brooke Quinn. Karen Machover has admirably 
delineated the uncertain stance of a relatively 
new technique for investigating personality. She 
obviously did not intend this little volume as a 
manual for administering and analyzing the 
“draw a person" test but rather for showing the 
potentialities and present shortcomings of a pro- 
jective technique for personality analysis. The 
author states forthrightly that she has no inten- 
tion of trying to communicate the method to 
the reader; one hopes, however, that, if this 
technique is as useful as its advocates claim, 
someone will present the principles clearly, stat- 
ing the criteria for analysis and the underlying 
rationale. The author suggests that such a work 
must await the outcome of experimentation now 
in progress or proposed ; certainly, experimental 
verification of the present tentative hypotheses 
must be undertaken if social scientists are to 
have confidence in the method. There are some 
omissions, however, which make the book un- 
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satisfactory. In general, the treatment of the- 
oretical considerations is adequate, but the con- 
ceptual framework has sometimes been neglected. 
While the author suggests a concept of the “nor- 
mal" personality and mentions “normality or 
adjustment indicators," the latter are not speci- 
fied, and the implication is clear that the “пог- 
mal" configuration is not one which can be 
ascertained in positive terms but simply one 
which is comparatively free from the indicators 
of abnormality. Mrs. Machover obviously recog- 
nizes this lack but seems to consider it relatively 
unimportant: "the differentiation of normal 
Írom abnormal by drawing analysis is a less 
real problem than the effectiveness of the method 
in determining the personality and dynamics of 
the behavior of the individual." It is true, how- 
ever, that at the present time clinical psycholo- 
gists are using this particular technique for 
making judgments that differentiate the normal 
and the abnormal. A further shortcoming is that, 
although it is quite clear that the sexes are ex- 
pected to draw differently, there is no firm 
statement of the varying significance of the 
indicators for the two sexes. Mrs. Machover 
points out that her data are more nearly complete 
for males than for females and that most of the 
interpretative data relate to drawings made by 
males, but the occasional mention of alternative 
explanations when drawings are made by female 
subjects becomes a distracting feature which 
contributes little to understanding the test. 
Omitting interpretation for female subjects com- 
pletely would have been better than this wholly 
inadequate treatment of the subject. Configura- 
tions of indicators are stated to be the proper 
basis for analysis; yet careful study of the sam- 
ple analyses fails to reveal these patterns and 
leaves the reader with a strong suspicion that 
conclusions are based upon a simple additive 
process rather than a weighing and balancing 
of indicators. A technique should not be aban- 
doned or refused recognition because it does 
not do everything. What the author succinctly 
calls “economy of method" makes it imperative 
that every effort be made to determine the valid- 
ity and reliability of the test and to perfect the 
technique if it can be shown to be dependable. 
A method of personality investigation which so 
simplifies problems of administration would be 
invaluable if it could be made precise and clearly 
communicable. 

B Menninger Clinic 15:75 Mr '51. Irene Hol- 
lingsworth. * a more organized presentation of 


personality analysis through interpretation of 
drawings of the human figure than has yet been 
available * While there may be doubt as to the 
validity of some of the particular findings, this 
book is an excellent way for the interested cli- 
nician to become further acquainted with ways 
of using a very useful tool, after which he can 
adapt it to his own particular needs. 

Brit J Psychol, Gen Sect 40:47-8 S "49. 
М (agdalen) D. V (ernon). * as the author her- 
self states, she has started from a particular the- 
oretical position, mainly psycho-analytic, with 
regard to the significance of many of the features 
of the drawings, for instance, as to their phallic 
symbolism. Working then largely from cases 
upon the nature of whose personalities and per- 
sonality disorders she has already decided, the 
author proceeds to conclude that particular fea- 
tures of the drawings demonstrate personality 
characteristics the existence of which she has 
already postulated. In this way she has, it is 
true, reached a fair agreement between drawing 
interpretation and clinical diagnosis. But it seems 
likely that it would be equally possible to start 
with some other theory as to the nature of the 
personality, and demonstrate with equal clarity 
its projection into the drawings. Nor is there 
any indication how far the drawing is actually 
carried out in accordance with schematic ideas 
originating within the individual; or how far in 
an attempt, varying with the individual's sophis- 
tication and skill, to make a representational 
drawing. Again, a determining effect may be 
exerted by social customs as to how the human 
figure should be drawn—differing surely in dif- 
ferent cultural groups. Considerations such as 
these must be taken into account in making ac- 
curate interpretations. Thus, however useful this 
method may be to the individual clinician in 
indicating particular characteristics of his pa- 
tients, its use as a general projection method for 
personality analysis is fraught with difficulty. 

Can J Psychol 3:242 D ^49. Noël Mailloux. 
[Review in French; editor's English abstract 
follows.] * this book is very useful for clinical 
psychology, from the technical point of view. 
Remarkable for its fullness and for the extreme 
simplicity of its treatment, it contains data sup- 
ported by observations accumulated over 2 
period of fifteen years. Its greatest merit lies 11 
the fact that the author has tried to formulate 
his explanations in dynamic terms. 

J Abn & Social Psychol 45:785-6 O '50. John 
E. Bell. This handsomely printed monograph 
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...contains....many suggestions for clinical re- 
search * The least satisfactory part....is the the- 
oretical discussion with which it begins. While 
the author manifests a sincere theoretical inter- 
est, and indicates that research to answer the- 
oretical questions is in progress, she seems in 
a hurry to proceed to the technical aspects of 
her method, and even in the section on theory 
introduces an illustration of interpretation which 
is nearly as lengthy as the discussion of theory. 
The basic theorem that drawing serves a func- 
tional purpose for the individual who is making 
the drawing is briefly noted. The author then 
dismisses previous literature in the subject with 
a few generalizations and two specific references 
and advances to the rewarding part of her mono- 
graph, the discussion of her clinical method. It 
is in the second part of the volume, the discussion 
of principles of interpretation, that the author 
hits her stride. * The author wisely and con- 
scientiously makes use of the verb “may” to 
indicate the tentativeness of her conclusions, 
since she has not here quoted any statistical basis 
for her interpretative cues. On several occasions 
she points out that her work is based upon pa- 
tient comparison of drawing indicators and case 
history material over a period of fifteen years. It 
is to be hoped that the details of such study may 
be presented at a future time in such a manner 
that independent evaluation of her conclusions 
may be accomplished. * Until validation studies 
are published it cannot be expected that this 
simple clinical tool will reach its full usefulness. 
As a guide for clinical experimentation with a 
promising and expedient diagnostic method this 
monograph amply justifies its publication. 

J Consult Psychol 13:223 Je ’49. Laurance 
Е. Shaffer. Because of the simplicity of its ad- 
ministration and the scope of its claimed use- 
fulness, the draw-a-person projective test is 
rapidly gaining a wide recognition. * There can 
be little question of the merit of the projective 
theory on which the test is based, that “the hu- 
man figure drawn by an individual....relates in- 
timately to the impulses, anxieties, conflicts and 
compensations characteristic of that individual. 
In some sense, the figure drawn is the person...” 
On the other hand, psychologists may rightly 
demand that authors supply them with more 
public evidence about a new method than the re- 
assurance that it was “developed....in the course 
of studying thousands of drawings in clinical 
contexts.” The monograph—alas, like many 
others in the projective field—abounds with de- 


tailed interpretive hypotheses unsupported by 
research data. In many instances quantitative 
criteria are not impossible to obtain, and real 
validation could be accomplished. Further, the 
cry against “atomistic” validation is unjustified 
when the interpretive suggestions are given in 
terms of the relationships of specific picture 
qualities to specific personality characteristics. 
*Disproportionately large heads will often be 
given by individuals suffering from organic 
brain disease....” [How often?] “Full lips, given 
to the male figure, are regarded as an effemi- 
пасу indicator..." [What is the x? with some 
reasonable criterion of masculinity-femininity? | 
« „Ае paranoid individual....gives much graphic 
emphasis to the eyes..." [What is the overlap 
of paranoid and non-paranoid groups when 
judged independently by this feature? | One may 
hope that the future of this promising technique 
will be marked, not by slavish adherence to 
formulas, but by sound research to determine 
its validity—and its limitations. 

J Crim Law & Criminology 40:782 Mr-Ap 
’50. Robert Н. Gault. * The present book in- 
cludes analyses of drawings made by several 
persons who had been charged with crime. The 
layman, on reading them, will naturally be puz- 
zled by such a statement as the following relating 
to a drawing that had been made by a young 
man who wasa car thief (Page 131) : “The omis- 
sion of hands would relate to the patient’s guilt 
regarding his car stealing and also regarding his 
lack of achievement. (Guilt aggravated by the 
demands of his guardians.) He had never made 
an adequate adjustment to people or things.” 
Couldn’t hands be omitted from the drawing 
merely because the drawer found it difficult to 
include them? In another case “Regression and 
collapse of judgment are seen in....the confu- 
sion of profile and full face... (Page 119).” Could 
this peculiarity, also, be due to lack of skill in 
drawing? Somewhat analogous questions were 
asked years ago when the Binet Tests were new. 
Untiring research has answered them, or sub- 
merged them in practical results. It is to be 
hoped that such an outcome will be forthcoming 
in relation to Projective tests. 

Psychol В 47:84-5 Ja’50. Lillian Wald Kay. 
It would be unfortunate if we came to feel that 
our techniques in personality study were apart 
from our theories. If theories are valid, they 
should be demonstrable. If techniques are valid, 
it must be because they stem from some hypothe- 
sis which is basically true. This should hold for 
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all aspects of our research even when practical 
considerations are most important, as it is un- 
derstandable that they should be for the clinician 
whose primary concern is for the patient's wel- 
fare. It is, therefore, to be deplored that a promi- 
nent clinician, describing an important projec- 
tive device, should state that: "Incentive for, 
and primary focus of, investigation centered 
around perfection of the drawing technique as 
a clinical tool for personality analysis, rather 
than around any theoretical hypotheses" (p. 20). 
The statement is, of course, not adequately de- 
scriptive of the content of the book. The first 
part, “Personality Projection in the Drawing of 
the Human Figure,” actually has a section on 
theoretical considerations. The second part, 
“Principles of Interpretation,” draws heavily on 
psychoanalytic theory, as well as on empirical 
clinical experience. The third part, “Illustrative 

_ Case Studies,” is an attempt to relate the theory 
of projection in drawing the human figure to 
examples of drawings accompanied by case his- 
tories. * [Nevertheless] the method seems to 
have considerable validity in clinical procedure. 
As one reads the illustrative cases, Dr. Mach- 
over's analyses can be followed without much 
argument. The question which often arises in 
connection with new projective techniques— 
“Would I see that if I didn’t know the case 
history ?”—is answered by her report on “blind” 
interpretations (pp. 25 to 27). The drawing of 
the human figure is an extremely intriguing pro- 
jective device. It is, admittedly, still in its in- 
fancy and communication of method is limited, 
as for most techniques which emphasize pat- 
terns of relationship rather than "scoring." It 
is very much to be hoped that readers will be 
stimulated by this book to further study and 
research. Dr. Machover introduces some chal- 
lenging experimental problems. In spite of the 
format of the book, it is a progress report rather 
than a manual. It would be undesirable, with so 
many people working on this new method, to 
accept this report as a manual and mark the 
problem solved. 

Psychosom Med 12:138-9 Mr—Ap ' 50. Samuel 
Waldfogel. * The assumption underlying this 
method as stated by the author (page 35) is 
“that the human figure drawn by an individual 
who is directed to ‘draw a person’ relates inti- 
mately to the impulses, anxieties, conflicts, and 
compensations characteristic of that individual." 
Support for this assumption is presented in the 
form of some eight case histories where clinical 


material is presented side by side with the draw- 
ings and their interpretation. While these do 
not lack conviction, they cannot be accepted as 
sufficient validating data. The only deference to 
accepted methods of test validation is made (p. 
26) where the author refers to a brief experi- 
ment conducted by herself and another compe- 
tent judge, in which an attempt was made to 
match drawings with case records. We are told 
that '...a degree of accurate matching was 
achieved that was much better than chance." No 
specific figures are given. In regard to test re- 
liability the author assures us that drawings 
from the same individuals obtained over long 
periods may be so much alike as to constitute 
personal signatures. Again no supporting data 
are given and no mention is made of the degree 
of consistency among interpreters. Allusions are 
made to studies in progress that should ulti- 
mately supply this information but it would 
seem that at our present stage of methodological 
sophistication in personality measurement, an 
introduction to a new test should contain at 
least preliminary data of this sort. * Each char- 
acteristic of the drawing is considered separately 
and its implications are enumerated. Many of 
the statements made have an almost glib quality 
and some are startling in their resemblance to 
physiognomic stereotypes. This similarity exists, 
we are told, because the average individual has 
been exposed to these stereotypes and has either 
consciously or unconsciously adopted them. 
Again the argument would be more convincing 
if supported by precise and detailed documenta- 
tion. A fair idea of the writer's method of ex- 
position may be gained by the following excerpts 
which are typical: “The concave or orally re- 
ceptive mouth is frequently encountered in the 
drawings of infantile, dependent individuals...” 
(p. 44) ; “The eyes are the chief point of con- 
centration of the feeling of ‘self’ and the vulner- 
ability of the self,” (p. 47) ; “Тһе Adams apple 
..has been seen mostly in the drawings of 
young males as an expression of a strong virility 
or masculinity drive,” (page 58) ; and “Severely 
shaded or reinforced fingers are generally re- 
garded as guilt indicators.” (page 64). To some, 
these statements will have a high degree of 
plausibility ; to others, they will sound like non- 
sense. However, regardless of their truth or 
falsity, it is certain that such generalities are 
more easily made than proved. Although one 
may quarrel with the nature of the evidence 
that Dr. Machover presents to support her as- 
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sertions, there can be no question of her in- 
genuity and originality. She has, through the 
careful observation of a very ordinary phe- 
nomenon, laid down a new path in the field of 
personality investigation. While it is too early 
to say where this will eventually lead, it beckons 
us to explore it further. Practitioners, in the 
meantime, are proffered another instrument to 
assist them in their work. While this technique 
will not give them the precision in personality 
description and measurement that is so sorely 
needed and so sadly lacking, it will provide them 
with an additional projective method for sup- 
porting their diagnostic impressions. 

О R Psychiatry & Neurol 4:252-3 Jl °40. 
Isabelle V. Kendig. The significance of this book 
hinges upon the validation of the basic hypothesis 
upon which it is developed, viz., that drawings 
of the human figure represent projections of the 
subject's own attitudes, particularly his concep- 


tion of the body image. То be sure, there has 


been general validation, by Allport and others, 
that all expressive movements, whether in gait, 
gesture, or handwriting, reveal intrapsychic ten- 
sions, conflicts, and formulations, compensatory 
or otherwise, of the Ego role. To this extent the 
body-image of the individual may reasonably 
be expected to emerge in response to the direc- 
tive to “draw a person." Where validation is 
specifically lacking....in regard to the meaning to 
be attached to any given detail, for instance, that 
a concave mouth indicates fixation at the oral 
level, that buttons signify mother-dependence, 
and elongated feet or a conspicuous neck-tie are 
phallic symbols. In general, psychoanalysis, of 
which such drawing interpretation must be con- 
sidered at least in part derivative, has escaped 
from its bondage to fixed symbols. While for- 
merly all long, narrow objects—swords, guns, 
sceptres, etc., were wont to be accepted as sym- 
bols of the male sex organ and all broad objects 
—bowls, boxes, rooms, etc., as symbols of the 
female genitalia, no longer are such generic 
meanings accepted. Instead, there has been a 
growing realization of the extremely individual- 
istic language of symbolization. Long antedat- 
ing psychoanalytic doctrine, however, are the 
theories of constitutional typology which in our 
own culture may be traced from the specula- 
tions of Hippocrates and Galen through the 
French, German, Italian and Swiss schools to 
the recent work of Sheldon in this country. Run- 
ning through them all is the assumption that 
physique and character are inextricably related. 
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Tending to substantiate these typological the- 
ories, essentially akin however differently ver- 
balized, we find an underlying belief of men 
everywhere that certain specific features are ex- 
pressive of significant personality traits—thin, 
compressed lips of cruelty, a broad forehead and 
well-spaced eyes of frankness, generosity and 
nobility, a strutting gait of braggadocio. Upon 
the virtually universal acceptance of such “stere- 
otypes” do literature and drama depend to con- 
vey meaning swiftly and economically. Given , 
such consensus as to the meaning of various 
physical features on the one hand, and the psy- 
choanalytical concept of projection and symboli- 
zation, with such general psychological valida- 
tion as has been given to “expressive movement,” 
on the other, the twin supports upon which this 
experimental drawing technique rests become 
clear. The book itself should be regarded only 
as a progress report. Recurrently it is stated 
that research is going on to answer questions 
as to the constancy of projection, the significance 
of mood, and, through the use of associative 
material, the meaning to the individual of the 
stereotypes employed. To date the chief valida- 
tion, as in psychoanalysis, and in such other pro- 
jective techniques as the Thematic Appercep- 
tion Test, has come from success in personality 
evaluation and diagnosis especially in so-called 
“blind” analysis and from the improvement fol- 
lowing treatment based upon it. The detailed 
presentation of the interpretive concepts utilized 
by the writer gives other workers an opportunity 
to test their usefulness and thus the publication 
of the book at this time seems justified. 

Rorsch Res Exch & J Proj Tech 12:259-60 
no 448. Hanna F. Faterson. Fifteen years of 
research have preceded the publication of Mrs. 
Machover's book * Obliged because of require- 
ments laid down by the publishers....the author 
was forced to condense her presentation. She 
concentrates on the basic formulations of inter- 
pretative principles. Those who look for tables 
of intercorrelations, levels of confidence, and 
traditional data on reliability and validity will 
be disappointed. But those who are clinically 
oriented and who are experienced in the use 
of projective techniques will appreciate this in- ` 
valuable addition to methodology which enables 
us to understand what the patient tells us about 
himself in his graphic projections. The funda- 
mental working hypothesis underlying the analy- 
sis of drawings is “that the human figure drawn 
by the individual who is directed to ‘draw a per- 
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son' relates intimately to the impulses, anxieties, 
conflicts, and compensations characteristic of 
that individual" (p. 35). The data consists of 
the drawings of a male and a female figure in 
the order determined by the subject, and of a 
fairly detailed list of questions, amplified and 
modified as needed. The questions are intro- 
duced with the instructions to “make up a story 
about the person as if he were a character in a 
novel or a play." The author's observation that 
resistance to these tasks is likely to be a function 
of timidity on the part of inexperienced ex- 
aminer is born out by the experience of many 
who have used this and other projective meth- 
ods. * The volume is divided into three parts— 
introductory material, principles of interpreta- 
tion, and illustrative cases. The didactic section 
on interpretative principles is necessarily "frag- 
mented" into analysis of specific items. In a 
sense, such a presentation may be thought of 
as parallel to the teaching of Rorschach scoring 
as part of Rorschach interpretation. Actually, 
mastery of these items no more guarantees suc- 
cess in drawing analysis than mastery of scoring 
and interpretation of individual scoring symbols 
guarantees skillful use of the Rorschach method. 
In neither case can a list of “signs” yield either 
diagnostic impressions, or insight into personal- 
ity. “As with....all projective tools, grasp of 
mechanical details...cannot substitute for the 
knowledge of personality dynamics and clinical 
syndromes which is so indispensable to the 
proper use of the method (p. 22)." The case 
studies, which take up fully one third of the 
book, are presented with related clinical history, 
and with clear reproductions of the drawings. 
In analyzing these, the author goes far beyond 
the preceding text, showing the method at work. 
Anyone seriously interested in using the tech- 
nique would do well to study these analyses care- 
fully, since they constitute the "meat" of the 
book. These eight cases also illustrate the validity 
of the method, since they contain a detailed 
demonstration of the relationship between 
graphic projection and clinical data. A detailed 
index of twenty-one pages will prove a boon 
to anyone who actually works with drawings, 
and who is constantly running into a snag with 
such items as "symmetry," "reinforced line," 
"hair emphasis," "evasiveness," or any other 
feature under consideration. * in the experience 
of the reviewer and many colleagues who have 
used the method for several years, the Machover 
Figure Drawing Test has already established 


itself as an indispensable part of a battery of 
projective methods used in personality evalua- 
tion. Drawings invariably make their unique 
contribution to the total personality picture: an- 
swering questions raised by other tests, calling 
attention to factors underplayed or not mani- 
fested elsewhere, clarifying or corroborating, by 
means of a different medium of expression, evi- 
dence from other tests, and, like the Rorschach, 
varying in richness, clarity, and eloquence from 
case to case. 


[113] 
*Make A Picture Story. Adolescents and adults; 
1947-48; commonly called MAPS; a measure of for- 
mal psycho-social aspects of fantasy production; indi- 
vidual; figure cards ('47) ; background pictures ('47) ; 
figure location sheets ('47); manual ('48; see 7 be- 
low) ; $13 per complete set of testing materials and 25 
figure location sheets; $11 per set of testing materials, 
excluding manual, and 25 figure location sheets; $1.50 
per 25 figure location sheets; theatre and carrying case 


1 d $13.50 when ordered with set, $15 when 
ori 


ered separately; postpaid; nontimed (45-90) min- 
utes; Edwin S. Shneidman; Psychological Corpora- 
tion. 
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ALBERT I. Rapin, Associate Professor of Psy- 
chology, and Director, Psychological Clinic, 
Michigan State College, East Lansing, Michi- 
дап. 

This instrument is a variation upon the The- 
matic Apperception Test. Whereas ТАТ con- 
sists of some populated and some unpopulated 
pictures, MAPS gives the subject an opportunity 
to construct his own dramatic situation with a 
given supply of backgrounds and people and to 
tell his story about them. 

The test material includes 22 background pic- 
tures, some of which are highly structured (liv- 
ingroom, bathroom, etc.), while others are un- 
structured and vague (dream, stage, etc.). One 
blank background is included. Sixty-seven card- 
board cutout figures (about 5 inches in height) 
in various poses and with different facial ex- 
pressions (including several with blank faces) 
make up the population from which the testee 
may select his dramatis personnae. The 67 fig- 
ures fall in the following categories: males (19 
figures), females (11), indeterminate as to sex 
(2), children (12), legendary and fictitious (6), 
animals (2), minority figures (10), and blank 
faces (5). The examinee is instructed to place 
one or more of the figures on the given back- 
ground and to tell a story about it. The rest of 
the instructions concerning the story are similar 
to those given with TAT. A special Figure Lo- 
cation Sheet with reproductions of the back- 
grounds is used for the recording of the sub- 
ject's choice of figures and their placement in 
the picture. 

Investigation of “psycho-social aspects of 
fantasy production" is the chief aim of the author 
of MAPS. Consequently, formal scoring cate- 
gories which bear upon the problem of inter- 
personal relationships have been devised. The 


scoring categories deal with such questions as. 


the number of figures used by the testee, the 
number of times the same figure was used, the 


actual figures selected, the activity and -inter- 
action of the figures, their meaning, and so on. 
Out of a total of about eight hundred "signs" 
derived from the scoring categories some several 
dozen differentiate to a statistically significant 
degree between schizophrenics and normals. In 
addition to the "formal" sign approach, it is 
also recommended that the stories obtained be 
treated like TAT stories, for the interpretation 
of personality dynamics. 

For clinical use, usually, only 11 background 
pictures are used. It is advocated that they 
include some of the unstructured pictures as well 
as the blank one. Flexibility in the choice of 
backgrounds to fit the particular case being 
studied is recommended. 

So far, the use of MAPS has been rather 
limited. The sign approach for differential diag- 
nosis is time-consuming ; in addition to the hour 
and a half that it takes to administer the test, 
some time must be spent in sign counting. More- 
over, the range of psychopathologic populations 
upon whom such data have been obtained is lim- 
ited. No material on women is available at pres- 
ent. If the less "formal" story-interpretation 
TAT-like approach is used, no distinct advan- 
tage over the more time-honored test is evident 
or has been reported. 

Though it is questionable whether MAPS, at 
its present stage of development, offers much 
that is new to the busy clinician that may reward 
him for his time, further research with it may 
uncover greater potential. The opportunity that 
the subject is given to construct his own dramatic 
situation may be an advantage over TAT, facili- 
tating the projection of an “inner world” and 
leading the clinician to deeper insights. 


Cartes R. STROTHER, Professor of Clinical 
Psychology, University of Washington, Seattle, 
Washington. : 

The MAPS test was developed initially as a 
method of studying schizophrenic phantasy. The 
subject selects one or more of 67 cutout figures, 
which he places on one of 22 background pic- 
tures selected by the examiner. The subject then 
constructs a story, as in the Thematic Apper- 
ception Test. A record blank is provided, оп · 
which each of the background pictures is dupli- 
cated and provision is made for noting the choice 
and placement of the figures used with each 
background. The instructions and inquiry fol- 
low the usual TAT form and the story protocols 
are recorded verbatim. 
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In addition to modifying the TAT materials, 
Shneidman has developed a method of “formal 
analysis" to supplement the conventional analy- 
sis of thematic content. Formal analysis is defined 
as "analysis..in terms of which figures are 
chosen, how many are chosen, where they are 
placed on the background, how they are handled 
by the subject and what relationship they bear 
to each other." For purposes of the formal analy- 
sis, a comprehensive list of 800 "signs" (i.e., 
categories of number, selection, placement, ac- 
tivity and interaction of figures, etc.) was drawn 
up on an a priori basis. The frequency of occur- 
rence of each of these signs in the records of an 
experimental group of 50 psychotics (mainly 
paranoid schizophrenics) was compared with a 
rather carefully screened and reasonably well 
matched group of “normal” controls. The sig- 
nificance of the difference between percentages 
of the two groups was determined for each of 
the 800 signs. Sixty-four of the signs were sig- 
nificant at the 10 per cent level—42 “normal 
signs” and 22 “psychotic signs.” These signs 
were weighted plus one and minus one respec- 
tively and “difference scores” were then com- 
puted for each of the тоо subjects. 

In an attempt to validate the signs, both 
groups were split in half and a new critical ratio 
for the difference in proportions was determined 
for each of the 64 signs which had been found 
significant for the group as a whole. Differences 
were significant (10 per cent level) for 51 of 
the 64 signs. These 51 signs were then applied 
to the second halves of the two groups and sig- 
nificant differences were found on 29 signs (22 
normal and 7.psychotic signs). “Difference 
scores” discriminated significantly between both 
experimental and both control groups, with non- 
significant differences between the two experi- 
mental groups and between the two control 


groups. This is, of course, validation in a very 


restricted sense and the manual carefully states 


the limitations of the sign approach and of the 
data presented. No attempt is made to present 
an analysis of thematic content, which would 
probably have been of more interest to prospec- 
tive users than the formal analysis. 

The MAPS is somewhat more difficult to ad- 
minister than the TAT because of the necessity 
for manipulation of the figures and for formal 
as well as content recording. With many indi- 
viduals, however, this disadvantage is out- 
weighed by the fact that the MAPS test is more 
stimulating than the TAT. This is particularly 


true of adolescents and insecure or inhibited pa- 
tients. There is the further advantage that the 
materials may be used as “miniature psycho- 
drama” for therapeutic purposes. 


For related reviews, see 114. 


[114] 

*[Re Make A Picture Story Test.] SHNEIDMAN, 
Epwin S. “Schizophrenia and the MAPS Test: 
A Study of Certain Formal Psycho-Social As- 
pects of Fantasy Production in Schizophrenia as 
Revealed by Performance on the Make a Picture 
Story (MAPS) Test.” Genetic Psychol Monogr 38: 
145-223 N '48. * (PA 23:2816) 


J Personality 18:385-7 Mr ’50. Robert R. 


Holt. * Enough evidence is presented to support 
the contention that the MAPS Test really has 


something that the TAT does not, and thus to 
justify it as a contribution to the only too rap- 
idly growing heap of devices available to the _ 
clinical psychologist. After reading through the — 
monograph, one feels that the test itself remains 
the author's most solid contribution. As a manual 
to accompany the test....Schizophrenia and the 
MAPS Test suffers from its attempt to be a 
research report. Considered under the latter 
heading, it is painstakingly clear and comprehen- 
sive—so much so that its deficiencies of method 
are readily apparent. Only a few of its nearly 
80 pages present the kinds of information one 
wishes from a manual, although in this space it 
does provide more useful information than one 
usualy gets with a new projective technique 
these days. There are norms for many formal 
aspects of the test performance, based on 50 
schizophrenics and 50 nonneurotic veterans 
temporarily hospitalized for somatic conditions. 
(The selection of these normal controls was un- 
usually well handled, and may serve as a mod 
for similar studies.) The materials are carefully 
described ; there are a few pages on the principal 
aspects of administration, and brief mention of 
uses other than in psychodiagnosis— principally 
as an aid in psychotherapy. The unique feature 
of the MAPS Test is that the subject partici- 
pates actively in structuring the scene about 
which he builds a story. * this requirement of an 
incitation to active participation would appear 
to be an important asset of the test. It is regret- 
table that the monograph is devoted entirely to 
the formal features of the test performance. t 
is acknowledged that the thematic content of 
‘the stories that result is the most important as- 
pect of the test, but we are told nothing about 
the ways the author would suggest that content 
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should be analyzed or interpreted. There are 
only a very few indications of the "popular" 
stories told to a few particular backgrounds, 
and nothing about "popular" interpretations of 
the numerous figures. The variety of materials 
will probably make it difficult to assemble useful 
norms. Nevertheless, it is to be hoped that nor- 
mative studies of the kinds mentioned will ap- 
pear soon, to shorten the otherwise lengthy 
period of groping on the part of the clinician 
who attempts to use the technique. A few words 
about the book as research; The design is ade- 
quate enough, if conventional, and well carried 
out. Limited as it was to the attempt to find test 
*signs" which would differentiate normal from 
schizophrenic subjects, the study could hardly 
contribute much to our understanding of schizo- 
phrenic fantasy. * It is in the use of statistics 
to demonstrate the differentiation of the groups 
that the research is weakest. The decision to 
treat the data in terms of percentages and their 
standard errors was not a happy one; it had the 
result that the significance of many differences 
involving extreme proportions was underesti- 
mated ; that of differences between proportions 
near 50 per cent was often seriously overesti- 
mated. The calculation of direct probability via 
binomial expansions would have avoided these 
difficulties. The procedure was to identify all 
signs which gave even a slight degree of dif- 
ferentiation, then to combine them (without 
weighting) into a total score. The distributions 
of the resulting scores show an impressive 
amount of overlap, considering the extreme na- 
ture of the groups in question. In the range 
where the scores of the two groups overlap fall 
46 per cent of the “normals” and 68 per cent 
of the schizophrenics. Of course, much of this 
result is due to a few atypical extremes in both 
groups ; their medians are still quite significantly 
differentiated—if one assumes that the sign- 
scores constitute an equal-interval scale, The 
author admits, however, that they do not; he 
gives this inequality as a reason for working 
with medians instead of means, but then goes 
on to perform the same operations (in obtain- 
ing a critical ratio) that he forbade himself in 
getting a measure of central tendency. Other 
examples of faulty statistical reasoning could be 
cited, but would not greatly affect the general 
picture of the research. Despite the weakness of 
the original research in the context of which it 
is presented, the MAPS. Test shows distinct 
signs of being an important contribution to psy- 


chological testing. The fact that so young a tech- 
nique could have been the basis for a special 
group of papers at the 1949 APA meetings is 
testimony to the vigor of Dr. Shneidman's brain 
child. We may confidently expect to see it used 
in more significant researches in the future, as 
well as in diagnostic and therapeutic work. 
TAT Newsletter 4:7-9 su ' 50. John E. Bell. 
Under normal circumstances, a monograph of 
this dimension would scarcely merit independ- 
ent review. As a report of a single clinical ex- 
periment, it follows the traditional pattern of 
carefully defining a problem, reviewing previous 
literature, describing procedures and popula- 
tions, presenting data, and arriving at conclu- 
sions, which in this instance are not of singular 
importance. Because the monograph also serves 
as the manual for a new and important projective 
method, the Make-A-Picture Story (MAPS) ` 
Test, it becomes worthy, however, of especial 
attention. It is the chapter on materials and 
procedure (III) and that on quantitative re- 
sults containing the MAPS Test Signs (V) that 
will have the most interest for the clinician. * 
the author has focused his attention on formal 
analysis of the protocols of matched groups of 
50 psychotic patients, primarily paranoid schizo- 
phrenics, and 5o hospitalized non-psychotic 
patients. By formal analysis he means the study 
of "which figures are chosen, how many are 
chosen, where they are placed on the back- 
ground, how they are handled by the subject, 
and what relationship they bear to each other." 
He has developed a list of signs, or objective 
scores (800 in this study) by which such analy- 
sis might progress. While recognizing that con- 
tentual analysis, after the manner of TAT inter- 
pretations, would probably provide the major 
diagnostic value of the MAPS Test, he bases 
his study on the objective aspects of the fantasy 
creations, claiming rather broadly that “the 
unique contribution of this study lies in the in- 
terpretation of the formal aspects of fantasy pro- 
duction." While the MAPS Test does permit of 
ready analysis by formal techniques, it is to be 
recognized that this is not a unique property of 
this test. Formal analysis is a characteristic, if 
not always explicit, interpretative method of all 
projective techniques, including all those which 
are based on fantasy productions. While formal 
analysis in the MAPS Test is not a unique 
interpretative method, the monograph illustrates 
that it may be possible to develop it here with 
greater ease than in other techniques because of 
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the large number of quantifiable results. It must 
be stressed, however, as the author has stated, 
that use of the MAPS Test for formal analysis 
will depend upon the multiplication of normative 
studies with many clinical groups. This is one 
of the handicaps in using formal analysis in any 
projective device, a limitation that applies with 
as much force to the MAPS Test as to any other. 
The MAPS Test, as Klopfer has indicated in his 
Preface to this monograph, adds “information 
not easily revealed by existing techniques” ; and 
lends "itself readily for inclusion in a well inte- 
grated battery." One of the advantages observed 
in the use of the Test is that the subject, in telling 
a story about a picture he has created, has po- 
tentially less stress in revealing fantasy than 
when he is required to associate to a more or- 
dered stimulus as in the TAT. This may lead to 
. greater productivity in fantasy ; it may, however, 
be disadvantageous in not reflecting as readily a 
subject’s reactions under pressure. The freedom 
of choice of figures would seem also to enhance 
"the possibilities...for personal identification 
and interpersonal projections." The inclusion 
of figures representing minority groups serves 
to make the Test of usefulness in the study of 
racial attitudes and tensions. The materials are 
of wide enough interest to make them applicable 
to the study of children, adolescents and the 
total adult range. Further uses of the materials 


for a sorting test and for a “miniature psycho- 


drama" have been suggested by the author. Dr. 
Shneidman's study well illustrates another po- 
tential use of the Test. In his final chapter he has 
attempted to generalize on the characteristics 
of schizophrenic fantasy and behavior on the 
basis of the signs that significantly (i.e., in this 
study below the 10% level of confidence) differ- 
entiate schizophrenics from normals. Thus he 
tentatively concludes that schizophrenic fantasy 
is marked by “(a) intra-group variability; (b) 
self-identification; (c) variability of identifica- 
tion; (d) social isolation; (e) over-inclusion ; 
(f) inappropriateness; (g) symbolization; (h) 
desire for environmental simplification ; (1) in- 
hibition of fantasied violences; (j) punitive con- 
science; (k) lack of identification with normal 
masculine role; (1) religiosity ; and (m) debase- 
ment of women." While he has overextended his 
conclusions by failure to quality that his groups 
of schizophrenics is a paranoid group, and has 
spoken of his conclusions as relevant to “schizo- 
phrenia," he has demonstrated that as a tool for 
research into the nature of fantasy behavior the 


MAPS Test may be of more than considerable 
value. The monograph has thus served to intro- 
duce effectively a new technique that appears 
distinctly worthy of further experimentation and 
clinical application, 
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XThe Mosaic Test. Ages 4-adults; 1930-51; 1 form 
['30]; no data on reliability and validity; no manual; 
no norms; [revised] directions for administering ['s1] ; 
$37.50 per complete double set of testing materials ; $24 
per complete single set; postpaid; nontimed (20) min- 
utes; Margaret Lowenteld ; distributed by Psychologi- 
cal Corporation. * Y 
REFERENCES 
, T P pum E Verc Фе ey Test." Am 
rthopsychiatry 9:232-6 Ja '39. 13:4 

2. WERTHAM. dM A ox: reas S Differential- 
Diagnostic Method of Interpreting Mosaics and Colored Block 
Designs," Am J Psychiatry 98:124-31 Jl "41. * (PA 16:10 5) 

3. DIAMOND, BERNARD L., AND SCHMALE, HERBERT Т. Di е 
Mosaic Test: 1, An Evaluation of Its Clinical Application," 
Am 7 Or 14:237-50 Ap '44. * (PA 1833142) : 

4. Himmetweit, Н. T., лмо Eysenck, Н. J. “Ап ХШ. 
mental Analysis of the Mosaic Projection Test." Brit J Med 
Psychol 20:283-94 О '45. * (PA 20:1156) 5 

5. Bett, JOHN ELDERKIN. S 
pp. 410-20. Їп his Projective Techniques: A Куна, Appro 
to the Study of the Personality. New York: Longmans, Green 

Co., Inc., 1948. Pp. xvi, 533. * (London: Longmans, Green 
& Co. Ltd., 1949.) (PA К Ж ЕЕ С ; 

6. Coum, Hanna. “The Value of Projective Methods in the 
Psychological Examination of Children: The Mosaic Test in 
Conjunction With the Rorschach and Binet Tests." Rorsch Res 
Exch & J Proj Tech 12:216-37 no 4 '48. * (PA 24:1194) 


7. GORMAN, Mary Млкслкет, An Evaluation of the Mosaic 
Test Grade School Children. Master’s thesis, Catholic 
Uni of America (Washington, D.C.), 1948. (PA 23: 


2990, title only) " 
8. BELL, Jouw ErpznKIN. “The Case of Gregor: Psychologi- 
cal Test Bata,” DES ites Exch & J Proj Tech 13:155-205 
no 2 "49. 24:2589 Я 
9. МсСоітоси, ATHOMAS L., AND GIRDNER, JOHN В. Use of 
the Lowenfeld Mosaic Test With Mental Defectives.” Am J 
Moser DU 53:486-96 Ja '49. * (PA 23:3271) 4 


bility and Validity." Discussion by Anni Weiss Frankl. Am 
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sonality. Edited by Lawrence Edwin Abt and Leopold Bellak: 
New York: Alfred A. Knopf, Inc., 1950. Pp. xvii, 485 XO 
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*Plot-Completion Test. Grades ¢ 
guised personality test to be administered in an Le 
lish class by an English teacher to elicit more honest 
answers; І form ['46]; manual ['46]; $1.50 per 25i 
IOÉ per manual; 156 per specimen set; postpaid; B 
timed (45) minutes; Sarah I. Roody; W. Wilbur Hat- 
field, 211 West 68th St., Chicago 2r, Ill. * 
REFERENCES in 
т. Roopy, Saran I. “The Plot-Completion Test: For Дейк 
High Schools by Guidance Counselors, School Psychiatr 4 
ead: Teachers of Literature.” J Exp Ed 12:45-7 S '43- 
18:1263) dad 
2. Коорү, Saran I. “Plot-Completion Test." Eng! J 34: 
5 My 'as. * (PA 19:2760) 


Ковевт C. CHALLMAN, Clinical Psychologist, 
301 Kenwood Parkway, Minneapolis, Minne- 
sota. + 1 
'This test consists of 10 incomplete stories 
about high school pupils who get involved in 


Chap. 20, “The Mosaic Test," * 
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minor conflict situations. Five endings are given 
for each story and the testee is asked to rank 
each ending having in mind its trueness to life 
and the extent to which it is in line with expecta- 
tions based on the personalities of the characters. 
Two of the endings to each story are realistic, 
one contradictory to the facts, one a fortunate 
coincidence, and one “morbid or sadistic.” It is 
expected that the pupil will identify himself with 
the central character and thus reveal his per- 
sonality. The fact that it is to be given in an 
English class by an English teacher is thought 
to reduce the wariness of the testees. 

The split half reliability is reported as .84 
though there is no indication as to the group 
upon which this figure was obtained. Its validity 
was assessed by correlating the scores of 9r 
pupils with their scores on Noll's What Do You 
Think? A Test of Scientific Thinking (r “be- 
tween .14 and .78 on the one percent level"). 

The incomplete stories appear to the reviewer 
as being interesting to high school pupils and 
the endings are apparently in keeping with the 
author’s intention. Whether the test has any 
merit as a personality test cannot be assessed 
since the only evidence of validity given is the 
relatively low correlation with a test which can 
hardly be considered a criterion. The idea upon 
which the test is based is good, but unfortunately 
many ideas which appear to be good turn out to 
be valueless when put to the proof. 


Percrvat M. Ѕумомрѕ, Professor of Education, 
Teachers College, Columbia University, New 
York, New York. 

This test is essentially a measure of certain 
outcomes in English instruction, but its author 
recognizes that the test also has some projective 
possibilities. It may be used either as a teaching 
device, by having pupils discuss the several al- 
ternatives, or as a measure of improvement, in 
which case the test naturally should not be dis- 
cussed, 

The test consists of 10 brief unfinished inci- 
dents, episodes, or plots with five possible end- 
ings for each. The subject is asked to rank each 
of these endings “according to how probable you 
think it is as an outcome of the given situation.” 
It is suggested that those who finish the test early 
go through it again marking with a star the 
ending that would make the best story for each 
of the 10 plots and with a check the ending that 
seems to be the most advisable course of action. 
The difficulty that most individuals have in 


ranking five items (as contrasted with selecting 
best and worst or even rating) seems not to 
have been considered by the author. 

In every story, two of the endings are “not 
only possible but highly probable.” One ending 
disregards some of the facts in the story, another 
presents an “extraordinarily fortunate coinci- 
dence.” In several of the stories a closing “vio- 
lates some fundamental law of nature in order 
to bring about a happy ending.” Some alterna- 
tives represent “morbid or sadistic” outcomes. 
A “morbid” ending is one in which a person 
becomes ill, receives a severe punishment or in- 
dulges in strong aggression. 

A split half reliability coefficient of .7168 
(sic!), raised to .8350 by the Spearman-Brown 
formula, is reported for an undefined group. The 
author reports that errors made on the test tend 
to be similar to those made in short stories actu- 
ally written by the same subjects. This reviewer ' 
is mystified by the statement that the test serves 
as a measure of adjustment to life because it 
correlates with Noll’s What Do You Think? A 
Test of Scientific Thinking (see 40: 1263). The 
only norms presented are median scores for a 
twelfth grade group. The size and composition 
of this normative group are not reported. 

This test undoubtedly has some value for an 
English teacher in indicating how realistic and 
logical a student is in creating the structure for 
a short story. Its value as a projective technique 
remains to be demonstrated. Cursory inspection 
leads one to believe that the several alternatives 
are so concerned with logical and moral factors 
that the test would have only limited usefulness 
as an instrument for revealing personality trends 
or the quality of a pupil’s “adjustment to life.” 
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Rorschach. Ages 3 and over; 1921 to date; also com- 
monly referred to as Rorschach Method, Rorschach 
Test, Rorschach Ink Blot Test, Rorschach Psychodiag- 
nostics, and Rorschach Method of Psychodiagnosis ; 
many variants have appeared such as Harrower Group 
Rorschach, Harrower Multiple Choice Test, Monroe's 
Inspection Technique, Grassi’s Graphic Rorschach, 
and Marseille Rorschach Mail Interview; many modi- 
fications of the variants are in use, such as the Ampli- 
fied Multiple-Choice Rorschach and the Ranking Ror- 
schach Test; in addition to the Rorschach original 
Psychodiagnostic Plates, parallel plates have been pre- 
pared by Zulliger and Harrower and Steiner; Her- 
mann Rorschach. n 

а) PSYCHODIAGNOSTIC PLATES, FOURTH EDITION. 1921- 
45; $10 per set of 10 cards mounted on heavy card- 
board; $3 per pad of 100 record blanks; $6.50 per copy 
of the 1949 English translation of Rorschach's Psycho- 
diagnostics, Fourth Edition; Hermann Rorschach; 
Grune & Stratton, Inc. ; 
b) BEHN-RORSCHACH TEST. 1942; а parallel set of ink 


ы 
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blots ; $10 per set of то cards; $3 per pad of 100 record 
blanks; $4.75 per manual; Hans Zulliger; Grune & 
Stratton, Inc. 

C) PSYCHODIAGNOSTIC INKBLOTS. 1945; a parallel set of 
ink blots; $5.70 per set of 10 cards; $2.10 per manual ; 
M. R, Harrower and M. E. Steiner; Grune & Stratton, 
Inc. (These ink blots on slides for use with standard 
projectors are distributed by Psychological Corpora- 
tion at $20 per set, postpaid, for glass, slides and $10.50 
per set, postpaid, for Kodaslides.) 

d) HARROWER'S GROUP RORSCHACH. Ages 12 and over; 
1941-45 ; $20 per set of the original Rorschach ink blots 
on slides for standard projector, postpaid; $10.50 per 
set of ink blots on Kodaslides, postpaid ; $3.75 per 25 
record blanks, postpaid; (75-90) minutes; M. R. Har- 
rower and M. E. Steiner; distributed by Psychological 
Corporation. 

€) HARROWER'S MULTIPLE CHOICE TEST. Ages I2 and 
over; 1943-45; same slides as for Harrower’s Group 
Rorschach; $1.75 per 25, postpaid; 30¢ per specimen 
set, postpaid; (45) minutes; M. R. Harrower and 
М. Е. Steiner; distributed by Psychological Corpora- 
tion. 

f) RORSCHACH METHOD OF PERSONALITY DIAGNOSIS, 
1939-42; $1.60 per 25 record blanks; Bruno Klopfer 
and Helen H. Davidson; World Book Co. 
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HELEN SamcENT, Chief Psychologist, Winter 
VA Hospital, Topeka, Kansas.* 

DESCRIPTION AND HISTORY. А critical ap- 
praisal of the Rorschach Test must take into 
consideration the popularity and prestige which 
it has won in spite of still unsettled questions of 
technique, scoring, rationale, and standardiza- 
tion. On the other hand, its limitations despite 
extensive experience amassed through its use, 
must also be recognized. These paradoxes can 
best be understood and evaluated in the setting 
of its history and development. - 

'The Rorschach has, in recent years, become 
the best known and most widely used projective 
method for the description of personality, both 
normal and abnormal, in terms of dynamic 
processes. The test, which consists of a set of 
10 inkblots (5 achromatic and 5 in which color 
is combined in part or all of the blot) was origi- 
nated by a Swiss psychiatrist, Herman Ror- 
schach, who published a monograph on the 
method in 1922 shortly before his death. In spite _ 
of the incompleteness of his own work (or per- 
haps because of it), his originality in devising 
the method itself and his penetrating thinking 
on problems of interpretation have exerted a 
strong and continuing influence upon subsequent 
research. In spite of hundreds of published 
studies which have appeared in increasing num- 
bers since the first English publication by Ober- 
holzer in 1924 (1) the Psychodiagnostik (285) 
itself is still the basic reference and the arbiter 
of many questions. 

Although in the 1930's a few pioneer workers 
with a small but active following introduced the 
test in this country and began important re- 
search, not only experimental psychologists but 
the then more orthodox and behavioristically 
oriented clinical psychologists viewed the 
method with cold skepticism and active an- 
tagonism. Although controversy has by no 
means died out, the issues now center around 
the Chief Medical Director. he. statements’ and. прота об 
published by the author are a result of her own study and do 
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questions of research methodology, problems of 
establishing reliability and validity, and alarm 
over the widespread and uncritical acceptance of 
inferences drawn from the Rorschach alone. Its 
admission to the armamentarium of the clinical 
psychologist is no longer debated: the problem 
now is, rather, how it shall be used, by whom, 
and in what way. Kelley (1734) has recently 
compared certain Rorschach reports to 35-cent 
horoscopes (a just attack upon abuse of the test, 
though unfair to its demonstrated effectiveness 
when properly used by qualified clinicians). 

The increased status of the test in clinical work 
began with the publication of the first treatise by 
Beck in 1937 (62). Until 1942, when the Klop- 
fer-Kelley (277) and Bochner-Halpern (240) 
texts appeared almost simultaneously, the Beck 
work was the only reference in book form other 
than Rorschach's own monograph. Klopfer and 
Kelley has remained a standard volume. Beck's 
earlier book, now supplemented by a more recent 
two volume work (364, 415) and the extensive 
discussions in Rapaport (535) and in Schafer 
(749) are the most complete resources now avail- 
able for the practicing clinician. Rorschach ex- 
amining is, however, not a skill which can be 
acquired by reading, but only through instruc- 
tion and experience. Until after World War II, 
training was carried out mainly through the 
seminars and workshops offered by Beck at Mi- 
chael Reese in Chicago, by Klopfer at the 
Rorschach Institute founded at Columbia in 
1939, and by Marguerite Hertz at the Brush 
Foundation in Cleveland. In addition, since the 
war, an increasing number of courses are being 
offered at universities and in clinical training 
centers. 

AREA OF USEFULNESS. The Rorschach is used 
in almost every conceivable situation requiring 
the psychological appraisal of individuals, in- 
cluding child guidance, vocational selection, and 
college counseling as well as clinical diagnosis. 
It has been administered to individuals and 
groups in such widely varied settings as indus- 
trial concerns, schools, military installations, 
clinics and hospitals. Although for research 
purposes diversified application is to be encour- 
aged, the limitations of the test for applied use in 
such an assortment of problems must be recog- 
nized. By far the largest body of experience with 
the Rorschach has been accumulated in the clini- 
cal setting. It is in the area of mental illness, 
therefore, that we can place relatively greater 
confidence in what the data can reveal Even 
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here, however, it is important to distinguish fact 
from theory and evidence from proof. Ror- 
schach findings like the findings from any other 
clinical method (whether this is a testing de- 
vice, an interview, or an observation) depend 
upon the knowledge, experience, sensitivity, and 
sound judgment of the clinician more than upon 
the eliciting power of the inkblot stimuli them- 
selves. This decidedly limits the value of the in- 
strument in the hands of the untrained or un- 
qualified worker. 

The skilled Rorschach examiner, on the basis 
of experience (still considerably in advance of 
experimental methodology to demonstrate cru- 
cially what is believed to be clinically valid), uses 
the test chiefly for what it can contribute, in con- 
junction with other psychological methods, to 
the understanding of certain important aspects 
of personality. The test offers a means not only 
for appraising capacity, both intellectual and 
emotional, but for analyzing the way in which 
capacity is used in problem solving, in adapta- 
tion, and in control of thinking and impulse. 
Thus it offers an index of the extent to which 
normal functioning is preserved, disturbed, dis- 
organized, or destroyed. 

The test is expected to yield not a catalogue of 
traits or predispositions, and certainly not even 
a list of signs of abnormality or impairment, but 
a view of personality processes as they function. 
The theory which supports the use of ambiguous 
stimuli (such as the inkblots) to elicit a response 
which is then interpreted in terms of an individ- 
ual’s idiosyncratic imposition of structure upon a 
perceptual stimulus field having only rudimen- 
tary structure of its own is the “projective hy- 
pothesis” (Rapaport, 535). This theory, that 
perception is individually motivated, selective, 
and organized in terms of the person’s needs, 
experiences, and habitual patterns of response, 
as well as by properties of the stimulus itself, 18 
an extension of Freud’s concept of projection as 
a mechanism of defense. It has been elaborated 
by Frank,? Bellak,? and others as a rationale for 
the projective tests. Because ambiguous stimuli 
are unfamiliar problems to be solved, they set 1n 
motion processes of perceiving, associating, Of- 


2 Frank, Lawrence К. Projective Methods, Springfield, Ш: 
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ganizing, selecting, and rejecting which provide 
the opportunity to study these processes in ac- 
tion. Since, furthermore, the unfamiliar situa- 
tion is anxiety instigating, the adequacy or mis- 
management of function under stress may be 
observed. 

Some of the questions with which the Ror- 
schach analyst approaches the response record 
are of the following nature: How accurately or 
arbitrarily does this person perceive what is seen 
by most others? How rich is the experience and 
knowledge he brings to bear on it? How much 
anxiety is aroused in the face of the problem of 
responding to vagueness? What is his way of 
handling anxiety and to what extent can he get 
it under control? How responsive is he to stim- 
uli from the outer world which normally excite, 
please, or threaten? What does he do when im- 
pulse is aroused—does he try for escape, become 
cautious, or show disorganization, or is he re- 
sourceful enough to delay and direct the impulse 
into an appropriate response? How productive, 
flexible, and adaptive is he? How much tension 
and drive does he experience? 

It is, of course, in the disturbance of processes 
such as the above (which are inferred from their 
verbal end products) that abnormality and im- 
pairment are most evident, and it is on this basis 
that special kinds of pathology characteristic of 
various nosological groupings of mentally ill pa- 
tients have been studied through their reflection 
in the patterns of response found in the Ror- 
schach. A large body of evidence is available 
(though poorly systematized) which describes 
the typical performance in the psychoses (schiz- 
ophrenic and affective) and neuroses (hysteria, 
obsessional neuroses, anxiety states, etc.), in 
organically impaired patients, and in character 
disorders. Diagnostic criteria for distinguishing 
these and other groups range from Rorschach 
indications which are so universally agreed upon 
by clinicians as to be considered pathognomonic 
to a variety of signs either so over-specific and 
unsupported by adequate research, or so diffuse 
and general, as to be differentially meaningless. 


CONTROVERSIES OVER ADMINISTRATION TECH-. 


NIQUE. There are several different procedures 
for the administration of the Rorschach method, 
all of which have enthusiastic adherents and se- 
vere critics. All techniques are based on agree- 
ment on the following points: (а) that the pur- 
pose of the experiment is to get the subject to 
react to the blots (telling what they look like to 
him) in his own way with a minimum of sugges- 
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tion from the examiner, (5) that a full verbatim 
record of all'the subject’s verbalization is essen- 
tial, (c) that time to the first response on each 
card and total time for the experiment be re- 
corded, and (d) that responses must be followed 
up by inquiry to determine to what part of the 
blot the response refers, what aspects of the card 
determine it, and how the percept is conceived 
and organized. 

On the last point there is general acceptance 
of a division of the test into two parts, the first of 
which is known variously as the Association 
(Beck), the Performance (Klopfer), or the 
Test (Rapaport) and the second of which is 
called the Inquiry. It is on the handling of the 
inquiry that "schools" differ most sharply. Beck 
and Klopfer advocate going through the entire 
series of ten cards without interrupting the sub- 
ject by questions. At the end of this period, the 
cards are re-presented, responses are read back 
to the subject, and questions are asked to elicit 
the additional information necessary for scoring. 
Rapaport, however, has introduced a “‘blind- 
fold” inquiry (535) which follows the presenta- 
tion of each card. Without showing the card 
again (except when absolutely necessary to clear 
up confusion in regard to location), the subject 
is queried on those responses which are not read- 
ily identifiable or scorable on the basis of the 
spontaneous verbalization. Most workers also 
advocate additional probing (after the initial in- 
quiry), a procedure which Klopfer introduced 
and termed “testing the limits." Answers to the 
more leading questions in this period are not 
allowed to influence scoring, but yield fuller in- 
formation for qualitative interpretation. Both 
immediate and delayed inquiry are proposed by 
their advocates to minimize suggestion. Beck 
and his followers hold that card-by-card inquiry 
alters the experiment in that it influences the re- 
sponse to subsequent cards. Rapaport, on the 
contrary, insists that questioning after the re- 
mainder of the test has intervened reveals less 
of the original response determinants and re- 
quires more suggestive questioning. The Beck 
method does play down the interpersonal rela- 
tionship between examiner and subject during 
the association, thus reducing examiner influ- 
ence and making the objective conditions of test- 
ing more nearly equal in the hands of different 
examiners, The Rapaport technique, in the hands 
of skilled examiners, provides a rich and fresh 
material on a subject’s immediate experience 
and allows the clinician to establish better rap- 
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port which, in a sense, means equating the sub- 
jective conditions of testing. The main objection 
to the latter method is the unpredictable effect 
which even minimal examiner-attention to cer- 
tain responses and not to others exerts upon the 
subject who is quick to detect what the examiner 
thinks is important. The method of choice, how- 
ever, depends largely upon the clinician's pref- 
erence for a controlled method by which indi- 
vidual behavior in an objectively standardized 
situation can be compared or for a less controlled 
technique which allows richer observational data 
and more leeway for intuitive interpretation. 
There are other variations in administration 
technique among the leading experts with regard 
to wording of instructions, amount of pressure 
exerted to elicit response, position of subject in 
relation to examiner (Beck’s subject sits with 
back turned and is handed cards over his shoul- 
der), handling of the cards, and the like. Only 
the instructions, or manner of presenting the 
task, is of much importance. Beck and Klopfer 
both offer an initial explanation about the ink- 
blot test, Beck utilizing a standardized intro- 
ductory statement in contrast to a less formal 
introduction by Klopfer. Rapaport uses only 
Rorschach's original question “What might this 
be?", answering requests for more information 
as they arise, rather than seeking to anticipate 
them. The latter procedure has much to recom- 
mend it, Initial explanations may be useful in 
quieting the anxiety and guilt feelings of the 
examiner for the task he imposes, but they have 
a variable effect on the subject and ordinarily 
are not so helpful as dealing with questions, con- 
fusion, or uneasiness at the time they become 
manifest. The selection of administration method 
must be guided largely by the examiner’s train- 
ing and preference. No systematic research has 
been carried out to demonstrate which is best. 
Even the criteria of “best” would be in doubt. 
The above discussion refers only to individual 
Rorschach testing. Various methods for the 
group administration of the test have also been 
devised. The Harrower group method, using 
slides and booklets in which responses are writ- 
ten, produces protocols which can be scored and 
interpreted in much the same way as if the test 
` were individually administered (199, 316). The 
material elicited suffers only from lack of detailed 
inquiry data on doubtful responses. Beck calls 
this method “а test but not a Rorschach." How- 
ever, experience suggests that the situational 
variables and the mode of presentation do not 
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seriously alter the meaning of response, Har. 
rower's multiple choice: form is not recom- 
mended, since it limits response possibilities in 
a way which is incompatible with the principles 
of projective test construction and is less well 
standardized than the better established non- 
projective inventories. 

SCORING SYSTEMS. The variety of scoring sys- 
tems is at first bewildering to the beginner in 
Rorschach. Actually, the difference is in symbol 
and method of tabulation, rather than in the test 
variables they represent. The clinician familiar 
with one “shorthand” (Beck's term to describe 
the reduction of complex response material to 
symbols for purposes of analysis) can easily 
learn to translate from one to another. 

In all systems the following are represented 
in scoring: (а) the proportion of responses to 
the card as a whole, to prominent detail, and to 
various small or unusual details ; (b) the deter- 
minant of each response according to the in- 
fluence of qualities inherent in the figures them- 
selves, such as form, color, movement, and 
chiaroscuro or shading; (c) content classified 
into various categories; and (d) various quanti- 
tative relationships and percentages derived from 
the above scores, plus qualitative aspects of 
verbalization, organization, form level, original- 
ity, or commonality. 

The beginner will bé annoyed to find different 
ratios and indices computed in each system and 
will discover that a determinant such as shading 
is called Y in one system, K in another, and Ch 
in a third, for reasons known only to the symbol 
originators. As indicated above, these differences 
are not as important as they may at first seem. 

Somewhat more disquieting, but also more 
apparent than real, are differences in interpreta- 
tion which strike at the core of the problem of 
validity to be discussed below. M. (movement) 
for example, has been thought to represent in- 
telligence, native endowment, creativity, wealth 
and flexibility, phantasy activity, unconscious 
wish, amount of inner resource, and control of 
impulse. Beck emphasizes the wish fulfilling and 
creative character of M (which he calls the wish 
or "waking dream"), whereas Rapaport stresses 
its implications for wealth of intellectual resource 
and ideational control, depending on form qual- 
ity. Beck has emphasized certain types of shad- 
ing response as indicative of self-appraisal, self- 
criticism, special forms of anxiety. Rapaport 
points to the relation of chiaroscuro to anxiety, 
but more particularly to the manner in which it 
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is handled. Klopfer and Hertz each utilize inter- 
pretations peculiar to. their own. systems. The 
several experts, moreover, have different lists 
of "popular" responses and different standards 
for evaluating the good or poor quality of form. 

A system of scoring less widely known, and 
too new to be evaluated at this time, is an elabo- 
rate sign approach by Charlotte Buhler (767). 
Its major purpose is to provide for more ade- 
quate quantitative evaluation of protocols and a 
more objective appraisal of the level of personal- 
ity integration. A method which aids in the 
rapid evaluation of large numbers of records 
for research or screening purposes is the Munroe 
Check List (219). - . 

RELIABILITY AND VALIDITY. From the stand- 
point of objectivity, the above variations and 
differences between schools of Rorschach are a 
weakness of the test but are also a strength in 
that they show flexibility and almost unlimited 
possibilities for development. At the same time 
the complexity of the test variables and the com- 
prehensiveness of rationale for their interpreta- 
tion create for the Rorschach the same problems 
in establishing reliability and validity that other 
projective methods encounter. Variables are 
interrelated and interdependent, and "results" 
are not simply the product of the test, but of 
the interpreter who reads and organizes them. 
Scores are only a step toward interpretation 
rather than the interpretation itself as is the 
case with more objective tests. For example, in 
an inventory which purports to distinguish neu- 
rotic from non-neurotic subjects by comparison 
with established cutting scores, the obtained 
score is the diagnosis. The score itself may thus 
be checked for agreement with a criterion meas- 
ure, No Rorschach score, singly or in combina- 
tion, has any such meaning except as the inter- 
preter considers it in relation to the entire 
record, Furthermore, various scores have dif- 
ferent meanings according to the setting in 
which they appear. Methods for dealing with 
patterns of scores proposed by Cronbach (795); 
Buhler (787), Zubin (358), and others are 
hopeful, but these still form only the basis for 
conclusions and cannot be directly read. 

In addition to the validation problems which 
are inherent in the nature of the instrument and 
its use, there is the difficulty of establishing valid 
criteria for those aspects of personality with 
which the test deals. There are virtually no 
sound measures for internal states (which may 
or may not be manifest in behavior). The very 
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existence of these is an inference whether drawn 
from Rorschach or other evidence. It is a strange 
paradox that the psychiatric interview, the case 
history, and the unstructured clinical observation 
(bases of inferences which are equally complex 
and equally invalid as scientific evidence) have 
been so frequently used as diagnostic and de- 
scriptive criteria with which the Rorschach and 
other projective test findings are correlated. We 
demand of projective tests an accuracy of pre- 
diction which cannot be demonstrated for other 
clinical methods, yet we use the latter to judge 
whether Rorschach conclusions are "right" or 
“wrong.” While agreement may be viewed as 
evidence in the direction of validation, disagree- 
ment is a negative datum not subject to explana- 
tion. 

Validity studies on the Rorschach may be 
roughly classified into three groups: (a) “sign” 
studies in which contrasting groups of known 
composition are compared in terms of difference 
on various scores; (b) matching or correlation 
of diagnoses and/or total personality descriptions 
with clinical or history data, and (c) veri- 
fication of one-dimensional Rorschach interpre- 
tations (intellectual control, anxiety, impulsiv- 
ity, etc.) by comparison with other measures 
of the same variable. 

The difficulties in the first two procedures have 
been cited above. The first type of research has : 
normative value in that it has helped to establish * 
expectancies for groups and for the range of 
variation in the scores themselves ; this, however, - 
is not validation. The second approach has been 
valuable in those studies in which criteria have 


‚ been carefully chosen. The third method is prom- 


ising but has only begun to be exploited. One 
study of intellectual control (Williams, 596) and 
one of impulsivity (Gardner, rr09) show a 
promising agreement between Rorschach judg- 
ments and external measures of the same in- 
ferred characteristic. Williams' study is unique 
in that an experimental method for produc- 
ing a phenomenon predicted from the Rorschach 
(good or poor intellectual control under stress) 
was devised. Gardner showed satisfactory cor- 
relation between Rorschach measures of im- 
pulsivity and impulsivity rated on the basis of 
another test and on direct acquaintance and ob- . 
servation. Both of these studies, although they 
are interpretation rather than score oriented, 
fall back upon correlations of scores and score 
patterns with the criteria, rather than upon in- 
terpretations (the basis of which may be inter- ' 
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changeable). A promising method for validating 
interpretations, utilizing the О) technique, has 
been developed by Kobler in a study of Szondi, 
Rorschach, and full battery inferences in rela- 
tion to therapists’ judgments. Benton has re- 
cently reviewed a number of validation studies, 
the results of which are conflicting (908). 

The reliability of the Rorschach, defined as 
consistency of measurement, can be judged only 
by agreement between interpreters, since the 
self-agreement of a single Rorschach protocol 
(split half) is affected by variations in the 
stimulus value of the ten cards, and agreement 
between Rorschachs administered to the same 
patient after an interim is disrupted to an un- 
known degree by changes in the subject. Al- 
though two series of alternate cards, the Behn 
Rorschach and the Harrower set, are available, 
the published studies are so few in number that 
results are inconclusive. 

SUMMARY. The: Rorschach test is a clinical 
technique, not a psychometric method. As such 
it has the advantages, and is subject to the limi- 
tations, of other complex, flexible clinical tools. 
In its present stage of development it is an aid 
to psychological investigation and interpretation, 
the usefulness of which depends upon the clini- 
cian who applies it. It is to be hoped that dif- 
ferences in regard to administration, scoring, 
and interpretation will be reconciled through 
research rather than argument. It is to be hoped 
also that validation studies in which care is exer- 
cised in the choice of criteria, and in which the 
interpretation rather than the score will be the 
candidate for proof, will be forthcoming. The 
test has no magic; vital diagnosis and treatment 
planning should not rest on its results alone. At 
present as much critical attention should be di- 
rected to the processes engaged in by the clini- 
cian who interprets it as to the validation of the 
test itself. Within these limits, the Rorschach is 
a method for the study of personality which has 
no threatening competition today. ' 


` For reviews by Morris Krugman and J. R. 
Wittenborn, see 3:73; for related reviews, see 
118-28. 
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*[Re Rorschach.] BrussEL, JAMES А.; HrrCH, 
KENNETH S.; Амр Prorgowskr, Zyemunt A. А Ror- 
schach Training Manual, [Third Edition]. Utica, 
N.Y.: State Hospitals Press, 1950. Pp. 86. Paper. 
$075. * (PA 25:4560, 4581) 


Am J Psychiatry 108:236 S "51. George A. 
Ulett. * a helpful manual for Rorschach training 


Da 
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consisting of 2 articles that appeared first in the 
Psychiatric Quarterly in 1942. Other brief Ror- 
schach manuals have since appeared, but this 
reprinting is timely owing to the present in- 
creasing tempo of neuropsychiatric training. The 
manual contains pictures of the Rorschach cards 
in miniature and helpful summary tables. Dr, 
Piotrowski's article contains a brief but very 
interesting historical account and developmental 
sketch of Rorschach theory. His discussion of 
the meaning of the separate scoring symbols is 
excellent and reflects a wide background of 
clinical experience. The book suffers from want 
of organization, as both articles cover, in part, 
the same material. Combining the 2 sections 
could have prevented duplication of material 
and produced greater clarity. The use of a some- 
what different scoring system in the 2 papers 
will be confusing to the beginner. An attempt 
to simplify the scoring of shading responses is 
commendable, but the inexperienced trainee who 
will wish for a means to understand current 
Rorschach literature should be given a clearer 
comparison of the author’s system with standard 
accepted terminologies. Considerable space is 
used to conyince the reader that the Rorschach 
test is a valuable procedure, and to this end there 
is included a study of 50 patients in whom the 
Rorschach and clinical diagnoses are compared. 
The manual as a whole contains much worth- 
while information but, as the writers themselves 
state, it cannot serve as the “sole armamentar- 
ium” with which to plunge into Rorschach ad- 
ministration and interpretation. 

B Menninger Clinic 15:192 S ’51. Walter 
Kass. * It is regrettable that assumptions, Tà- 
tionale, interpretive speculation are undifferen- 
tiated. Too meager for introductory teaching and 
too elementary for advanced students, when use 
alone, it is recommended as supplementary read- 
ing by all Rorschach students. 

Q J Child Behavior 3:111 Ja'51. M. М. Gem. 
This manual is a re-printing under one covet of 
two previously published monographs An In- 
troduction to Rorschach Psychodiagnostics ™ 
Psychiatry by Brussel and Hitch and A Ror- 
schach Compendium by Piotrowski. The sec 
tion by Brussel and Hitch explains in simple, 
readable form, what the Rorschach is and how 
it is administered. There are colored тергойис- 
tions of the plates and a comprehensive form 
containing all the tabulations the Rorschach ex- 
aminer might make. The authors give concise 
and careful descriptions of all the scoring sy™ 
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bols, what they are usually considered to mean, 
and what significance can be attached to an ex- 
cess or deficit of the various signs in any one 
protocol, especially as related to the various diag- 
nostic entities. Finally, this section includes a 
report of the Fort Dix study in which a close 
agreement was found between the Rorschach 
diagnoses and the psychiatric diagnoses of 50 
patients with a variety of mental ailments. This 
section by Brussel and Hitch is a good, brief 
introduction to the Rorschach method. Piotrow- 
ski in his section covers not only the administra- 
tion technique, but the history of the Rorschach 
and the theory behind it. The latter discussion is 
particularly valuable for potential users of the 
test. The most important contribution Piotrow- 
ski makes is an excellent and comprehensive re- 
definition of each scoring symbol. These defini- 
tions go beyond what is usually found in most 
training manuals, for they are closely tied to 
dynamic principles. Piotrowski defines and dis- 
cusses these symbols in relation to the everyday 
functioning of individuals, and this frees the 
trainee from the confinement of interpreting 
Rorschachs in terms that only another Ror- 
schach examiner can understand. Both sections 
would be extremely useful to beginners. Pio- 
trowski's section should be useful even to those 
examiners who have already had some. experi- 
ence with the Rorschach. As the authors them- 
selves indicate, it is understood that a reading of 
this manual alone will not provide proficiency in 
Rorschach interpretation. The potential Ror- 
schach examiner should have also a sound un- 
derstanding of dynamic principles of personality, 
as well as considerable supervised experience 
and training. 
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*[Re Rorschach.] Bunter, CHARLOTTE; BUHLER, 
Kart; AND Lerever, D. Мету. Development of the 
Basic Rorschach Score With Manual of Direc- 
tions, Rorschach Standardization Studies, No. 1. Los 
Angeles 46, Calif.: Charlotte Buhler (1127 North 
Sweetzer Ave.), 1948. Pp. ix, 190. Paper, mimeo- 
graphed. $4.95. * (РА 23:745) 

Am J Orthopsychiatry 19:722-3 О *49. Her- 
man Molish. * The Rorschach records were ob- 
tained through individual administrations in 
which questionable deviations from the usual 
procedures occurred: (1) "Total time of the 
test is not measured because it is affected by the 
subject’s verbosity, a factor irrelevant to diag- 
nosis.” (2) "The subject is told to give three to 
five responses per card....subjects who see very 
little on certain cards and suddenly a great deal 


on others are always allowed to continue because 
the difference in responsiveness is significant.” 
One wonders, however, if increased responsive- 
ness can be obtained in all cases if the examiner 
has originally induced a mental set by instruct- 
ing the subject to give only three to five re- 
sponses per card. (3) “It is best to rely as little 
as possible on the subject’s opinion of what 
determined his perceptual process." This state- 
ment is made to defend the unorthodox series of 
alternative questions employed during the in- 
quiry period. As justification for this method 
the authors state that individual Rorschach rec- 
ords given "in the usual way" compare favor- 
ably with group Rorschachs. True, the group 
Rorschach method can also suggest certain de- 
terminants by the printed instructions, but this 
cannot be comparable to the situation where the 
examiner employs suggestion by such questions 
as those cited by the authors. * The statistical 
treatment is extensive; but the selection of sizes 
of groups within the population upon which 
these statistics are based invites criticism. The 
thirty normals who comprise the criterion group 
in establishing the Basic Rorschach Score are 
very possibly one of superior adults. Especially 
small, are the normal and schizophrenic groups. 
After the authors accept their weighted scores, 
we are informed, “In a number of instances the 
scoring weights from this chart were modified 
in the light of the clinical experience of the two 
authors.” It thus appears that the authors al- 
ready have lost faith in their original hypothesis 
that “typical personality pictures in Rorschach 
terms can be described statistically.” They do 
not discuss to any extent their reasons for these 
changes based on their clinical experience. One 
does not deny the value of clinical experience, 
but to conveniently accept statistical findings 
on the one hand when they agree with hypotheses 
and discard them on the other certainly cannot 
be justified in scientific method. An important 
step in this research is the formulation of four 
levels of personality integration. These levels 
are determined statistically by the distribution 
of the Basic Rorschach Scores in the various 
clinical groups. The rationale of these levels of .. 
integration is to be commended. Tt stresses the 
need of functional classification rather than the 
traditional clinical categories. These levels are: 


` adequacy, conflict, impairment, and reality loss. 


The problem of overlapping between clinical 
groups and the so-called “blurring effect" within 
the four levels, recognized by the authors, is not 
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attacked in the present study but is left for 
further research. Just how useful these levels are 
in terms of their present description is doubtful. 
Personality structure, whatever its clinical clas- 
sification, is too complex to be fitted into levels 
` described by a series of terms as “unabsorbed 
and defective reality awareness,” “over-wide as- 
piration focus,” etc, For the present, this study 
will be found valuable not in terms of its appli- 
cation to Rorschach analysis, but only as a be- 
ginning point in the understanding of the 
extensive problem at hand in quantifying Ror- 
schach data, and describing personality struc- 
ture on a statistical basis. , 

Am J Psychiatry 106:558-9 Ја ' 50. Douglas 
M. Kelley. * The book is quite technical, based 
primarily on Klopfer's work, and a knowledge 
of his scoring system is essential for its under- 
standing. Some Rorschach experts will cer- 
tainly disapprove the changes which are pre- 
sented primarily in Chapter. 10 of the book and 
which, taken as a whole, in spite of the discus- 
sion of these authors, yield а definitely differ- 
ent approach with different results from those 
achieved by a Rorschach given in the accustomed 
fashion. Such deviations are not new but must 


be emphasized particularly in a book of this type 


which is quantitatively oriented. From this point 
of view, it is not especially important as to 
whether the changes in technique are good or 
bad since the results presented can be assumed 
to hold good only when the authors' technique 
is employed. This means that, unless all workers 
agree to accept this new method of administra- 
tion and inquiry, little gain is to be had from the 
volume as a clinical work except the indication 
that it seems to be successful in the hands of the 
present workers. It is, of course, highly interest- 
. ing asa research project but a great deal of vali- 
dation by other workers must be presented be- 
fore the changes introduced can be generally ac- 
cepted. It does, however, represent an excellent 
example of the type of study which can be done 
in this complicated field and certainly as such is 
of interest to all experts. The statistical data 
seem adequate as far as they go. The book....in- 
troduces a system of clinical groups or ranks 
which can be discriminated by the modifications 
of the technique. Four ranks are indicated : level 


I represents essentially variations of normality; _ 


level II represents neurotics; III contains de- 
pressions, manics, alcoholics and nonpsychotic 
organics ; and IV, schizophrenic categories, This 
method of evaluation deviates markedly from 
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the usual nosological approach but may be of 
some practical value. The Rorschach sign list 
which consists of the usual Rorschach symbols 
alone or in combined form is well worked out 
and may prove of use along with ordinary 
Rorschach procedures, In addition to a compre- 
hensive discussion of their method of adminis- 
tration, the authors have provided ample data 
explaining their specific modifications in the 
scoring system. They also present interesting 
discussions on the psychodynamics of Rorschach 
signs and, of course, include a wealth of statisti- 


cal data with a vast area of tables which will: 


hopelessly confuse the nonstatistically trained 
clinician but serve toward validation of their 
theory. A number of case presentations are given 
depicting examples of various types of psycho- 
pathology and are of real value, The book should 
certainly be provocative of considerable argu- 
ment among Rorschach workers and as such 
may perhaps provoke some clarification of this 
complicated but most useful technique. 

Arch Neurol & Psychiatry 63:677-8 Ap ’50. 
Mortimer M. Meyer. * a real contribution * Al- 
though the study is essentially a statistical one, 
and will have most direct meaning to Rorschach 
workers familiar with statistics, it has impor- 
tance for the clinical psychiatric and psychologic 
fields, both for its further objective, statistical 
examination of this widely used instrument and 
for the new technic—"the basic Rorschach 
score." * Since the tables and figures are re- 


ferred to only in a general way, it might have 
‘been more helpful if the tables had been placed 


in the appendix, as they tend to interrupt the 
reading for the many who are not statisticians. 
Although the book is mimeographed, rather than 
printed, it is an excellent publication. 

B Menninger Clinic 14:35-6 Ja ' 5o. Walter 
Kass. * The content and treatment....are geared 
to the understanding of the more experienced 
Rorschacher. 

J Abn & Social Psychol 45:173-6 Ja '50. 
Maria А. Rickers-Ovsiankina. * an ambitious 
and, in many ways, an impressive attempt at 
standardizing the diagnostic significance of 
quantitative aspects of the Rorschach Test. The 


procedure employed for this purpose is the. 


development of a statistically validated sign list, 
consisting of over one hundred signs....based on 
the Klopfer scoring system. Although no men- 
tion of origin is made, the similarity between 
this list and the so-called "check list" of Ruth 
Munroe is evident. The newness of the present 
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approach lies in the systematic application of 
this list to a large variety of clinical subjects 
(518 all told, ranging from normal to schizo- 
phrenic) and in the careful statistical testing of 
each item with respect to its usefulness in differ- 
entiating one clinical group from another. Ac- 
cording to the relative incidence of these signs 
in the various clinical groups they were assigned 
numerical weights, ranging from +5 to —5, the 
plus signs indicating frequency of this sign in 
groups “more nearly normal,” whereas the 
minus signs reveal an association with the less 
adjusted groups. The algebraic summation of 
all weighted signs occurring in a specific record 
constitutes its Basic Rorschach Score—the focal 
contribution of this investigation. * the frequent 
occurrence of a certain Rorschach sign in a 
particular clinical group or on a certain integra- 
tion level, as well as its tendency to appear in 
conjunction with some signs and not with others, 
in turn throws additional light upon the psycho- 
dynamics of the chief Rorschach categories. It 
is this latter material which seems to the present 
reviewer to be of most immediate interest to 
Rorschach workers. The value of an overall 
measure of adjustment, as the BRS purports to 
be, is readily appreciated by everyone in re- 
search. The usefulness of the BRS, however, at 
the present stage of the investigation is limited 
by evidence of wide overlap in scores obtained 
from the different clinical groups. One therefore 
doubts that many clinicians will be ready to 
discard the more conventional psychiatric clas- 


sifications for the sake of this new measuring. 


device, although there is in principle a good deal 
of truth in the statement “To’predict a patient's 
chances for recovery with reference to his job, 
his family, and his practical affiliations, it would 
be a more functional classification to determine 
the patient's disintegration level rather than to 
call him a psychopath or a psychoneurotic." (p. 
13) Confidence in the clinical usefulness of the 
proposed scale also suffers from the employment 
of precariously crude gradations within the scale, 
a jump from —3 to +3 being assigned to such 
minor changes in raw scores as R=24 vs 
К = 25, Е% = 20 vs F% =21, sum C — 235 
vs sum C — 3. With respect to the theoretical as- 
pects of the study, our main dissent is with the 
concepts employed and with their implications. 
The central concept of integration, e.g., seems 
to be used interchangeably with that of adjust- 
ment. Although it cannot be denied that these 
two concepts are often linked, they are by no 
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means synonymous, if only because of their 
disparity in semantic area of discourse. The 
promise of quantifiable integration levels within 
the personality organization presents stimulat- 
ing potentialities both theoretically and practi- 
cally. A difficulty arises, however, in trying to 
think of conflict, defect and reality loss as real 
levels or relative degrees of some one function, 
such as integration. It is hard to imagine how 
these concepts, referring to totally different psy- 
chological phenomena, can be related in linear . 
fashion either to each other or to the highest 
level in the hierarchy, that of adequacy of inte- 
gration. To return now to the discussion of the 
psychodynamics of individual Rorschach signs. 
Categories such as FM (animal movement), 
m (movement of inanimate objects), and C" 
(achromatic color), the psychological meaning 
of which hitherto had not been demonstrated 
very convincingly in the literature, gain in in- 
terpretative clarity. Somewhat ‘new light is 
thrown upon F% (percentage of form re- 
sponses), C ит (weighted total of color re- 
sponses) and the various shading categories. 
Although not everybody would agree with all 
inferences drawn, this analysis is quite enlight- 
ening and will no doubt enhance our understand- 
ing of these problems, within certain limita- 
tions of technique to be discussed later. In a few 
categories, however, the conclusions seem more 
misleading than constructive, as e.g., the blanket 
assertion that “W increase is a negative sign." 
(p. 38) In cases like this one senses the diffi- 
culty in trying to deal with Rorschach factors in 
a crudely quantitative fashion without consider- 
ation not only for qualitative aspects of the re- 
sponse, but even for objective discriminations 
of a finer nature like the “instantaneous whole" 
or Wv response vs the more complex W’s, anal- 
ysis of W’s by cards, etc. Part II of this study 
describes the authors’ technique of administer- . 
ing the test. It involves important departures 
from the established procedure. Informing the 
subject of the purpose of the examination, prod- 
ding for specific responses, requesting 3-5 in- 
terpretations per card, introducing alternatives 
during the inquiry and so forth, are sufficiently 
drastic changes to necessitate a good deal of 
caution in applying the findings of this study to . 
data obtained by a more conventional method. 
E.g., the novel observation that the peak for 
S (space responses) and Dd (small detail) falls 
into the normal group might very well bea result 
of prodding for responses. It seems safe to pre- 
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dict that these technical innovations will be the 
part of the investigation most widely discussed 
in the Rorschach world, but since such dissection 
is by nature highly specialized, a mere mention 
of its implications will suffice here. Part III pre- 
sents separately the statistics of this investiga- 
tion, by elucidating in detail (1) the different 
steps of analysis which led ultimately to the 
development of the BRS; (2) the establishment 
of its reliability and validity and finally (3) a 
variance analysis of the individual Rorschach 
signs, It is a commendable piece of work, well 
organized, clearly and comprehensively written, 
and may be recommended to anyone interested 
in the problem of quantification in projective 
techniques. When the Rorschach Test first be- 
came popular in this country two schools of 
thought emerged, one stressing the need for 
objectivity and norms and the other favoring a 
qualitative and more wholistic approach. In re- 
cent years this opposition has fortunately di- 
minished with both sides explicitly or implicitly 
admitting the indispensability of either method 
for a truly valid Rorschach analysis. It is there- 
fore a noteworthy development to see within 
the same year the publication of two books which 
not only are reviving the gap, but actually make 
it deeper than ever: L. К. Frank’s so decisively 
individual-centered interpretation of projective 
techniques at the one pole and the present volume 
at the other extreme. Clearly, the latter is the 
most vigorous and bold champion of making the 
test "as objective, as simplified and as brief as 
possible" (p. 50) that has appeared in the litera- 
ture, aside from Harrower’s multiple choice 
variation. To be sure, we are warned repeatedly 
not to interpret the concentration upon quanti- 
tative aspects of the analysis as an indication of 
rejection of its qualitative side. And yet, after 
having made this laudable avowal, the authors 
. Soon forget about it and permit themselves to be 
carried away by their statistical findings. At least 
such appears to be the case when it is asserted 
e.g., that no information regarding moral qual- 
ities or super ego functions could be elicited from 
a Rorschach test (p. 32), or when diagnostic 
inferences are made solely on the basis of the 
sign list (pp. 164 ff.). Last but not least, it is 
regrettable that in a publication which sets itself 
such broad aims, there is a conspicuous absence 
of reference to the vast Rorschach literature 
fie and abroad. Except for reliance on Klop- 
_ fer's work, one finds almost no mention of the 
“numerous clinical investigations which have 
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built up over the years the reputation of the 
Rorschach Test. Even the standardization stud- 
ies of Beck and of Hertz receive no citation here, 
Whatever the reason for this “isolationism,” it 
is surely as deplorable in research as it is in any 
other field. 

J Consult Psychol 13:64—5 F '49. Laurance 
F. Shaffer. Clinical acumen and statistical dis- 
cipline combined to procure this study of the 
quantification of the Rorschach. A list of 102 
diagnostic signs was developed in an attempt to 
differentiate among groups ranging from nor- 
mals to schizophrenics, By statistical procedures, 
positive and negative weights were attached to 
each sign, yielding an algebraic total designated 
as the Basic Rorschach Score (BR). Such scores 
appear to discriminate significantly among clini- 
cal groups, and further, to rank the groups in 
the order of the severity of personality disturb- 
ance shown. * While BR scores cannot be used 
for individual diagnosis, they may serve to ex- 
clude syndromes that do not apply, and may 
assist in gauging the level of adjustment. If fur- 
ther study sustains the validity of the BR scores, 
their convenience will increase the utility of the 
Rorschach in research. Despite minor flaws, the 
BR represents a forward stride in the difficult 
task of expressing significant aspects of per- 
sonality function in quantitative terms. * . 

J Nerv & Mental Dis 3:173 F ’50. (Zygmunt 
A.) Piotrowski. * The authors introduce а 
change in the administration of the test limiting 
the freedom of the association of the individual. 
“Our procedure makes the test administration 
more independent of the individual talents of 
the examiner and examinee.” One wonders 
whether making the test “more independent of 
the examinee” might not lower its validity. * 
This is clearly and deliberately a book for the 
specialist who uses the Rorschach for a diag- 
nostic aid. Everyone actively engaged in Ror- 
schach practice will wish to acquaint himself 
with, this contribution. It is a highly technical 
book and, for this reason, might discourage the 
general reader. 


Nerv Child 8:71 Ja '49. This is the prelimi- _ 


nary edition of a book destined to be the basis of 


all sound Rorschach work in the future. * Тһе 
folly of overestimating the Rorschach results 


is emphasized and a sound evaluation estab- 
lished, 
Occupations 27:425-6 Mr ’49. Goldie Ruth 
Kaback. * Personality analysis based on Ror- 
schach signs is not new. The particular weight- 
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ing of these signs, however, marks a new de- 
parture for clinical diagnosis. * Part I....includes 
an excellent discussion of the psychodynamics 
of Rorschach signs. However, this discussion is 
not for the novice. A thorough background in 
psychoanalytic theory and familiarity with psy- 
chopathology are needed in order to differen- 
tiate between the interpretations presented here 
and those held by other Rorschach workers. 
Part II consists of a Manual of Directions. The 
procedure is recommended for use for diagnostic 
purposes only, and if and when objective results 
are considered essential. The original presenta- 
tion limits the subject to not more than five 
responses to each card. The authors apparently 
have not considered the stimulus value, which 
differs from card to card. The inquiry also 
marks a new departure in Rorschach testing. 
The examiner asks direct questions regarding 
movement, posture, color, texture, etc. The au- 
thors, however, present good logical reasoning 
for the inquiry changes advocated. Testing the 
limits procedure is abandoned. Part III presents 
the Statistical Development of the Basic Ror- 
schach Score. The statistical treatment of the 
data leaves little to be desired. Figures and tables 
are clearly explained and can easily be followed. 
The basis for determining the weights assigned 
to each BRS is fully described. * The authors 
warn Rorschach workers that the mechanical 
use of the diagnostic sign list is to be avoided. 
The list will probably be most useful for quick 
personality appraisal. Content analysis is still an 
important technique for good psychological di- 
agnosis, This manual is not for the newcomer to 
the Rorschach field. It can best be used by those 
who have had extensive Rorschach training and 
experience with a variety of clinical subjects. 
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*[Re Rorschach.] BUHLER CHARLOTTE, AND LE- 
Fever, D. Wetty. A Rorschach Study on the Psy- 
chological Characteristics of Alcoholics. Yale 
University, Laboratory of Applied Physiology, Mem- 
oirs of the Section of Studies on Alcohol, No. 6. New 

aven, Conn.: Quarterly Journal of Studies on Al- 
cohol, 1948. Pp. viii, 64. Paper. $0.75. * (PA 22: 1232) 
ере {гош Q J Studies Alcohol 8:197-260 S 
47. 


Am J Psychiatry 106:78-9 Л 49. Albert Ax 
and Milton Greenblatt. This interesting and pro- 
Vocative monograph attempts to define charac- 
teristics of patients suffering from chronic alco- 
holism by relating Rorschach findings in 100 
alcoholics with findings for other groups * The 
authors are well aware of the complexity of their 
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problem, and the fact that alcoholics represent 
a wide diversity of types. Their differentiation 
between superficial and deep-level conflicts, how- 
ever, makes one pause to wonder whether acute 
pressüres are not highly selective phenomena re- 
lated to deeper problems even in the alcoholic. 
Further difficulty is encountered in comprehend- 
ing what is meant by a “а physiological factor 
underlying alcoholism.” “The reasonableness of 
such an assumption is strengthened by our find- 
ing that there are striking similarities in the 
response patterns of the alcoholic group and the 
nonpsychotic group. Further support for this 
assumption may be seen in the fact that educa- 
tional, moral, religious, and social influences, 
as well as manipulation of the environment, are 
more effective in some cases of alcoholism than 
formal psychotherapy alone.” In dealing with 
human behavior, can we ever neglect a physio- 
logical factor? And does the supposed lack of 
effectiveness of psychotherapy in alcoholism ar- 
gue in favor of an “organic” disease? The au- 
thors’ objective and statistical method of setting 
up a comprehensive list of discriminative signs 
empirically evaluated on clinical groups is ап 
effective approach to Rorschach diagnosis. Do- 
ing a “Discriminant Function” analysis would 
probably be the ideal method as it would take 
into consideration the intercorrelations of the 
various signs, but the work would be prohibitive 
for more than a few signs. The charting of the 
signs and marking their discriminative reliability 
by the 5 % confidence level is excellent. Of course 
with 99 signs, at least 5 signs significant at the 
596 level should occur by chance. Significant 
differences between groups can only be claimed 
when the number of significant signs exceeds 5. 
The crucial problem arises as to which of the 
significant signs are the real discriminative ones 
and which are the spurious 5 that occurred by” 
chance. The only way to separate satisfactorily 
the sheep from the goats is to apply the set of 
empirically determined discriminative signs on 
a new group of subjects and see which signs 
stand up to their discriminative task. The old 
problem of regression toward the mean is es- 
pecially dangerous here where there are so many 
signs to select from. While very likely some of 
the signs here found discriminative may hold 
up on a new group, it is very probable that some 
will lose all, and others a great part, of their 
discriminative value. Even more vulnerable is 
the weighting of signs, regardless by what 


' method. The arbitrary weights, as used here, 
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may be as good as any but when tried out on a 
new group the weights will nearly all be changed. 
Most experimenters with such data have found 
that the weights are so radically changed when 
applied to a new group as to be worthless. The 
net effect of weighting seems to be to exaggerate 
the discrimination as found on the original 
groups and actually to reduce the discriminative 
value of the signs when tried on new groups. 
They tend to exaggerate the effect of regres- 
sion to the mean and at the same time exaggerate 
the researcher's disappointment when verifica- 
tion is attempted. The obvious answer to the 
above criticism is to apply the identical set of dis- 
criminative signs to a new group of subjects and 
verify the predictive signs. Even without in- 
creasing the number of subjects, possibly a more 
rigorous method would have been to select ran- 
domly one half of the cases and pick the dis- 
criminative signs on the basis of the first half, 
then apply them to the second half and retain 
only those signs that discriminate for the new 
groups. A dubious trend in this monograph is 
the definition in clinical terms of a Rorschach 
sign (like m) on the basis of the clinical groups 
here used, then using the presence or absence 
of the sign as descriptive of the diagnostic group. 
Thus they arrive at the novel meaning of m as 
"tension tolerance" presumably from clinical 
observations (since the standard Rorschach au- 
thorities do not thus define m) and then pro- 
ceed to use the low m for alcoholics as indica- 
tive of low tension tolerance in the group. If they 
have other sources for this definition of m, they 
fail to specify them. Certainly external defini- 
tions of all the Rorschach signs are required if 
they are to be used for describing the personali- 
ties of the diagnostic groups. On the other hand, 
it is perfectly legitimate to use clinically de- 
scribed groups to give new meaning to the Ror- 
Schach signs, but one cannot do both on the 
same material. The above criticisms do not sig- 
nificantly detract from the value of this excellent 
monograph which represents a definite advance 
in Rorschach diagnosis. 

B Menninger Clinic 14:38 Ja '50. Laura 
Malkenson. * Before this approach can yield 
fruitful results, a better understanding of what 
such signs mean in terms of personality must be 
gained. The larger research which the authors 

are conducting may further such understanding, 
rendering the study they present here more 
meaningful. 

J Ат Med Assn 138:786 N 6 '48. *' The 
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monograph does not include a sample of the 
original Rorschach test records. A report on à 
Rorschach test investigation, without a sample 
of original test records, amounts to asking а 
reviewer to accept interpretations on faith, with- 
out the raw material that was interpreted. The 
authors do promise (page 13) that "detailed 
reports will be made elsewhere," since the pres- 
ent study is part of a larger one. Perhaps when 
these other reports are published, some Ror- 
schach test records will be included. Until then 
the report is incomplete. 

Psychiatric Q 22:770 О '48. The authors... 
report “а pattern of signs significant for the al- 
coholic of every clinical group." * However, one 
non-alcoholic group composed of non-psychotic 
epileptics and non-psychotic organics, had the 
same pattern. These signs, then, would be of 
little value in differentiating between the two 
groups. * The authors have conducted an excel- 
lent study which may well contribute to a better 
understanding of the Rorschach records of alco- 
holics and the results may be of some value in 
differentiating the alcoholic types. This well- 
written book should be of definite interest to all 
Rorschach workers whether or not they agree 
with the findings. 

Psychosom Med 11:303 N-D ’49. Gotthard 
Booth. * interesting * The procedure is based 
on the frequency of 99 “signs” in different clini- 
cal groups. Most of the “signs” are derived from 
the scoring system of Klopfer and Kelley, and 
carefully differentiated and quantified. Some of 
the "signs," however, appear to be arbitrary 
combinations of incongruous contents, e£. 
#77 “Confabulation, contamination" and #93 
“Dead, injured.animals, corpses, snakes, spiders, 
gorillas, monsters, ghosts or frightening fantasy 
figures = 3+.” No attempt is made to consider 
the gestalt character of the inkblots in general, 
and its specific variations in the ten cards. Thanks 
to the very rigorous statistical evaluation of ү 
sign frequencies the questionable signs do no 
appear among those which the authors foun 
significant. * The syndrome shared by the alco- 
holics and the neurological cases is the combina- 
tion of low tension tolerance and high anxiety, 
despite immediate discharge of tensions. This 
character of the alcoholic personality is not t e 
result of secondary alcoholic deterioration, ы, 
it is most outspoken among the younger r 
tients. The authors point out that the presenc? 
of a “primary organic factor” in alcoholism ia 
plains that alcoholism manifests itself first whe 
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the patient has to meet the demands of society 
as an adult, that he is benefited by social sup- 
port rather than by individual psychotherapy. 
Without such external help the alcoholic “feels 
up against a wall....and escapes...in forgetting 
by means of alcohol” (p. 41). Perhaps it would 
be more pertinent to say that the organically 
overemphasized dependent needs of the alco- 
holic can be satisfied on the level of strong group 
relationship. Only if the latter fails does the 
alcoholic experience the need for the chemical 
crutch of alcohol. 

О R Psychiatry & Neurol 3:364—5 Jl’ 48. Isa- 
belle V. Kendig. The questions which this study 
was designed to answer were framed by L. K. 
Frank as follows: “What in the alcoholic leads 
him, or drives him to utilize alcohol instead of 
all the other neurotic, psychotic or psychoso- 
matic patterns of defense or escape or release 
equally available ?" and “How does it happen that 
the individual develops craving for alcohol only 
at adolescence or in adult years, as contrasted 
with the early development of the neurotic and 
psychotic patterns, many of which derive from 
infancy or preschool years?" The answers were 
sought by the authors in Rorschach studies of 
alcoholics and contrasting normal and clinical 
groups. * the dependence of this study on such 
ill-defined concepts as psychopathy, psychoneu- 
rosis, schizophrenia, etc., must be considered 
a limiting factor. After a brief review of three 
recent Rorschach studies of alcoholism, in which 
the findings show an almost perfect negative cor- 
relation ( !), the authors set forth in some detail 
their own statistical procedures. Their scoring 
is based on the principles developed by Klopfer 
and Davidson but they themselves have devised 
an exhaustive check list of diagnostic criteria 
(99 items) in terms of which their data is ex- 
pressed. * Interpreting their data, the authors 
find “а pattern of signs significant for the alco- 
holic of every clinical group." They are dis- 
tinguished from nonalcoholics in general by sig- 
nificantly high anxiety (k plus K plus FK) in 
conjunction with low tension tolerance (low m). 
The only nonalcoholic group showing a similar 
pattern are organic cases (including epileptics) 
without psychosis, which tends to strengthen the 
assumption of an underlying physiologic factor. 
The alcoholic differs from the psychopath in 
Showing better functioning rationality (F per 
cent), greater self criticism (FK), greater sensi- 
tivity (Fc), and better emotional responsive- 
ness (Sum C). The answer to Frank's first 
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question is not explicitly given by the authors 
but presumably is to be found in the intolerance 
of tension (low m) which differentiates the al- 
coholic from the psychoneurotic and from most 
other clinical groups. The onset of his symp- 
tomatology in adolescence is a direct result for 
he is unable to meet the new demands which 
arise at that time involving the choice and pur- 
suit of a career and responsibility for a family. 
With a great deal of ambition or wishful think- 
ing (high W) but lacking directivity (low М), 
the painfulness of his failures in perseverance 
accentuates his difficulties. As already indicated, 
this study is weakened by its uncritical accept- 
ance of our present purely descriptive nomencla- 
ture but the approach is otherwise sound and the 
results fruitful to the extent that they uncover 
the dynamics of well-recognized behavioral pat- 
terns, whatever may be their ultimate status as 
psychiatric entities. 
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[Re Rorschach.] Fon», Manv. The Application of 
the Rorschach Test to Young Children. University 
of Minnesota, Institute of Child Welfare Monograph 
Series, No. 23. Minneapolis, Minn.: University of Min- 
nesota Press, 1946. Pp. xii, 114. $2.00. (London: Ox- 
ford University Press. 16s.) * (РА 21:807) 


Brit J Psychol, Gen Sect 38:42-3 S '47. T. A. 
Although much has been published about the 
Rorschach test as applied to adults and 'adoles- 
cents, the literature regarding its use with young 
children is comparatively scanty, and to this Dr 
Ford's book makes a welcome addition. She 
presents a careful analysis of the responses given 
by 123 children of ages between 3 and 8 years 
with whom she had wisely established previously 
a friendly relationship, so making the test situa- 
tion easier. Another aid to good co-operation 
was probably the high intelligence of the chil- 
dren, whose average І.О. was 124-35—not а 
typical sample of the population, as the author 
recognizes. In administering the test, Dr Ford 
made a deviation from the standard Rorschach 
procedure by restricting the use of each card to 
the upright position, so decreasing markedly the 
normal range of perceptual stimuli—a simplifica- 
tion which she found necessary with very young 
children, but which makes comparison difficult 
between her results and those of other investi- 
gators. To many Rorschach workers it may seem 
a pity that the modern Research Exchange scor- 
ing has been ignored in favour of the less differ- 
entiated original, though to the latter there is 
added a new category described as OL., "Or- 
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ganizational Link," which seems to deserve fur- 
ther investigation as a factor in estimating the 
intellectual development of young subjects. In- 
teresting tables are given of determinants com- 
pared with age, sex, and intelligence quotients, 
and an appendix contains three sample records 
scored by the author. In the study of these 123 
tests there is perhaps too much stress laid upon 
the points value of isolated determinants and too 
little upon the interrelation of components and 
the general personality structure. It is, however, 
clear that Dr Ford combines a scientific approach 
with an excellent contact with small children, 
and it may be hoped that this book will prove 
to be a trial trip before her fuller exploration of 
this field of enquiry. 

Can J Psychol r:214-16 D "47. Frances S. 
Alexander. As a contribution to the scientific in- 
vestigation of the Rorschach test applied to 
young children, this volume merits considera- 
tion. * No one will doubt the usefulness of Dr. 
Ford's study in exploring the use of the Ror- 
schach with young children. The reviewer feels 
that certain considerations must be kept in mind 
in accepting her data. These relate primarily to 
sampling and statistical treatment of results. In 
making comparisons with respect to sex and 
age, the number of cases is often too small to 
give valid age and sex trends. In fact, trends are 
made much of and the impression is given that 
these trends point to valid conclusions even 
though the critical ratios reported are below the 
limit of statistical reliability or omitted entirely. 
In this respect, great confusion exists in the 
discussion of colour determinants (pages 47, 

48, and 54). Sometimes the data are analyzed 
on the basis of the average number of a particu- 
lar determinant in relation to an age group, at 
other times, on the basis of the percentage of the 
determinant in the total R. The results drawn 
by these two methods are sometimes diamet- 
rically opposed (pages 45 and 73). It would 
have made for clarity to have kept the compari- 
sons in terms of proportions since R shows 
variation with age. The author presents a list 
of suggestions for further study which indicates 
that she realizes the limitations of her data. A 
useful glossary of Rorschach terminology is ap- 
pended. 

Fed Probation 13:54 Mr "49. 2. A. Piotrow- 
ski. * Ford studied 123 children, 57 boys and 
66 girls, with an age range from.214 to 8 years. 
This is the largest single group of children ever 
reported. However, the group is not representa- 


tive because its average IQ was 124 with a range 
from 9o to 157. The fathers of more than two. 
thirds of the children were professional people; 
only IO percent of the fathers belong to the 
average socioeconomic group. * An extensive 
and helpful bibliography of 154 items closes the 
monograph. The interpretation of children's 
records is more difficult than that of adults for 
two main reasons: First, the intensive process 
of growth and maturation which takes place in 
childhood complicates the interpretation of Ror- 
schach records. We should have different norms 
for different age groups. It would be easier to 
differentiate pathology from inadequacy due to 
the immaturity of young age. Furthermore an 
adult examiner finds it more difficult to under- 
stand children than adults. Therefore it is more 
difficult for him to translate children's Ror- 
schachs in terms of real psychological experi- 
ences than to explain adults’ Rorschachs in terms 
of attitudes and real feelings. It will take a great 
deal of effort to obtain adequate norms for all 
ages and intellectual levels in childhood. Dr. 
Ford's contribution is an important step toward 
this desirable goal. 

Nerv Child 7:344 Jl "48. This short study, 
which has been almost completely ignored, seems 
to us to have a very sound viewpoint, avoiding 
the mechanization of the Rorschach test which 
is unfortunately so prevalent today and which 
will eventually destroy its usefulness. 

Q R Biol 22:369 D '47. Helen Hewitt Arthur. 
* a careful exploratory study done to evaluate 
the use of the Rorschach test as it is now applied 
to very young children. It is a most provocative 
subject and one which the author...has dealt 
with in a scholarly but never pedantic fashion. 
* She aptly stresses the need for standardizing 
this test for more intelligent and valuable usage 
with the pre-school child. In addition to making 
these original observations, Mary Ford has com- 
pared her results minutely with the conclusions 
reached in the few previous studies of a similar 
nature. Of especial importance to clinical work- 
ers with children are not so much the formal 
conclusions which Mary Ford draws so ably as 
rather the implications of the insight to be 
gained into child thinking through the use of а 
properly standardized projective test. It is to 
be hoped that serious Rorschach students will 
take up some or all of the author's twelve sug- 
gestions for future research in this area. 

Rorschach Res Exch & J Proj Tech 11:53-5 
no I 47. This book promises to be of great 1" 
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terest to Rorschach workers because of the 
paucity of published material in this field. Un- 
fortunately, the promise is not completely borne 
out, for reasons inherent in the author's ap- 
proach. * Ford is chiefly concerned with the 
statistical and non-clinical aspects of the method, 
hence she has stressed especially those studies 
whose results were quantitatively expressed. * 
the scoring method....is an eclectic combination 
of the original Rorschach symbols with certain 
additions suggested by Beck, Hertz and Rickers. 
'The concept of small usual details (d) was not 
utilized, and the author states that she made no 
distinction between animal and human move- 
ment. This, however, is later contradicted by a 
breakdown (in Tables 6 and 11) into M, FM, 
and m responses. One new scoring category, 
that of "organizational links" (OL) or “num- 
ber of logical connections," is introduced. * With 
respect to the general pattern of responses, the 
earlier work of Klopfer and Margulies is sub- 
stantially corroborated. To these authors' three 
patterns of early Rorschach reactions, Dr. Ford 
suggests the addition of a fourth. This is the 
so-called *'perseverated-logic" stage, occurring 
most frequently with 3- and 4-year-olds, in which 
"magic repetition" occurs in response to details 
rather than to the whole card. In the color area, 
the author's findings disagree with Klopfer and 
Margulies, in that she finds that CF exceeded 
other color responses at all levels except at age 
5, where FC predominated. Many factors might 
contribute to this discrepancy, including differ- 
ences in scoring standards, which are difficult to 
apply at the preschool level, or differences in emo- 
tional maturity between the subjects used in the 
two studies. Since this is a crucial area for inter- 
pretation, the need for further careful normative 
research is emphasized. To provide data on relia- 
bility, 55 of the children were retested and corre- 
lation coefficients computed for the major cate- 
gories, Also, the split-half technique was used 
for selected determinants. The test-retest co- 
efficients were all positive, ranging from +.38 
to 4-.86 with a majority above +.70. The split- 
half method gave reliabilities of +.47 to +.84 
on the original test. This method is obviously 
inappropriate in Rorschach research since the 
different potentialities of the cards prevent their 
division into two equal groups. Despite this, it 
is encouraging to note the size of the coefficients. 
* Of major interest to many Rorschach workers 
will be the findings with respect to the validity of 
intelligence and personality diagnosis. The au- 


thor investigated Rorschach’s original contention 
that W, М, and F+% were related posi- 
tively and A% was related negatively to intelli- 
gence, and that О% and C increased with in- 
crease in the intellectual level, but also increased 
as it declined. The highest correlations found 
were between MA and OL (+.684), number 
of items mentioned (+.680), and R(+.634). W 
correlated only to a slight degree (+.108). F+ 
M, M+, and O+% were all positively re- 
lated to MA. Color responses as a group showed 
a slight but positive relationship to MA, and 
A% (in agreement with Beck but in disagree- 
ment with Rorschach) correlated slightly but 
positively (-+.070) with MA. In evaluating these 
findings, it is necessary to consider two factors. 
Most of the determinants vary with both CA 
and MA, a factor which immensely complicates 
the determination of such statistical relationships 
among young children whose mental ability is 
growing very rapidly. Also, it will be remem- 
bered that Rorschach's findings were based 
largely on work with adults and that Beck's study 
was of feeble-minded children ; thus, differences 
from both their results might well be expected 
in a homogeneous group of superior young chil- 
dren. Once again, this reviewer wishes to point 
out that adequate norms are needed at each year 
level before completely valid theories can be es- 
tablished beyond cavil. The section оп the va- 
lidity of the determinants of personality, which 
is a crucial issue in Rorschach work, is very 
disappointing. As criteria, Dr. Ford used scores 
on the Marston Introversion-Extraversion Scale. 
and on four selected items from the Olson Be- - 
havior Rating Scale, the ratings being made by 
nursery school teachers. The M: Sum C and 
R (VIII-X) : К ratios were utilized as a meas- 
ure of introversion-extraversion. In this su- 
perior group of young children, introversion was 
positively related to МА. In average or inferior 
groups, a different result might well occur. Com- 
parison of the M:Sum C ratio with scores 
on the Marston for 30 children showed a con- 
tingency ratio of 60%. Combination with 
R(VIII-X):R increased the ratio to 73%. 
This finding supports Klopfer and Kelley's claim 
as to the value of this ratio and, at the same time, 
indicates the extent to which determinants taken 
out of context lose their value. * The subject 
matter of this book is potentially of great im- 
portance. Very few studies of Rorschach reac- 
tions of young children have been published and 
none other to the reviewer's knowledge in which 
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such a painstaking, statistical analysis has been 
made. In the light of Dr. Ford’s stated aims, she 
has fulfilled her commitments in all except the 
fourth. Here, the reader is disappointed to find 
such a vital problem as that of the validity of the 
determinants of personality dismissed with a 
statement of coefficients of correlation with an 
Introversion-Extraversion Scale and four items 
(!) selected from the Olson Behavior Rating 
Scale. This same coldly non-clinical viewpoint 
pervades the entire volume and dilutes the en- 
thusiasm with which the reader might otherwise 
greet it. If the author's laborious statistical treat- 
ment had only been leavened with some attempt 
to delineate also the qualitative aspects of the 
records and to consider these in relation to more 
full-bodied criteria of adjustment, how much 
more valuable a study this might have been! 
Among its positive contributions, however, are 
the following : discovery that forbidding card 
rotation assists in administration to young chil- 
dren without decreasing the number of re- 
sponses; description of a new stage in the ap- 
pearance of characteristic early reaction patterns 
to the test; verification of the earlier use of 
movement by boys and of color by girls; estab- 
lishment of the reliability of the major deter- 
minants ; last but not least, underlining the ne- 
cessity for normative studies of young children 
of varying intellectual, social and economic 
status, 


For additional reviews, see 3:80, 
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*[Re Rorschach.] Harrower, M. R., AND STEINER, 


M. Erik 
- XX, 353. $8.50, * (Oxford, Brain. 
Blackwell Scientific Publications, Ltd. 63s.) (РА 26: 
2183 

J Am Med Assn 147: 1718 D 2251. * To 
fully understand the group procedures and their 
results it is advisable that the reader have some 
previous knowledge of scoring and interpreta- 
tive principles for the individual Rorschach, 
Since they are not given in the book. This edi- 
tion contains several studies not found in the 
first edition (1944). The most valuable is a sta- 
tistical comparison between the group and indi- 
vidual Rorschach methods as to the locations, 
determinants, and contents "pulled" by each 
card for normal persons, This, together with 
the studies of frequency of rejection and distri- 
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bution of anatomic answers on each of the то 
cards and the list of popular answers, can be a 
great aid even to the clinician interested only in 
the individual Rorschach. The section on in- 
terpretative principles in differential diagnosis 
provides a good review of pathognomonic indi- 
cators, for they are presented in a clearer and 
more condensed form than in the more basic 
texts by Klopfer and Beck. It tends to be dog- 
matic, however, and over-generalized, so that 
one who is unfamiliar with the Rorschach may 
be led to accept them without qualification. This 
is not intended to be the last word on the group 
Rorschach and multiple choice tests, but rather 
it is meant to point the way to future research. 
Clear in its presentation of experimental design 
and the statistical methods used, this book is an 
excellent example of the scientific approach to a 
study of personality. 

J Consult Psychol 15: 513 D '51. Laurance F. 
Shaffer. * 'The main body of the original book is 
unchanged, save for the deletion of numerous 
pages of superfluous graphs, and for a few mi- 
nor editorial corrections. The principal new ma- 
terial is confined to a 53-page added section on 
"recent developments," which does scant justice 
to its topic. Of the 34 “new” studies cited, 26 
are dated 1947 or earlier. Moreover, the sec- 
tion's poor organization gives evidence of hasty 
and careless preparation. A really up-to-date 
annotated bibliography would have been more 
serviceable. There is also a supposedly new sec- 
tion on “card-pull” that turns out to be a rear- 
rangement of some old data on the relative fre- 
quencies of locations, determinants, and content 
categories. The second edition does not give 
nearly as good support as it might to the genu- 
inely useful Group Rorschach. On the other 
hand, it maintains the unduly optimistic atti- 
tude of the first edition toward the Multiple 
Choice Test, in spite of the bulk of adverse evi- 
dence, 


For reviews of the first edition, see 3:82. 
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[Re Rorschach] Kanack, Сор Rut, Voca- 
tional Personalities: An Application of the Rem 
schach Group Method. Columbia University, Teac 
ers College, Contributions to Education, No. et 

arry D. Kitson, faculty sponsor. New York: Bureau 
of Publications, the College, 1946. Pp. xi, 116. $2.10. 
(PA 21: 1929) 


J Appl Psychol 33:612-3 D ^49. Boyd Mc- 
Candless. * The author is to be commended for 
a type of study greatly needed. It seems how- 
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ever, first, that such a study can hardly attain 
highly reliable or valid results as long as socio- 
economic pressures, family predilections, etc., 
determine the chosen career, often quite with- 
out regard to the personal wishes or capabilities 
of the individual involved ; and that, second, a 
Rorschach study is difficult if not impossible 
when scoring categories of the test are consid- 
ered (as is almost necessary) without the bene- 
fit of refinements in statistics of percentages, 
ratios, and sub-relationships. The most ardent 
proponent of the Rorschach would probably not 
regard it as infallible when adapted as a group 
method and scored for means, using isolated 
scoring categories. In the hands of a competent 
technician, used to illuminate other data, it often 
seems to be of great use in vocational counsel- 
ing; more "clinical" validations along the rather 
promising lines shown by the author may result 
from this interesting, carefully done, but prac- 
tically not too useful study, the final conclusion 
of which is necessarily that there are all types of 
persons who enter or study in the fields of ac- 
countancy and pharmacy. 


For additional reviews, see 3:82a. 
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*[Re Rorschach.] Krorreg WALTER С. Sugges- 
tions for the Systematic Analysis of Rorschach 
Records. Los Angeles, Calif.: U.C.L.A. Student's 
Store, University of California, September 1949. Pp. 
16. Paper, mimeographed. $0.35. * 


J Proj Tech 14:79 Mr "50. Winafred Lucas. 
This brief but comprehensive manual begins 
with a delineation of personality areas in which 
the Rorschach is able to contribute hypotheses 
and information. A succinct review of the analy- 
sis of quantitative Rorschach data begins the 
actual discussion of interpretative clues. This is 
followed by a sequence analysis of the ten cards. 
In this analysis both the inter-relationships of 
the cards and their content symbolism are dis- 
cussed in a practical way, valuable in the sketch- 
ing out of norms for the various cards so that 
the new Rorschach worker will have some idea 
what to expect before, instead of after, his first 
hundred records. The manual closes with an 
outline of relevant quantitative and qualitative 
aspects from which hypotheses concerning vari- 
ous areas of the personality may be drawn, as, 
for example, intellectual potentials or the avail- 
ability of inner resources as an aid in adjust- 
ment, These "Suggestions" have been in experi- 
mental use by graduate students for the last 


academic year and have proven very useful in 
closing the gap between the interpretative hy- 
potheses published in the literature and the 
many additional interpretative clues used by 
most experienced workers in actual clinical 
practice. The author would appreciate receiving 
information on any differences in findings or on 
any further clues in content analysis for the 
various cards. A revision of the manual will be 
made taking cognizance of such contributions. 


fe veal Zoi 

[Re Rorschach.] Mastow, PauL. The Analysis 
and Control of Human Experiences: The Indi- 
vidual Seen Through Rorschach, Volumes 1 and 
2. Brooklyn 2, N.Y.: the Author, 16 Court St., 1947. 
Pp. iii, 233; iv, 195. Paper, mimeographed. Out of print. 
* (PA 21:2326) 

Am J Orthopsychiatry 18:180-1 Ja '48. * a 
revision and an expansion of the author’s Ror- 
schach Psychology, which appeared in 1945, 
and constitutes the philosophy rather than the 
clinical knowledge of one not a psychologist, 
who mastered the Rorschach technique auto- 
didactically. The bibliography of 212 items tes- 
tifies to the author’s extensive reading, done in 
a few years. Personal contact with Mr. Maslow 
made it clear that he does not expect anyone to 
learn Rorschach technique from these two vol- 
umes. Yet even the trained Rorschach worker 
will find it hard to digest their contents. The 
author is as radical as he is revolutionary in 
some of his philosophical speculations and con- 
clusions. Basically, he is in opposition to many 
of the tenets of modern psychology, psychiatry, 
and psychoanalysis. He attempts to find his own 
system or formulation. Hence he tries to cover 
the whole realm of human behavior, from both 
micro- and macro-scopic angles. Generalizations 
are then presented as they might appear in a 
Rorschach setting. The books contain no case 
records analyzed and explained in terms of per- 
sonality constellations. Rather, items such as 
reason, dialectics of thinking, cultural frame of 
reference, constriction, anxiety, formation of 
ideas, and many others are discussed in general 
terms and then related to Rorschach language. 
Maslow finds the present systems of Rorschach 
scoring inadequate, He substitutes his own indi- 
vidual scoring, thus adding many over-refined 
combinations to the already comprehensive se- 
ries of Rorschach symbols. For instance, he 
breaks down human movement (M) into EAM 
(active), LAM (living), RAM (restrained) 
and FAM (flexor), Not only'does such an over- 
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elaboration lead to confusion, it serves no prac- 
tical purpose. To this reviewer it appears al- 
most: pathological to attempt to explore every 
crevice and niche of possible human behavior 
with philosophical speculation in which clinical 
reality is not considered. 

Am J Psychiatry 104:510—1 Ja 48. W. Don- 
ald Ross. * Maslow's book....contains neither 
preface nor summary to indicate the audience 
for whom it is intended or the function it is ex- 
pected to fulfill. It contains many ideas which 
might be of interest to Rorschach specialists, 
but these are diluted by ambitious generaliza- 
tions and mixed with discussions on social phi- 
losophy which seem hardly relevant to the start- 
ing point of the book. The Rorschach data which 
are included suggest that the author has a con- 
siderable knowledge of the technique and a pro- 
found understanding of people, but he presents 
these data dogmatically, without evidence, and 
makes constant use of an apparently original 
and almost neologistic scoring system, with no 
explanation of its origins. Many of his socio- 
logical ideas are also profound, and his future 
prophecies are indicative of a most fertile imag- 
ination. However, the connections of these with 
the Rorschach method are extremely tenuous, 
and the route by which they have been reached 
most circuitous. If there is a contribution from 
the Rorschach method to social science, the au- 
thor has obscured it by circumstantiality and 
excessive ambition. There are many typographi- 
cal errors, suggesting a hasty printing, and the 
footnote on the very first page of Volume One 
makes reference to a bibliography supposed to 
be presented in that volume but not appearing 
until the second volume dated in the following 
year. The book was apparently not conceived as 
a whole nor finished with care toward a definite 
purpose. There are two reasons for printing a 
review of it. One is to warn others that its pearls 
can be gleaned only at the expense of much 
wasted reading. The other is to give recognition 
to the brilliance of the author, which requires 
harnessing into more useful channels. There is 
much in the book which could profitably be 
printed if it were rewritten several times to 
eliminate the circumlocution of the author’s 
spontaneity, Some of the chapters could be pub- 
lished, with some revision, as essays on a great 
variety of topics. Better still, collaboration with 
someone of a different temperament might ena- 
ble the author to make a unique contribution to 
the integration of ‘the Rorschach technique with 
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psychopathology. At present the volumes are 
unique, but their contribution is not clear. 

Am J Sociol 53:318 Ja ' 48. Alfred R. Linde- 
smith. The reviewer finds it almost impossible 
to comment adequately upon the ruminations 
included in these two volumes, with their one 
hundred and eight chapters. The ostensible pur- 
pose of the books is to describe the inner work- 
ings of the human personality as revealed by 
the Rorschach technique. The author, however, 
uses the Rorschach test as a springboard from 
which he plunges into discussions not only of 
personality but also of such subjects as war, 
business, profits, science, fascism, and countless 
others, whose relevance to the central purpose 
is not adequately indicated. Comments specifi- 
cally directed at the Rorschach technique are 
vague, and the evidence upon which statements 
are made is lacking. The work itself may best 
be described as a kind of Rorschach test for the 
reader. The readability of the volumes is re- 
duced by the prevalence of typographical errors, 
awkward and incorrect sentences, poor type, 
and by the use of a superfluous system of alpha- 
betical symbols. If these faults were corrected 
and the entire study were condensed and made 
more compact by being limited to matters defi- 
nitely connected with the Rorschach test, some- 
thing of value might emerge. 

Nerv Child 6:475 O '47. A few years ago this 
author offered a philosophical discussion of the 
Rorschach method. He now adds an extensive 
study of the same type which attempts to apply 
Rorschach diagnosis to the total personality and 
to the various concepts most commonly known 
in modern psychology. Since not too many Ror- 
schach signs are applied, the uninitiated can also 
get something out of the intensive study. 

Psychiatric Q 21:701 О ’47. Only small parts 
of these two volumes deal directly with the Ror- 
schach method. Mr. Maslow touches on a Very 
wide variety of topics, great and small, in an 
unexpected order. The topics include various 
mental mechanisms, religion, the OPA, com- 
munism, fascism, psychotherapy, recent socio- 
economic changes in the United States, the 
unity of sciences, an outline of a future “empa- 
thetic” society, and many others. The author 
criticizes psychology for seeing only the bad in 
man, Psychoanalysis “is the single science that 
attempts to fill the breach left by a rejecte 
church in the lives of irreligious people. Were 
it not for this stepchild [psychoanalysis], SC 
ence would have absolutely nothing to offer to 
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the spirit of mankind." Mr. Maslow's work is 
rambling, vague, and almost incomprehensible 
in parts. Its quality level is very uneven ; it lacks 
synthesis and lucidity. Whatever unity there is, 
is due to the author's fervent and exalted appeal 
for “empathy” and for the recognition of the 
good in man. 


For additional reviews, see 3:86., 
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*[Re Rorschach.] Mons, W. Principles and Prac- 
tice of the Rorschach Personality Test. London: 
Faber & Faber Ltd., 1947. Pp. 164. * (PA 22: 4424) 
For latest edition, see 127. 1 


Am J Psychiatry 105:556 Ja "49. Fred V. 
Rockwell. This short book....is based upon his 
lectures on the Rorschach method given in the 
British army during the last war. It was com- 
piled in the field, without benefit of textbooks or 
even the author's own notes. The book, designed 
to serve as an introduction for the study of more 
advanced works, contains brief sections on ad- 
ministration, scoring, interpretation of the sep- 
arate scores, and assessment of personality, in a 
style very similar to Klopfer and Kelley's pres- 
entation, of which it might be termed a synop- 
sis. As such it could serve a useful purpose to 
the reader who is already familiar with the more 
detailed productions of various earlier writers. 
For anyone not acquainted with the Rorschach 
method, this book at best offers a rather one- 
sided orientation to a highly controversial field 
of investigation. 

Austral J Philos 26:132-3 S ’48. I. K. Wa- 
terhouse. * the author claims “freedom from 
dogmatism where the method is concerned" and 
proposes its “reduction to a basis of logic and 
common sense." In the text, however, there is 
little to justify this claim: it is clear from the 
outset that he follows most of the established 
procedures of administration, scoring, and in- 
terpretation and accepts the test as offering a 
valid and reliable assessment of various person- 
ality traits. Although the Rorschach technique 
has been so frequently criticised on these very 
points of reliability and validity, he provides no 
Satisfactory evidence or discussion to support 
the assumptions involved in these procedures. 
Again, the author remarks that when he began 
to use the Rorschach technique he had hoped it 
would be an "empirically standardised instru- 
ment," but that trial and error and consider- 
able reading had revealed disagreement between 
Workers and freedom in their definitions and 


interpretations. But despite the author's claim 
to eschew dogma and rely on logic and common 
sense, none of this disagreement is presented in 
this book. * The opening chapter, on The The- 
ory of the Test, does not give....a critical exam- 
ination of underlying theory, but a superficial 
account of the origin of the test, a sound warn- 
ing that the technique will not automatically 
provide a diagnosis, that much depends on the 
skill and experience of its user, a cursory de- 
scription of the ten standard cards and a short 
statement on the results of retests. * This book 
is an introductory sketch for beginners: the 
author has leant heavily on material already re- 
corded in available literature. * The neophyte 
who wishes to use the test will not be adequately 
equipped by a perusal of this book. He will need 
to take the author's prescription seriously and 
embark on a liberal and critical study of more 
advanced works, and also to receive some initial 
supervision from practised Rorschach workers. 

Brit J Psychol, Gen Sect 39:69 5 '48. Any- 
body who is proposing to use the Rorschach test 
will find this a very good practical manual. No 
attempt is made to present the method as easy 
or short; there is no suggestion that success is 
likely to be achieved by "blind" trial or in the 
absence of a considerable amount of information 
which cannot be contributed by the test itself. 
Rorschach is in fact presented as a useful tool 
for the psychologist or psychiatrist—especially 
the latter—who is well equipped in other ways. 


- The volume is the outcome of a large amount 


of practical experience. It is clearly presented 
and is mercifully concise. Nothing better con- 
cerning this technique has been published in this 
country. 

Brit Med J 4595:184-5 Ja 29 "49. Eliot 
Slater. Though the Rorschach test has become 
firmly established and is widely used in Britain, 
no really elementary textbook has been pub- 
lished until now, and the test remains largely 
in the hands of the expert. One unfortunate 
result of this is that its potentialities are not 
always realized as fully as they might be. The 
tester is seldom a clinician and so does not se- 
lect his subjects, and the clinician does not 
know how to select those patients for whom 
the test can most appropriately be used. Both 
classes of reader will welcome this book. Dr. 
Mons gives an account of the theoretical basis 
and of the actual technique of the test, and pro- 
vides the clinician with information about the 
indications for its use and about its limitations. 
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He emphasizes the fact that this is an introduc- 
tion only; he is writing for beginners, and does 
not take for granted in his readers any previous 
acquaintance with the test. He has evolved hfs 
own technique of interpretation, and this he 
describes minutely and clearly. The reader will 
have little difficulty in following the instruc- 
tions, and will find that most of his difficulties 
and questions have been answered. Although 
for so small a book it is remarkably compre- 
hensive, it is by no means an exhaustive de- 
scription of all the theories and techniques 
which have grown up round the Rorschach test, 
especially in recent years. It is rather an ac- 
count of the views and practice of one exponent, 
evolved by him from an extensive acquaintance 
with the literature and wide personal experi- 
ence. It is to be recommended as a most read- 
able introduction to the subject. 

J Consult Psychol 13:309 Ag '49. Laurance 
F. Shaffer. * 'The author makes a few original 
contributions to scoring, including Wv for in- 
ferior “cheap” wholes, and Mm for “expressive 
poses.” * While it offers little that is novel, the 
book is a clear and simple text, quite adequate 
for an introductory course. 

J Nerv & Mental Dis 109:281 Mr 49. (Zyg- 
munt A.) Piotrowski. * The presentation of the 
results which can be expected from the use of 
the test is reserved and testifies better than any 
other part of the book to the wide and sound ex- 
perience of the author. Dr. Mons has given us a 
lucid, lively and very informative introductory 
text for beginners. The book is very readable 
and can be highly recommended. 

Lancet 254:600 Ap 17 '48. * Mons's account 
is thoroughly intelligible: it does not conceal 
the difficulties of using the test, nor introduce 
unnecessary complications. On the other hand, 
it does not offer convincing evidence on the 
test's reliability or validity. This is not a criti- 
cism of Dr. Mons, however, since the extremely 
wide scope allowed for personal interpretations 
of results makes it impossible to produce gen- 
eral evidence of that kind. Some will feel that 
the use of the Rorschach test is as far removed 
from scientific psychology as palmistry or graph- 
ology; but this implies no disrespect, for the 
practitioner who has made an extensive study 
of human hands can learn a great deal about a 
man's character from the pair presented to him, 
especially if he is skilful in the conversation he 
makes in the meantime. The Rorschach expert 
conducts his test in the same way, using all the 
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additional information he can assemble to help 
him in interpreting the result. Can we then, 
safely attribute to the test the successes achieved 
by the skilled practitioner, or blame it for the 
failures and indiscretions of the unskilled? Is it 
not, perhaps, a measure of the examiner rather 
than the examined? 

Mental Health 8:81 F '49. T. A. In common 
with other pioneer Rorschach workers in this 
country, Dr. Mons learnt the hard way, teach- 
ing himself by reading, experience, and compar- 
ison of notes with colleagues. With his own past 
tribulations in mind he sets out to smooth the 
path of beginners in this complex method of 
personality assessment, and does so with much 
skill. * To produce a simple handbook on so 
elaborate a diagnostic instrument is a formi- 
dable task. Dr. Mons' courage is especially 
worthy of honour because, as noted in the 
preface, his book was compiled during the war, 
on active foreign service, having "neither text- 
books at hand nor access to my notes stored in 
England." It is an amazing feat to have pro- 
duced so good a book under such conditions, 
with only a rare slip for lack of references. Dr. 
Mons modestly describes his work as an intro- 
duction to more advanced studies, but in addi- 
tion to describing the mechanism of the test, he 
puts forward a number of original views and 
interesting suggestions upon the basis of a very 
wide range of cases. Besides adult work, Dr. 
Mons has given the Rorschach test to over a 


‘thousand children of ages between 4 and 16 


years, an experience probably unique in this 
country. It is to be hoped that he will later find 
time to publish his norms for various ages. This 
book is admirably practical in plan and content, 
with excellent advice upon methods of record- 
ing. A chapter entitled “The Practice of the 
Test" supplies a long-felt want in setting the 
student on the right road, while later chapters 
on scoring and assessment are clearly set ОШ 
and exemplified. The significance of separate 
signs in the various categories are lucidly de- 
scribed, with the use of familiar modes of rea- 
soning and examples from everyday life to illus- 
trate points which are further demonstrated by 
case records. It may be that too great stress 18 
laid upon isolated signs as diagnostic factors Ш- 
stead of upon the balance of all factors in the 
total record; it may be that some of Dr. Mons 
opinions will be challenged; but any pent 
should be far outweighed by appreciation, wit 

respectful congratulations to the author on prO" 


233 | 


ducing the first British textbook upon the Ror- 
schach method. 

Mind 57:389 J1*48. В. A. Farrell. * a helpful 
manual for the beginner * The tone....is cautious 
and undogmatic. But because the validity of the 
Test is not discussed, the significance assigned 
to some of the responses may strike the begin- 
ner and the outsider as rather arbitrary, and 
much of the procedure for assessing the person- 
ality as mere rules of thumb. In spite of all the 
careful help it provides, the book serves to rein- 
force the point that training and skill are re- 
quired for the adequate use of the Rorschach 
Test, and that scoring and assessment are partly 
a matter of personal interpretation by the tester. 
'The book ends with three examples which are 
scored and interpreted. These are useful in view 
of the difficulties of scoring and the vagueness 
of the terms used in interpretation. More exam- 
ples could have been included with profit to 
help the beginner through his apprenticeship. 

Nature 162:87 Jl 17 48. John Cohen. * It is 
refreshing....to read Dr. Mons’ clear and re- 
strained account of the techniques and interpre- 
tation of the Rorschach “‘ink-blot” test, an ac- 
count based on extensive experience with Serv- 
ice and civilian patients and on the examination 
of a thousand clerical records. With the difficul- 
ties of the novice in mind, Dr. Mons painstak- 
ingly explains how to administer the test, how 
to classify the responses according to a widely 
used, though somewhat pretentious, notation, 
and how to evaluate the significance of the re- 
sponses so as to yield an assessment of the per- 
sonality, particularly its "structural" features. 
Several new formulae useful for diagnostic pur- 
poses are suggested. It is to be hoped that this 
useful manual will stimulate workers to pursue 
much more rigorously than hitherto methods of 
validating the test which could greatly improve 
its potential value as a diagnostic tool. 

Psychiatric Q 23:405-6 Ар ’49. * clearly- 
written * Its main value seems to lie in its sim- 
plicity and concise form: It expresses no origi- 
nal views or thought and makes no significant 
additions to the more comprehensive texts of 
Klopfer and Kelley, Beck, Bochner and Hal- 
pern and others. * He practically achieves his 
stated aim of sticking to reason and logic in his 
explanations, and remaining free from the dog- 
matic setting forth of principles. And therefore 
we find, for example, that the section on scoring 
is not a rigid system, but simply a necessary con- 
venience clearly outlined as an anchoring point 
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for understanding of the total personality pic- 
ture. Dr. Mons does slight in his book an impor- 
tant aspect of the test, the inquiry period, which 
has been highly developed and perfected as a 
sensitive adjunct to the examination proper for 
bringing out meanings and undercurrents of the 
subject's original responses ; he also barely men- 
tions the testing-the-limits period in which the 
clinician can determine whether the subject will 
accept popular concepts or possibly be able to 
utilize color or shading determinants that had 
been neglected in the original protocol, or will 
be able to project movement to the blots. He 
does make the “helpful” suggestion of preserv- 
ing the cards by wrapping them in a cellophane 
covering, a procedure which has been looked 
upon with horror by experts here, (It adds a 
shiny quality to the cards which may be dis- 
tracting or may bring out new determinants in 
the record.) Another small point on which Dr. 
Mons' teachings differ from the findings of the 
accepted Rorschach authorities is his inclusion 
of the statement, “You may turn the cards any 
way you like" in his original instructions to the 
test subject. This is generally held to be unnec- 
essarily suggestive and it has been found better 
to say nothing about card-turning at the begin- 
ning but to be encouraging when the subject 
questions on this matter. The book is possibly 
adequate for one who is interested in the very 
barest principles of the Rorschach test that can 
be digested in short order. For those who will 
spend more time and effort, or who plan to go 
further into the subject, other published texts 
would be more satisfactory and give a more sub- 
stantial understanding. 

Psychosom Med 12:66-7 Ja-F ’50. Peter Н. 
Knapp. “A fool uttereth all his mind.” This 
quotation appears on the cover of a new manual 
by W. Mons on the Rorschach. One wonders 
about the choice of quotation. Does it just refer 
to the ink-blot method? Is there a conscious in- 
tent to be modest, which, in many ways, the 
writer is? Or is there a still deeper, unwitting 
confession? For there are vulnerable points in 
this small work of 164 pages. The author intro- 
duces it as a manual for beginners on the prac- 
tice of the Rorschach technique. As such it has 
useful features, It gives a clear description of 
the test, a diagrammatic representation. of the 
cards to facilitate location of responses and an 
adequate statement of the major scoring cate- 
gories. There are convenient summaries of work 
done by some leaders in the field—Hlertz's in- 


dicators of hysteria, Piotrowski's “organic 
signs,” and, above all, many of the statements 
of the Klopfer school. Sometimes the tenets of 
the Rorschach Research Exchange group are 
presented to the exclusion of other viewpoints ; 
some names of importance figure only fleetingly, 
such as Binder, Oberholzer, Beck, indeed Ror- 
schach himself; and considered as a practical 
guide the book has dubious aspects. For in- 
stance, the paragraph of instructions to be given 
to a test subject contains two outspoken sugges- 
tions, one that the cards are made of “Indian 
ink,” two, that “you may turn the card as much 
as you like.” One might ask if the work is more 
valuable than other instructional aids, such as 
that of Beck or that of Klopfer, from which it 
stems in such great part. Still, as a handbook, it 
has the virtue of compactness. However, it goes 
further. “We have to establish fundamental 
principles which can be checked by any experi- 
menter, and lay a foundation of common sense 
upon which to build up theory and practice.” 
This laudable statement on page 15 carries with 
it the implication that the author will attempt 
to clarify such basic principles of the method. 
At times he tries conscientiously to do this. 
Some of his thoughts on the significance of chi- 
aroscuro response are cautious and stimulating. 
At other times, for instance in the discussion of 
black and white used as color, C’, (page 75), 
the text seems to be merely a restatement of 
what others have said—which is both glib and 
arbitrary. Dogmatism dominates the interpre- 
tation of responses embodying “vista” (page 
68), and responses to the whole card (page 57). 
At times the thinking appears perplexed, as in 
the discussion of responses to the white areas 
(pages 59-60). Similarly his categorial accept- 
ance of FC responses as a sign of manual dex- 
terity" seems slipshod. The style is often loose, 
For example, on page 44 he refers to anatomy, 
blood, fire, and clouds as "pathognomic re- 
sponses.” “Pathognomic” of what? He does not 
say. Certainly little confidence is inspired in his 
deep understanding of clinical phenomena by 
statements such as the one on page 86: “The 
Schizoid response expresses a schism in the 
personality which enables it to live at one and 
the same time in reality and in fiction." On page 
96 we find a remark about rare detail responses ; 
they "arouse the suspicion that the person is try- 
ing to find an escape from the real problems of 
life by courting petty interests." Such turgid 
phraseology might be forgiven, but some state- 
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ments are downright wrong. Examples are his 
definition of “Confabulation” on page 46, ang 
his statement on page 14 that Rorschach got 
color responses “mainly from elated manics,” 
whereas the “movement response, such asa *rün- 
ning man,’ occurred mainly in the depressives” 
The Psychodiagnostik, and subsequent studies 
report as a uniform and significant feature in 
depressions the constriction of both color and 
movement responses. Three sample test proto- 
cols close the book. Many mathematical ratios 
are worked out. The responses themselves are 
not accurately delineated, far less the personali- 
ties of the subjects. The Rorschach method at- 
tempts to derive information about a person 
from his apperception of a complex group of 
gestalten. Its use depends upon careful observa- 
tion of the process of apperception in any given 
subject, and cautious testing of his production 
according to certain working hypotheses. There 
is, indeed, a need for “fundamental principles.” 
These will come only from careful comparison 
of Rorschach data with clinical data. To spect- 
late, a guess is that the clinical data will have to 
reach deep back into personal development, pos- 
sibly to preverbal levels. My own preference for 
investigation is along the lines of clinicians who 
have attempted such comparisons, Rors 
himself, Binder, and Oberholzer. However, one 
cannot criticize attempts at statistical validation 
on large groups of “types,” if the attempts are 
honest. Although this work stems from a “cok 
lection of over a thousand selected records” the 
statistical effort is not even made. What is left 
often seems naive, facile, or oracular. The authot 
states that he is compiling the book in the field, 
in response to requests that he publish some it- 
structions for beginners. Yet it is hard to s 
how his work serves much purpose as a ena 
this difficult field. With this manual as а Wi 
beginners might be poorly armed. They shouk 
be told not only that “A fool uttereth all his 
mind" but that “A little learning is a dangerols 
thing"—to avoid the risk of rushing in where 
angels fear to tread. 

Rorsch Res Exch & J Proj Tech 12: 258 b - 
4 48. Edward M. L. Burchard. This little a 
by a British psychiatrist is meant to serve 25? 
introduction to the more advanced works a 
Klopfer and Kelley, Beck, Binder, and P 
schach. The author further states in his рге F, | 
that he compiled it during the war when he wi 
neither notes nor textbooks available. This к 
cumstance perhaps explains the very T€ 
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documentation and one reference to this Journal 
as the Rorschach Exchange Research (p. 15). 
However, this reviewer is also puzzled by such 
statements as: "the movement response, such 
as ‘a running man,’ occurred mainly in depres- 
sives" (p. 14) ; responses given during testing 
for limits should be added in brackets as addi- 
tional responses; and “Responses which can- 
not be fitted with confidence into any of the 
score groups outside F, should be scored, either 
F, F+, or F—. The end results will not be ma- 
terially affected.” (р. 43—reviewer's italics.) 
Similar seeming confusions or important di- 
vergences from usual procedure occur fre- 
quently without any documentation to back them 
up. The book is logically organized as a manual 
—discussing in turn—administration, inquiry, 
recording, scoring, tabulation, significance of 
separate scoring categories, and interpretation 
or assessment of the total personality. A ten- 
page section then sets forth "some useful elab- 
orations of the method." Here are included re- 
finements in scoring from diverse sources— 
Beck’s “Z” and “Po,” the “Mm” score for ex- 
pressive poses, Earl's *F/C" and “C/F,” “Нах” 
and “Аах” for Rorschach's oligophrenic de- 
tails, etc. The book closes with a nine-page dis- 
cussion of Rorschach diagnosis of the normal 
personality, neurosis, psychosis, mental defect, 
epilepsy, enuresis, and juvenile delinquency; 
followed by a detailed analysis of three cases. 
This last is perhaps the most valuable portion of 
the book, since here the author does illustrate 
rather fully the manner in which he scores and 
interprets. The book is handsomely printed in 
two colors on good paper ; the author has an ex- 
cellent reputation in England ; and it is thus with 
some regret that this reviewer must state that, 
in his opinion, the book serves little useful pur- 
pose. The dilettante should not be encouraged 
to use the Rorschach method at all, and the 
serious student has little need for such a primer. 
Instead, he should be advised to begin his stud- 
ies with the more complete and better-docu- 
mented manuals of Klopfer and Kelley, Rapa- 
port, or Beck. As obiter dicta the reviewer 
cannot resist stating his disapproval of the in- 
clusion of the motto, “Thus are the secrets of 
his heart made manifest," on the dust jacket of 
what purports to be a serious, scientific book ; 
and his delight in Mons’ perhaps unconscious 
humor in warning of the dangers of blind diag- 
nosis on page 17: “Neglect of this precaution 
can otherwise lead to socially embarrassing situ- 
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ations, as in the case of a young Rorschach 
worker who practiced “blind” diagnosis and 
wrote “Schizophrenic with Homicidal tenden- 
cies” underneath the anonymous record of his 
Superintendent; this method is so informative 
that one should never test one’s friends or col- 
leagues” (reviewer's italics). 

Sociol & Social Res 33:241-2 Ja-F '49. Em- 
ory S. Bogardus. * succinct and incisive trea- 
tise * Mons makes his main contribution in his 
organization and interpretation of the symbols 
and in pointing out the structuring of personal- 
ity that certain reactions to the test seem to in- 
dicate, The author has strengthened the role of 
Rorschach as one method of approach to the 
study of certain types of personality. 


[127] # 
X*[Re Rorschach.] Mons, W. Principles and Prac- 
tice of the Rorschach Personality Test, Second 
Edition. London: Faber & Faber Ltd., 1950. Pp. 176. 
Ыы; б ОРИОНА Pa.: J. B. Lippincott Co., 1951. 
4.00. 


J Consult Psychol 15:447 O '51. Laurance F. 
Shaffer. The second edition of this compact and 
clear British text differs little from the first 
edition..except that civilian cases have been 
substituted for the military examples. 

Psychiatric Q 25:704 О '51. * differs only in 
minor points from the first edition. It is still in- 
tended for the beginner, presenting the Ror- 
schach method in a clear and simplified man- 
ner. Possible criticism may be directed to the 
discussion of determinants, in which oversim- 
plification suggests a possible injustice to the 
technique. In general, however, it is a compact 
little volume and may be recommended to those 
beginning the study of this test. 

Psychol Service Center J 2:145-6 Je ^50. 
Mary Morrow. The author's purpose....was to 
provide a much-needed primer for beginners in 
the use of the Rorschach technique, as well as 
a guide to better understanding of the test for 
people in allied fields or interested laymen. The 
book, itself, is a commentary on the fact that 
this is a difficult feat to perform. Although writ- 
ten in a clear and readable style, with the use of 
interesting clinical material, it falls short of its 
ultimate intentions. The reasons are not readily 
determined, for what seems a lack to one indi- 
vidual may not seem so at all to another who 
reads a book for a different purpose. As a text 
for neophyte Rorschachers, there are several as- 
pects of the book which would seem to encour- 
age poor methodological habits. For example, 
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the author does not stress the inquiry part of the 
test, which is generally regarded as a delicate 
and exacting operation, stating only that in- 
quiry is unnecessary except in the case of re- 
sponses which confuse the administrator. In 
helping others learn the Rorschach test, one of 
the major difficulties encountered is the inade- 
quacy of inquiry made by the beginner. With 
them it is necessary to emphasize that the ap- 
parent scoring of a response often does not in- 
clude the determinants actually utilized by the 
subject. The author also implies that the test 
can be readily self-taught and that the scoring of 
responses comes largely from one’s own ex- 
perience with many and varied kinds of records. 
Although one cannot gainsay the value of ex- 
perience, this thinking tends to place the ad- 
ministrator on an island of self-decision, whereas 
he should be encouraged from the beginning to 
attempt to objectify his use of the test by care- 
ful reference to the research done for this very 
purpose. This tendency is clearly reflected in the 
author’s own writing in which few references 
are cited for the many dogmatic statements. One 
is never certain whether these conclusions are 
gleaned from the author’s experience or the sum- 
marizing of other investigators. For example, 
in determining intellectual level from the Ror- 
schach, the author states, “The I.Q. in records 
without gross abnormality can be worked out 
to within 576 of the Stanford-Binet Test as ap- 
plied by Terman and Merrill." This method is 
not concretely defined, nor are references cited 
from which one might gain a more satisfactory 
explanation. No student should be asked to ac- 
cept such statements as fact without sufficient 
foundation. He should be encouraged to ask, 
“Why is this so? What is the basis for this con- 
clusion ?" And, his questions should be answered 
as fully as possible. As a text for the student, it 
does not seem to be sufficiently objective in its 
approach to the Rorschach test. For the allied 
worker or the interested layman, its exposition 
is too ambiguous and obscure, and therefore it 
is likely to increase his confusion. Therefore, 
although this book provides interesting reading 
and a great deal of informative material, in a 
field where much is needed, it does not attain 
its stated goal. 


For reviews of the first edition, see 126. 


[128] 
*[Re Rorschach.] Urrzrr, Grorce. Rorschach In- 
troductory Manual: A Primer for the Clinical 
Psychiatric Worker: With Interpretative Dia- 
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gram to Permit Clinical Use While Learni; 
Ink-Blot Technique. St. Louis, Mo.: ае 
Publishers, Inc., 1950. Pp. 48. $3.00. * (PA 24: 5262) 


Ат J Psychiatry 108:158 Ag ?5т. W. Donald 
Ross. This is a most ingenious “short cut” to 
the Rorschach method for clinical psychiatrists, 
It describes in a very simple manner how any 
psychiatrist can proceed to use the test on his 
own patients. It provides valuable tables and a 
quite inspired scoring template for dealing with 
the test performance. It gives enough of inter- 
pretative and diagnostic guidance to stimulate 
interest in using the method to develop one’s 
own interpretative skill with practice and the 

*help of the suggested further reading. It is 
oversimplified. It cuts many corners. It presents 
many valuable compromises, however, between 
the various scoring systems which are extant, 
It is too glib on the subject of clinical diagnosis, 
although this aspect is mitigated somewhat by 
the explanatory text. There is a danger that it 
may be used by psychiatrists as if they were 
in possession of the sensitive instrument pro- 
vided by the more complicated use of the test 
materials in the hands of a well-trained and 
experienced worker. Such psychiatrists might 
blame the failures on the technique rather than 
on their own rough use of it. It is to be hoped, 
however, that it may become widely used as an 
initiation into Rorschach possibilities, to be fol- 
lowed up by more intensive study and super- 
vision with the method, or by a more judicious 
use of referrals to clinical psychologists for Ror- 
Schach examinations than the psychiatrists pre 
viously could have made. * This reviewer is in 
agreement with Karl Menninger, who is quoted 
in the foreword as follows: “This is the most 
practical manual to serve as a primer to the 
psychiatrist that I have seen." Why Dr. Men- 
ninger's name is enclosed in quotes, while his 
statement has quotation marks only at the be- 
ginning, is something that remains for the at- 
thor to explain ! 1 

Am J Psychother 5:475-6 Il "51. Lily Н. 
Gondor. * There is no doubt that the author has 
used his excellent capacity for organization an 
his ingenuity to simplify the learning process ^ 
the test. It is as though the whole Rorschac 
method were served on a tray, so to speak, n 
palatable portions, ready for use—and this 15 
exactly where the danger of this manual lies. 
The author himself emphasizes that he is aWare 
of the incompleteness and the almost naive ove™ 
simplification of his presentation of the Ror- 


237 ] 


TESTS & REVIEWS: CHARACTER—PROJECTIVE 


128 


schach. * the oversimplification of the method in 
this manual, the chart and the checking-device 
are indeed a great temptation for any student 
of the Rorschach who wants to take a shortcut 
rather than devote the time necessary for the 
intensive study which proficiency in this tech- 
nique requires. There is no doubt that there is 
a lack of psychologically trained personnel, in 
comparison to the number of psychiatric pa- 
tients. These patients and their doctors, how- 
ever, are poorly served when the psychological 
report consists mainly of slogans and technical 
phrases. Unfortunately such reports are not 
rare. One suspects that the authors often do not 
know themselves what they are writing about 
and only use the terminology of Rorschach text- 
books without deeper understanding of the 
personality they have tested. This manual was 
originally devoted as a teaching device. As such 
it may render a good service as a primer for 
beginners who are learning the Rorschach tech- 
nique under the supervision of an expert in 
projective tests. 

Brit J Med Psychol 24:148-9 pt 2 "51. T. 
Alcock. It is difficult to reconcile Dr. Ulett’s ref- 
erence to “the long training required for mas- 
tery” of the Rorschach test with his claim that 
his monograph is designed to “bring this valu- 
able diagnostic test within the reach of many 
who are not otherwise familiar with (psy- 
chological) testing’....“without the otherwise 
wasted time of learning technique.” (Italics are 
the reviewer’s.) This beautifully printed short 
book aims at providing for the psychiatrist, un- 
learned in test procedure, a more than royal road 
to learning-as-you-go. The novice is advised 
that the author’s technique “permits almost im- 
mediate interpretation of personality trends and 
suggests clinical diagnoses.” While administer- 
ing the test he is recommended to score as he 
receives each response, searching for the appro- 
priate symbol in a table of scoring kept open 
before him ; this in addition to the already ardu- 
ous task of recording responses verbatim, with 
notes of observed behaviour and of timing in 
terms of seconds. To “facilitate the conversion 
of the subject’s scored responses into terms of 
a personality diagnosis” there is devised a visual 
aid in the form of a multicoloured circle whose 
segments are severally labelled imagination, 
deep inner feelings, emotions, associations, in- 
tellectual control, orderliness and apperception. 
Factors towards the centre of the circle are said 
to represent normality, decreasing towards the 


periphery. In this circle are punched holes into 
which scores are entered and the personality is 
then read off—(It is tempting to say “clocked 
out")—Xwith the aid of interpretive descriptions 
around each hole and tables of diagnostic points. 
It is comforting to read in the chapter on Clini- 
cal Diagnosis that “further verification of such 
Rorschach diagnoses on clinical material is 
necessary,” and it is also noted that “it will be 
obvious to a trained Rorschach worker that the 
interpretations suggested will not be complete.” 
It is to be feared that they might also be mis- 
leading. Since this work is specifically directed 
to its use by psychiatrists, it is perhaps taken 
for granted that the human relationship of 
tester and testee will not be disregarded, but 
there is a striking omission of any considera- 
tion of the test from the patient’s point of view, 
or of the means by which an optimal situation 
may be brought about and a maximum amount 
of valid material obtained. The theory of per- 
sonality as a dynamic process of interacting 
forces, which underlies the Rorschach method, 
is brushed aside. Of the four stages of the test, 
the introduction and inquiry are only lightly 
touched upon, while the valuable stage of limit 
testing is disregarded. Of his simplified Ror- — 
schach Dr. Ulett writes clearly and with con- 
viction, but as the famous couturier said 
“Simplicity is always expensive.” And over- 
simplicity in diagnostic methods may prove dan- 
gerously expensive to the patient. 

J Nerv & Mental Dis 112:177 Ag’50. * The 
text is arranged in a way to be understood by 
any psychiatrist and will orient him in the use 
of this valuable diagnostic tool. * the author has 
rendered a valuable service to psychiatry by his 
clear presentation of the fundamentals of the 
test, of the technical procedures and of a simpli- 
fied scheme of interpretation. This manual may 
be recommended with the satisfaction that the 
reader will be well repaid for his efforts. * 

J Proj Tech 15:279-80 Je ?5$1. L. Joseph 
Stone. This is a highly condensed manual of 
Rorschach procedure—rather eclectic, although 
its scoring principles are closest to the Klopfer 
system. The first half summarizes administra- 
tion (without testing the limits) and scoring 
(with estimation of form level according to the 
F+ and F— method) ; the latter half is a fairly 
complete but extremely abbreviated statement 
of essential interpretive principles. The core of 
the contribution is a circular chart or diagram 
placing before the user all the scoring categories 
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with their major interpretive significance indi- 
cated in small type. * There is obvious value— 
particularly in the process of Rorschach learn- 
ing—in any device which holds the multifari- 
ous Rorschach variables before one, which per- 
mits crosscomparison between them, and which 
serves constantly to suggest interpretive mean- 
ings and interrelations: this is the value claimed 
by the author. It is equally obvious that any 
kind of interpretive code sheet invites misuse by 
the unskilled who are also unscrupulous. Pre- 
cisely because the device is handy it is subject 
to this misuse. However, it is doubtful whether 
the author—who disclaims all uses save that of 
a more or less temporary crutch in the process 
of acquiring Rorschach skill—should be held 
responsible for possible misapplications. His 
manual is intended for the psychiatrist and the 
clinical psychologist in training and assumes 
some sophistication regarding personality or- 
ganization and its disorganizations. As far as 
the structure of the chart is concerned, many 
Rorschach workers will wonder why the F seg- 
ment of the chart does not lie between the seg- 
ments pertaining to diffusion and to texture in- 
stead of falling quite apart from these. This 
would appear to be a more logical placement 
and would also make it easier for the student to 
go from Ulett’s circular chart to the Klopfer- 
Davidson profile. As a handy condensation of the 
gist of much Rorschach material, there is much 
to recommend this manual. It is useful for the 
worker already reasonably sophisticated in clini- 
cal understanding who has not yet passed from 
the initial Rorschach stage of understanding 
scoring principles to the stage of full mastery 
of many variables and their interpretive values. 
It is not—and does not appear to be intended as 
—a substitute for adequate intensive training, 
There is always the danger, however, that such 
devices will serve as “dream books” and the 
profession should be on guard against such mis- 
use. Unfortunately, the ingenious dial-diagram 
itself, the essential contribution, is marred by 
some misprints. 

Psychiatric Q 24:617-8 Jl’50. The main pur- 
pose of this manual is to enable psychiatrists to 
use the Rorschach method as a rapid diagnostic 
tool. The essential parts of the procedure—ad- 
ministration, scoring and interpretation—are 
presented in concise and oversimplified form. 
A neat and colorful circular diagram serves as 
a visual aid for personality interpretation and 
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for rapid recognition of "abnormal" areas, Ma. 
jor Rorschach “signs” found in different clini- 
cal entities are combined in a well-organized and 
comprehensive table, and there is a second table 
which includes short definitions of scoring sym. 
bols as well as the popular and other frequently- 
encountered responses. In his introduction Dr, 
Ulett states: “It is our purpose to present an 
aid to learning the Rorschach test that will fa- 
cilitate its more rapid mastery.” If by “mastery” 
the author means learning basic Rorschach sym- 
bols and their individual meaning, then he has 
succeeded remarkably well in his purpose. How- 
ever, it is doubtful if persons unfamiliar with 
the Rorschach will be able to interpret a record 
adequately, or come to a diagnostic decision, 
on the basis of the information contained in 
this manual alone. One can easily visualize, for 
example, a person unfamiliar with the technique 
trying to fit his results into one of the constella- 
tions of “signs” and finding to his disappoint- 
ment that his record does not correspond to 
any one of the clinical subgroups. After a num- 
ber of such experiences, he may disregard the 
method as unsuitable for diagnostic purposes. 
And this is really not the case. The basic error 
lies in using statistically discriminative points, 
which are indicative of group trends only, for 
individual diagnosis. A small number of cases, 
usually those presenting extreme conditions, 
may be diagnosed by signs, but there are many 
others where only one or two—or even none— 
of the classical signs can be found. The author 
feels that the manual “allows some clinical ap- 
plication during the otherwise wasted period of 
learning technique.” Perhaps such a learning 
period, enabling a real understanding of the 
basic personality structure revealed by the Ror- 
schach, would not be so “wasted” in the long 
run, Recalling that the originator of this test 
was a psychiatrist himself it seems only “ap 
propriate that as many psychiatrists as poss! i 
should use the Rorschach. However, to the 
present day no magic short-cut for interpreting 
records has been found. This book cannot D 
place standard works on the Rorschach tec 
nique for anyone interested in interpreting ? 
record, but will serve as an excellent шой, 
tion to the basic principles involved. It will n 
prove useful as an adjunct in courses W 2 
this method is taught, or if read in coniun ; 
with some of the standard source books on 
Rorschach. 
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X*Rosenzweig Picture-Frustration Study. Ages 
4-13, 14 and over; 1944-48; also called Rosenzweig 
P-F Study; 2 levels; 1 torm, '48; $4 per 25 examina- 
tion blanks ; $1 per 25 record blanks; specimen set not 
available; postage extra; Saul Rosenzweig; the Au- 
thor, 8029 Washington St., St. Louis 14, Mo. * 

а) FORM FOR CHILDREN. Ages 4-13; 1948; manual 
(448) ; $1 ‚рег manual (reprinted from J Psychol 26: 
141-91 Jl '48; see 21 below) ; nontimed (20) minutes. 
b) REVISED FORM FOR ADULTS. Ages 14 and over; 1944— 
48; manual ('47); $1 per manual (reprinted from 
J Psychol 24: 165-208 О 47; see 15 below) ; nontimed 
(15) minutes. 
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Both forms of the P-F Study consist of 24 
crudely drawn cartoon-like pictures involving 
two central characters. One of the characters 
is placed in a mildly frustrating situation while 
the other is shown saying something which 
frustrates the first or helps to define the frustrat- 
ing situation, The adult sub ject is asked to write 
in the blank caption box what the person would 
answer, “the very first reply that comes into 
your mind.” Children are given a simpler formu- 
lation of the instructions. Two thirds of the 
pictures involve a directly frustrating obstacle; 
in the remainder one of the characters is accused 
of a minor offense. 

Responses are scored for direction of aggres- 
sion: ie, extrapunitiveness—aggression ex 
pressed against the environment ; intropunitive- 
ness—aggression turned inward on the subject 
himself; and impunitiveness—“aggression is 
evaded in an attempt to gloss over the тана 
tion." Responses are also scored for type 0 
reaction : i.e., obstacle dominance, йагы, 
and need-persistence, їп accordance with ДЕ 
response being concerned with the barrier, the 
subject's ego, or with a solution for the problem. 
Instructions are also given for detecting D. 
"trends" in the subject's responses: i.e., CR 
ing from one type or direction of response 2 
another in the course of the test. A group ps 
formity rating (GCR) may also be obtains М 
comparing the subject's scores with typical г 
actions to 12 of the items. ol 

‚ The Revised Form for Adults is standardize 
on 236 males and 224 females in the age Ves р 
20-30. Although this group included а bil 
tively small number of skilled and unski ^ 
laborers, the vast majority were middle class W 
a fairly high educational level (a mean че, 
of schooling of 13.5 years). It is not C b 
whether the norms based on this group ue 
be used for any adult since this point is n 
mentioned in the manual; in fact, the nor 


241 ] 


were bound into the already printed manual and 
follow directly after a normative table labeled 
“Revised Figures.” In another place (54) the 
author comments that the norms are only slightly 
different from the original tentative norms, and 
Bernard's norms (25) based on 175 college 
students of both sexes are also very similar to 
the original norms. In any case, caution would 
dictate their use only with people whose class 
and educational status is similar to that of the 
standardizing group. 

The manual does not include mention of re- 
liability although the manual for the children's 
form asserts that the adult form has a retest 
reliability of from .60 to .80. Bernard in the 
above mentioned study found retest r's of .50 
to .75, the retests being made about four months 
after the original tests. He also found the relia- 
bility of the GCR to be .45. These reliabilities 
are rather low for individual diagnosis. A. spe- 
cial study (13) showed the reliability of scor- 
ing to be adequate, since two scorers, after a 
series of discussions with each other, were able 
to increase their agreement from about 74 to 
85 per cent. It is unlikely, however, that the 
average user of the test could achieve that level 
of reliability. 

Validity is not mentioned in the manual, al- 
though in a previous article (4) the author 
claims that validity “has been repeatedly ex- 
amined on a clinical basis." He also mentions 
a comparison of extrapunitiveness on the P-F 
Study and on the TAT as yielding an r of 74. 
Other studies of validity (28, 44, 49, 60) have 
shown results varying from negative to slightly 
positive. All these studies were group centered, 
however, and cast little light on the individual. 
Perhaps special mention should be called to a 
study of Bernard (26) in which he concluded 
that “blind” analysis of the P-F Study was of 
value with normal cases and of still greater 
value when used in combination with other tests. 
Тһе GCR, however, was found to be of little 
value. 

Extensive examples are offered in the manual 
as an aid in scoring, but even so it is not an easy 
test to score; the distinctions between some of 
the responses placed in different catagories is 
often difficult to understand. The meticulous 
scoring of trends also appears to the reviewer 
as possibly attempting to get more out of one's 
data than there is in it. к 

It is assumed that the subject “unconsciously 
or consciously identifies himself with the frus- 
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trated individual...and projects his own bias in 
the replies given." Yet the author frankly ad- 
mits (in the children's manual) that there is 
no way of deciding whether the subject's replies 
reflect what he would actually say or whether 
he is censoring his answers or whether they 
*disclose a more covert or latent basis of the 
subject's manifest behavior." An inquiry is sug- 
gested after completion of the test to clarify “re- 
sponses which are too brief or ambiguous” and 
to permit a judgment from “the tone of voice 
or additional remarks” of the subject. In the 
reviewer's opinion it would be better to follow 
the procedure advised for children and have the 
subject read all his replies aloud. Or still better, 
to have the subject respond orally in the first 
place. This would also permit some judgment 
on self-censorship since the time before replying 
could be measured and it would permit a correct 
scoring of remarks apparently easy to classify 
but in reality ironic, or meant in some other 
special way. i 

Other possible imperfections are: (a) the 
tacit assumption that response to mild frustra- 
tion is predictive of response to strong frustra- 
tion especially when a threat to the personality 
is involved; (b) the absence of any assessment 
of intensity of reaction although one might ex- 
pect that to be very important; (c) the lack of 
clarity in the interpretation of the results after 
they have been obtained. Suppose a subject is 
more impunitive and less extrapunitive than 
the norm. Does this mean he inhibits normal 
aggression or avoids blaming someone when 
he should do so? Not necessarily, since Schmeid- 
ler (59), for example, found that the subjects 
in her impunitive group tended to be split into 
two subgroups, one of them composed of gentle 
and helpful people and the other composed of 
restrained and cold persons. 

The Form for Children was standardized on 
131 boys and 125 girls with approximately equal 
numbers of each sex at each age from 4 to 13. 
The children at age 7 and below were drawn 
from private schools and the rest from public 
schools—a class difference which might affect 
the obtained norms is reflected here. No relia- 
bility is reported and the only comment on va- 
lidity in the manual is “the clinical indications 
of validity...are promising but systematic re- 
sults must await the completion of a study now 
in progress." Presumably one aspect of this 
study was reported in 1950 by Rosenzweig and 
Mirmow (57) in which 79 per cent of a group 
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of 19 4-7 year-old children classified (on the 
basis of teachers' detailed records of their ad- 
justment in school) as socially inadequate had 
one or more "trends" in their records. On the 
other hand, the socially adequate and the socially 
uninterested children had percentages very sim- 
ilar to each other and lower than that for socially 
inadequate children—about 56. This group dif- 
ference of 23 per cent at the .07 level of con- 
fidence can hardly be considered very adequate 
validation. 

The instructions for administering this form 
are in general similar to those for the adult form, 
the most notable exception being that the 
younger children are required to give their re- 
sponses orally. Older children are asked to read 
all their responses aloud when they have finished 
the Study and explicit directions are given as 
to how to conduct the inquiry. Scoring is similar 
to that of the adult form and as with that form 
very complete scoring samples are given. Norms 
are given for each two-year age span between 
4 and 14. An illustrative record of an 11-year- 
old child is presented, scored, and given a rather 

superficial interpretation. 

` At present it appears to the reviewer that 
the clinical use of both forms of the P-F Study 
except in relation to other tests or case history 
material is rather hazardous. The data on re- 
liability, validity and to some extent the norms 
themselves are inadequate. As the authors men- 
tion, the fundamental problem of the projective 
level on which the subject approaches the test 
has not been solved, and the possible different 
meanings of similar records with respect to per- 
sonality has not been sufficiently explored. On 
the other hand, if given individually and orally 
together with an extensive and judicious in- 
quiry and with an attitude of readiness to dis- 
regard it if it contradicts other more valid ma- 
terial, the P-F Study may often yield valuable 
insights. 


PzncivAL M. Symonps, Professor of Education, 
Teachers College, Columbia University, New 
York, New York. 

This test is recognized today as one of the 
established projective techniques with its own 
unique method for obtaining the responses of 
an individual to frustrating situations. The au- 
thor describes the test as consisting of a 
series of 24 cartoonlike pictures, each depicting two 
persons who are involved in a mildly frustrating situa- 


tion of common occurrence. The figure at the left of 
each picture is shown saying certain words which either 
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help to describe the frustration of the other individua] 
or which are themselves actually frustrating to him. 
The person on the right is always shown with a blank ` 
caption box above. * The subject is instructed to ex- 
amine the situations one at a time and write in the 
pae box the first appropriate reply which enters his 
min 


These responses must be classified by the 
examiner to indicate the direction of aggression, 
and the reaction type. Aggression can be di- 
rected in one of three ways; outwardly on the 
environment—extrapunitiveness ; inwardly on 
the subject himself—intrapunitiveness ; or eva- 
sively to avoid the frustration—impunitiveness. 
"Three types of reaction are recognized ; obsta- 
cle-dominance, in which the response concen- 
trates on the barrier occasioning the frustra- 
tion; ego-defense, in which attention is turned 
to the self of the subject ; and need-persistence, in 
which the solution of the frustrating problem is 
emphasized. All combinations of the six catego- 
ries give nine scoring factors (plus two additional 
variants). To assist scoring, Rosenzweig has pre- 
pared elaborate scoring samples for each of the 
24 situations. He suggests the use of an inquiry, 
as in the Rorschach, following the responses in 
order to clarify the intention behind the re- 
sponses. 

A record blank is provided for recording the 
scoring of the responses. In addition to enter- 
ing the classification of each response, certain 
additional comparisons are suggested. The pos- 
sible significance of these additional items in 
the record blank has not been demonstrated, and 
they add considerably to the task of analyzing а 
given record. First, there is a group conformity 
rating which indicates the extent to which а 
subject conforms with or departs from expecta- 
tion as determined by the responses of normal 
subjects. Second, there is a set of boxes called 
“profiles” in which the scores and percentages 
of responses of each of the six types can be re- 
corded. Third, there is a space for summarizing 
the relative order of direction of aggression apart 
from type of reaction and of type of response 
apart from direction of aggression. Finally, 
there is a calculation of trends of the change of 
the direction of type of response during the 
course of a given test. In one study using the 
children's form, Rosenzweig has demonstrate 
that the presence of a trend in a record is an Ш- 
dex of personal instability and of low frustra- 
tion tolerance. 

The author has met expectations in Mes 
ardizing the test and making it usable and useft 


243 ] 


TESTS & REVIEWS: CHARACTER—PROJECTIVE 


130 


by providing standardized instructions for ad- 
ministration and a helpful manual for scoring 
and interpretation and supplying norms on both 
the adult and children's forms. The adult norms 
are based on responses from 236 males and 224 
females between the ages of 20 and 29 with an 
average of 13.5 years of schooling. The chil- 
dren's norms are based on the records of 256 
boys and girls broken down into five age levels 
between the ages of 4 and 13. He has tested the 
reliability of the scoring and found in his trial 
85 per cent agreement between independent 
scorers, which seems satisfactory in view of the 
fact that tests in this experiment were adminis- 
tered without determining the meaning of the 
responses or of revised scoring samples. 

In spite of all the care that the author has taken 
in making his test useful, he is still unable to 
tell the worker how to interpret the results. But 
this is a difficulty that the Rosenzweig Picture- 
Frustration Study shares with all other projec- 
tive techniques, and Rosenzweig is fully aware 
of the difficulties. For instance, in spite of the 
fact that the instructions tell the subject to 
write the first appropriate reply which enters 
his mind, the examiner does not know whether 
the response is what the subject would do in 
a similar situation as depicted in the test, what 
he thinks he ought to do, or what he feels like 
doing but would not actually do. The examiner 
knows, for example, that the subject has extra- 
punitive tendencies, but he does not know where 
or how these will be expressed by the subject 
in actual life. Rosenzweig says that eventually 
experimental investigation will answer this ques- 
tion. This reviewer believes that scores on the 
P-F Study cannot be interpreted blindly. The 
test has value only when interpreted in the light 
of other facts known about the subject; hence 
its main value should be in helping to under- 
stand the meaning of behavior and personality 
in terms of the underlying modes of response 
to frustration. Rosenzweig suggests that, in 
the absence of experimentally determined mean- 
ings for the responses, one assumes that what 
an individual writes on the test indicates how 
he will respond in real life; but this is an un- 
tenable assumption, for one never knows whether 
it applies in a given case. Д 

The Rosenzweig Picture-Frustration Study 
is one of the recognized projective techniques. 
It is carefully prepared, and its author has done 
all that could reasonably be expected in making 
the test useful. 
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*The Rotter Incomplete Sentences Blank. Grades 
9-12, 13-16, adults; 1950; also called ISB; 3 levels, 1 
form; manual and standardization data for use with 
college level only; $1.25 per 25; $1.90 per manual; 
specimen set not available; postpaid; (20-40) minutes; 
Julian B. Rotter and Janet E. Rafferty (manual); 
Psychological Corporation. * 
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CnuamrLES N. Corer, Professor of Psychology, 
University of Maryland, College Park, Mary- 
land. 

This blank is designed for use as a gross 
screening instrument for maladjustment. It ap- 
pears to be simple to administer to either indi- 
viduals or groups. The manual contains a rather 
extensive set of examples, presented as guides 
for scoring ; adequate scoring reliability, even on 
the basis of familiarity with the manual alone, 
is reported. 

Specific validity studies of the Rotter blank 
have been performed. Although one may doubt 
the ultimate value of the criteria employed and 
deplore the small number of cases used in the 
validity studies, there is, nevertheless, some 
clearly presented evidence that the test has some 
validity as an indicator of maladjustment. The 
authors are to be complimented for their presen- 
tation of validity data in a form that clearly il- 
lustrates how their validity studies were per- 
formed, an unusual occurrence in the case of 
projective tests. Norms are presented for 85 
female and 214 male college freshmen at Ohio 
State University. Unfortunately, the validity 
studies are reported for samples that are not 
strictly comparable to those for which the norms 
are presented, and it is therefore not certain that 
the tentative cutting scores suggested on the 
basis of the validity studies would be applicable 
to the norm group. It is probably unwise to 
discuss specific cutting scores in the manual at 
all, since, as the authors point out, such cutting 
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scores should be developed in the light of the 
purposes for, and the groups on, which the test 
is to be used. 

There are separate scoring examples and 
norms for male and female subjects. In view of 
this, it seems quite likely that for noncollege 
groups the norms and the conclusions concern- 
ing validity are inapplicable. No norms or other 
information are presented for the Adult Form 
and the High School Form, which differ slightly 
in wording from the College Form. I would 
seriously criticise the publication and distribu- 
tion of these two forms for which there are no 
data, since this seems to invite the use of the 
college norms with results obtained for the adult 
and high school groups. 

Another feature of the test is its clinical inter- 
pretation, designed to obtain information of 
value in planning initial treatment interviews. 
Six case records are presented with a fairly de- 
tailed clinical interpretation. It is to be hoped 
that the users of the manual will exercise the 
same restraint and caution in their interpreta- 
tions as the authors apparently do. 

On the whole, this reviewer is well impressed 
with the prospects of this test as a useful screen- 
ing instrument. On the other hand, it should be 
emphasized that present data permit its use only 
for college groups and that even here the norms 
and cutting scores will need considerable re- 
vision before they have wide utility. It is un- 
fortunate that the authors have not considered 
developing a scoring system for the thematic 
characteristics of the test responses, rather than 
being content with clinical interpretation alone. 


WILLIAM SCHOFIELD, Associate Professor, De- 
partments of Psychiatry and Psychology, Uni- 
versity of Minnesota, Minneapolis, Minnesota. 

Of this instrument one might indeed remark: 
"E pluribus unum!" Sets of incomplete sen- 
tences have been sired so prolifically that the 
question of parenthood has drawn amusing at- 
tention. The Rotter Incomplete Sentences 
Blank, College Form is a revision of a revision 
by Rotter and Willerman of an Army instru- 
ment devised by Shor and others. The authors 
had two objectives : the provision of a technique 
having experimental and screening potential and 
the obtaining of information pertinent to diag- 
nosis and treatment. 

The ISB, College Form presents 40 incom- 
plete items on two sides of a simple record form 
which includes blanks for identifying informa- 
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tion—age, sex, marital status, etc. Each item has 
a single line, between 5 and 6 inches in length, 
This might seem to impose certain undesirable 
restrictions on those persons whose graph- 
omotor personality projections assume the form 
of rather expansive hieroglyphics. The authors 
mention the possibility of the subject crowding 
words on the single line or writing a double line 
within this space; the scoring system takes such 
behavior into account. 

An advantage claimed for the ISB is that the 
time required for scoring and analysis is gen- 
erally shorter than that for most projective tech- 
niques. Though the manual does not indicate 
the usual time required for scoring, the claim 
seems reasonable, The scoring examples for 
each item have been empirically selected so as 
to represent a 7-step scale covering a continuum 
from extremely good adjustment to need for 
psychotherapy as manifested in relatively small 
samples of male and female college students. 
Scoring examples are presented separately for 
the two sexes. It is possible for an inexperienced 
scorer to score a single record with good ac- 
curacy in as little as 15 minutes. 

Interscorer reliabilities of .91 and .96 are re- 
ported for trained scorers, using 50 male and 50 
female records respectively. Spearman-Brown 
reliability coefficients of .84 and .83 are reported 
for 124 male and 71 female college student rec- 
ords respectively. No test-retest coefficients are . 
reported. Such data would be very desirable in 
view of the probability that temporary moods 
and reactive states might serve to alter greatly 
the overall adjustment rating yielded by this 
instrument. 

For the most part the validity data are not 
impressive. Scores for each sex are reported 
separately for students in effective study and 
mental hygiene classes who were rated by their 
instructors as “adjusted” or “maladjusted” un- 
der a “forced choice" technique (Group I). А 
group of то (!) females were rated by "ad- 
vanced student clinicians" as definitely well or 
poorly adjusted. Finally, data are reported for 
46 males which were either self-referrals to a 
psychological clinic for treatment or cases гё- 
ferred for personal counseling by vocational 
counselors. The authors are to be commende 
for reporting the complete score distributions of 
all of these groups rather than resorting to 
the misleading statement only of mean scores 
and critical ratios. The distributions clearly de- 
pict a considerable overlap of adjusted and 
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maladjusted groups. Selection of an appropriate 
cutting score permitted good identification of 
the adjusted, and considerably poorer identifica- 
tion of the maladjusted students. The Group I 
distributions yielded biserial coefficients of .5o 
and .62 for the females and males respectively 
between ISB scores and classification as ad- 
justed or maladjusted. 

Normative data are provided in the form of 
actual frequency and cumulative per cent distri- 
butions of the scores of 85 female and 214 male 
college freshmen. The manual, which is very 
well organized and generally thorough in its 
coverage, includes a discussion of scoring pro- 
cedures, six practice cases illustrating scoring, 
and six cases illustrating clinical interpretation. 
The High School and Adult Forms of the ISB 
are identical with the College Form in 34 and 
36 of their 40 items respectively. However, the 
authors correctly caution against unreserved 
use of the college norms with these forms. 

Though more validity data with better clini- 
cal criteria are needed, the authors have pro- 
duced an excellent model for the conservative, 
objective standardization of instruments of this 
kind, The ISB certainly has much experimental 
potential and should prove useful as a gross 
screening instrument for college students. 


Am J Orthopsychiatry 21:649-50 Jl 'sr. 
Adolf С. Woltmann. * differs from others in 
that it allows the testee to include humorous an- 
swers. Most tests of this kind make no allow- 
ance for the scoring of humor and often specif- 
ically request the subject to refrain from 
answering humorously. Such omissions fail to 
recognize that humor is a healthy way to meet 
frustrating situations. * Like all test instru- 
ments of this kind, the Rotter ISB is not as well 
disguised as other projective methods. This 
might enable the sophisticated subject to fake 
answers in order to appear better adjusted than 
he may actually be. On the other hand, the gen- 
eral intent of what constitutes a good or bad 
answer is not readily apparent to most subjects. 
This test should not be scored in a purely math- 
ematical manner. Interpretation depends on the 
examiner's general clinical experience, skill and 
awareness of personality dynamics. In this way 
the Incomplete Sentences Blank is less of a 
straight paper and pencil personality test, but 
must be viewed essentially as a projective tech- 
nique. As such it is comparatively easy to ad- 
minister and takes less time to give and to score 
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than most projective tests. * This test is warmly 
recommended to clinical psychologists and also 
to psychiatrists or analysts who want to gain 
information about their patients in a minimum 
of time. 
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*Sentence Completions Test. Ages 12 and over; 
1940-47; a revision of Payne Sentence Completions 
Blank (28) ; 1 form, ’40; manual ('47) ; $1.50 per 25; 
$1.50 per 25 scoring and interpretation forms ; $1.80 per 
manual (contains the test and interpretation form); 
specimen set not available; postpaid; nontimed (45-50) 
minutes; Amanda R. Rohde and Gertrude Hildreth; 
distributed by Psychological Corporation. * 
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CHarLEs N. Corer, Professor of Psychology, 
University of Maryland, College Park, Mary- 
land. 

This test was developed to provide an instru- 
ment simpler than other projective tests but pre- 
serving their advantages. Apparently it may 
be interpreted clinically to estimate maladjust- 
ment. There is also an elaborate scoring system 
which yields values for each of a large number 
of needs, environmental press, and other vari- 
ables derived from the work of H. A. Murray. 

The test is simple to administer and is in- 
tended for persons whose mental age is at least 
12 years. Although the author (the manual is 
by Rohde alone) apparently believes that it may 
be used for adults, the only data presented come 
from studies of ninth grade students from one 
high school in New York City and one in Fort 
Lee, New Jersey. That extreme caution should 
be exercised in the use of the norms is indicated 
by the fact that the parents of only 30 per cent 
of the children were American born. 

In the reviewer's opinion, it would be im- 
possible to score a record on the basis of the 
instructions given in the manual with assurance 
that the scoring was properly done so as to per- 
mit comparisons of scores with the norms. Al- 
though each need, environmental press, and in- 
ner state is presumably to be scored in terms of 
its frequency of occurrence and its strength 
(3-point scale), criteria for counting the oc- 
currence and for rating the strength of a variable 
are not presented. Similarly, it is not apparent 
how the frequency and strength estimates are 
combined to give a single score for each variable, 
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although the norms given indicate that this is 
done. In none of the examples cited in the manual 
is a numerical score listed; the analysis usually 
is clinically oriented and the completed sentences 
are simply listed for the various needs they 
presumably indicate. Percentage of agreement 
among different scorers is reported to be fairly 
high but variable ; however, they must have been 
trained by methods not indicated in the manual. 
A test-retest reliability check is reported for 
some of the subjects over an eight-month period, 
The average test-retest correlations for the vari- 
ous variables were .68 and .64 for girls and boys, 
respectively. These are reported as .80 and .76 
when corrected for attenuation; the reviewer is 
unable to understand why they were corrected 
for attentuation. 

Unusually high correlations are reported be- 
tween scores on the test variables and a com- 
posite rating on each of the same variables (av- 
erage for girls .79; for boys, .82). These were 
averages of ratings made by teachers and the 
investigator. Insufficient information is given 
concerning the distribution of the original scores 
or of the ratings to permit an interpretation of 
the correlations, and the validity study was re- 
stricted to but a part of the original norm group. 

In its basic characteristics this test is essen- 
tially the same as other sentence completion 
tests. The scoring system is so poorly described, 
however, that it cannot be used, and the norms 
are inadequate. On this basis, the Rotter Incom- 
plete Sentences Blank seems to be a better in- 
strument, although it is applicable only to college 
students at present. Clinicians may, however, 
prefer this somewhat longer test and, with ex- 
perience, may find it useful. 


CHARLES R. STROTHER, Professor of Clinical 
Psychology, University of Washington, Seattle, 
Washington. 

This test consists of a list of 64 single words 
or short phrases, intended as the beginnings of 
sentences which are to be completed as rapidly 
as possible by the examinee. The items were 
selected so as to: (a) sample as many phases of 
personality as possible; (b) cover various areas 
of conflict; (c) be intelligible to persons with 
an MA often years or over ; and (d) allow maxi- 
mal latitude of response. The test was standard- 
ized on 180 ninth grade students in two junior 
high schools. . 

For scoring and interpretation, the test blank 
provides a check list of attitudes toward parents, 


siblings, self, friends, associates, religion, educa- 
tion, etc, and tabulation sheets on which are 
entered the frequency and strength of various 
“needs,” “inner states and traits,” “inner inte- 
grates,” “press,” and “cathexes,” according to 
Murray’s system of personality analysis. The 
manual gives abbreviated but reasonably ade- 
quate definitions of these terms. The checklist 


and needs-press scores are to be filled out by’ 


the examiner on the basis of inspection of the 
record as a whole, rather than by tabulation of 
responses to individual items. 

The manual provides a distribution of 
"needs," "inner state," and "inner integrate" 
scores only for a sample of 180 ninth grade 
cases. Because of the restricted nature of this 
population these norms are of very limited value. 
Interscorer agreement, for judges familiar with 
Murray's system, was .95 for two judges and .78 
among five judges. Test-retest correlations for 
ratings (not for particular items) for 44 cases 
retested after an 8-month interval, ranged from 
.44 to .98 (corrected for attenuation) for spe- 
cific needs. The validity criterion was the pooled 
judgments of several teachers and the investiga- 
tor, who interviewed each of тоо students and 
had access to their school records. Correlations 
of scores and ratings averaged .79 for girls and 
.82 for boys, with a range of .30 to .96. 

The clinician will find frequent use for a 
sentence-completion test, particularly for screen- 
ing purposes and in out-clinic work. The Rohde- 
Hildreth test is to be preferred for adolescents, 
with or without the Murray scoring, and the 
Sacks Sentence Completion Test? will be pre- 
ferred for adult subjects. The possibility of sub- 
stituting or adding items for special purposes 
increases the usefulness of this method. 


J Consult Psychol 12:284 Л-Ад °48. Law- 
rance F. Shaffer. The recent publication of the 
scoring form and manual has increased the po- 
tential usefulness of the well-known incomplete 
sentences technique. The scoring form provides 
a framework for evaluating the needs, inner 
states and presses shown by the sentence com- 
pletions according to a classification scheme 
adapted from H. A. Murray. There is also a 
more informal analysis of the examinees’ attt- 
tudes, interests, values and emotional problems. 
The manual defines and illustrates the «ш 

i di Sentence Com 


by Lawrence Edwin Abt and Leopold Bellak. New York: Alfred 
A. Knopf, Inc., 1950. Pp. xvii, 485, xiv. * 
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used in scoring, and gives normative data based 
on 680 ninth-grade students. Reliability is dis- 
cussed in connection with the problems of re- 
peated scoring by different examiners, and of 
repeated administration after various intervals 
of time. Validity is studied by means of teachers' 
ratings of the pupils. Case studies illustrate diag- 
nostic and clinical applications of the test. 
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XSymonds Picture Story Test. gades 7-12; 1948; 
individual; 2 parts (Sets А and B); 1 form; Set B 
may be administered alone or Sets А and B together; 
$5.25 per set of testing materials, postpaid; nontimed 
(60-70) minutes per set; Percival M. Symonds; Bu- 
reau of Publications, Teachers College, Columbia Uni- 
versity. * 
REFERENCES 
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man, American Council on Education Studies, Vol. 12, Series 
1, No, 32. Washington, D.C.: the Council, October 1948. Pp. 
vii, 110. Paper. * 


quis y ne ia Story Method of Personali) Study, New 
Сотк: Colum шуша Brave. зал. 60) (РА 24:850) 
E. J. С. Braprorp, Senior Lecturer in Educa- 
tion, The University of Sheffield, Sheffield, Eng- 
land. 

The chief limitations of the projection tech- 
nique based upon pictures as the primary or ini- 
tial stimulus are undoubtedly: (a) the intel- 
lectual efficiency of the person tested; (5) the 
culture pattern within which that person has 
developed his personality ; and (c) the strength 
of the inhibitive forces as they influence that 
person's freedom to give rein to his imagination. 
These limitations apply whether the investiga- 
tor is concerned primarily with making a diag- 
nosis or obtaining a psychometric estimate of 
the strength of the dynamic forces or merely 
using such tests as a preliminary screening to 
determine whether psychotherapy in any form 
is called for. 

It is the opinion of the reviewer, based ad- 
mittedly on limited experience, that the schools 
in the U.S.A. encourage, and succeed in de- 
veloping among the pupils, a greater readiness 
to the free oral expression of their thoughts than 
is achieved among the pupils in British schools : 
the American pupil is also more “test conscious 
than his British contemporary. Both these fac- 
tors may contribute to a more copious supply of 
material being obtained in response to the pic- 
tures. Consequently, it is of the utmost 1mpor- 
tance that for psychometric purposes specific 
norms should be worked out to correspond to 
the different culture patterns. Actual pictures 
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themselves may need modification ; e.g., picture 
I of series A is liable to be interpreted as a youth 
amidst an ehormous industrial plant because 
the pupil's experience does not lead him to think 
of skyscrapers as so many “homes.” Picture 2 
prompts the reaction “What on earth is 11?” 
because it does not fit neatly into the concept of 
letterbox, sweet machine, or first-aid outfit. 

The manual says “stories and case material 
were collected from 40 normal adolescent boys 
and girls.” Reference to the author’s book 
Adolescent Fantasy, which gives a very full ac- 
count of all the work which culminated in the 
construction of the test, shows that these 40 
pupils include 7 only children, 8 children from 
broken homes, 4 children with foreign parents. 
The medical histories include one of vomiting 
attacks over a period of 3 years, one of severe 
migraine for 9 years, one of infantile paralysis. 
Does “normal” in this context mean “repre- 
sentative sample"? If it does, is it related to a 
particular culture pattern? 

Although the results reported in Adolescent 
Fantasy have been examined statistically with 
care and insight, the author's caution has lapsed 
when he published the table of norms in the 
manual. These norms appear to have been re- 


‘produced from pages 97-98 of Adolescent Fan- 


tasy. Norms derived from the responses to 42 
pictures do not apply to a selected group of 20 
drawn from the original 42. If one examines the 
frequency of themes fon each picture, given in 
Appendix D of the same book, with a view to 
discovering whether the 20 pictures of the 
shorter version call forth similar themes to the 
discarded 22 pictures, the impression gained is 
that they do not. If one sums the positive (+5 
and over) and the negative (—4 and over) devi- 
ations from independence separately for each 
group of 20 and the 22 pictures, one finds that 
the shortened version is much more likely to 
produce erotic and economic themes or catego- 
ries of response, is more likely to produce family 
themes, less likely to produce aggressive themes, 
and much less likely to produce punishment 
themes than do the 22 rejected pictures. These 
differences suggest that revised norms are re- 
quired. 

In spite of the above criticisms, the reviewer 
is impressed by the possibilities of the test and 
believes it will become a valuable addition to 
the present small company of projection tests. 
One specially useful feature is to be found in 
the instructions, which attempt (but significantly 
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may fail) to focus attention upon features that 
will have diagnostic importance : e.g., What hap- 
pened before the scene depicted and what fol- 
lowed from it? What are the characters think- 
ing, doing, and saying? 

Though more time consuming than the van 
Lennep Four Picture Test, in which the subject 
is required to weave one story out of four pic- 
tures, Symonds Picture Story Test is better 
suited to persons of lower intellectual calibre. 
The reviewer has found that used in conjunc- 
tion, these two tests furnish information which 
is confirmatory to a degree which inspires some 
confidence in the method of projection. 


Rorsch Res Exch & J Proj Tech 13:347-9 S 
'49. Robert R. Holt. The publication, after about 
IO years, of the TAT pictures which Dr. 
Symonds had drawn by Lynd Ward raises again 
the important question: how are pictures for a 
thematic test to be chosen? * The original idea 
that stimulated Symonds’ research seems....to 
have been a sound one: that pictures might be 
found or designed which would be better suited 
for clinical use with adolescents than Murray’s 
and Morgan’s. But it is unfortunate that the re- 
search was designed in such a way that it could 
lead only to a competing set of pictures, not a 
systematic supplement to the original test, re- 
placing those cards which were shown to be 
of less value. For after all the years of work of 
Symonds, Milton Wexler and others on the 
Adolescent Fantasy Project, there is only the 
original assumption, in place of any proof, that 
the nicely gotten-out little pictures are in any 
way superior to Murray’s set for their avowed 
purpose. In fact, Symonds makes statements 
about the ease with which adolescents can iden- 
tify themselves with figures of any age or sex, 
even with objects and abstract ideas; these are 
results which cast doubt on the necessity of a 
special set of pictures. The method by which 
the 20 pictures were arrived at seems to be a 
sound one. * the author....had a well-known art- 
ist make a set of 42 wash drawings to his specifi- 
cations, representing a wide variety of situations 
and interpersonal relations of concern to ado- 
lescents, most of them with figures of teen-age 
boys and girls in them. Then after analysis of 
all the stories told by 20 boys and 20 girls to all 
pictures, the latter were rank ordered separately 
according to four criteria: “(1) those pictures 
which yielded the largest number of themes in 
the stories told about them; (2) those pictures 
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which yielded stories with the most important 
themes ; (3) those pictures which were judged 
best (on a five-point scale) by examiners who 
used them; (5) those pictures which yielded 
stories used for illustrative purposes in the re- 
port of the investigation.” A final combined rank 
order was made up, giving equal weight to each 
criterion (why the last was weighted so heavily 
is an obscure but perhaps unimportant point), 
and the 20 pictures with the lowest mean ranks 
were chosen for publication. Now if one were 
starting from scratch without any other basis on 
which to make a choice of test materials, a pro- 
cedure such as this would be quite in order. But 
to think to supplant the set of TAT pictures, 
with which many clinicians have the experience 
of many scores of administrations and interpre- 
tations, by virtue of a study involving only 40 
subjects seems unrealistic. When this much crit- 
icism has been made, there is a good deal to be 
said for the new pictures. A picture which sug- 
gests the topic of sibling rivalry, (B4) is one 
which is sorely missed in Murray’s test; jeal- 
ousy (A3) and a variety of parent-child situa- 
tions look also as if they might be quite useful. 
On the other hand, as a group they look as if 
they have more uniformity of style and mood 
than is desirable—the figures in successive pic- 
tures look enough alike to encourage persevera- 
tive carry-over stories. With one exception all 
are on the plane of daily reality. My impression 
(unbacked by any systematic evidence) is that 
it is stimulating to subjects’ imagination to have 
a variety of styles, media, types of characters, 
and degrees of imaginativeness in the pictures. 
Of course, such an off-the-cuff evaluation with- 
out any experience with the pictures is not wo 

much; the acid test is experience. As for the 
manual which accompanies the pictures, it says 
hardly anything that is not in Murray’s. Proce- 
dure and instructions are very similar, even to 
the stress on imagination in introducing the test 
to the subject. An extremely non-directive ap- 
proach is urged, “mild encouragement" being 
recommended only for "unusual cases in which 
material is produced very slowly." Symonds 
routinely includes a final period of association to 
each story as it is read back to the subject—@ 
good idea where feasible. * Analysis and inter- 
pretation of content and form are given brief 
treatment, again along rather conventional lines. 
The main task of content-analysis is "to tabu- 
late the principal psychological forces which are 
indicated in the stories"; a list of 14 are SUE- 
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gested. It is a rather miscellaneous group of 
needs, defenses and what Murray calls. inner 
states. Eleven formal aspects are also described, 
but their significance is left entirely unclarified. 
In general, Symonds has some sober and sensi- 
ble things to say about interpretation, but he 
seems so eager to prevent anyone from going 
off the deep end that he leans over backwards 
to deny the test any usefulness except in the 
context of much corroborating case data. One 
thing this manual has that is nowhere to be found 
in Murray's publications: a table of norms. The 
author deserves some sort of an A for effort, 
but for the effort only—the norms (Q,, Mdn. 
and Q, percentile points for 28 miscellaneous 
themes) are of questionable usefulness. They are 
no better than the representativeness of the 
original tiny sample of 40 on which they are 
based ; and there are no instructions for identify- 
ing the 28 themes, which do not even include 
all of the 14 themes listed in the section on analy- 
sis of content. There is nothing in the manual 
to indicate that Dr. Symonds has accumulated 
any clinical experience with his pictures, or in- 
deed that they were ever used with any subjects 
other than the original group of high school 
volunteers. Perhaps his sizeable volume, Ado- 
lescent Fantasy, has favorable answers to these 
doubting questions ; at any rate, the pictures are 
now available to others who may give them the 
kind of clinical try-out in simultaneous compari- 
son with Murray’s TAT that is very much 
needed at this point. It is devoutly to be hoped 
that comparative studies will be made, and that 
they will use as systematic and explicit a method 
for deciding the relative merits of all the pictures 
as Dr. Symonds has in his own research. 


For related reviews, see 133. 


[133] 
*[Re Symonds Picture Story Test.] Symonps, 
РЕкстул M. Adolescent Fantasy: An Investiga- 
tion of the Picture-Story Method of Personality 
Study. New York: Columbia University Press, 1949. 
Pp. xii, 307. $6.00. * (London: Oxford University 
Press. 325. 6d.) (РА 24: 650) 

Am J Orthopsychiatry 20:845-6 О ’50. Leo 
Kanner. * original, meaty, and highly informa- 
tive study * painstaking and carefully organized 
investigation * Symonds....has a way of letting 
the reader in on his plans and deliberations in 
a lucid style and with unswerving scientific hon- 
esty. His review of the literature, with its con- 
cise emphasis on essentials, gives a good basic 
orientation and leads to an understanding of 


the problems and areas of inquiry which have 
preoccupied the author in the course of his un- 
dertaking. * A number of problems arose during 
the investigation. Symonds discusses these 
thoughtfully, never yielding to the temptation of 
drawing hasty conclusions. * Some of the au- 
thor’s conclusions are extremely important: For 
the psychotherapist, a knowledge of the patient’s 
fantasy is of the utmost significance. Telling the 
stories “is a real growth experience out of which 
a boy or girl gains slightly more courage to ex- 
press his aggressions, goals, and ambitions, to 
assert his independence, to learn that his fears 
are trivial, to reduce in some small measure his 
burden of guilt, and to become more self-assured 
and confident.” * a good bibliography * a most 
valuable addition to the literature on the psy- 
chology of adolescents. This reviewer wishes it 
a wide circulation. 

Am J Psychol 63:160 Ja "50. Madison Bent- 
ley. * The “fantasy pictures” used are well re- 
produced in the attractive and well-written vol- 
ume, which is a major addition to the rapidly 
growing literature of this method. Two of the 
cases are fully “analyzed” in the text. A casual 
remark may be added upon the possible change 
from one generation to the next in a subject ad- 
ministered in Columbia’s Teachers College (cf. 
sup., a review of Thorndike’s Selected Writings 
from a Connectionist’s Psychology). 

B Int Bur Ed 24:74 q 2°50. A thorough study 
* The pictures....take into account the American 
background. Used with a similar age group else- 
where, they might not yield the same results. 

B Menninger Clinic 14:187 S ’50. John A. 
Grimshaw. This study....should interest the clin- 
ical psychologist. For reasons that the author 
did not make clear, he found it necessary to pre- 
pare a complete new set of pictures. * The pic- 
tures which are given in an appendix of the book 
are in general rather morose, gloomy, tragic in 
mood and seem to lack the variability and flex- 
ibility seen in the pictures used in the familiar 
Murray Thematic Apperception Test. 

Brit J Psychol, Gen Sect 40:229-30 Je ’50. 
M. Powell. * A conscientious account is given 
of the exhaustive analysis made of the stories 
obtained, including an inventory of themes, ta- 
bles of norms, and a section on quantitative rela- 
tions among groups of themes and relations be- 
tween the themes and the other variables of the 
study. In the light of his extensive experience 
with efforts to work out objective methods for 
scoring the stories, the author’s conclusion about 
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the value of such methods should carry consider- 
able weight: “I am more than ever convinced 
that the value of a projective technique is lost 
by any attempt to analyse or score it psycho- 
metrically—its value resides in the degree to 
which the interpreter is able to build up from the 
trends and the themes in the stories a dynam- 
ically integrated picture of the individual being 
studied." He is aware, of course, that interpreta- 
tions derived in this way are subjective, gov- 
erned as they are by the factors to which the 
experimenter is at the moment most sensitive ; 
but this limitation, he suggests, may be one of 
incompleteness rather than of inaccuracy. In 
contrast with some workers in this field, who 
advocate blind analysis of the TAT, Symonds 
emphatically denies the possibility of deducing 
facts in the real life of the subjects from their 
stories, and would emphasize that the proper 
purpose of the TAT and similar techniques in 
personality study is not to yield precise facts 
and information, but to reveal background mo- 
tives and tendencies which help to show the 
meaning and motivational significance of other 
facts known about the person. Perhaps the most 
interesting section of the book is a discussion of 
fantasy and character, which includes a com- 
parison of the stories with the life-history ma- 
terial. There is a useful account of problems 
connected with the process of identification of 
the subject with the characters in his stories, 
and one rather startling conclusion made by the 
author is that sex lines are no barrier whatever 
to identification : “Apparently boys can tell sto- 
ries in which a girl is the hero as easily as those 
in which the boy is the hero—and vice versa 
for girls, Consequently there is no reason for 
having separate sets of pictures for boys and 
girls.’ The author is refreshingly cautious in 
relating the fantasy material to the overt be- 
haviour of his subjects. He found, for example, 
but a negligible correlation between the nature 
of the fantasy material and “adjustment,” and 
concludes that this would indicate that the pic- 
ture-story method does not have diagnostic 
value, that is, does not help to distinguish be- 
tween normal and pathological subjects. “The 
same fantasies are produced by normal and by 
pathological individuals; this means that the 
same dynamic forces help to make a person 
normal that help to make him pathological.” It 
is difficult to find meaningful connexions be- 
tween fantasy and overt behaviour, he adds, be- 
cause these two forms of expression may par- 
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allel or oppose one another in a given individual, 
This relation in turn would seem to depend in 
each case upon the degree to which the fantasy 
need and the manifest expression of it are both 
expressions of some more general factor or dis- 
position. The section of the book which tabu- 
lates the norms for the themes obtained in the 
stories, although making a needed contribution 
to the literature in this field, may be of but little 
practical value to the English reader: e.g. many 
of the individual items may reflect the “mom” 
culture in which American adolescents are often 
said to live. This objection does not appear to 
apply, however, to the full discussions given 
of two individual cases, one of a well-adjusted 
boy and one of a boy with delinquent tendencies; 
for here the interest lies in the relations that 
are traced among all the variables which were 
considered in an effort to obtain a "dynamically 
integrated" picture of the two boys. The rich- 
ness of these two pictures suggests that the au- 
thor has been perhaps too modest in the claims 
for the value of his method. 

Cath Ed R 48:70-1 Ја '50. F. J. Houlahan. * 
Unfortunately, all of the pictures....are “on the 
whole gloomy, severe, morose, mournful and 
tragic.” Despite the obvious critical importance 
of the pictures for this study, the artist’s own 
interpretation is ignored except to say: “It is 
possible that the record of adolescent fantasy 
which this study yields is influenced to some ex- 
tent by the fantasy life of the artist who drew 
the pictures, but it is not believed that this is 
a very important factor, even though a con- 
stant factor.” * A tremendous amount of work 
went into the study. It is a very good illustra- 
tion of a method, and of a report of an investiga- 
tion. However, in this reviewer’s opinion, there 
is little validity to it. The pictures are admit- 
tedly of one general type. Whatever statistical 
formulas may show, generalizations based on 20 
boys and 20 girls are of dubious representative- 
ness, particularly when the two sexes are com- 
pared. It appears that the psychological precon- 
ceptions of the investigators must have had 
much influence in classifying the themes. On 
the other hand, the authority of the author and 
of his collaborators is such as to compensate 
for much of what the study lacks. 

J Consult Psychol 13:310-1 Ag ' 49. Laurance 
F. Shaffer. Symonds’ work is a skillful blend 
of objective and dynamic approaches, applied 
to the development of a picture-story technique 
for exploring the personalities of adolescents. 
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Several chapters are of general value in giving 
insight into adolescent needs, stresses and fan- 
tasies, apart from the particulars of the test it- 
self. * The volume is an indispensable back- 
ground and aid in the use of the Symonds Pic- 
ture-Story Test * 

J Proj Tech 15:276-9 Je ’51. Eva Ruth 
Balken. In this decade of intoxication with the 
merits of projective techniques as the psycholo- 
gist’s sine qua non of psychotherapy and psycho- 
diagnostics, it is refreshing to find a book which 
deals specifically with methodology and rationale 
and does not transcend the data to make gen- 
eralizations about either psychodiagnostics or 
psychotherapy. Adolescent Fantasy: An Inves- 
tigation of the Picture-Story Method is what 
its title claims it to be. It summarizes the results 
of a ten-year research project. Dr. Symonds’ 
earlier investigations of adolescent fantasy are 
not only the first application of the TAT, but 
the most important methodological contribution 
in this area. With the meteoric rise of clinical 
psychology and the ascendancy of the TAT-and- 
its-derivatives cult, the implications of this early 
study have been overlooked and the “criteria” 
exploited into: “Given the right set of pictures, 
the rest follows,” “Anything Goes,” and “Any- 
body can do anything and everything.” This in 
turn has been exploited into an equation of fan- 
tasy with symbolism, symbolism with the fan- 
tastic, the unreal, the impossible, the improba- 
ble, and of the fantastic with penetration to 
“deeper layers of the personality.” It is hoped 
that this book will not also be exploited in similar 
fashion and that it will serve to establish a new 
trend in clinical psychology—the investigation 
of fundamentals—and to serve as a model in 
methodology and rationale. The subtitle does 
more justice than does the main title to the 
contents of the book and its twofold purpose: 
“to prepare a set of pictures for general use as 
a projective technique,” and “to learn how the 
picture-story method operates.” A set of 20 pic- 
tures is now available and the book, as its table 
of contents reveals, is a faithful presentation of 
how the picture-story method operates with a 
given methodology and with a given rationale: 
“Exploration of the problem of classification of 
projective responses....The question posed in 
personality diagnosis (at present, at any rate) 
is not how much of this or that trait or function 
does a person possess, but what are his char- 
acteristic attitudes and responses.” If only Dr. 
Symonds did not distrust his own intuitions and 
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observations and formulate the problems in terms 
of the structure of personality, the quantitative 
element, so clearly brought out in his data, would 
not have to be subordinated to the finality of 
classification. Chapter XVI presents 21 con- 
clusions. This reviewer has selected conclusions 
2 and 3 as an orientation: “(2) Stories must 
successfully reveal personality when interpreted 
dynamically. Themes in the stories are those 
which would be expected from the findings of 
dynamic psychology and psychoanalytic theory. 
(3) The same dynamic principles which explain 
the personalities of poorly adjusted individuals 
also help us to understand the personalities of 
well adjusted individuals." The review of the 
literature is restricted to The Thematic Apper- 
ception Test and its antecedents in free-associa- 
tion tests. There is no discussion of the nature 
of fantasy, no indication that fantasy is an ac- 
tivity, a mental activity, and hence no compari- 
son with day-dreaming and imagination. There 
is no indication that fantasy represents the ful- 
filment of a wish and improves on unsatisfactory 
reality, that the impelling wishes vary with the 
character, sex, and circumstances, that with 
growing into adulthood and with the adult they 
are concealed with so much secretiveness that 
some of the wishes from which the growing- 
into-adulthood and the adult’s fantasies spring 
are such as to be entirely hidden from its creator, 
that the activity of fantasy covers the three 
periods of ideation and is linked up with some 
current impression, occasioned by some event 
in the present, which has the power to arouse 
an intense desire. Above all, there is no stressing 
of the quantitative element, the decisive element, 
and no description of the psychoanalytic find- 
ings regarding adolescence. In fact, there is no 
reference to either Sigmund Freud or Anna 
Freud. So the reader has no idea of what “find- 
ings of dynamic psychology and psychoanalytic 
theory” and what “dynamic principles” the au- 
thor is referring to in conclusions 2 and 3. The 
42 pictures used in this study were reproduced 
from sketches in black and white crayon made 
in accordance with two criteria : “A minimum of 
detail and characters with whom the subject 
can identify himself” and in accordance with 
content specifications, The pictures have a same- 
ness, a shadowy and unreal quality which they 
might not have had if the artist had not been 
bound by restrictions. They are easily identifi- 
able as the productions of one artist and have 
the effect of a continued story, which may ac- 
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count for the “warming-up effect" noted. Ex- 
perience has shown that drawing upon sources 
such as magazines provides greater opportunity 
for displacement and hence greater permissive- 
ness for communicating the story. As so often 
happens when the nature of fantasy is not ap- 
preciated, the dynamic interaction of uncon- 
Scious and conscious not recognized, and the 
distinction between internal and external his- 
tory not grasped, fantasy eventually becomes 
equated with symbolism and symbolism indis- 
criminately equated with the symbolic. So the 
Section; "Equations-Symbolism," while very 
informative, is still misleading with its equa- 
tions: (1) “love-acceptance-place in family- 
food-job.” (2) “clothes-poverty-riches-old- 
young-aspects of self.” (3) “poverty-depriva- 
tion, without desirable characteristics, mean.” 
(4) "job-affection-belonging and clothes-self- 
esteem.” (5) “being crazy-death wishes.” (6) 
“school-mother.” The fantasies have not been 
related to the totality of the research situation; 
hence the ubiquitous phenomena of transference 
and resistance have not been unfolded. A care- 
ful examination of the overtones and under- 
tones of this book reveals that Dr. Symonds has 
never been motivated by a belief in the magical 
powers of the research techniques and in the 
omniscience of the investigator and the exam- 
iner. It also reveals that again and again—as in 
the instance of the equations and in the discus- 
sion of Implications of Fantasy for Counseling 
and Psychotherapy—he is on the verge of recog- 
nizing the defenses against anxiety and finding 
the clues for their identification. But the failure 
to consider the importance of the directions has 
prevented appreciation of the extent to which a 
conflict had been created by the conditions of 
the research situation itself—a sine qua non in 
any investigation which claims to be an applica- 
tion of psychoanalysis—which would then de- 
lineate the individual’s mode of reacting to a 
conflictful situation. The subject not only takes 
a test of creative imagination. He is also re- 
quired to take a double role: actor and play- 
wright—“imagine yourself a story writer” ; then 
his freedom is checked by various admonitions 
and specifications: “Please don’t feel....,” and 
before the second picture the further check: 
“That was pretty good, but I am sure that you 
can do better. Perhaps you are afraid to say 
just what you are thinking about. It is all right 
to tell me anything you think of.” “Better” is 
equated with the “fantastic,” perhaps the im- 
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probable and impossible. The population con- 
sisted of 20 boys and 20 girls from a suburban 
city, all but one “well-adjusted” without observ- 
able conflict. (The 40 sketches show that dis- 
counting age and the like, they still were far 
from being a homogeneous group.) An initial 
conference preceded the testing. The 42 stories 
were obtained in a school period of 40 minutes, 
taking an average of 5.5 periods. At the com- 
pletion of the 42 stories came the association 
period in which the stories were read back to 
the subject. An interview with the parent (pref- 
erably the mother), one with the teacher, and 
an autobiography were obtained. The autobi- 
ographies were among the most helpful parts of 
the study. In addition, there were Sheviakoy- 
Friedberg Questionnaire, pupil ratings by the 
teachers, school records, and rankings for ad- 
justment. But it is not indicated how they and 
the biographical material were utilized in the 
delineation of fantasy. The Sheviakov-Fried- 
berg Questionnaire merits special discussion. 
This is the first and sole instance, to the re- 
viewer’s knowledge, in which it has been in- 
cluded in the methodology of investigations of 
projective techniques. It is scored for 26 cate- 
gories; in this study 24 were scored, but only 
five used in the quantitative relationship study: 
aggression, family, authority, identification, and 
opposite sex. (Chapter IX). It is regretted that 
the ratings were not scrapped so as to utilize the 
other scored categories. The quantitative and 
the qualitative analyses of the data were re- 
stricted to the identifying and classification of 
themes. There is a count of theme production 
by individuals and by pictures. Only those 
themes were listed which had occurred three or 
more times. The total number was 1850—an 
average of 46.25 per individual. The total num- 
ber per picture was 257, average 6.42, ranging 
from 150 (average 3.98) for card 7 to 319 (av- 
erage 7.98) for card 38. The peak is at age 15, 
confirming by inference that with growing into 
adulthood there is the growing tendency to be 
secretive about day-dreams, etc. Whether the 
peak is really at 15 is obscured by the conditions 
and circumstances of the research as is also the 
significance of the finding that the boys are more 
productive than the girls. The 1850 themes 97079 
then classified into three large “natural” groups: 
“(1) those of a psychological nature, (2) pns 
of environmental nature, and (3) those derive 

from stylistic qualities of the stories," по atigmpt 
being made “to analyze the grammatical an 
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other structural features of the stories.” Such a 
classification involves much overlapping and 
leads to such label impasses as “‘special fantasies" 
and "equation-symbolism." This inventory led 
to the establishment of norms for "only 30 main 
themes." To really comprehend the scope and 
promises of this contribution, the appendices 
must be referred to constantly: A. Sketches of 
the forty cases, B. Personality traits of the 40 
cases as inferred from the stories and as observed 
in real life, C. Theme headings, D. Frequency of 
norms by pictures, and E. Correlation tables. 
This reviewer has found A. and E. most helpful, 
instructive, and suggestive of problems for 
further research. A —.or correlation of anxiety 
(Symonds) with I.O. as contrasted with a —.27 
of guilt (Symonds) with I.Q., a —.34 of re- 
pentance with I.Q., and a +.26 of depression 
with I.Q. certainly justifies further investiga- 
tion, as do also the sketches, especially the skill- 
ful summation of the case data. This review 
may seem to many readers impertinent. So also 
will seem the cautionings of the author, which 
are too numerous to cite out of context, and his 
constant reminder that the analysis of the data 
is restricted to content, to identifying and classi- 
fying themes. The reader is especially apt to 
disregard Dr. Symonds’ remarks regarding the 
riding of “hobbies.” This is an age in which 
science is being travested into “hobbies” and 
subordinated to their furtherance and suste- 
nance. The relation between science and tech- 
niques is a reciprocal one and appreciation of 
this relation is an art: “It is a man’s imagina- 
tion that devises experiments.” It is the problem 
of technique, which brings about specialization 
in the unity which is science. A technique can- 
not create a science. Under the aegis of the pro- 
jective techniques cult, especially the TAT-and- 
its-derivatives cult (because of the belief that 
the effect must resemble the cause) various hob- 
bies are being ridden and masked by a benevo- 
lent skepticism regarding theories, facts and 
principles which must then await acceptance, 
rejection, or indifference until the constantly en- 
larging jury has passed its verdict. Fantasy, the 
reality of fantasy, is still on trial. The significance 
of Dr. Symonds’ contribution lies in his demon- 
stration of the reality, the existence of fantasy. 
It is hoped that it will serve to terminate the 
endless controversies and riding of “hobbies.” 
Jun Col J 20:230-3 D "49. Robert R. Blake. 
* the report of a piece of research....[which] 
might very well serve as a basic text....in the 
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relatively unexplored realm of adolescent fan- 
tasy life * While the investigation seems to 
have been conducted with care, some possibility 
of bias is present since the same investigator who 
had administered the Picture-Story Method oc- 
casionally participated in the collection of the 
other information as well. This allows the danger 
that the investigator would look for and conse- 
quently see what he thought “should be there.” 
The hazards from this source are probably not 
too great, however, because the investigators do 
not appear to have had preconceived notions as 
to what the data should mean in the final analy- 
sis and interpretation. * No direct demonstra- 
tion of validity is provided for the interpreta- 
tions that are given. The conclusions drawn are 
numerous and important. * This report....should 
prove to be of value to those investigators who 
are concerned with the deeper mainsprings of 
behavior as well as to those who teach, counsel, 
and advise adolescents. 

Psychoanalytic О 19:431-3 Jl "50. Hyman 
S. Lippman. * a painstaking effort to penetrate 
the personality and conflicts of the adolescent * 
a good deal of significant material was stimu- 
lated by the pictures which could be very useful 
in psychotherapeutic interviews. The tests 
would be more revealing in psychotherapy be- 
cause the therapist is in a position to elicit asso- 
ciations and to trace the source of the fantasy, 
after having obtained a trusting relationship 
with the subject. In spite of the fact that there 
were so many revealing fantasies in this study, 
this test has not the advantages of the Ror- 
schach Test in which the subject is totally un- 
aware of the nature of the material he is reveal- 
ing. Its advantage is the stimulation to discuss 
subjects generally avoided which produce con- 
flict. * In several instances references to the 
dynamics involved in the stories seem forced 
and unclear. This may be due to the fact that 
material is omitted for brevity. It would seem 
to the reviewer, however, that such references 
should not be made unless the evidence for them 
is clear and unmistakable. The author wisely 
states in his conclusions that the interpretations 
at best are hypothetical, and subject to revision. 
When the master of a technique concludes with 
such a statement, it is to be hoped that his less 
well-trained followers will use correspondingly 
greater caution. Adolescent Fantasy is a book 
well worth reading. The picture method will 
probably be used extensively and, as time goes 
on, may help to reveal important factors con- 
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cerning the complex fantasy life of young people. 

О R Biol 26:111 Mr "sr. Helen Arthur. * 
impressive study * exceedingly careful and de- 
tailed approach * Throughout the book there 
is a nice balance between the statistical use of 
the data and a reporting of the actual clinical 
material. * 


i [134] 

*Szondi Test. Ages 4 and over; 1937-51; 8 “factors” 
and 4 “vectors” (each vector is the total of 2 factors) : 
homosexual, sadistic, sexual vector, epileptic, hysteric, 
paroxysmal vector, catatonic, paranoic, schizophrenic 
vector, depressive, manic, contact vector; individual; 
BM; 1 form, '47; $5 per manual, Introduction to the 
Szondi Test ('49—sce тт below); $11 per set of 48 
test pictures ('47); separate answer sheets may be 
used; $1.80 per 5o IBM answer sheets (sra led 
Form D); $3 per тоо profile sheets '49—labeled 
Form B); $3 per тоо table-of-tendency-tensions sheets 
("49—labeled Form C); $3 per so folders containing 
the preceding two sheets ('49—labelled Form A); 
postage extra; test must be administered "at least six, 
preferably ten, times with at least one day intervals 
between administrations"; (10-15) minutes per ad- 
ministration; English manual by Susan K. Deri; L. 
Szondi; Grune & Stratton, Inc. * 
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Arpie LuBiN, Statistical Psychologist, Insti-, 
tute of Psychiatry, Maudsley Hospital, London, 
England. 

The aim of this review is to persuade psy- 
chologists to use the Szondi Test only for those 
purposes for which it has been shown to be 
valid. Since the published data to date have not 
shown the Szondi to be unequivocally valid for 
any of its stated purposes, this aim resolves it- 
self into a warning that the Szondi must be used 
for research purposes only. 

Deri states that the purpose of the Szondi 
Test is “to reflect the personality as a function- 
ing dynamic whole. * Its field of application is 
-.Similar to that of other projective technics; 
in other words, as a diagnostic instrument for 
clinical use or for the interpretation of the so- 
called normal personality, vocational guidance, 
experimental social psychology and a variety of 
fields of research" (тт). Deri claims that the 


Szondi is particularly valuable for demonstrat- 
ing changes in personality. 

THE VALIDITY OF THE SZONDI FOR PSYCHO- 
DIAGNOSIS. Studies by David (9, 45), Fosberg 
(49), and Whiteman (64) give, on the whole, 
negative results. Deri concludes that psychotics 
can be distinguished from normals because of the 
greater stability of the normals (22). However, 
when her data were analysed by the reviewer, 
the results were not wholly consistent with her 
conclusions. 

From the data available, it seems that the 
diagnostic signs given by Deri and Szondi are 
invalid. Although David and Whiteman have 
shown that certain empirically derived scores 
do discriminate, David points out that so many 
scores were tested, that some would be expected 
to pass the 5 per cent level by chance alone. 
Obviously the Szondi cannot be used for clinical 
diagnosis without further verification. 

THE VALIDITY OF DERIS NEED-SYSTEM THE- 
оку. Deri has elaborated a theory about the 
Szondi which makes use of the concept of “need- 
systems.” The essence of the theory is that each 
of the eight factors represents a need, and that 
a lack of choice in a factor (known as an “open” 
reaction) reflects a weakness of the particular 
need or indicates that the drive has been “lived 
out" through adequate activity. Conversely, a 
large number of choices in a factor (known as 
a “loaded” reaction) indicates an unsatisfied 
need. For example, an open reaction in the sexual 
vector would ordinarily mean that there had 
been sexual activity. у 

Fosberg (49) has produced the most ex- 
tensive and best designed tests of Deri's theory. 
Although his statistical technique is not the best 
that could have been used, his data show that 
the Szondi Test is not sensitive to sexual ac- 
tivity or electrically induced seizures, ( Deri spe- 
cifically states that the Szondi should be a good 
measure of the tensions and discharges in these 
need-systems.) Deri's data (22) suggest that 
the s, Ay, and Ё factors are affected by a com- 
plete course of electric shock treatment. But the 
changes in the Лу and & factors had not been 
predicted by Deri and contradicted the general 
theory that discharges of need tensions should 
result in decreased loadings. 

CHANCE AND THE SZONDI TEST. The question 
has been raised whether the responses to the 
Szondi are the same ones that would occur by 
chance. Researches by Cohen (42), David (45), 
Fosberg (49) and Guertin (27) show that 


134 THE FOURTH MENTAL MEASUREMENTS YEARBOOK 


[ 256 


————————————————————————————————————————————É9t 


chance is not the sole determiner of Szondi re- 
sponses. This is necessary but not sufficient for 
establishing the validity of the Szondi technique. 

THE VALIDITY OF THE LABELS ON SZONDI'S 
PICTURES. What is the relation of Szondi's pic- 
tures to the eight types of mental patient they are 
supposed to represent? The studies by Klopfer 
and Borstelmann (30), Rabin (34-5) and Fos- 
berg (49) seem to show that psychiatrists and 
psychologists, as well as other subjects having 
some psychological training, can classify the 
photographs into Szondi's eight categories with 
considerably better than chance success. 

THE INTERNAL CONSISTENCY OF THE SZONDI. 
Studies by Ancelin, Duchene and Schützen- 
berger, Lubin and Malloy (56), and Guertin 
(27, 52) show that photographs bearing the 
same Szondi labels do not (except by chance) 
measure the same trait. 

THE TEST-RETEST RELIABILITY OF THE SZONDI. 
Lubin and Sandler (бо) found that the test- 
retest correlations ranged from .20 to .65 for the 
16 like and dislike scores. (The interval between 
test and retest was 5 to 7 days.) АП correlations, 
except the lowest, are significantly different 
from zero. It was found that the test profile of 
Szondi responses could be matched to the retest 
profile with 96 per cent accuracy. 

SUMMARY AND CONCLUSIONS. The Szondi is 
a highly structured, easily administered test; 
however, it must be given over at least six days, 
and it requires a long time for interpretation. 
Szondi's "genotropic" theory has been rejected 
by Deri, who has substituted a need-system set 
of assumptions. Empirical attempts to validate 


the need-system theory have resulted in observa- * 


tions which contradict the theory (except for 
Deri's results which were ambivalent). Szondi's 
and Deri's diagnostic signs cannot be trusted 
for use in clinical diagnosis, but other scores not 
mentioned by Szondi or Deri may be of some 
use. The observed relations between the photo- 
graphs are not in accordance with the theory on 
which the test was constructed, and it is highly 
probable that the majority of photographs 
measure traits other than those indicated by the 
labels they bear. The Szondi has a low internal 
consistency but a high test-retest reliability 
(presumably due to memory). 

Because there is no positive evidence of the 
Szondi's theoretical validity, practical useful- 
ness, or superiority over other widely used tests, 

1 Ancelin, A. E.; Duchene, H.; and Seite aug’ oat M. P. 

. Szondi." 


“Recherches Critiques sur la Theorie et le Test de 
Enfance 3: 65-73 'so. (PA 25:3155) 


such as the Rorschach, it is not recommend- 
ed. 

If further research is to be carried out on the 
Szondi, the reviewer feels that the most fruitful 
approach would be to substitute classification by 
the clinician for the single quantitative scores 
and repeat the studies on the diagnostic utility 
of the Szondi as well as on the validity of Deri's 
theories. 


ALBERT I. Rasin, Associate Professor of Psy- 
chology, and Director, Psychological Clinic, 
Michigan State College, East Lansing, Michi- 
an. 
А This test is a projective technique which does 
not involve any verbal responses. The test ma- 
terial consists of six sets of photographs of 
mental patients in European hospitals. Ir each 
set there are eight pictures representing the fol- 
lowing diagnostic categories: homosexuality, 
sadism, epilepsy, hysteria, catatonic schizo- 
phrenia, paranoid schizophrenia, depression, and 
mania. Each set is placed before the testee who 


is asked to select the two pictures he "likes ` 


most" and the two he "dislikes most." Thus, the 
final selections consist of 12 liked and r2 disliked 
pictures, out of the total of 48. For clinical pur- 
poses it is recommended that the test be ad- 
ministered at least six times with one-day inter- 
vals between testing sessions. It usually takes 
about 15 minutes to give the test. 

The basic assumption of this device is that 
the photographs have meanings which cor- 
respond to need-systems characteristic of all 
persons. The selection or rejection of the photo- 
graphs is indicative of the relative tension reduc- 
tion in the need systems which they represent. 
The test profile that is obtained indicates the 
relative tension present in each need system 
represented by the factors which are named 
after the diagnoses represented in the photo- 
graphs (homosexual, sadist, etc.). A related as- 
sumption is also the equal valence of pictures 
within the same diagnostic category. 

The interpretation is a rather complex ps код; 
since it does not involve only the relative loa à 
ing" (selection and rejection) of the factors, bu 
also the interplay of the need systems and their 
integration. within the personality. Moreover, 
the "process" of the changes from one ай 
tration to another further complicates the tas 
The result is not a cross sectional static analy: "e 
but a dynamic interpretation in the tempor? 
dimension. i 
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Since the publication of the English manual, 
a number of investigations with the Szondi have 
been reported. The acceptance of this test as a 
clinical tool is not widespread at the present 
time. Contrary to the historical trend of other 
projective techniques, clinical reports are few. 
The bulk of the investigations deal with a criti- 
cal examination of the basic assumptions and 
theoretical foundations of the test. 


Some studies have demonstrated that рѕу- ' 


chologically trained persons without familiarity 
with the test are able to identify the diagnoses 
represented in the pictures better than is ex- 
pected by chance (34, 35, 49). This is an indi- 
cation of the “meaningfulness” of the pictures. 
They are not merely neutral stimuli. It was also 
found that the pictures have “а demonstrable 
associational valence" similar to that described 
by the author of the test (30). However, another 
study (63) showed no differences between the 
catatonic and paranoid pictures in traits assigned 
them by several judges. Muscular contractions 
and expressions of friendliness in the pictures 
seem to be important determiners in picture 
selection. Some doubt is, therefore, cast upon the 
picture representation of particular need sys- 
tems. That the pictures are not selected on a 
chance basis and that sex differences may be 
seen in the selection of some of them has been 
demonstrated in another study (19). Yet, arti- 
facts other than the “foreign” appearance (20) 
of the patients must be operating. 

The assumption of equivalence of pictures 
within the same diagnostic categories does not 
hold up in any of the investigations dealing with 
the problem (26, 27, 49, 51, 52)- 

There is ample evidence that the Szondi does 
not differentiate readily between groups of psy- 
chotics and normals (64) or between different 
psychopathological groups such as homosexuals 
and epileptics (46). Lack of relationship between 
sexual discharge, convulsive seizures, and 
changes in Szondi factor loading has also been 
shown (49). Some positive relations between 
Szondi choices and fluctuations on other tests 
in a single case have also been reported (28). 

Many difficulties in the statistical treatment 
of Szondi data have been encountered. Some of 
the reported suggestions (42) may be useful in 
further validation studies. 

Thus far, the evidence shows that the cautious 
approach to the Szondi as a diagnostic tool has 
been amply justified. Pending more positive sta- 
tistical and clinical validating findings, it will 
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have to continue its precarious position in the 
field of projective techniques. 


For a review by Susan K. Deri, see 3:100; 
for related reviews, see 135. 


[135] 


*[Re Szondi Test.] Derr, Susan. Introduction to 
the Szondi Test: Theory and Practice. Foreword 
by Lipot Szondi. New York: Grune & Stratton, Inc., 
1949. Pp. xiv, 354. $5.00. * (PA 23: 3748) 


Am J Orthopsychiatry 20:650-1 Jl ’50. 
Joseph P. Lord. This book, strongly endorsed 
by Szondi as a valuable “supplement to and 
elaboration” of his Experimentelle Triebdiag- 
nostik stems from a well-qualified background ; 
its author has had eleven years of clinical ex- 
perience with this projective technique and 
worked with its originator for four years during 
its inception. * Supporters of Freudian psycho- 
analysis or Lewin's vector psychology will note 
the inclusion of many of their principles; the 
statistically minded will writhe at the large 
amount of nonquantitatively supported data and 
at the necessity of “suspending judgment" until 
employing the technique, although Deri par- 
tially precludes this objection by reference to the 
appendix of Szondi’s cited work; the clinician 
in search of a rapidfire personality prober will 
derive satisfaction from the objectivity of the 
scoring and from the ease of a single administra- 
tion, but must accept the limitations inherent in 
the necessity for six to preferably ten day-spaced 
administrations to the same Subject, as well as 
in the complexities involved in the wide range 
of possible factor and vector combinations. Of 
great import, if the author’s claim is valid, is 
the test’s capacity for delineating levels of func- 
tioning in the personality. 

Am J Psychiatry 107:237 S ’50. Раш E. 
Meehl. This important book appears oppor- 
tunely, as the Szondi test is arousing increasing 
interest among American clinicians, most of 
whom will probably not work their way through 
Dr. Szondi's two volumes in German. * The 
book is clearly an attempt to teach clinicians how 
to use the test, and not in any sense a study of 
its validity or an effort to produce conviction 
about the latter. She says “....acceptance of prac- 
tically any of my statements about the....con- 
stellations is left to the good-will of the reader” 
(p. xi). This remark will strike most American 
psychologists as rather odd, unless the purpose 
of the book is kept in mind. * On the negative 
side, it is dangerous to speak pending satisfac- 


135 


tory validity studies. There is a good deal of 
the “now-you-see-it-now-you-don’t” type of dis- 
cussion, in which the interpretations are loaded 
with enough qualification and references to the 
"rest of the pattern" so that we approach em- 
pirical emptiness. Although an isomorphism 
with the psychoanalytic vocabulary is denied, 
the constant use of it with subtly altered mean- 
ings produces a certain vagueness. In the at- 
tempt to get across the “feel” of a personality, 
enough words are used so that the picture be- 
comes uncomfortably general There is some 
statistical-psychometric naiveté (or else care- 
less formulation). For example, we are told 
that "..these changes always imply some 
change in the subject's attitude toward a particu- 
lar drive" (p. 40). Here we see no apparent 
recognition of the existence of “uniqueness” in 
the factor-analytic sense—each picture is free of 
any contribution from specificity or error. Again, 
the question is not raised of how much of the 
temporal shifting found in schizophrenics could 
be attributable simply to the variability found in 
almost any kind of repeated performance ob- 
tained from these patients. Again, the threshold 
value for the various pictures is not discussed; 
it is apparently assumed that each picture is lo- 
cated at the same “difficulty” level on its own 
factor, and that no other variables produce dif- 
ferences in its stimulus properties. Hence, we 
are asked to accept differences among picture 
sets (absolute values) as confirmatory of the 
theory, e.g., “....minus s is not very frequent be- 
cause it implies ability to sublimate aggression 
which is not common..." (p. 77), or “the fre- 
quency of minus р in the general population... 
is much higher than that of plus ....this would 
mean that most people act according to their 
emotional needs..." (p. 179). In general, all 
responses and shifts in responses are attributed 
to real changes in the named factors, as if per- 
fect validity and factorial purity were estab- 
lished. The factor matrix of the Szondi must be 
unique among all psychological tests ever built, 
if this is true. Finally, even though it is unfair 
to criticize a book for not achieving aims ex- 
plicitly repudiated by the author, it is perhaps 
fair to ask why no systematic validity data are 
presented? This reviewer would gladly have 
exchanged Chapter III, on "factorial associa- 
tion," for a section of similar length presenting 
some validation material. After all, the book is 
not easy to read, and will be a lot of work to 
really master. Surely the reader is entitled to a 
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few pages devoted to establishing validity, be- 
fore taking on such a task. It is not a question 
of his lacking “good-will,” since the best will in 
the world could not convince the scientific clini- 
cian on the basis of this book. Should we be asked 
to accept “....proving it pragmatically by the use 
of the test..." (p. ix) or “....the increasing de- 
mands for Szondi reports..." (p. Xi) as substi- 
tutes for validation studies, to the point of em- 


"barking on a learning task which will only then 


be able to tell us, from our own experience with 
it, whether we spent our time well or not? I do 
not think so. 

Am J Psychol 63:141—2 Ja "50. B. L. Margo- 
let. * The....book runs from a description of the 
test and the proper mode of its administration to 
a polemic upon its significance, a polemic which 
occupies a good part of the eleven chapters 
(Chaps. iv-xi). The point of departure for the 
argument is the test-subject’s selections of 
“liked” and “disliked” pictures from among 48 
serially arranged photographs of “mental pa- 
tients” (8-16). The tenets of the argument 
would be the transition between these selections 
and a divination of the mainsprings of the sub- 
ject’s “personality.” But they are tenets neither 
convincing nor plausible. As here conceived 
“personality” seems to represent a centrifuged 
mixture of orthodox freudian mechanisms and 
forces, psychiatric “malignancies,” lewinian vec- 
tors and tensions, some of Szondi’s factorial pos- 
tulates, and certain products of Deri’s own clini- 
cal and pedagogical experiences (2-4, 7-8, 
25-28, 65-66, 88, 118-120, 167—172, 287). Sanc- 
tion and documentation for this strange product 
are wanting. The author, who has been pressed 
to write now through suspicion of misuse of 
Szondi's work (ix, xi), notes that her exposition 
rests upon assumption and postulate (3, 26, 38, 
45, 66, 286) and states that, because of “the lack 
of rigorous quantitative validating data....the ac- 
ceptance of practically any of my statements 
about the meaning of the various factorial con- 
stellations [the main concern of the book] is left 
to the good-will of the reader” (xi). Some three- 
hundred pages of these “statements” make a 
heavy draft upon “good-will” standing in the 
account of a critical reader. The volume seems 
to be making headway among some clinicians 
in this country. Possibly a simplified Szondi, 
largely relieved of statistics and written in Eng- 
lish, might be expected to arouse hopes among 
clinical testers of personality. 

B Menninger Clinic 15:34 Ja "51. Walter 


259 | 


Kass. * There is much intuitive material of inter- 
est to clinicians, But those who look for valida- 
tional data will see only large claims and little 
evidence. 

J Abn & Social Psychol 45:184-8 Ja’ 50. Roy 
Schafer. The formal introduction of a new per- 
sonality test of any complexity is an extremely 
demanding task. Mrs. Deri’s book, based on 11 
years’ participation in the development and ap- 
plication of the test, meets some of the demands 
but falls far short of meeting many others. Un- 
fortunately, limitations of space in this review 
will make it impossible to cover adequately all 
of the author’s claims for the test or to docu- 
ment all the criticisms to be advanced. * basic 
definitions [are] given on pages 55-56; and 
pages 57—285 are devoted to elaborating their 
general implications as well as their specific 
meanings in different factorial and vectorial con- 
figurations of choices. This task is carefully and 
thoroughly carried out, with due respect for the 
shifting psychological implications of specific 
sub-patterns depending on the total contexts in 
which they occur. Frequently only lip-service 
or a little more is paid in publications to the 
problems of interpretation-within-context, but 
Mrs. Deri takes this quite seriously. * The book 
is easily readable and has a helpful index. There 
are three main headings under which this book 
will be criticized : specific test rationale, valida- 
tion, and the general handling of personality dy- 
namics in the interpretive process. Rationale: 
There is a minimum of rationale in the strict 
sense of the word, i.e., the elucidation of medi- 
ating processes between the stimulus and the 
response. Pertinent mediational processes would 
include all the stimulation of thoughts, attitudes, 
values, and feelings by global and/or fragmen- 
tary perception of the photographs, how these 
are integrated, how the level of attention fluctu- 
ates from one picture to the next, etc. The author 
loosely refers to the principles of interpretation 
as “rationale” and otherwise pretty much limits 
herself to the basic assumption that the psycho- 
pathological dynamics in the members of the 
various diagnostic categories are somehow com- 
municated to and perceived by the subjects 
through the photographs used. This assumption 
actually embraces or conceals a number of as- 
sumptions. (1) The dynamics of patients in each 
psychiatric category are well established, and 
(2) are identical in all patients with the same 
diagnosis. (3) These deep dynamics are ex- 
pressed in the faces of the patients, and (4) these 
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facial expressions are adequately “caught” in the 
pictures used, (5) The underlying dynamics are 
expressed equally well in all six representatives 
of each diagnostic category, pattern analysis 
being a purely quantitative matter and remain- 
ing independent of particular choices; thus (6) 
the sexual identity of these pictured patients 
makes no differences in one's response to them, 
(7) the more or less striking cultural differences 
in appearance (hair-do, clothing, beard) make 
no difference to American subjects, and (8) 
there are no “popular” reactions (pictures 
“liked” or “disliked” with very high frequency) 
which, by being “popular,” require some modi- 
fication of interpretation ; this reviewer's experi- 
ence with the test indicates that there are “рори- 
lars." (9) The expressed dynamics on the faces 
of the patients are equally well perceived by all 
subjects, so that for each picture the “likes” or 
"dislikes" of all subjects have essentially the 
same qualitative implications and quantitative 
weights. (10) Varying choices from day to day 
reflect basic shifts in the subject's need-tensions 
and have little or nothing to do with factors of 
satiation, superficial mood changes, interest or 
disinterest in consistency of reaction, and other 
ego attitudes. (11) The fate of every factorial 
distribution in the test is independently deter- 
mined, and the limited number of choices (12 
plus, 12 minus) has no real bearing on the re- 
sults; thus, for example, given special sensi- 
tivity to pictures in two or three factors with 
resulting loaded distributions in these factors, 
the remaining more or less open distributions in 
the other factors can still be directly interpreted 
as if they were left open by conscious or uncon- 
scious intent alone. All of these assumptions are 
more or less doubtful. The uniqueness of indi- 
vidual perception, particularly in interpersonal 
matters, one of our most valuable clinical psycho- 
logical insights, is implicitly dismissed when the 
above assumptions are made, although its super- 
ficial expression is acknowledged. Validation: 
It goes without saying that in the introduction 
of a new test with as bold and deep interpre- 
tations as are made from this, careful presen- 
tation of validating techniques and data is in- 
dispensable; The author frequently mentions 
interfactorial correlations or significant group 
differences with no mention of the size and se- 
lection of the samples, the statistical techniques 
used, or the magnitude and reliability of the re- 
sults obtained. Strict validation of psychological 
interpretations of test results is admittedly an 
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extremely complex, uncertain, and as yet by no 
means satisfactorily worked out procedure, but 
. when big claims are made, one would like to 
know more about their justification. With re- 
gard to her presentation of the genetic and oc- 
cupational studies as validational material, even 
if it were adequately presented, it would not be 
entirely satisfactory as validation, because (1) 
it makes the dubious assumptions that there are 
common dynamics in the members of any occu- 
pation or age level and that these dynamics are 
well established, and (2) since the "established" 
test patterns do not cover all the members of the 
group, it must be, but has not been, demonstrated 
that the interpretations apply to just those mem- 
bers of the group who have the patterns in ques- 
tion. Indirect validation of the type attempted, 
even when statistically significant differences are 
obtained, does not truly validate the deep inter- 
pretations. The author claims, however, that 
pragmatically the test works, that is, skillful in- 
terpretation produces a personality description 
that corresponds closely with the clinical picture 
as derived from case history and psychiatric or 
psychoanalytic study of the individual. Indeed, 
in discussing the absence of formal validational 
material in the book, the author asks for the 
reader's “good will" acceptance of the interpre- 
tive principles, and assures the reader that he 
will convince himself of their validity by the use 
of the test. It is here that the basic weakness of 
the entire presentation is found. It is difficult to 
imagine how, at the present stage of sophistica- 
tion in clinical psychological validation tech- 
niques, one can validate the claims made for the 
test. The author acknowledges this difficulty but 
then claims convincing pragmatic validation— 
as if this could be something other than a more 
or less systematic correlation and ordering of 
experience and still retain meaning. The extreme 
difficulty of validation comes about, first of all, 
because scant attention is paid by the author to 
the phenotypical referents of many of the deep 
dynamic trends. Without carefully specifying 
these referents, and as would probably be neces- 
sary, the context or configuration of referents 
from which the underlying genotype may be in- 
ferred or ruled out, the author's claims are not 
subject to validational tests. In this regard, the 
author treats variables like “need to fuse into 
environmental objects" relatively casually, as if 
their observation is a more or less routine matter 
to any psychoanalytically sophisticated observer. 
Altogether an atmosphere of certainty is cre- 


ated around a loose mixture of clinical psycho- 
analytic findings and hypotheses, which though 
highly provocative and penetrating, are often 
equally abstract and speculative. The author 
writes as if always presenting clinical facts, while 
the fact of the matter is that on theoretical points, 
on the general dynamic characterizations of di- 
agnostic entities, and on the interpretation of 
specific case material, there is as a rule quite a 
bit of variation of emphasis if not outright dis- 
agreement in the field, even when those con- 
cerned are numbered among the more orthodox 
Freudians. A second source of validational diffi- 
culty, related to the difficulty of establishing con- 
crete indications and contra-indications, is that 
if one proceeds to a sufficiently abstract level of 
dynamic description and does not strictly and 
consistently define corresponding behavioral ref- 
erents one reaches a point where all things are 
true of all people; that is to say, we all have im- 
pulses or aspects of impulses which may be char- 
acterized as oral or clinging, anal or possessive, 
homoerotic, narcissistic, exhibitionistic, repres- 
sive, projective, hostile, and the like. The life 
of a human being is so complex that by the 
proper selection or interpretation one can set 
out, if he wishes, and find behavior items to con- 
firm almost any interpretation. Case material of 
any detail is enormously plastic and not to be 
trusted in a clinical experiment unless there are 
guarantees of independence of validating criteria 
and techniques, the precise nature of which are 
stated, and unless weighted estimates or predic- 
tions, however gross, are made and then checked. 
Careful reading of the chapter оп the “exper 
ment of factorial association,” in which some 
randomly selected descriptions of pictures by 
different types of subjects are presented as tenta- 
tive validation material, will illustrate how 
blurred the line can become between careful, 
controlled analysis and facile, often glib inter- 
pretations which smack of sophisticated psycho- 
logical fortunetelling. The same can be said for 
the author’s hasty and omniscient rationalization 
of the genetic, occupational and diagnostic find- 
ings presented all through the 228 pages on 10- 
terpretation. Further mistrust is engendered by 
the hedging, I’m-never-wrong quality in state- 
ments to the effect that indications of lack © 
tension in any need-system do not mean that 
there is no such tension in the subject, but only 
that such tension is not indicated in the test 
results. Theoretical aspects: (1) Though the 
author leans heavily on Freudian psychoanalytic 
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knowledge and theory to support her arguments, 
in many places she comments on the happy coin- 
cidence of Szondi Test and psychoanalytic find- 
ings as if these were two independent systems, 
and in still other places she indicates that the 
Szondi Test has an independent psychology of 
its own, though it overlaps much with psycho- 
analysis. This shifting use of personality theory 
is confusing, and the introduction of a relatively 
independent psychology of personality and psy- 
chopathology as a frame of reference for a new 
test inevitably implies a closed, instrument- 
centered system: personality becomes what the 
Szondi Test measures. Although the book gen- 
erally adheres to the Freudian psychoanalytic 
viewpoint of the last 30 years, it also refers fre- 
quently to Lewinian concepts and sometimes, 
where this is convenient for her argument, to 
Jungian and Rankian concepts, and even to some 
early ideas of Freud’s which, as the author ac- 
knowledges, he subsequently repudiated. There 
is a quality of opportunistic eclecticism and se- 
lectivity in all this. Further material of this sort 
includes relatively frequent, casual, and in the 
main unacknowledged revisions of accepted psy- 
choanalytic theory, as where she maintains that 
introjection and repression are dynamic oppo- 
sites, and, as in the test report quoted above, 
that what she considers to be an essentially orally 
fixated and apparently quite neurotic patient has 
“basically healthy heterosexual drives.” (2) It 
is a questionable proposition that it is the ten- 
sions in the loaded factors which underlie the 
symptom formation or overt behavioral charac- 
teristics indicated in the open factors ; this prin- 
ciple implies, for example, that tension indicated 
by a loaded paranoid factor may underlie symp- 
tom formation in the oral area, or vice versa. (3) 
It is a questionable proposition that the distribu- 
tions in the ego vector reflect dynamics (the 
general mode of handling needs) at a more gen- 
eral level than those of the other three vectors. 
That the mediational processes can vary 50 
(reaching different levels of personality organ- 
ization) while the stimulus situations remain 
the same (photographs) requires considerably 
more explanation and validation than we find. 
The same point holds for the paroxysmal vector 
which has to do chiefly with defenses, while the 
sexual and contact vectors have more to do with 
specific needs. (4) There is a tendency to reify 
some of the constructs used, particularly those 
in the ego vector. (5) There is a questionably 
expansive and fluid use of the key terms “subli- 
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mation” and "projection." The author ends by 
promising a second volume to follow soon with 
more detailed case illustrations..It is to be hoped 
that there will be a third and less omniscient 
book presenting discussion and data pertaining 
to rationale and particularly to sound valida- 
tional techniques and their results ; the magnifi- 
cent claims made for the test are badly in need 
of support. A convincing validational technique 
for this test and for the variables described by 
the author will represent a great methodological 
contribution to clinical psychology and psychia- 
try. It is to be doubted that this will be forth- 
coming: a test in which process is so concealed 
and only the end results observable is inherently 
weak as a projective technique; interpretation 
is inevitably dictionary-style. It is difficult to 
see what aspects of the response process the in- 
terpreter can project himself into—a psychologi- 
cal act, according to the author, indispensable in 
adequate interpretation. 

J Consult Psychol 13:307 Ag’ 49. Laurance F. 
Shaffer. The long-awaited text on the Szondi 
method contains the instructions for its adminis- 
tration, which are simple, and those for its inter- 
pretation, which are not simple at all. * The 
most astonishing aspect of the book is the total 
absence of evidence attesting to the method’s 
value other than by reference to the experience 
of the author, and the lack of any reference to 
research literature beyond Szondi’s three books 
in German. The absence of public evidence for 
the test’s merit is a matter of preference or phi- 
losophy, rather than of neglect or ignorance. Mrs. 
Deri appeals to private evidence. The acceptance 
of her statements is “left to the good-will of the 
reader,” who should keep his judgment “sus- 
pended until he can convince himself about the 
clinical validity....of the test." But, alas, the per- 
sonal feeling of validity has been an unhappy 
guide since Mesmer; most of us prefer more 
public and communicable testimony. It may be 
hoped that skillful and hard-headed research 
workers will not be repelled by the implausibility 
of the Szondi until it has been given a thorough 
and public evaluation. 

J Ed Psychol 41:447-8 № '50. George К. Ben- 
nett. * To the reader who is accustomed to look 
for experimental data in support of claims of 
usefulness or validity, this volume is wholly in- 
adequate. Elaborate explanations of the signifi- 
cance of various factors are given without even 
frequency tables. Sums, differences and ratios 
are computed and interpreted without thought 
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for the problems of reliability involved. Shifts in 
type preference or aversion from one day to the 
next are regarded as indications of changes in 
the needs and tensions of the subject rather than 
errors of measurement. Validity in the usual 
sense appears to be dismissed in the following 
paragraph: 


The superficial appearance of normalcy is responsi- 

ble for the extreme difficulties inherent in the problem 
of validating studies on the basis of observable behavior 
or verbal or written questionnaires. Many basically un- 
happy individuals who are unable really to become 
emotionally attached to any person or object would 
rate extremely high on a written adjustment inventory, 
or on the basis of observation. 
This reviewer can only conclude that, on the 
basis of Mrs. Deri’s book, the Szondi test is one 
more unproven instrument proposed as an aid 
in personality diagnosis. Pending the publica- 
tion of adequate supporting evidence, the psy- 
chologist using this method is proceeding on the 
basis of faith and should be willing to recognize 
the peril of his course. 

Psychoanalytic Q 19:112-4 Ja’50. William F. 
Murphy. * Criticism of this book cannot be sepa- 
rated from criticism of the test and it is difficult 
to know where to begin. Szondi’s theories of the 
genesis of mental disorders are omitted, and to 
this reviewer they appear to be simply a varia- 
tion of the universally accepted belief that consti- 
tutional factors play an extremely important role 
in determining the strength of the instinctual 
needs. The main bone of contention, however, 
is whether or not these needs are expressed in a 
photograph and are responded to in the test situ- 
ation. The author expresses the problem very 
well : *Without the assumption that these eight 
types of mental disturbances imply well defin- 
able extreme manifestations of generally known 
psychological mechanisms, the functioning of the 
test would be inconceivable. We also have to as- 


sume that the presence of these extreme and ex- . 


aggerated psychologic drives is somehow ex- 
pressed through the corresponding photographs 
and further that the subject's liking or rejection 
of the pictures is based on an unconscious identi- 
fication or counteridentification with the proc- 
esses depicted." She appears to be perfectly well 
aware that all of this is somewhat dubious and 
nonvalidated and spends a great deal of time and 
energy. digging into psychoanalytic literature in 
an effort to make it sound plausible. There ap- 
pear to be some doubts as to whether or not this 
has been accomplished, and she begs the read- 
.er's indulgence: “Аз it is, the acceptance of prac- 
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tically any of my statements about the meaning 
of the various factorial constellations is left to 
the good will of the reader....until he can con- 
vince himself about the clinical validity of the 
statements by the actual use of the test." With 
the leeway given an examiner with this type of 
test, any connection between results and validity 
is purely coincidental. The author feels that the 
coincidence of the student's interpretations with 
independent clinical evidence is a constant source 
of validation as is also the increasing demand for 
Szondi reports. In connection with this, another 
of her statements is interesting. "I am fully 
aware of the autistic nature of this reasoning 
since actually in this manner I am the only one. 
who has received the accumulative evidence of 
all these individual clinical validations.” This 
type of reasoning does not need criticism. She 
feels that the fact that the test works has to be 
accepted anyway as a pragmatic proof that 
something essentially characteristic is expressed 
and reacted to in the picture used as a stimulus 
material. No one will quarrel with this state- 
ment. However, one wonders what the subject 
is reacting to. An individual's reactions are 50 
unique and complicated that a great deal of work 
must be done fully to comprehend his likes and 
dislikes, which may turn more on a dislike for 
a beard or a bald head than on the fact that the. 
subject is a catatonic praecox, more on the size 
of the nose or the presence or absence of teeth 
than whether the subject is hysteric or epileptic. 
The subject may repeatedly choose adjacent 
photographs. Some may be interested in small 
details in the pictures and react to them accord- 
ingly. Others consider the picture as a whole as 
in the Rorschach Test. As yet, these reactions 
have not been integrated into the text. The ad- 
ministration of the test is adequately described 
and, despite the theoretical deficiencies, the ma- 
terial and clinical examples are interesting 
enough to warrant reading by those intereste 
in research in the evaluation of personality. 
Psychol В 47:360-1 Jl” 50. Helen D. Sargent. 
The publication of Susan Deri's long awaited 
book on the Szondi method will, in the opinion of 
this reviewer, be to the Szondi what Beck's Man- 
ual was to the Rorschach in 1937. Both books 
were written by ardent disciples of а master; 
both bear the stamp of their author’s original 
extension of a creative idea; and both were pro” 
duced in response to the intense interest of a few 
at a stage of apparent readiness for sharing wit 
the many. Since Deri began her seminars in 
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1945, research under her leadership has grown 
in an atmosphere of mingled fascination and 
skepticism. American psychologists have found 
it impossible to take seriously Szondi's theory of 
choice based on affinity between the genes of a 
subject and the genes of a patient whose photo- 
graph is selected as "liked" or “disliked.” The 
test has nevertheless had strong appeal, not only 
because of its ease and simplicity of administra- 
tion and recording, but because the experience of 
many clinicians suggests that clinically valid in- 
terpretations of choice profiles can be derived 
even without fully acceptable explanation of the 
assumptions developed from this theory. In her 
book, Deri loyally states Szondi's views, includ- 
ing the typology which appears in his own un- 
translated book. Then, with a tact which is un- 
usual in professional writing, she sets his theory 
aside and proceeds to the description of her own 
rationale of interpretation. To this task she 
brings her experience asa practicing analyst, and 
her training under Lewin. The determinants of 
choice are discussed not in terms of genetic char- 
acteristics of pictured patients falling in eight 
nosological categories, but rather in terms of 
stereotypes which the pictures represent to sub- 
jects who select or reject them in terms of their 
own needs. Choice, according to Deri, is the re- 
sult of tension or satiation in eight need-tension 
_ systems, grouped into four vectors, which pro- 
yide “ап octagonal gauge for the understanding 
of personality dynamics." Although the need 
systems are reconciled both with Freudian con- 
structs and with topological conceptualization of 
personality structure, the vectorial approach of- 
fers a new set of dimensions within which to 
conceive the dynamic processes of sexuality and 
aggression, cycles of control and discharge, ego 
structure and function, and object relationship. 
To the extent that the clinical validity of the test 
proves verifiable, the Szondi test, as Deri de- 
scribes it, promises several unique contributions. 
Through serial administration, it is possible to 
study test-retest change, thus capitalizing on a 
form of unreliability which in more cumbersome 
methods is an obstacle. Second, the interpreta- 
tion of “loaded” and “open” factors, in terms of 
discharge and discharge readiness, allows the 
prediction of symptoms and overt behavior pat- 
terns in relation to stable, “root” factors in 
which may be sought the deeper, inner springs 
of action. Although Deri’s book does not in it- 
self clear the mystery of why the test appears to 
“work,” it offers a clear formulation of assump- 


tions which should lend themselves to experi- 
ment. It is to be hoped that clinicians will not 
use the book as a manual of signs for the de- 
ceptively easy identification of syndromes and 
traits, but will utilize it as a source book of hy- 
potheses for systematic exploration. 
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Thematic Apperception Test. Ages 7 and over; 
1943; also called ТАТ; individual; 1 form; no data 
on reliability ; $6 per set of testing materials; cash or- 
ders postpaid; nontimed (120) minutes; Henry A. 
Murray; Harvard University Press. (Oxford Univer- 
sity Press, London: 1944; 28s.) 
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AnTEUR L. Benton, Professor of Psychology, 
State University of Iowa, Iowa City, Iowa. 


In the Third Mental Measurements Year- 
book, appraisals of this instrument were gener- 
ally favorable. However, at least two of the re- 
views carried the implication that the clinical 
utility of the test lay in the opportunity for eval- 
uation both of the specific productions and of 
general behavior which it afforded the experi- 
enced clinician rather than in any distinctive 
merits it possessed as an objective psychodiag- 
nostic instrument. Thus Rotter concluded that 
*the TAT cannot be considered as a clinical in- 
strument apart from the person who uses it. 
The value, significance, nature, and validity of 
the tests are dependent upon the interpreter, his 
experience, and his approach to the field of per- 
sonality." And the present reviewer commented 
that “the general validity of the test is fairly well 
established if one means that in the hands of a 
competent examiner the test data permit true, 
meaningful inferences. At the present time the 
greatest hindrance to its usefulness is the lack 
of normative data on the test productions." 

Recent research on the test has been devoted 
both to the problem of making it a more objec- 
tive procedure and to communicating clinical ex- 
perience with it. Shorr (146) proposes a new 
scoring system which apparently possesses a 
high degree of inter-examiner agreement and 
which apparently yields data amenable to statis- 
tical treatment. Lyon, after reviewing different 
systems of interpretation (Murray, Wyatt, 
Henry, and Lasaga), illustrates his own proce- 
dure which is more or less a blend of previous 
systems, Stein (147) offers a useful manual for 
the test’s administration and interpretation. 
Aron (154) presents a new, highly developed 
scoring system which is objective in character 
and which is supported by preliminary evidence 
suggesting clinical validity. In an experimental 
study, Hartman (261) demonstrates that many 
TAT categories are significantly related to clini- 
cally important personality variables. 

That the Thematic Apperception Test enjoys 
wide use in clinical facilities is undeniable. Yet, 
one has the impression that for the most part it 
is extensively utilized only in those setups where 
the psychologist’s functions are restricted to diag- 
nostic testing with evaluative interviewing and 
psychotherapy being the province of the psychia- 
trist and the psychiatric social worker. Indeed, 
even the friendly observer cannot escape the im- 
pression that in many instances the utilization 
of this rather tedious and time-consuming tech- 
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nique serves no purpose that could not be accom- 
plished more efficiently by personal interview. 
This implies that the TAT has no special merits, 
ie. that it does not provide data which are not 
readily elicitable in other situations and that the 
data it does provide are no more objective than 
ordinary interview data. Happily, the indica- 
tions of the newer investigative work are that 
both implications are unwarranted. There is real 
promise that, with the development of useful 
scoring systems, such as that of Aron, and the 
establishment of empirically verified principles 
of interpretation, the test will achieve distinctive 
value as a psychodiagnostic instrument. 


For reviews by Arthur L. Benton, Julian B. 
Rotter, and J. R. Wittenborn and an excerpt 
from a review, see 3:103; for related reviews, 
See 137-41, 3:104, and 3:104a. 


[137] 
Ae Thematic Apperception Test.] Bellak TAT 
Blank. Ages 7 and over; 1947; a form for recording 
and analyzing responses to the Thematic Apperception 
Test; $1.10 per 10; $1.50 per 100 analysis sheets; 356 
per manual; 60€ per specimen set; postpaid; Leopold 
Bellak; Psychological Corporation. * 


J Consult Psychol 12:126 Mr—Ap ' 48. Except 
for almost blank pages for recording and for final 
summary, the Bellak TAT blank consists en- 
tirely of the analysis page, which provides a 
check-list differing considerably from the 
method introduced by Murray. The 14 compo- 
nents of the analysis аге: main theme, main hero, 
attitudes to superior figures and to society, fig- 
ures introduced, objects introduced, objects 
omitted, attribution of blame, significant con- 
flicts, punishment for crime, attitude to hero, 
signs of inhibition, outcome, pattern of need 
gratification, and structure of plot. The manual 
gives directions for the administration of the 
TAT, and describes the scoring categories. 


[138] 

X*Thematic Apperception Test: Thompson Mod- 
ification. Negroes ages 7 and over; 1949; individual ; 
I form; no data on reliability and validity; no norms; 
$6 per set of testing materials; cash orders postpaid; 
nontimed (120) minutes; Charles E. Thompson; Har- 
vard University Press. (Oxford University Press, 
London: 1949; 325. 6d.) 
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Underlying the Ni Version of the TAT.” J Abn & Soci 
Psychol 45:700-9 $ 'so. * (PA 25:2466) cial 
5. SCHWARTZ, EMANUEL ; Riess, BERNARD F.; AND Сог. 
TINGHAM, ALICE. "Further Critical Evaluation of the Negro 
Version of the TAT.” J Proj Tech 15:394-400 S °51, * 


Am J Orthopsychiatry 21:208 Ja’ 51. Eugenia 
Hanfmann. * the author....concludes that identi- 
fication is likely to be greatest when the pictorial 
material reflects the culture of the individual. 
Regardless of whether this conclusion is justi- 
fied in this generalized form, it is certainly ad- 
vantageous to have a parallel series available for 
further experimentation in this field. In com- 
paring the results of the two series it is essential 
to keep in mind that the redrawn pictures differ 
from the original ones not only in regard to the 
race of the figures, but also in some other detail. 
In particular the change of color of the figures 
from light to dark often necessitates the change 
of background from dark to light, which in 
some cards modifies considerably the immediate 
emotional effect produced by the scene. * 

Brit J Med Psychol 24:79-80 pt І '51. Н. 
Phillipson. * This modification, which is ex- 
tremely well drawn and produced, sets a pattern 
for ТАТ pictures of use in testing minority cul- 
tural groups within a white culture. The social 
situation and the objective environment of the 
picture limits their usefulness with other "negro 
groups. With the T-TAT is published a very 
brief manual adapted and abridged from the 
manual accompanying the original ТАТ. There 
is no mention in the manual of the effect of the 
“white” examiner in the test situation, except 1 
general terms of rapport which means friendli- 
ness and encouragement, or in the case of diffi- 
cult children the promise of a present. One feels 
also that there is too much emphasis on the use 
fulness of TAT as a method of getting mater! | 
“in spite of the patient"—swhich suggests tha 
the author may be making too little use of the 


"therapeutic situation" in clinical testing t0, 


supply the motivation for good co-operation by 
the subject. $ 
J Consult Psychol 13:311 Ag '49. * The s 
thor's clinical impression is that richer recor | 
are secured from Negro subjects by the use “i 
the new cards. A further suggestion might 


ial 
- the use of the two sets of cards to explore raci? 


attitudes. 


Ls Test.] Aron, BETTY. 
*[Re Thematic Apperception Test. ON, ES 
A Manual for fuac of the Thematic р. 
ception Test: А Method апа Technique Sanford: 
sonality Research. Foreword by R. Nevitt bg 
Berkeley, Calif.: Willis E. Berg, 1949. Pp. д 
Lithotyped. $3.50. * (РА 24: 644) 
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J Clin Psychol 6:106 Ja ’50. * an important 
contribution, both theoretically and in terms of 
practical clinical applications. 

J Consult Psychol 13:381 O °49. Laurance F. 
Shaffer. Since its first appearance fourteen years 
ago, the TAT has remained an elusive technique, 
despite its wide use. Although Murray originally 
proposed a somewhat quantifiable scoring in 
terms of needs and presses, many users have 
preferred а "global" or "intuitive" interpreta- 
tion. The apparent unsatisfactoriness of the 
original scoring might be due to either of two 
defects: that it is faulty in principle, or that it 
is insufficiently elaborated and developed. Miss 
Aron’s work supports the latter hypothesis. 
Her manual presents a further development of 
the Murray-Sanford scoring of content, which 
may be checked for scoring reliability, and which 
permits the quantitative comparison of individ- 
uals and groups, a first step toward experimental 
validation. Although complicated and not rapid, 
the scoring technique is clear enough to be mas- 
tered, and is at least practicable for research. As 
yet, the research procedures do not permit statis- 
tical tests of significance, which may lead to un- 
justified conclusions. For example, in one of Miss 
Aron's sample research studies, she concludes 
that “а general decrease in expression is found 
at the pre-puberty stage,” on the basis of a low- 
ered output in three out of four groups of chil- 
dren 11 to 12 years old, numbering 4 or 5 in 
each group. Any step of true research in the 
projective field is encouraging, as is Miss 
Aron's study, but much of the path of progress 
lies ahead. 

TAT Newsletter 4:1-5 su "50. Robert R. 
Holt. The greatest single obstacle to the wider 
use of the TAT is the lack of a scoring system 
comparable to Rorschach’s in simplicity, the ease 
with which it may be learned and applied, and 
in the significance of the distinctions it makes. 
It seems unlikely that such a system will ever 
be worked out, due to differences in the kinds 
of data yielded by the two tests; in spite of the 
anchoring effect of the pictures, the TAT story 
has (so to speak) many more degrees of free- 
dom than the Rorschach response, whence its 
peculiar combination of richness and impalpa- 
bility. It is no discredit to Miss Aron, therefore, 
to say that her book does not succeed in provid- 
ing the philosopher’s stone. Just what ores, 
precious or base, are locked within the stonily 
forbidding difficulties of her method it is diffi- 
cult to say on the basis of merely reading her 
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book. It would be necessary to learn and to 
apply the system she outlines, and preferably to 
compare the differentiations it yields to those 
afforded by other methods of analyzing stories. 
Judging from its careful, psychologically-based 
construction and from the suggestive results of 
the sample studies she reports, it is well worth 
such a trial by those interested in personality re- 
search, As the subtitle explicitly states, this is a 
method for research and the author is careful to 
make it clear that she does not expect much im- 
mediate clinical application of it. She is careful 
throughout to avoid sweeping claims, and she 
does not underestimate the difficulties of the 
way she is pointing. It is presented as having 
gone through enough stages of revision to be 
shared with other workers, while still further 
revisions will be made. It is as a working docu- 
ment, oriented toward research, that it will be 
evaluated here. It is too bad that a method of the 
importance I believe this one will prove to have, 
had to be presented with such a constant eye to 
economy. That probably accounts for the book’s 
over-brevity. In about a hundred pages of text, 
Miss Aron presents in a condensed manner an 
explanation of the steps through which a scorer 
must go to use her system; she lists and defines 
the many variables used and the notations that 
modify them in ways indicated above as well as 
a few others; she gives fifteen stories drawn 
from quite different populations of subjects, with 
complete scoring; she describes certain techni- 
cal procedures for recording and summarizing 
the scores and concludes with detailed citation 
of four “sample studies” which illustrate the 
kinds of data provided by the system and the 
kinds of interpretations that can be made on 
their basis. It seems to me that 100 pages is too 
narrow a compass within which to attempt so 
much, though she wastes little of the space. 
There are seven steps through which the scorer 
goes. First, he sets down on a worksheet an ab- 
breviation designating each character or object 
in the study to which any variable is attributed— 
including “The Environment," if necessary. 
Each such notation is put at the head of a col- 
umn, in which are listed all variables relevant to 
it. Second, the scorer identifies the hero or he- 
roes. Third, he reads through the story, noting 
all variables under the appropriate columns. 
When a striving is the hero’s, it is need; when 
it is someone else’s, it is recorded as press. 
Fourth, “the source of complementary variables, 
i.e., the source of the precipitating press or the 
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precipitated need, is recorded in front of the 
variable in question.” Fifth, note is made (by the 
symbols for kinds of defenses) of the ways scor- 
able trends are described when not freely ex- 
pressed by the hero. Sixth, the scorer assigns nu- 
merical “intensity values” to each variable in 
the main story content. Finally, he records the 
outcome of each episode. It is clear, from this 
description, that the technique has certain limi- 
tations, most of which are admitted. It deals only 
with content, deliberately leaving the question 
of formal aspects aside, for the most part. It is 
somewhat atomistic, in that each dynamically 
meaningful subpart of a story is scored sepa- 
rately, yet with an ingenious system of notations 
that preserves many of the important aspects of 
its context. Miss Aron started out to revise the 
Murray-Sanford method of scoring needs and 
press to make it more useful, and she found that 
it could be simplified only by making it more 
complex. The many clinicians who have dis- 
missed Murray’s approach as impracticable will 
undoubtedly feel the same way about this one 
when they spy the scores of distinctions that are 
now required, and will go back to their global, 
intuitive analyses. For daily clinical use, they 
are quite justified, but if this attitude is ex- 
pressed by a researcher, he can only be advised 
to choose a less complex field of inquiry than is 
personality. To do justice to the multifariousness 
of human motives, a handful of concepts simply 
will not do. Those familiar with the Murray- 
Sanford system of analyzing TAT stories will 
recognize only the second, third, sixth and (to 
some extent) seventh steps as carry-overs, In 
modifying the older scoring method, the author 
says that she was guided by both theoretical and 
empirical considerations: the distinctions made 
were those suggested by psychoanalysis and by 
Murray’s personology, but when they could not 
be reliably scored by different raters they were 
changed until they could be, or were dropped. 
How is one to evaluate the changes? The ulti- 
mate test can only be an empirical one: the dem- 
onstration that the new system differentiates 
better between groups known to be diversely 
constituted psychologically, or that the new 
quantitative scores correlate more highly with 
indices of known validity—in each case, as com- 
pared to the earlier system, or as compared to 
any other scoring schemes (for example, Wyatt's 
or Henry's). One gets the impression that the 
failure of the Murray-Sanford scoring to differ- 
entiate prejudiced groups in the Berkeley Pub- 
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lic Opinion Study was responsible for the 
changes, and that the revision did provide good 
differentiation. But in the present book Miss 
Aron tells us nothing of all this, possibly because 
she has discussed it in her chapter in the forth- 
coming report of the total research project on 
which she was working, The Authoritarian Per- 
sonality. Some citation of results from the stud- 
ies of anti-semitism would have helped the pres- 
ent book, nevertheless. In the lack of empirical 
data, then, the modification will have to be eval- 
uated on theoretical bases and by rule of thumb. 
I think that the first modification— recording the 
figures and objects to and from which dynamic 
trends emanate and are directed—is prima facie 
an excellent one. In totalling the scores for Nur- 
turance in mother-son stories (card 6BM), for 
example, it is desirable to be able to separate the 
son's nurturance towards the mother from the 
mother's towards the son. In large part, this dis- 
tinction is made by differentiating needs and 
press, but even so it is often lost. The second 
major innovation sounds like a valuable one, too: 
the introduction of special notations to record 
certain kinds of defenses that operate in relation 
to the trends expressed: conflict, denial, rejec- 
tion (ie, unacceptability), displacement. of 
identification (which includes a special notation; 
f, for fantasy, memory, wish or dream instead 
of action) and "restriction of ego expression by 
describing static [i.e., characterological] features 
rather than behavior" of a figure. The exact psy- 
chological significance of all these defenses 18 not 
easy to determine, but research on this problem 
should be well worth the doing. A further con- 
tribution of the present manual is that it provides 
relatively objective and understandable criteria 
for performing some of the more judgmental 


‚ aspects of scoring : identifying the hero, and es- 


pecially quantifying ratings of the intensity of 
needs and press. Regrettably, it does not do so 
wellin defining and making clear the differences 
between all of the variables used. Miss Aron $ 
suggestions about procedures to be used in re- 
search are knowing and helpfully concrete. 
further set of differences from Murray is in tht 
specific variables used. Here my prejudices ап 
preferences do not always agree with the au- 
thor’s. The additions of p Blamavoidancé, 
Infavoidance and n Blamescape (the latter (ш 
Sanford) sound good to me, and I think i7 ; 
was lost by dropping п Exfosition, and or 
many purposes) m Excitance. But why St 
Nutriance (the taking of food and drink) ? y 
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no n Passivity? These are often very significant 
features of TAT stories. Likewise, I should have 
retained the distinction of n Aggression: De- 
struction from n Aggression: Physical: as it is, 
the same score is given when the hero fights with 
somebody and when he smashes his car in an 
auto accident—events with quite different sig- 
nificance. Combining n Order, n Achievement 
and n Construction under the last heading loses 
frequently useful distinctions. A few other minor 
criticisms may be levelled at specific variables : 
in addition to the “endogenous benefit” of Men- 
tal Health, there is another called Mental Health- 
Intelligence, which is scored for any kind of tal- 
ent. With the present headings, one would get 
into an absurd fix trying to score the statement: 
“The boy is bright though rather neurotic.” 
Then, it seems forced, to say the least, to score 
thinking, planning, and studying as "Mental 
Sentience" and then to combine it, in summariz- 
ing, with other "sensory aspects of behavioral 
trends"— physical, sexual and esthetic sentience! 
These are not basic objections, by any means. If 
they have weight, they can be incorporated into 
a revision of the Aron scheme which would not 
change it in principle to the slightest extent. The 
same is true of other minor criticisms that might 
be made of the book itself, which contains a 
rather annoying number of typographical errors. 
Only one of them is of much consequence: on 
pp. 79-84 and 136-141, the good old boy-and- 
violin that has always led off is persistently called 
Card 7 instead of Card 1. The book could have 
been further improved by the addition of a sorely- 
needed glossary for the great number of new 
abbreviations. To conclude these minor criti- 
cisms, I cannot resist the temptation to point 
out a rather funny “interpretation” of the short- 


comings in the pioneering efforts of two practis- 


ing psychoanalysts, Murray and Sanford, to 
quantify dynamic content: they, “like many 
other personality psychologists who have been 
influenced by research with lower animals, 
tended to overlook mechanisms, many of which 
are specific to man, that make for a vast range 
of personality differences!" Whenever a com- 
plex scheme for the analysis of qualitative data 
is proposed, the interested researcher may well 
ask about “observer reliability’—how well do 
different scorers agree? It is not easy to answer 
such a question when one’s system contains so 
many dimensions of variability. Miss Aron has 
approached the problem with the intent of find- 
ing one or two measures that will demonstrate 
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the overall reliability of the method. This is an 
essentially defensive approach, and its results in 
this case are not very helpful. By a complicated 
system of rating all differences between raters, 
she arrives at a set of overall figures which claims 
from 69% to 85% "complete agreement,” on 
different samples, with from 9% to 12% full 
errors. But it cannot be concluded that these fig- 
ures mean that two raters assign identical scores 
a majority of the time. Many differences are 
considered to be “по difference,” because they 
have approximately the same interpretative sig- 
nificance—which makes one want to ask, why 
make such distinctions in the first place, then? 
Other differences are considered separately as 
“lack of acuity” on the part of one rater: where 
one rater scores something that the other simply 
failed to see. The result of using this method of 
reporting reliability is to make anyone who has 
a skeptical turn of mind suspect that there was 
not actually a very high degree of agreement on 
many categories, especially considering the fact 
that the scorers had been trained and closely su- 
pervised by the author. It is quite sensible and 
legitimate to distinguish between differences 
that are serious and those that are less so, but 
it would have been much better to have kept 
separate the actual agreements and the insignifi- 
cant differences. And, of course, overall relia- 
bility doesn't mean much in such a system; the 
important question about reliability is: which 
ones of the many discriminations called for can 
be made with a satisfactory degree of agreement, 
which ones are difficult or impossible to make? 
Improvement of the method demands the latter 
approach. As far as validity is concerned, we get 
a number of encouraging hints in the chapter on 
Sample Studies, but nothing definite. Of course, 
the problem of validating such a method as Miss 
Aron’s is a very thorny one. But it would have 
been more convincing to have enlarged the N on 
one or two of the four miniature studies so that 
a few significant differences between groups 
could have been reported instead of a larger num- 
ber of interesting, often plausible, but very pos- 
sibly unreliable trends. What the chapter does 
show is that the method yields quantitative group 
results that lend themselves easily to psychody- 
namic interpretations with a minimum of far- 
fetched inferences. This fact alone is a weighty 
endorsement of the Aron method as a promising 
research instrument. Looking back, I see that I 
have found more to criticize than to blame. Yet 
I feel that the criticisms are mostly minor, the 
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good points mostly major, and that in the bal- 
ance Miss Aron has made a contribution of solid 
importance to research with the TAT. 


[140] 
*[Re Thematic Apperception Test.] Ѕтегм, Mor- 
ris I. The Thematic Apperception Test: Ап In- 
troductory Manual for Its Clinical Use With 
Adult Males. Foreword by James G. Miller. Cam- 
bridge, Mass.: Addison-Wesley Press, Inc., 1948. Pp. 
viii, 95. Lithotyped. $3.00. * (РА 22:4959) 


Am J Psychiatry 106:945 Je ’50. Helen 
Thompson. * a welcome publication * The tech- 
niques of administering and analysing the test 
are succinctly, clearly, yet fully outlined * This 
compact, concise book should prove valuable to 
all students of clinical psychology whether in 
formal course instruction or in self-education. 
For the psychiatrist who does not have available 
the services of a psychologist, it might be a help- 
ful guide to the use of the TAT in his own prac- 
tice for both diagnostic and therapeutic work. 

Int J Psycho-Analysis 30:209-10 pt 3 '49. 
Joseph Sandler. * There is an obvious parallel 
between the interpretation of the T.A.T. phanta- 
sies and the interpretation of dreams. On this 
account T.A.T. analysis should be of theoretical 
interest to the psycho-analyst, and indeed Mur- 
ray's need-theory, on which the test is based, de- 
serves a little more attention from psycho- 
analysts. On the other hand, precisely because 
of the closeness between T.A.T. interpretation 
and dream productions, the psycho-analyst must 
regard any T.A.T. manual in rather the same 
light as a manual of dream analysis designed for 
the use of non-analysed, and perhaps non- 
analytical, psychologists. The T.A.T. is perhaps 
the best of the projective tests, in terms of use- 
fulness to the psycho-analyst, and Stein's manual 
should prove as valuable as the two other books 
reviewed here. : 

J Abn & Social Psychol 44:429-31 Jl'49. Roy 
Schafer. * in the main, a bold and praiseworthy 
effort to meet the greatly increased demand for 
training by providing a supervisory conference 
for mass consumption. The last 41 pages of this 
9I-page manual are taken up with a detailed, 
sentence-by-sentence analysis of a complete ver- 
batim TAT record. The presentation of the ana- 


_lytic process is easy, clear reading with an ad- 


- ik i 


venturous quality: Expectations (hypotheses) 
are stimulated, subsequently strengthened, dis- 
couraged, qualified, amplified, and integrated, 
and slowly and surely a living personality takes 
shape before us. The author’s reasoning is never 
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flashy or obscure, but rather is cautious, modest, 
sensitive, and conscientious. (It may be men- 
tioned here, however, that despite his strong 
warning to the contrary, the author himself 
makes historical predictions that often seem far 
too concrete and specific; this is evident in the 
summary of his test analysis on pages 87-90.) 
If Stein could have published 20 such analyses, 
using a representative variety of clinical cases, 
he would have given us a true manual for the 
TAT; his present volume, because it takes up 
only one case in detail, can be considered only 
introductory. A further limitation on this level 
is the restriction of the entire discussion to only 
the set of 20 pictures administered to adult males. 
Stein....provides concrete, helpful examples from 
actual stories to illustrate how each of the press 
and need variables may be expressed. There fol- 
lows a brief supplementary chapter on dynami- 
cally significant formal—as opposed to content 
—aspects of the test responses. (This topic 
should have been treated more fully, since formal 
aspects of stories are crucial in locating and 
identifying significant needs, conflicts, and goals; 
in the lengthy case discussion, Stein makes lib- 
eral use of these formal, non-content indica- 
tions.) Finally, before the case discussion, there 
is a three-page discussion of “syndrome or clus- 
ter analysis.” It is here that Stein will be most 
disappointing to his readers, for syndrome analy- 
sis is nothing but the construction of an inte- 
grated personality picture from the fragmentary 
hypotheses suggested by single themes, percep- 
tions, and significant verbalizations; and the 
construction of an integrated personality picture 
is the stumbling block that persists longer m the 
development of skillful handling of the TAT 
than any other. A discussion of how to establish 
need-hierarchies is missing; the presentation 0 
the Murray need-press orientation does not har 
for it unfortunately has a catalog quality an 
not a hierarchic quality. For example, ^ Sex an 
n Aggression are listed as if they had the same 
depth, breadth, and genetic significance as ” 
Playmirth, and n Change, Travel, and Adven- 
ture, Stein thinks hierarchically in his case dis- 
cussion but does not instruct his reader how to 
do so, beyond emphasizing frequency, intensity, 
and duration of expression of need in the Um 
as keys to the place of the needs in the subjec 
hierarchy. Actually, however, these aspects n 
need expression tell us more about the cado 
present picture of his need-hierarchy, how ds 
experiences or is able to communicate his needs, 
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and may not be dynamically accurate. Here is 
where the main outlines of the Freudian psycho- 
analytic theory or personality dynamics could 
have been of great help to Stein as a teacher— 
for they do implicitly help him when, as a clini- 
cian, he grapples with concrete case material. 
Psychoanalytic findings and theory have to a 
great extent centered on this problem of need- 
hierarchy or "psychic structure." Particularly 
clarifying would have been invoking the distinc- 
tion between deep-lying needs and defenses 
against these needs, a distinction that Stein 
treats rather casually. When interpreting a TAT 
story, or апу clinical material for that matter, we 
must answer the question, “Does this material 
express a consciously accepted need, a rejected 
and to some degree unconscious need, or a de- 
fense against an unconscious need ?" Many sub- 
jects tell us much more about their defenses in 
their TAT stories than about deeper-lying needs 
which they cannot integrate with their preferred 
self-image. The intensively discussed case, for 
example, again and again shows how he contin- 
ually tries to resist consciously acknowledging 
to himself and to others his very strong depend- 
ent needs. (Stein points this out in his final sum- 
mary interpretation, but gives no real place to 
this aspect of personality dynamics in his pre- 
vious rationale of interpretation.) Often the 
underlying need does not show itself at all clearly 
and we can infer its presence and strength only 
by the rigidity and overemphasis on the defenses 
commonly associated with it—for example, de- 
nial of and reaction formation against hostility. 
Often we see little more than the defensive struc- 
ture with little idea about the underlying needs. 
The play of intra-psychic forces is, then, not 
only indicated in the stories, as is Stein’s chief 
assertion in this connection, but they also select 
and shape, and thereby limit, the content of the 
very stories themselves. Limitations of content 
can be understood in terms of ego-defenses and 
the scope of interpretations is thereby broadened 
beyond the limits of the Murray approach. It is 
not implied here that a manual adequately in- 
corporating psychoanalytic insights can now be 
written by anyone; this isa task waiting on much 
more research and conceptual analysis. The cru- 
cial role of the ego in the perceptual-associative 
interplay that goes into the development of 
stories has yet to be studied. But Stein might 
һауе advised the student more fully as to where 
the answers to the problems of integration of in- 
ferences might most hopefully be sought. Never- 
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theless this little volume is well worth careful 
study by the beginning TAT worker. 

J Consult Psychol 13:147 Ap '49. Laurance F. 
Shaffer. A brief manual, giving a condensed but 
well-illustrated account of the Harvard Psycho- 
logical Clinic's method of administering, analyz- 
ing and interpreting the TAT. * The use of the 
ТАТ has remained an art rather than а science, 
depending more on a broad knowledge of per- 
sonality and on clinical insights than on distinc- 
tive methods of scoring. For the student who 
already has some grasp of dynamic psychology, 
Stein’s manual seems an economical and suffi- 
cient introduction to the TAT, to be enriched 
more by further clinical experience than by addi- 
tional psychometric complications of the tech- 
nique itself. 

J Mental Sci 96: 549-50 Ap ' 5o. Ralph Heth- 
erington. * Stein’s introductory manual....is 
rather disappointing since its data are confined 
to adult males * As well as analysing the intra- 
individual material, the author suggests that 
comparisons between individuals might be of 
value with the caveat that the former type of 
comparison has the greater value. He provides 
a “common story” for each of the 20 pictures for 
men. These stories are "based on clinical ex- 
perience" and it would be of great interest to 
learn the method used to arrive at these com- 
mon stories. This is of vital importance if they 
are to be used in any way as norms, but the au- 
thor does not apparently regard them as such, 
but simply intends that they should stand as ex- 
amples of the type of material that may be ex- 
pected from the use of the test. It seems that an 
opportunity has been missed. If a thoroughly 
systematic method of arriving at the common 
story had been employed, it would have been 
possible to make more exact comparisons be- 
tween individuals. In other words, some kind of 
standardization would have been achieved. This 
would have enhanced the usefulness of the test. * 
A sample protocol is analysed and intercepted to 
illustrate the technique. Clinicians will note that 
this analysis of a single case covers 40 pages of 
the book, and the patient’s first remarks: “Well, 
first of all,....” and “as far as I'm concerned,....” 
take two pages for their interpretation! The 
author states, however, that this analysis is more 
detailed than would normally be necessary, since 
he has endeavoured to include all the possible 
steps in the procedure in order to illustrate the 
method. It is interesting that the author suggests 
a blind analysis of the protocol should be under- 
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taken, that is, that no more of the patient should 


two short chapters describing the method and 


be known than his age, sex, occupation, marital - materials of interpretation, and finally a com. 


status, whether he has any siblings and whether 
his parents are living or dead. This book will not 
be as useful as the more comprehensive manual 
the author has promised us. = i 

J Nerv & Mental Dis 110:263 S ’49. (Zyg- 
munt A.) Piotrowski. * The author, less re- 
strained than Murray, makes the assumption 
that "the same set of principles which are uti- 
lized in analyzing and interpreting daily be- 
havior may be utilized in working with the ТАТ 
material.” It seems doubtful that words can be 
treated as actions. This assumption appears to 
be an evasion rather than a solution of the prob- 
lem to what degree the TAT reflects overt be- 
havior and to what extent it gives fantasies that 
are not acted out. 
| TAT Newsletter 2:4-7 Ад "48. Robert R. 
Holt. In a compact little volume, Dr. Stein has 
given the beginning student of the TAT a very 
valuable manual which will set his feet properly 
on the path to effective professional use of the 
test. He does not pretend to have furnished an 
exhaustive or complete monograph, but .ex- 
plicitly states that it is limited to the application 
of twenty cards to the testing of one popula- 
tion, adult males. The author is also careful to 
state that the book is not for beginners in psy- 
chology ; “it is designed as а manual for those 
who have had some training in the analysis of 
personality and who are adequately oriented to- 
wards clinical practise." Taken for what it is, 
then, the manual is the best book I know of for 
the clinician who knows something about his 
trade but who needs an introduction to the ТАТ. 


` Murray's and Bellak's otherwise excellent pam- 


phlets are too brief for the purpose; Rapaport's 
chapter in Vol. II of Diagnostic Psychological 
Testing is on а level of technical complexity 
suited to the more advanced worker, and Tom- 


kins’ book, like Aron's forthcoming manual, 
' presents a E primarily oriented towards 


than clinical practise. Written. 


pego 
in a clearly structured way—at times it is almost 


an outline, stri of everything but essential 
points—the book actually contains a good deal 
more than its ge hwould suggest. In suc- 
cessive chapters, there are descriptions of the pic- 


tures and stories most often told to them, a good 
summary of the main oe be learned aon 
admini istration, al comments on the 
technique of analysis with a presentation of a 
list of variables useful in approaching the stories, 


plete protocol with a blow-by-blow account of 
how it was analyzed and interpreted. A few com- 
ments are in order on each chapter. Dr, Stein 
notes that his accounts of the typical stories 
yielded by each picture and the kinds of prob- 
lems that each touches on are based on his own 
clinical experience. He points out the need for 
research to determine the “popular” stories for 
each picture, and the slight discrepancies be- 
tween his experience and that of others under- 
lines this need. In my experience, for example, 
it is not “usually said that the figure at the win- 
dow (in card 14) is spending a sleepless night." 
Fortunately, the attention of researchers is turn- 
ing to this problem, and after the central file of 
ТАТ stories has been set up it will not bea very 
difficult matter to solve it. Meanwhile, state- 
ments of experience such as the one given here, 
for Set D, and by Rapaport for Set B, are useful. 
In the chapter on administration, most of the 
familiar rules and admonitions are repeated in 
a clear, succinct manner. In addition there is an 
excellent discussion of the various possible meth- 
ods of recording, with the pros and cons of each. 
About inquiry there seem to be two rather dis- 
tinct schools of thought and practise, to one of 
which Stein and Tomkins belong, while the 
other is mainly composed of Rapaport and his 
students. The former method is one of consider- 
able caution, particularly with respect to what 
Stein calls “intermittent inquiry." One who has 
tried both methods may get the impression that 
this caution is really timidity, or that it is based 
only on the acknowledged fact that clumsy 1n- 
quiry while the test is going on can disrupt the 
situation and invalidate the results just as much 
in the case of the TAT as with any other test. 
“It is possible, nevertheless, to develop skill in 
‘non-directive inquiry so that the usefulness of 
total material obtained is considerably en- 
hanced. The example given of pushing a patient 
too hard is indeed a lesson in how not to inquire, 
but the same situation could have been handle 
adequately, without abandoning inquiry until 
the end of the test, at which time the various 
chains of thought have been broken. One addi- 
tional (rather petty) point: I have yet to $ 
anyone bring forward evidence that inquiry 
about sources of stories yields anything of value; 
ikewise for the procedures recommended ne 
choosing which pictures are liked and dislike у 
which are remembered, or other flourishes. 


* 
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know it is hard to produce convincing evidence 
on such matters and I concede that my criticism 
is a little unfair, but T still think the reader is 
entitled to see something more than just the 
assertion that such procedures "frequently yield 
valuable data." Students of the test would be 
better advised to stick to essentials and master 
straight administration and interpretation ; 
thereafter they can experiment with variations 
which in the early stages will only confuse things. 
The author begins his discussion of the technique 
of analysis by a sensible discussion of diagnosis 
vs. label-giving, with a warning against trying. 
to predict symptoms. The last point could have 
been made a little more strongly. It too often 
happens that clinical psychologists play games 
with themselves and psychiatrists, particularly 
when they practise "blind analysis," amazing 
everyone with their cleverness in deducing from 
tests which take hours to administer simple in- 
formation which can be obtained in five min- 
utes’ direct questioning. Not only will the psy- 
chologist err often when he tries to "predict" the 
presence of an ulcer, or a particular. obsession, 
or what not, but he will be wasting his time and 
missing the point of the kind of contribution that 
he can make to the study of the patient. АП of 
which raises the point of blind diagnosis. Stein 
presents the main points in favor of it: it gives 
an independent judgment on the case, the'tester 
will often notice aspects of the case which would 
have been overlooked if he had approached it 
with a preconception (I think this is the point 
being made, though it is not clear) ; and doing 
blind analysis sharpens the psychologist's wits 
and perceptions for the many times when he does 
have to work in more or less ignorance about 
the patient. These and similar arguments are 
valid in support of doing considerable blind test- 
ing as training, and doing it under special cir-- 
cumstances, but not for making it regular prae 
tise. A psychiatrist who is secure enough in his 
own methods of studying the patient can get 
more out of a psychologist’s services when he 
tells the latter what is established about the pa- 
tient and enables the psychologist to use these 
data in interaction with his test findings to yield 
information and hypotheses about areas that are 
unclear or inaccessible to him. * Another impli- 
cation of the position taken by Stein seems to 
be that one should analyze the TAT without 
reference to other psychological tests, though he 
does not say so in so many words. Such a pro- 
cedure is again good training, but not good clini- 
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cal practise. A TAT is often much more mean- 
singful in light of the Rorschach and Szondi re-. 
sults, which can help a good deal to establish, 
{ог example, what is overt and what is repressed, 
or whether a stimulus-bound stereotypy is or- 
ganic or depressive in nature. It is clear that the 
author'operates in such a way himself; it is a 
pity that he does not discuss the use of the TAT 
in a battery of tests. The assumptions underlying 
his technique of analysis, the author states next, 
are that people reveal much about themselves in 
telling stories ostensibly about others, and that 
the people, situations, and problems in the stories 
are the same as those in everyday life. From 
these he draws the quite acceptable conclusion 
that the same principles and variables used to 
analyze real behavior should be used in test in- 
terpretation, and the rather surprising one that 
“the patient telling the story should be regarded 
as participating in or having participated in the 
situations described.” That this is merely an un- 
clear wording of his meaning is immediately 
plain when he warns in the next paragraph 
against the common error of too-literal interpre- 
tation. What is lacking in the quoted statement 
is an addition such as the following: “on some 
level of generalization : principally, with respect 
to the. emotionally meaningful aspects of these 
situations.” The trickiness of the TAT lies in 
the very fact that sometimes stories are direct 
autobiography (the subject really was helped by 
his parents in a career as violinist, as the story 
says) ; sometimes they need to be generalized 
only in minor aspects (the parents did help him 
along in his career, though the subject didn’t 
study the violin) ; sometimes the story repre- 
sents a wishful fantasy of situations the subject 
never experienced in reality (he was really an 
orphan and always longed for such helpful par- 
ents) : or it may be related to a conscious plan 
(the subject has a son of his own and intends to 
help him along). We are given a list of principles 
which help in avoiding over-literalism and in 
finding the appropriate level on which a story 
"holds true of its author. To it however might be 
added Tomkins’ concepts of the level of action, 
and the remoteness of the picture or of the set- 
ting; also the subject's ap] rent attitude of ac- 
ceptance or rejection of E: action in question, 
and its consistency with inferences drawn from 
other stories and from other tests. The list of 
factors (variables) involved in clinical analysis, 
is straight out of the tradition of the mimeo- 
graphed manuals by Murray, Sanford, White 
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and Bellak. Press, needs, cathexes (including 
attitudes and sentiments), inner states, "the 
manner in which the behavior is expressed" 
(Tomkins' levels), and outcomes, all from the 
standpoint of the hero—these are old friends, 
dressed here in simple but becoming garb. The 
specific factors (e.g., p Aggression: Physical, 
Social) are defined and illustrated by quotations 
Írom actual stories. Stein states straightfor- 
wardly that these factors are useful chiefly in 
training, so that one may become sensitized to 
all of the important aspects in the stories, and 
that in clinical practice they are used quite in- 
formally and unsystematically. A short chapter 
takes up succinctly additional (non-contentual) 
factors to be studied: the patient's behavior 
during the test, the use of time sequence and 
emphasis on different temporal epochs, aspects 
of the patient's perceptual response to the pic- 
ture, and other formal characteristics of the ТАТ 
performance. The section on the use of language 
is good as far as it goes, but it is disappointingly 
brief. A potentially rich vein of ore has here been 
little more than scratched. The definitive treat- 
ment of formal aspects of the TAT is still that of 
Rapaport, Schafer and Gill. The section on sym- 
bolism, however, is something not found in most 
other sources, and is quite good. Symbolism here 
refers to the interpretation of certain stories ac- 
cording to psychoanalytic symbol-translation, 
not the conscious use of symbolism by the subject 
in his stories (a matter which is not discussed). 
A three-page chapter on the technique of inter- 
pretation illustrates with examples the basic 
principles of abstracting similar elements from 
a number of different stories to reach a conclu- 
sion. Most of what the book has to tell us about 
interpretation is contained in the final chapter ; 
which makes up nearly half of the book. Here 
in great detail we are shown how the author ap- 
plies all that he has told us to a concrete case.— 
‘something that Tomkins unfortunately never 
did. All twenty stories, with the examiner’s in- 
quiry in full, are printed on one side of the page, 
while a commentary of analysis and interpreta- 
tion is on the other. One could wish for more such 
cases, illustrating a variety of nosological types, 
but the comments made about this one have a 
gratifyingly broad range of relevance to other 
problems. The analysis itself is a thorough and 
painstaking one, in which almost everything in 
the stories has been exploited. Every clinician 
will of course have his own criticisms and addi- 
tions to make, as always, but they will deal with 
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quite minor matters. The steps in the reasoning 
are very explicitly set forth and should be very 
helpful to those who are relatively unfamiliar 
with the test. The only criticism I would make 
of this chapter is that the author seems to make 
even less use of the need-press variables than he 
had led us to expect. The sceptic who has already 
looked with a jaundiced eye on the long list of 
more or less neologistic terms and their defini- 
tions that he is supposed to learn, will probably 
decide to skip it after reading this final chapter, 
I don't think that Stein has given an adequate 
picture of how useful the variables can be in 
everyday application. One closes the book with 
a certain sense of incompleteness. It is good that 
there is another manual for the TAT for this 
one comes closest to being a workable hand- 
book. But it is very short—only 91 pages—and 
one wonders why it was not much longer. The 
author's ease and assurance in dealing with 
subtle interpretative problems in the one case 
given show that he has actually a great deal more 
of clinical experience than he has seen fit to 
share with us here. We are promised further 
publications dealing with "the application of the 
technique to the variety of cases that are en- 
countered in clinical practise." Meanwhile, how- 
ever, the reader is likely to get the impression 
that the TAT is useful only in affording descrip- 
tions-of personality and dynamic formulations 
(important though they are). After he has mas- 
tered this practical and sound introduction, he 
will do well to go on to Rapaport's Diagnostic 
Psychological Testing if he wishes to use the test 
in clinical diagnostic practise, to Tomkin's The 
Thematic Apperception Test if he is interested 
more in research and the finer analysis of per- 
sonality in normal and neurotic subjects. 
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[Re Thematic Apperception Test.] Towxixs, Sit- 
VAN S.; WITH THE COLLABORATION OF ELIZABETH J: 
Tomkins. The Thematic Apperception Test: 
The Theory and Technique of Interpretation 
New York: Grune & Stratton, Inc., 1947. Pp. xi, 297: 
$5.00. * (PA 22: 1658) 


Am F Med Sci 216:716 D '48. E. В. * a care. 
ful, systematic approach * Most clinicians shoul 
find the method of TAT analysis found here 
quite useful. The author begins with an-all-too- 
brief history of the development of the test, men- 
tioning only vaguely its relation to other prO 
jective techniques and to the concept of ment 
testing in general. He then proceeds with а dis- 
cussion of reliability and validity, largely 17 


or to his enterprise in looking up the references 
given. In a short chapter on administration, gen- 
eral instructions are given. The scoring system 
suggested by Tomkins is admittedly cumber- 
some and time-consuming and will probably 
prove more useful as an instrument for research 
than as a practical clinical tool. The elements in- 
cluded in the system, however, undoubtedly con- 
stitute a helpful guide and should be very useful 
if kept in mind while working with the test, 
even though most clinicians will find it inexpedi- 
ent to use the system formally. * There is a dis- 
tinct gap in the relationship between the scoring 
system and the interpretation, which seems not 
to depend to any great extent on quantitative 
findings. The bulk of the book is devoted to the 
problem of interpretation and includes many 
helpful sample stories. * One of the most valu- 
able chapters is that devoted to “Level Analysis” 
in which the relation between the story and the 
story-teller is probed. 

Am J Orthopsychiatry 18:369-70 Ap *48. 
Susanne Reichard. As the first published text 
devoted entirely to the Thematic Apperception 
Test, the present volume constitutes a milestone 
in the development of this test. After an histori- 
cal introduction the author presents his own 
scoring scheme which, however, is still closely 
wedded to Murray’s system of needs and press. 
This is unfortunate, inasmuch as the Murray 
scoring has not been widely used by clinical psy- 
chologists because of two basic disadvantages ; 
atomization of personality, and the use of a 
special terminology not meaningful to the other 
members of the clinical team—psychiatrists and 
social workers. For this reason the book fails to 
fulfill the need for an authoritative manual based 
on a system of interpretation communicable to 
all clinicians. However, in discussing actual 
case material, the author discards esoteric lan- 
guage and presents good practical interpreta- 
tions of personality dynamics that are meaning- 
ful alike to the professional worker and to the 
layman. This is not to say that a somewhat sys- 
tematized approach to the problem of interpre- 
tation lacks value. For instance, the author’s 
application of Mill’s deductive methods to the 
precise determination of the conditions under 
which a subject is likely to react in a certain 
manner is quite ingenious. Also his classifica- 
tion. of themes into four main areas of life- 
adjustment; namely, family, love and sexual 
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is developed in regard to "regions" of (a) the 
family, (b) love, sex, and marital relationships, 
(c) social relationships, and (d) work and vo- 
cational setting. This reviewer was impressed 
particularly with the author's comments regard- 
ing the family region, and with the emphasis 
given to the varying significance of the age of 
the central figure in the stories. * Throughout, 
the author's points are well illustrated by origi- 
nal protocols, and the author is to be commended 
for careful organization of his material accord- 
ing to his theoretical positions. This reviewer 
feels that some discussion of the ТАТ as but 
one of several projective techniques would have 
been pertinent. The book is well indexed and 
presents a good bibliography of TAT studies 
(though these are discussed briefly in the text). 

Am J Psychol 61:439—40 Jl'48. R. M. Ogden. 
[The review excerpted also includes a review of 
Personality by Gardner Murphy.] * The con- 
clusion that the "origins and structure" of per- 
sonality, as conceived by Murphy, lack a definite 
scientific meaning, is even more insistent when 
we consider the contribution of the Tomkins 
book. This *work of one who turned from psy- 
chology to the study of philosophy" when “the 
theory of value seemed....an inquiry more per- 
tinent to the study of man than did any investi- 
gation of either the higher or lower mental proc- 
esses” (p. viii), is simply bewildering. Although 
described as “а work-book and not a compila- 
tion of established doctrine” (p. vii), the unini- 
tiated reader might expect some concession to 
his ignorance of what TAT is and tries to do; 
especially so, when the chief author is a pro- 
fessed philosopher who returned to psychology 
after the rewarding experiences which he gained 
in the Harvard Psychological Clinic. One gath- 
ers, though he is never precisely told, that TAT 
has to do with a set of pictures about which 
subjects are asked to write stories. These pic- 
tures, however, are not reproduced, nor even 
described. Instead, the book is filled with stories 
by unknown persons, sometimes vaguely called 
“X, Ү and Z.” The content of the stories reveals 
that the same picture suggests different themes 
to different persons, yet no attempt is made 
to classify either the subjects or the themes. 
Instead, we are given a complicated and, to 
the reviewer, an incomprehensible "Scoring 
Scheme," followed by an "Introduction to the 
Technique of Interpretation" and a discussion 
of "Level Analysis." The four succeeding chap- 
ters are on the "Diagnosis of Personality," in 
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which stories are recounted and discussed with 
reference to “The Region of the Family,” “The 
Region of Love, Sex and Marital Relation- 
ships," “The Region of Social Relation- 
ships" and “Work and Vocational Setting." 
Even the final chapter of the book on “Diag- 
nosis and Psychotherapy" fails to do more than 
give a few cautious suggestions for a possible 
application and utilization of the method. As 
Murphy describes this test, along with other 
“projective methods," like the Rorschach, one 
gains an impression that they are all devised for 
the purpose of securing symptomatic data, 
When interpreted, these data serve in the same 
way that the knowledge and insight of a family 
physician, or a sympathetic parent, friend, or 
teacher will serve to gain understanding of hu- 
man desires and motives that are often hidden 
or incompletely expressed by the person under 
examination and treatment. Yet, without a 
guiding philosophy of life, or at least some 
scheme of logically definable terms of reference, 
it is hard to see the use to which these data may 
be put. In a clinical or an educational situation 
where a patient or pupil is known to have cer- 
tain difficulties of adjustment and achievement, 
the analysis of his personality and character ts 
a primary consideration. Both formal and in- 
formal means of analysis are then indicated, and 
the subsequent behavior which the counselor or 
teacher may direct or induce can be recorded in 
ways that demonstrate the utility of the diag- 
nosis and method of treatment. What one misses 
most in all of these books is any statement of 
concrete problems and their solution. Supply- 
ing, though they do, a storehouse of human va- 
garies, one learns very little of educational or 
therapeutic outcomes. 

В Menninger Clinic 12:109 My ’48. Rudolf 
Ekstein. Dr. Tomkins has done more in this 
volume than ably describe the test and review 
conscientiously 15 years of T.A.T. literature. 
Being one of the main contributors in researc 
on projective techniques he has, for example, 
introduced a new scoring system and has deep- 
ened the meaning of the psychoanalytic concept 
of repression in order to adapt it to T.A.T. lev 
analysis. His book can be looked at as а pro- 
jection, a “story” response of the psychologi- 
cal profession, and thus shows the many differ- 
ent uses of the T.A.T. structure in different 
branches of psychological and sociological "fr 
quiry and application, in the framework of э] 
ferent psychological ideologies. A comparatively 
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small part of the volume concerns itself with 
the use of the T.A.T. in therapy which again is 
perhaps a reflection of the present situation in 
clinical psychology. Dr. Tomkins' own efforts 
seem to indicate a shift in emphasis, a more 
useful occupation with diagnosis for therapy, 
with aspects of the psychotherapeutic relation- 
ship, and the problem of therapeutic change. 
This volume is highly recommended as a basic 
text and as an asset to the expert. - 

Digest Neurol & Psychiatry 16:580 S '48. 
As the first manual to be published, devoted 
exclusively to the Thematic Apperception Test, 
this book meets a long-felt need in an area of 
clinical psychology still at a highly experimen- 
tal and fluid stage—that of statistical analysis 
and standardization of the qualitative material 
derived from projective test techniques. * If, 
from the chapters devoted to principles of scor- 
ing and interpretation, the reader gains the im- 
pression of a somewhat atomistic approach to 
the study of personality structure, this impres- 
sion is agreeably dispelled in subsequent chap- 
ters where the theoretical constructs are applied 
to analysis of illustrative case material in the 
regions of the Family, of Love, Sex and Mar- 
riage, and of Work and Vocational Setting. It 
is this section which will probably be most re- 
warding to psychiatrists, for the author shows 
an appreciation of dynamic relationships that 
carries his concepts beyond the rather esoteric 
terms in which they are couched in earlier chap- 
ters, The author lists four college groups among 
the populations he has studied, and upon which, 
apparently, his methods of scoring and inter- 
pretation are based. The impression gained that 
Tomkin’s case material is weighted with sub- 
jects functioning on a relatively high intellec- 
tual level is supported by the nature of the illus- 
trative protocol reports. 

J Abn & Social Psychol 43:403-6 Jl '48. 
Robert R. Holt. * of all the techniques for diag- 
nosis and personality study used today, the Ror- 
schach and TAT are the most complex, having 
the most to offer, but making the most demands 
on the interpretative skill of the examiner. 
Popular though the TAT has become, its wider 
acceptance has been delayed because of the lack 
of a definitive book on its interpretation. In the 
last few years, two books have appeared which, 
complementing each other, have done much to 
fill the gap. The first was the second volume of 
Rapaport, Schafer, and Gill’s Diagnostic Psy- 
chological Testing, in which a long chapter was 
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devoted to the diagnostic use of the TAT with 
patients, the emphasis being primarily on the 
formal aspects of the stories. The second is the 
new volume by Tomkins, in which extended 
consideration is given to the analysis of the con- 
tent of stories, The experience on which his 
book is founded, the author tells us frankly 
enough, has been mainly with college students. * 
The book contains a short section on problems 
of administering the test (in which he has little 
to add to Murray’s suggestions), a proposal for 
a scoring scheme, and a short final section on 
the TAT as an adjunct to therapy ; there is also 
an excellent short summary of most of the im- 
portant literature. By far the bulk (225 pages) 
of Tomkins’ work, however, is devoted to prob- 
lems of interpretation, or diagnosis of person- 
ality. The introductory survey of literature is 
organized around one of the most lucid and 
comprehensive analyses of the problems of re- 
liability and validity of a projective test that 
this reviewer has yet seen. The point is well 
taken that validity is “a characteristic of infer- 
ences based on the TAT rather than a char- 
acteristic of the test itself,” and that it is de- 
pendent upon the maturity of the science of 
personality. The breadth and catholicity of out- 
look expressed in this chapter are admirable; 
unfortunately the rest of the book does not have 
these particular virtues to so great an extent. 
The scoring scheme (Chapter III) proposed by 
Tomkins, like some Rorschach scoring systems, 
seems to imply that the clinician has only one 
test, on which all conclusions are to be based. 
It is a cumbersome and unwieldy mechanism 
for squeezing every last drop of juice from no 
matter how small an orange. Each story must 
be rewritten to bring out the orderly sequence 
of ideas; each phrase is examined for the vec- 
tors (patterned after Е. Н. Erikson’s more in- 
telligible list), levels (eg. behavior, wish, 
memory), conditions (e.g., lack, danger, gratui- 
ties) and qualifiers (e.g., temporal characteris- 
tics, intensity, contingency) expressed in it— 
also the objects of the first three classes of 
variables. The exposition of the method, which 
is applied to a simple two-sentence story, takes 
up over six pages, and then it is still not clear 
just what one does after having dissected each 
phrase and strung its innards up on a complex 
two-dimensional frame. The author is aware 
that no busy clinician will give his system more 
than a curious glance. He claims, however, that 
it has great merit as a training device and as 
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a way of extracting meaning from seemingly 
empty stories. The first of these claims has some 
justification; surely it would be good practice 
in sensitivity to easily overlooked aspects of 
stories to work through a few records by this 
method, though the reviewer has not yet found 
this reward tempting enough to make him try 
the job. And it would be unfortunate to train a 
neophyte by this method only; he would later 
have to unlearn the language (for who is going 
to understand reports in terms of “а high de- 
gree of the vector ‘on’ "?) and readjust his per- 
spective rather drastically. As far as the second 
claim is concerned, a clinical psychologist 
would usually be better advised to leave the 
refractory protocol after an hour or so of poring 
and turn to a variety of other techniques, some 
of which would circumvent the subject's resist- 
ances and in turn throw light retrospectively 
on the stories. If one is going to be exhaustive 
in the evaluation of stories taken alone, then 
there are many other approaches which can be 
used fruitfully, especially analysis of formal as- 
pects, which unfortunately the book entirely 
ignores. А good case can be made for training 
in many approaches, since one of the most im- 
portant things for the TAT analyst to learn is 
flexibility of approach, so that he may choose 
his tools to fit the nature of the job set by each 
unique protocol. It is perhaps the author's very 
neglect of the formal aspects of TAT produc- 
tions that is responsible for the absence of any 
information about the use of the test in the diag- 
nosis of mental disorder in the chapter on diag- 
nosis and psychotherapy. Here the limitations 
of the book will be most disappointingly appar- 
ent to the diagnostic tester who has come to it 
for help. It is good to look, as the author does 
in this chapter, at the dynamics underlying an 
illness rather than be content just to christen it, 
but a less extreme rejection of current nosolo- 
gies (inadequate though they are), and a bal- 
anced point of view in which dynamics and 
diagnosis are integrated with personality de- 
scription, would have been preferable. The dis- 
cussion of the TAT in therapy contains some 
interesting suggestions, while omitting refer- 
ence to the new technique of self-interpretation. 
Tomkins has used this test to assess attitudes 
toward therapy, to elicit repressed memories, 
to ventilate repressed grief, and as what he 
calls a kind of play therapy with children. He 
has also used the test in psychotherapy in an 
experimental, exploratory way, and is com- 


mendably frank in discussing and learning from 
his mistakes. Now for the main issue: inter- 
pretation. Tomkins' method is an ingenious 
and highly rational one, in constructive contrast 
to the usual don’t-ask-me-how-I-know-this in- 
tuition of many "experts." He shows through- 
out a psychoanalytic orientation, but does not 
plunge into deep symbolic waters. By his cir- 
cumspection in this respect he sets a good ex- 
ample to beginners. In each of four chapters 
dealing with different "regions" of personality 
(the family, love and sex, morality, and work) 
he starts by delineating the most important di- 
mensions of the region, describing with exam- 
ples the range of variation in each. He then tells 
us to examine the variability of each dimension 
—say, the intensity of a love relationship— 
throughout any particular set of stories, and 
discover to what it is causally related, according 
to the principles laid down in two general chap- 
ters on interpretation. In numerous case histo- 
ries, the techniques are exemplified and the 
TAT findings checked against the clinical his- 
tory. These concrete cases are among the book's 
most valuable features ; one can only wish there 
were more of them, and that full protocols had 
been included. Interpretation is, then, primarily 
a means of uncovering the cause-and-effect re- 
lations that form the structure of the individ- 
ual's dynamics. Tomkins is not primarily in- 
terested in assessing the strength of needs or 
other motivational variables, nor in discerning 
the ego-structure or the principal mechanisms 
of defense. He does not attempt, either, to get 
at what might be called by analogy with dream 
interpretation the latent content of the stories. 
Following J. S. Mill's logical rules, he describes 
the methods of agreement (in effect the propo 
sition that if A is always accompanied by В 
while other things change, there is a causal re- 
lationship between them), of difference (if two 
stories are alike in all but one respect and in 
outcome, the latter is the effect of the former), 
and the method of concomitant variation (in 
effect the proposition that correlation implies 
cause). Used separately or in combination 
these logical analytic approaches make plain 
the complex causal relationships that exist con- 
sciously or unconsciously in the subject’s pr 
vate world. There is a beautiful simplicity ™ 
this approach to interpretation which does 
much to remove the nimbus of false mystery 
that too often hovers over TAT analysis. Yet it 
seems to promise too much, for it is presente 
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without any indication of its limitations. In 
practice, there are not always enough stories, 
for example, which deal with the themes one 
wants to understand better. Conversely, one 
can do much more by way of sketching a total 
personality picture with many TAT's than 
Tomkins’ method would allow. In the light of 
Horn's findings, which Tomkins quotes in his 
first chapter—that only in interaction with 
other data, such as Rorschach's results, may 
the TAT be of greatest help in understanding a 
case—best of all would have been some cases 
in which the subtle interplay of reasoning back 
and forth from one test to another could have 
been expounded. The chapter on analysis of 
levels is a solid contribution to TAT interpre- 
tation. There is a sophisticated criticism of the 
assumption that quantified need-ratings from 
TAT stories are measures of covert strivings: 
many levels may be represented in the stories, 
and a sensitive, qualitative analysis is necessary 
before one can determine whether a fantasy 
trend is a reflection of overt behavior, conscious 
wish, unconscious need, or something else. Fur- 
thermore, the author gives clear and under- 
standable criteria by which one may approach 
this most subtle interpretative problem with 
some success. The most advanced worker with 
the test can profit from a close reading of this 
chapter, with its systematic exposition of how 
much can be gained from careful attention to 
the level (that is, wish, behavior, daydream, 
etc.) on which dynamic trends appear. There 
is still the complaint to be made that it is made 
to seem a little too simple. Every subject, it 
appears, really knows and will tell you the con- 
ditions of his own happiness, for example, or 
of success in love or work, and you have only 
to look carefully in the stories to find them. 
Alas, the test often disappoints its staunchest 
adherents (as all tests do), and is silent at the 
point where we wish it to speak loudest ; or the 
subject is self-deceived unconsciously as well 
as consciously. A section is devoted to the ex- 
position of a revised theory of repression, and 
its application to TAT analysis. To oversim- 
plify somewhat: any need may be repressed by 
almost any other need with which it conflicts. 
If the energy (called by Tomkins “pressure” ) 
of the repressing need is in considerable excess 
of the repressed, no serious effects will come 
about unless the combined energy of both makes 
up a large part of the total energy available to 
the personality. The most pathogenic conflicts 


occur when both needs are strong and evenly 

matched, comprising between them a large pro- 

portion of the total energy. Such a conflict 

would be found in a prepsychotic condition. In 

its neglect of the problems of ego structure, of 

other defense mechanisms and their relations to 

repression and in its implications for a number 

of theoretical problems, this theory leaves a 

good deal to be desired. But as applied here to 

the study of unconscious conflicts by means of 
the TAT, it leads to impressive results. They 

are the more remarkable in that they require 
a number of highly questionable assumptions 

(such as that the sum of the forces manifested 
in 20 stories equals the total “pressure” of the 
personality). With all of these reservations, 
one must still admire, in his three illustrative 
cases, the way reasonably rigorous derivations 
are made, predictions about the nature of cer- 
tain stories and the effects of special conditions 
are recorded and the experiments successfully 
carried out. Of course, much more work of this 
sort would be necessary to validate the pro- 
cedure, but Tomkins has set future workers on 
the problem of repression a good example of 
an approach which is at the same time experi- 
mental and thoroughly dynamic. When one 
turns to the application of these general prin- 
ciples of interpretation set forth so well in 
Chapters IV and V to specific problems in the 
next four chapters, he suffers a disappointment. 
There is much unnecessary quoting of stories 
(many of them over and over, to serve different 
purposes) to illustrate most of the possible re- 
actions that are so exhaustively catalogued un- 
der each “region.” One might call this padding ; 
a more serious criticism would be that the cas- 
ual reader can easily get the impression that the 
textual remarks which preface each such quoted 
story are its interpretation, and that they refer 
to the teller of the story in some direct way. 
Since the reviewer fell at first into the assump- 
tion that some of these comments were deduc- 
tions about the life histories of the story-tellers, 
it is perhaps in order to supply other readers 
with warnings which the author has neglected 
to provide in enough quantity. Other minor 
criticisms could be added, but taken as a whole, 
the book is an important and useful piece of 
work, and will be required reading for every- 
one who uses the TAT. He must not mistake it 
for a complete guide to the use of the test, but if 
he reads it in conjunction with a judicious se- 
lection of the other published TAT material, 
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he will have most of what can be given at pres- 
ent by the printed word. There still is not, how- 
ever, and there probably can never be, any sub- 
stitute for extended clinical experience. 

J Am Med Assn 137:217 My 8 48. * clear, 
concise, well written * an excellent bibliog- 
raphy and an adequate index. The book is rec- 
ommended to all those who are interested in 
personality study, particularly to psychologists 
and sociologists, who could well use this volume 
as a textbook for study and teaching. 

J Nerv & Mental Dis 109:466-7 My ’49. 
(Zygmunt A.) Piotrowski. * Tomkins’ book is 
the first large and important work on the test. 
The methodological basis of the TAT has been 
scrutinized by Tomkins and modified by him; 
the applications and implications of his ap- 
proach have been made explicit in a very thor- 
ough, consistent and lucid manner. * The au- 
thor describes the precursors and the history 
of the TAT, the technique of administration in 
great detail, the scoring scheme and a very ac- 
curate procedure of interpretation. The manner 
of applying the test to problems of personality 
description, psychiatric diagnosis and therapy 
is fully illustrated. * Tomkins states that the 
determination of the relationship between story 
and storyteller is the keystone of TAT interpre- 
tation. He agrees with others that when stories 
are verbalized in the indicative mood and pres- 
ent tense, they are likely to reflect drives which 
are not repressed but are realized directly in 
action ; on the other hand, if the stories are pre- 
sented as mere probabilities, they are said to 
reflect unrealized or repressed drives. This can 
hardly be considered a satisfactory solution if 
for no other reason than that the vast majority 
of stories sound like reports of actual happen- 
ings. Inhibitions are not as rare as expressions 
of doubt in the TAT. In his thorough applica- 
tion of Mill’s principles Tomkins, in many a 
case, unwittingly treats the stories as if they 
described the subject’s actual behavior. This 
shift from phantasy production to reality de- 
scription is facilitated by the author’s psycho- 
logical viewpoint, e.g., he states “there is no 
reason to doubt that the individual can project 
himself into the past as easily as he can divulge 
his present preoccupations.” This optimistic 
belief in the ease and directness with which the 
attitudes of the past as well as many other ten- 
dencies allegedly are expressed in the TAT 
would lead to handling the TAT as a descrip- 
tion of genuine tendencies, hardly disguised or 
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displaced. In spite of oversimplifications Tom- 
kins’ contribution is of great value because it 
makes the reader face the fundamental prob- 
lems of the TAT and teaches him a great deal 
about the test. Even the controversial parts help 
the reader in clarifying his thinking. The book 
is a fruit of a great effort and of many years of 
experience. 

J Proj Tech 14:321-6 5 'so. Frederick 
Wyatt. * The first chapter....presents an excel- 
lent, well-organized account of the results and 
of the problems of research in the TAT, up to 
1947. It prepares the ground and arraigns all 
the material necessary for the ideas subse- 
quently expounded. In addition it provides one 
of the best topical summaries in recent clinical 
writing. * The fourth chapter, as an introduc- 
tion to the technique of interpretation, adapts 
J. S. Mill’s Theory of Induction to the needs 
of diagnostic inference. While a little startling 
at first, the suggestion as such is original and 
fruitful and worth a good deal of reflection. 
There is a real (and much neglected) need for 
a canon of procedures how to argue from the 
raw data of testing and clinical observation to 
the complex pattern of personality. One won- 
ders though, whether Mill’s Logic is particu- 
larly suited for the purpose at hand. Generally, 
the problem of individual meaning paramount 
in everything that concerns fantasy, will ren- 
der futile any categorical approach. * In order 
to sharpen the sensitivity of the test for re- 
pressed material, Tomkins undertakes a revi- 
sion and extension of Freud's theory of repres- 
sion. He assumes personality to be a closed 
system with a finite quantum of energy. Any 
wish within this closed system may have any 
force if it does not exceed a total available 
energy. “Deeply” repressed wishes, Tomkins 
points out, contrary to common opinion, are 
less pathogenic than others held under smaller 
pressure. Conflict between wishes is pathogenic 
either if it claims the entire energy of the sys 
tem or if the forces on either side are near equal. 
Imagination, as manifested in the TAT, allows 
the individual to reduce the force of repression 
by moving at a distance what otherwise wou 
have to be repressed more firmly. Conia 
the behavior of figures acting under “norma 
conditions, that is reasonably near the d 
day reality of "m. tee, with those remote fro 
this reality by dint of the setting will give us an 
idea of the force of repression. Anti-social be- 
havior undertaken by figures and in situations 
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similar to those of the testee's life, indicates that 
the wishes underlying it are subject to no great 
repression. Tomkins' theory of repression thus 
becomes the gauge of the most important ques- 
tion that can be asked of the TAT. While ap- 
pealing through its clarity, the theory itself, 
nevertheless, invites a good deal of doubt. The 
fundamental deficit of his concept is that the 
ego is missing. Ego-psychology which has be- 
come the center of organization in our under- 
standing of man's behavior, suggests that re- 
pression is only one of several mechanisms 
which the ego uses in order to avoid conflict. 
Conflict, again, will be of different origin at 
different occasions and consequently will ap- 
pear differently in fantasy. Nor will it be suffi- 
cient to account for fantasy by a simple com- 
bination of wish and the force that opposes it. 
Tomkins argument for the benign quality of 
deeply repressed wishes is, I think, based on a 
semantic misunderstanding. A distinction must 
be made between early repression, that is, at a 
period when the ego is relatively weak, and suc- 
cessful repression which may have taken place 
earlier or later. It is finally the lack of apprecia- 
tion of the organismic or field quality of any 
psychological event, or its projection into fan- 
tasy, which seems to vitiate the explanatory 
value of Tomkins’ schema of repression. No 
wish can be determined as to how much it is 
repressed or how much harm it will do when 
it emerges, unless one would consider at the 
same time under what conditions repression 
took place, what function it had, and what the 
psychological meaning of its emergence is. The 
measure of remoteness hence applies condition- 
ally only. * The scoring scheme as described in 
Chapter 2 and the theory of repression are the 
focal points and Tomkins’ most personal con- 
tributions. The scoring scheme itself is proba- 
bly the most complete and elaborate of its kind. 
Only Henry's methodology is equal to it in 
subtlety and in the number of provisions made 
for grasping what will escape the examiner who 
is not armed with so precise an instrument. It 
might be argued that this scoring scheme, al- 
though it proceeds on a less minuscule basis 
than that of the Need-Press analysis inaugu- 
rated by Murray, and systematized first by 
Sanford and White, then by Bellak, still loses 
sight of the forest for too much logical attention 
to the trees, Yet, if we want to be accurate, it 
seems that we have to go into cumbersome de- 
tail; and if ме want to hold on to the organis- 


mic, inseparable unit of mood and thought and 
desire expressed in a story—a true reflection 
of the integrative function of the ego—then we 
have to sacrifice accuracy for perceptiveness. 
Reliability studies afterwards will have to ren- 
der objective what could be accomplished only 
through an act of subjectivity. It is a more spe- 
cific question whether in Tomkins’ scheme the 
conative aspects of behavior have been focussed 
at the expense of the feeling tones of subjective 
experience and subjective reaction. All these 
arguments, however, seem to me secondary to 
a purely pragmatical опе: the number and vari- 
ety of categories designed by Tomkins for the 
analysis of TAT responses, that make it diffi- 
cult to keep them in mind and to handle them 
properly. Specific training would be necessary 
for it—in and by itself a completely justifiable 
demand, Were this system now used for re- 
search purposes, let us say, for the study of a 
quality peculiar to a certain group of patients, 
then a very large number of variables would 
have to be manipulated continually in order to 
arrive at a few meaningful correlations. If the 
use of this scheme might be cumbersome for 
research purposes, the same reason would make 
it prohibitive for the daily routine of the clinic. 
Interpretation does not immediately come from 
analysis, in fact, it begins only after the mate- 
tial has been properly fitted into the analytic 
scheme. Yet there is no demonstration in the 
book how this should be done. The second part 
of the book presents many interesting hints in 
this direction, but does not yield a procedure. 
What Tomkins shows, actually, is nothing but 
that the vast variety of human experience is 
really reflected in the responses to the TAT, 
which is as much as to say: fantasy is valid as 
a source of studying man. As the author pointed 
out in a spirited polemic, we may well take that 
for granted by now. There is a pervasive im- 
pression, too, that what these stories voice, is 
essentially taken at face value. Yet it would 
seem that the display of attitudes and tenden- 
cies in TAT stories is sometimes entirely de- 
fensive in its meaning. When one attempts 
finally to evaluate Tomkins’ book, one should 
make it clear that its merits are his own, its 
shortcomings those of an inchoate field of inves- 
tigation. This book abounds with interesting il- 
lustrative material. It makes a number of stim- 
ulating cases available for the careful perusal 
of the student. It presents a lot of good ideas 
and if they are open to controversy they are no 
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less worth careful pondering especially as the 
style of the book is exemplary for its clarity and 
control. The shortcomings of this book (as has 
been indicated before) may be seen in the ab- 
sence of a theory of fantasy upon which all in- 
terpretation could be based. What is the func- 
tion of fantasy? What are its workings? Only 
from such premises can we hope to derive di- 
rectives as to the meaning of the individual 
example. The disavowal of the picture stimu- 
lus, explicitly stated by Tomkins, in itself 
makes for a grave limitation. What should we 
say about a directed fantasy if we would not 
compare first the stimulus from which it took 
off, with the direction in which it went? АП 
this, one would suspect, is related to the basic 
limitation of this book, the variety of which is 
dependent upon one kind of material without 
the benefit of enlargement through psycho- 
pathology. With a very few exceptions only, 
the group from which these records were taken 
has the features and problems of an above- 
average college group. It differs significantly 
from any average population in intelligence, 
socio-cultural level, and articulateness. All this 
makes for a fantasy material which will give 
fascinating insight at the price of distortion. 
Clinical patients will be less cooperative and 
more suspicious, less articulate and more inhib- 
ited in fantasy and self-observation. If we want 
to study their fantasies, which, I am convinced, 
is theoretically and practically as fruitful as it is 
with the highly selected group from which the 
first experiences with the TAT were gleaned, 
then we have to make the necessary adjust- 
ments in our methods. When this correction 
has not taken place, suggestions about diagnos- 
tic procedures will remain one-sided and con- 
ditional. Thus, with all the author's diagnostic 
adroitness, the book appears to the retrospec- 
tive reader sometimes a little remote and a little 
academic. All these remarks imply standards 
for the realization of which we can hope and 
work, but which we cannot expect to be fulfilled 
in any single book. Tomkins’ book is too diffi- 
cult and too complex, and at the same time too 
specific and too conditional to be used as a text 
for the diagnostic beginner. It will be read with 
pleasure by the experienced worker and will 
stimulate him to transcend its premises or set 
him again to reflect critically and productively 
on the newly developed clinical usefulness of 
man's most indigenous faculty. 

Mental Hyg 33:476-81 Jl '49. Helen Oexle 


Pierce. * Tomkins includes a very fine chapter 
on the analysis of work needs and vocational 
setting. * 

Psychoanalytic Q 17:552-4 О '48. Adolf G. 
Woltmann. * There exist over one hundred 
publications on various phases of this test. Dr. 
'Tomkins has assembled the materials available 
in the first comprehensive textbook about this 
test, removing some of the major obstacles in 
the way of a more universal application of this 
projective technique. Among these are the 
thorny problems of reliability and validity. 
Scoring and interpretation were other neg- 
lected areas. The rich documentation of various 
test aspects with clinical case material is a tre- 
mendous help to the student of human behavior 
and to the professional worker. * excellent 
chapter on Level Analysis * The TAT might 
be one projective technique which the psycho- 
analyst may adapt to his own professional needs. 
He is trained to analyze dream material and 
free associations, and should not, therefore, en- 
counter difficulties in eliciting and analyzing 
stories patients create around the TAT picto- 
rial situations; however Rapaport, in his 1943 
paper, states that “...ап interpretation of the 
TAT should not be considered a dream inter- 
pretation; nor can it, in the majority of cases, 
be handled as material allowing for symbolic 
interpretation; nevertheless, the clinical and 
psychodynamic interrelations of attitudes, striv- 
ings, etc., have to be utilized in order to come 
to a meaningful understanding of how the world 
of thoughts of the subject is organized, and how 
the subject himself envisages his world and en- 
vironment.” The book is well written, well or- 
ganized and richly illustrated with case material 
which adequately covers all the points raised. 
A comprehensive up-to-date bibliography is ар- 
pended, Dr. Tomkins’ volume is sincerely rec- 
ommended as a textbook and instruction manua 
to the professional worker who deals with hu- 
man, dynamic behavior, its deviations and cura- 
tive efforts. A 

Psychol В 45:461-2 S '48. Helen Sargent. 
Tomkins’ book is the work of a philosopher 
and scholar, as well as an able clinician. It con- 
tributes an original approach to the problems 
of scoring, interpretation and research, pa 
upon a rationale which considers the instrumen 
in the larger setting of scientific methodology 
with special reference to the science of и 
ality. It is presented as “a workbook, гаће» 
than as a compilation of established doctrine 
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and offers a system of interpretation which in 
many respects is essentially new. Perhaps a 
peculiarity of the TAT is the fact that no sys- 
tem so far presented has gained general use, 
and it seems unlikely that this contribution will 
supplant eclecticism, or replace more flexible 
approaches of the type set forth by Rapaport. 
The scoring is based upon a set of variables 
(“vectors,” "levels," “conditioners,” and “qual- 
ifiers") which, in the hands of anyone but their 
originator, are likely to prove unduly labored. 
Of particular interest from the standpoint of 
methodology are the opening chapters which 
undertake an analysis of the analytic process 
engaged in by the interpreter, for the purpose 
of demonstrating that “The interpretation of 
TAT stories may employ canons of inference 
long accepted by other sciences.” This exami- 
nation of the logic of deduction is a valuable cor- 
rective to our sometimes myopic research for 
substantiation of hypotheses in a complex ma- 
trix of validating criteria. Nevertheless, some 
students of personality are likely to find in 
Tomkins an over-emphasis upon reductionistic, 
case-effect analysis, impressive in its grammar- 
ian thoroughness, but likely to obscure, espe- 
cially for beginners, less abstract and more il- 
luminating interdependencies in the data. The 
approach is from microscopic to macroscopic 
analysis, whereas the reverse is the method of 
preference for many. The practicing clinician 
is likely to find greatest satisfaction in the later 
chapters in which, after the author has de- 
scribed his tools, he demonstrates their use. 
The sections on personality diagnosis, in which 
the various regions of family, love and sex, so- 
cial relationships, work and vocational settings 
are discussed in significant dimensions, are 
rich in case material, with interpretation mas- 
terfully handled, and new diagnostic insights 
supplied. In the writer’s opinion, a highly im- 
portant contribution is Tomkins’ discussion of 
repression. Here he takes issue with orthodox 
psychoanalytic conceptions regarding the path- 
ogenic nature of “deep” repression, enunciating 
a theory by which the seriousness of conflict is 
judged not by depth of repression, but by the 
intensity and extensity of conflicting wishes 
and the degree of deadlock between them. This 
concept has import for therapy and for theory 
of personality, as well as for the prognostic po- 
tentialities of the TAT. The book opens with a 
useful summary of important research in the 
Thematic Apperception Test and closes with 
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a discussion of the test as an adjunct to therapy. 
Although it is unlikely to fill the need for an 
integrative text in the field, it marks the com- 
ing of age of an important psychodiagnostic 
device, and by reason of its thorough exposi- 
tion and systematic contribution deserves to be- 
come an essential reference in TAT research 
development. 

О R Biol 23:187 Je '48. Stanley B. Williams. 
* Most of the pages of this book are taken up 
with illustrative case material, which is more 
voluminous, perhaps, than is necessary. The 
historical introduction is too brief to be satis- 
factory. Of chief interest are two sections: the 
first, an application of Mill’s principles of logi- 
cal analysis (methods of agreement, concomi- 
tant variation, etc.) to the story material; the 
second, a novel statement of the theory of re- 
pression, cast in quantitative terminology. 
Neither contribution is sufficiently integrated 
with the TAT material. However, despite these 
shortcomings, the book is the best over-all 
treatment of the TAT and will for that reason 
alone find a place in the library of the clinician 
as a useful reference. 


[142] 
*The Travis-Johnston Projective Test: For the 
Exploration of Parent-Child Relationships. Ages 
4-15; 1949; individual; separate sets of 44 pictures for 
boys and for girls; no data on reliability and validity 
with test (for data prepared under the direction of the 
authors and presented elsewhere, see 1—3) ; no manual ; 
no normative data; $19 per test, postage extra; (90) 
minutes; Lee E. Travis and Joseph J. Johnston; 
Griffin-Patterson Co., 544 W. Colorado Blvd., Glen- 
dale 4, Calif. * 
REFERENCES 

т. Harris, Lester Ler. A Clinical Study of Nine Stuttering 
Children: in Group Psychotherapy. Doctor's thesis, University 
of Sothern California (Los geles, Calif.), 1950. (PA 
veis abe oo; Donato Murray, A Study of the Personalities 
of "Stuttering Children and Their Parents as Revealed Through 

rojection Tests. Doctor's thesis, University of Southern Cali- 
fornia (Los Angeles, Calif.), 1950. Qa 

3. Сметни Арта а, aa Слатине Sling 
P namics in Д 
раа бей, University of Southern California (Los ne 
geles, Calif.), 1951. 


Rozert R. Horr, Director, Psychological Staff, 
The Menninger Foundation, Topeka, Kansas. 

Here is another of the all too numerous fam- 
ily of prematurely published, unvalidated pro- 
jective tests, in this case a variation of the The- 
matic Apperception Test for children. The child 
is asked to tell a story (obviously only a very 
brief one) about each of 44 pictures, the stories 
to be interpreted in whatever way the examiner 
chooses or finds possible. 

For a very stiff price, the prospective user 
gets a handsome box filled with 88 pictures, 
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printed on heavy paper (not on stiff cardboard, 
like the TAT). That's all. No manual, no 
norms, no data on reliability, validity, or any 
other published research is furnished ; and only 
a single, dittoed page is included of off-the-cuff 
comment about the'test’s purpose, possible uses, 
and the method of administration, all of which 
is so brief as to be nearly useless. It seems to be 
assumed that the user will be trained in the ad- 
ministration and interpretation of the TAT 
(which is nowhere mentioned, even to acknowl- 
edge any indebtedness of the authors to it). 
There is a brief note to the effect that “stand- 
ardization of test results is proceeding on a 
sampling of approximately 400 children.” The 
least the authors could have done would have 
been to hold up publication until they could pre- 
sent the results of this preliminary study. 

The pictures, then, must stand or fall on their 
own merits. They are not very attractive, rough 
black and white sketches of children and adults 
with a minimum of detail. They do not seem 
very interest provoking—not nearly as much 
so as the gay animal pictures of the Children’s 
Apperception Test. They are a curious compro- 
mise between the contrasting ideals of maxi- 
mum ambiguity and a high degree of structur- 
edness. On the one hand, their sketchiness and 
the stolid formlessness of the faces is apparently 
intended to allow any expression to be pro- 
jected and to “be interpreted the same for any 
culture or socio-economic status"—the last a 
highly dubious, naive, and unproved assertion. 
On the other hand, situations assumed to be 
crucial in socialization, involving “sibling ri- 
valry, child-parent rivalry, discipline, eating, 
sleeping, toilet training, cleanliness and sexual 
development,” are presented in generally un- 
mistakable fashion and usually with all combi- 
nations of ages and sexes rather mechanically 
spelled out. Thus, for boys there are pictures 
of a little boy going to the toilet alone, with a 
woman, with a man, and with a girl watching, 
and of a boy watching a woman, a man, and a 
girl, respectively,—a total of seven pictures in- 
volving flush toilets (which are of course a fea- 
ture of all cultures and socio-economic levels!). 
These are paralleled by a similar series of seven 
for girls. Each situation is treated in this un- 
imaginative, repetitious way. 

The result is a large series of pictures, which 
probably contains a few useful ones not dupli- 
cated in the other competing tests, all of which 
are better established by research: the Murray 
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TAT (series for children), the Michigan Pic- 
ture Test, Symonds’ Picture-Story Test, the 
Children’s Apperception Test, and the Blacky 
Pictures. The prospectiye user who feels 3 
need of pictures not found in these tests, how- 
ever, would do better to draw his own rather 
than buy this portfolio; the results would prob- 
ably be in every way as satisfactory, as well as 
a lot cheaper and simpler. 

Anyone can put together a set of pictures to 
which stories may be told. Before marketing it 
as a test, however, he should furnish some evi- 
dence that his pictures contribute something 
new to the study of personality or something 
more than do the established instruments. 
There is no indication that the Travis-Johnston 
test makes any such contribution. 
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*Twitchell-Allen Three-Dimensional Apperce - 
tion Test. Ages 5 and over; 1948-51; 1 form (48); 
no data on reliability; no norms, manual (748), dittoed 
supplement (’51) ; $37.50 per complete set of 28 plastic 
forms, rd). 10 summary blanks, and 50 recording 
sheets; $1.10 per 10 summary blanks; $1 рег 50 Te. 
cording sheets; postpaid; (40-120) minutes; Doris 
Twitchell-Allen; Psychological Corporation. * 


REFERENCES 

т. TWITCHELL-ALLEN, Doris. "A 3-Dimensional Appercep- 
tion Test: À New Projective Technique." ‘Abstract. Am Psychol 
2:271-2 Ag "47. * (PA 21:4667, title only) 

EDWARD JOSEPH SHOBEN, JR. Associate Pro- 
fessor о} Education, Teachers College, Colum- 
bia University, New York, New York. 

The purpose of this test, presented as a pro 
jective technique, is “Чо elicit the overt expres 
sion of the subject’s interests, needs, goals, sen- 
timents, feelings, and emotions, to simu 
him to project his inner self into the externa 
environment." 

The approach is through the use of 28 am- 
biguous plastic figures varying from geometric 
forms to generalized organic forms to more 
concrete human or animal forms. Subjects are 
confronted with two tasks. In the first, they arè 
asked to choose one or more of the forms gh 
“make up a story" about them, using them ү 
any way they see fit to dramatize their T 
This aspect of the procedure is referred to а 
the Psychodramatic Test. The second task S 
quires the subject to name the various рт 
and explain the determinants of his label ү 
This is called the Naming Test. Recording 
both instances is in terms of reaction time a 
the kinds of gestures and verbalizations е 5 
ject employs. 


5 t 
The recommended analysis of the resulta" 


28) ] TESTS & REVIEWS: 


data shows the relationship of the: Three- 
Dimensional Apperception Test to other pro- 
jective devices. In the Naming Test, the sub- 
ject’s responses are categorized as to content, 
determinants (movement, color, form, texture, 
and size), form quality (F+ and F—), and 
originals. These modalities conform, of course, 
to those used in analyzing Rorschach responses. 
In the case of the Psychodramatic Test, the 
responses are interpreted as dramatized the- 
mata, reflecting characteristics of the subject’s 
personality. This corresponds roughly to the 
type of analysis used with the Thematic Apper- 
ception Test, psychodrama, and various forms 
of play interviews with children. 

If one considers the Three-Dimensional Ap- 
perception Test as a diagnostic test, the first 
question that comes to mind is that of validity. 
There seems to be little helpful data bearing on 
this crucial question. The only reports are of 
the I-know-a-case variety, describing the re- 
sponses of subjects whose pathological charac- 
teristics were known in advance. These re- 
sponses are certainly different, but two queries 
are relevant: (a) Are the differences between 
individuals (or diagnostic groupings) greater 
than the differences within individuals on re- 
peated administrations? and (b) Are the vari- 
ous responses of individuals consistent with 
those of other individuals ordered to the same 
class, such as aggressive, anxious, neurotic, 
schizophrenic, or delinquent? Until answers to 
such questions are provided, it seems unlikely 
that the test can be used diagnostically as any- 
thing other than a kind of informal observa- 
tional situation from which experienced clini- 
cians may be able to draw significant inferences 
about the personalities of their subjects. This 
is not necessarily to suggest that the instrument 
is devoid of diagnostic utility, but it does raise 
doubt as to whether it reduces {һе sources of 
error implicit in clinical judgment. It should be 
pointed out that the function of the psychologi- 
cal examination in the differential diagnostic 
situation has been traditionally to do just this. 

On the other hand, if one regards the Three- 
Dimensional Apperception Test as an ingeni- 
ous stimulus situation, the response correlates 
of which are yet to be determined, a large num- 
ber of fruitful research problems emerge. For 
example, do the individuals who produce large 
numbers of movement responses оп the two- 
dimensional Rorschach also produce large num- 
bers of moyement responses with these three- 
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dimensional stimuli? If they do, it seems much 
more probable that movement is a consistent 
mode of perceptual response and that it is likely 
to be related to other significant characteristics 
of personality. Likewise, interesting questions 
can be raised about the ways in which various 
classes of people respond to the varying degrees 
of structuredness of the stimulus forms. This 
could throw much light on the relative contri- 
butions of external stimulus conditions and 
(eg.) pathological states to the kinds of re- 
sponses obtained. Жл" ' 

Thus, as a research situation and as a con- 
dition under which the skilled clinician may 
possibly make significant observations, the 
Three-Dimensional Apperception Test prom- 
ises a good deal. As a test which differentiates . 
with statable degrees of accuracy among defined 
classes of respondents, it hardly exists, although 
future research and refinement may bring it 
into being. 
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*Visual Motor Gestalt Test. Ages 4 and over; 
1938-46; individual; 1 form, '46; complete manual A 
Visual Motor Gestalt Test and Its Clinical Use (38); 
instructions for administering (’46) ; $1 per set of cards 
and booklet of instructions; $3.50 per complete man- 
ual; postpaid; (10) minutes; Lauretta Bender; Amer- 
ican Orthopsychiatric Association, Inc. * ($1.80 per 
pad of 50 scoring forms (51), $6.50 per manual The 
Bender-Gestalt Test. (51) ; postage extra; Gerald R. 
Pascal and Barbara J. Suttell ; Grune & Stratton, Inc.) 


REFERENCES . 


1-8. See 3:108. E й 
$. BENDER, LAURETTA. “Principles | of. Gestalt in Copied 
Form in entally Defective and Schizophrenic Persons." 
Arch Neurol & Psychiatry 27:661-86 Mr '32. * Pa 8:5498) 
то. BiLLINGSLEA, Евер Y, “Тһе Bender-Gestalt: Ап Ob- 
jective Scoring Method and Validating Results," Abstract. 
‘Am Psychol 1:286 Jl '46. * (РА 20:3688, title only) i 
тї. BintINGsLEA, Екер Y. Reproduction of Geometric De- 
signs and Their Relation to Personality Salients in Humans, 
ponas thesis, Ohio State University (Columbus, Ohio), 
1946. M a 
"S Wiscuner, GEORGE J.; ROTTER, Jurian B.; AND бил, 
ман, Rosert D. Chap. 9, “Projective Techniques,” рр. 144—57. 
In The Psychological Program in AAF. Convalescent Hospitals, 


Edited iy Sidney W. Bijou. Army Air Forces Aviation Psy- 
chology Program Research Reports, Report No. 15. Wash- 
ington, D.C.: U.S. Government Printing Office, 1947. Рр. 


viii, 256. 
13. iu Јонм Experxin, Chap. 16, *'Visual-Motor Tests,” 
pp. 341—9. Та his Errena Techniques; A Dynamic Approach 
to the ersonality, New York: Longmans, Green 
y xvi, 533. * (London: Longmans, Green 
23:1284 
Faro Y. The Bender-Gestalt: An Ob- 
ethod and Validating Data. Clinical Psy- 
New Orleans, La.: the Author, Tu- 


Jane University, 1948. Pp. 27. Paper. (АНД from J Clin 


E. LAKIN; BERMAN, ISABEL R.; AND HANSON, 
Personality. Factors | Associated 
Children. Monographs 
of the Society for Research in Child Development, Vol. 12, 
ial No. 45. Evanston, Hl.: Child Development Publi- 
cations, the Society, 1948. Pp. vii, бо. Paper, lithotyped. * 
17. SEARS. RICHARD, “Castration Anxiety in an Adult as 
Shown b; 
48. * € 


= b. ud 


144° 


18. SoLowoN, Josern C. “Adult Character and Behavior 
Diofdert- J cna Or 93-35 А m = 
19. BARKLEY, BILL J, '* 
‘Kinestheti 


20. BELL, Joun Expverxin, “The Case of G : Psycho- 
logical Test Data." Rorsch Res Exch & J Proj feck 13:155- 
5 по 2 '49. 


hological Principles of the 
N Y Acad Sci 11:164-70 
Mr '49. * (PA 23:5515) 


22. Burteson, Derwoon Eppie. A Personality Study of 
Fourth, Fifth, and Sixth Grade Stutterers and Non-Stutterers 
Based on the Bender Visual Motor Gestalt Test. Master's thesis, 
ptu Ea Pu. (Pittsburgh, Pa.), 1949. (PA 24: 
uu HALPERIN, $рмкү L. “A Study of the Personality Struc- 
tan ы - ‘risoner in Hawaii." (purae I ge 
roj Tech 13:243 no 2 '49. 24:270 е on 
24. Horr «MAX E R rhe Bender Gestalt” Ты. The Case 
-of Greno Interpretation of Test Data: Symposium Presented 
at American Psychol iation Meeting, Denver, 1M» 
orsth Res Exch & J Proj Tech 13:443-6 b 'a9. (PA 24: 


30. Млрроск, Малме E. The Bender-Gestalt Vísual Motor 
Test as an Institutional- 


Instrument in the Discrimination of а 
ized Boys From Those Noninstitutionalized, aster's thesis, 
Fordham University (New York, N.Y.), 1949. 
те éd ti ARGARET J. e Che yin yos 
est Assists „рр. 1 . In Clare: ge Read- 
ing Conference, Foutteench HERE 1949: Conference Theme: 
The Problems and Techniques Involved in Reading Social Re- 
lationships. Claremont, Calif.: Claremont College Curriculum 
Laboratory, 1949. Pp. 191. Paper. * 

27. BALDWIN, MARCELLA ViG. “А Note Regarding the Sug- 
теа Use of the Bender Visual Motor Gestalt Test as a 

easure of School Readiness." J Clin Psychol 6:412-5 О E d 
(PA 25:8375) ч, 

28. Barnes, T. CUNLIFFE. 1 picctroencephalographic Valida- 
tion of the Rorschach, Hunt and Bender Gestalt Tests.” Abstract, 
Am Psychol 5:322 JI 'so. * (PA 25:1 title ) 

зр ANVIK, LzO J., AND ANDERSEN, LLovp. “The Effect 
of Focal Brain Lesions on Recall and on the Production of 
Rotations in the Bender Gestalt Test.” J Consult Psychol 
14:197-8 Je 'so. * (PA 25:503) 
ILDRED, AND HARRIMAN, PHILIP Law- 
Visual Motor Gestalt Test as a Measure 

Deum Readiness." J Clin Psychol 6:175-7 Ap 'so. * (PA 
24:6353 


‚31. NELLY, E. LoweLL, AND Fiske, рохлір W, “The Pre- 
diction of Success in the VA Training Program in Clinical 
Psychology.” Am Psychol 5:395-406 Ag 'so. “А 25:2183) 
,3^ KITAY, JULIAN I. “Ihe Bender Gestalt "l'est as a Pro- 
jective Technique.” J Clin Psychol 6:170-4 Ap 'so. * (PA 


24:6356) 
PASCAL, GERALD R. “Quantification of the Bender Ges- 
Am J Orthopsychiatry 20:418-23 
Ap 'so. * (PA 25:2465) 3 
к 3}, Witson, RoserT С. A Study of Expressive Movements 
in Three Groups of Adolescent Boys, Stutterers, Non-Stutterers 
Maladjusted and Normals, by Means of Three Measures of 
Per. , Mira's Myokinetic Psycho jagnosis, the Bender- 
Gestalt, and Figure Drawing. Doctor's thesis, Western Re- 
ris ‘University (Cleveland, Ohio), 1950. (P. 25:6584, title 


о 
5. WOLTMANN, Арогғ С. “The Bender Visual-Motor Ges- 
nc 


t Test," 322-56. (PA 25:2473) In Projective Psychol- 
ogy: Cli Pproaches to the Total Personality. Edited by 
Lawrence Edwin Abt and Leopold Bellak. New York: Alfred 
A. Knopf, Inc., 1950. Pp. xvii, 485, xiv. * 


36. CALDEN, GEORGE, AND CARP, ABRAHAM, “А Compari- 
son of Three Projective Techniques in a Longitudinal Study 
of a Post-Lobotomized Patient." Abstract. 4m sychol 6:343-4 


Лаг. * 

37. HaLPERN, FLORENCE. Chap. 11, “The Bender Visual 
Motor Gestalt Test,” ФР. 324-40. In An Introduction to Pro- 
ective Techniques. Edited by Harold H. Anderson and Gladys 

ew York: Prentice-Hall, Inc., 1951. Pp. xxiv, 


Anderson, 
720. 

38. Hanvrx, Leo J. “A Note on the Limitations of the Use 
of the Bender Gestalt Test as a Diagnostic Aid in Patients 
With a Functional Complaint." J Clin sychol 7:194 Ap 's1. * 


(PA 25:8215) 
. Нотт, Max L. Chap. 23, “Bender-Gestalt Drawings,” 
pn 227-33. In Thematic Те Analysis, By Edwin S. Shneid- 
i Walther Joel and Kenneth B. 


e. 
man with the collaboration of 

itle. Foreword by. Henry A. Murray. New York: Grune & 
320. * (PA 26: 


Stratton, Inc., 1951. Pp. xi, 3422 a 
40. Lum, Vernon К. A Modified Use of the Visual Motor 
rojective Instrument With Neuropsychiatric 


Gestalt Test as a P. 
thesis, University of Michigan (Ann Arbor, 


TA Doctor's 
ich. 
41. GrRALD R., AND SUTTELL, BARBARA J. The 


), 1951. 
Pasca: E 
Bender-Gestalt Test: Its Quantification and Validity for Adults. 


Foreword by David G. right. New York: Grune & Stratton, 
Inc, 1951. Pp. xiii, 274. * (PA 25:8106) 
Томкі, лхо Fiske, Рохлір W. The Pre- 


42. KELLY, 
diction of Performance in Clinical Psychology. Ann Arbor, 
Mich.: University of Michigan, 1951. Pp. xv, 311. Lithotyped. * 


on of the 
ic Gestalt Test." J Clin Psy- — 


ARTHUR L. Benton, Professor of Psychology, 
State University of Iowa, Iowa City, Iowa. 
The early status of this test is well described 
by the reviews appearing in The Third Mental 
Measurements Yearbook. Line remarked that 
the author's "theoretical statements are hard 
to summarize, and at this stage, to evaluate" 
and the test's “validity and practical value.,.as 
well as its classification, must await further re- 
search." He concluded that it nevertheless “rep- 
resents a very stimulating contribution to a chal- 
lenging and complex field." Rickers-Ovsiankina 
pointed out that "the theory expressed is cer- 
tainly not a theory in the sense that it leads to 
predictions and conclusions or even factual state- 
ments of a specific nature.” She too felt that “the 
study is a stimulating one and the test has con- 
siderable promise for clinical purposes." j 
Now, ten years after these reviews were writ- 
ten, it may be asked how well the promise has 
been performed, During the war, reports of the 
wondrous efficacy of this instrument emanated 
from some military clinical facilities. It was 
therefore with keen interest that clinical psy- 
chologists looked forward to seeing it in action 
in the postwar period. The outcome was most 
disappointing. Watching an "expert," trained at 
one of the military installations, interpret à 
"Bender" was to witness the чен: sort ч 
crystal gazing. It was particularly dismaying t 
] ditio come уне witihe fact tha this 
‚ kind of operation was taking place in many en 
versity clinical facilities, those presumed ies 
of the new “dynamo-scientific” clinical prn 
ogy. Reaction on the part of competent сше 
psychologists was swift and also somewhat d 
generalized. The scorn which was justifia 5 
poured on the crystal gazing extended to the 
instrument itself. PEE S 
However, this general attitude TE a ў 
the part of the more competent clinical psy 9 
ogists has not deterred a few serious attemp ox 
validation, the two major efforts in this dig M 
being the studies of Billingslea (74) and E 
and Suttell (47). Billingslea's results were vt 
most completely negative. Most of the seat 
categories utilized in the application of t den 
during the war proved to have insufficient à di. 
bility for diagnostic purposes and the e "i 
vanced by Hutt? as characteristic of ys it 
formances of psychoneurotics failed to su 
criminative. The findings of Pascal and es 
1 Hutt, Max L. А Tentative, Guide for the Admin tiyadi- 


and Interpretation of the Bender-Gestalt 
tributed by the Author, 1945. 
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were more positive in nature but in general not 
highly impressive. The authors are cautious in 
their interpretations and in many ways this de- 
tailed study is an exploration of possibilities 
rather than a validation in the strict sense. Minor 
studies are reported by Glueck ? and by Hanvik 
and Andersen (29). Glueck found that test per- 
formance failed to discriminate between normal 
subjects and psychotic patients. Hanvik and An- 
dersen found rotation of figures to be character- 
istic of brain-injured patients as compared with 
a control sample. The two groups did not differ 
significantly in respect to number of figures re- 
called correctly. к КЕ. ‹ 

The reviewer's evaluation ОЁ the present 
status of the test is as follows: It belongs to a 
class of test procedures, that of visuomotor and 
visual memory tests, which have been demon- 
strated to possess distinctive clinical merits, par- 
ticularly in the evaluation of cerebral injury and 
disease. Where disturbances in visuomotor be- 
havior and visual perception exist, performance 
on the test should be able to reflect these disabil- 
ities. That it possesses any power to identify 
psychogenic disturbances, as in the psychoneu- 
roses, remains to be demonstrated. The chief 
deterrent to its sagacious use at the present time 
is the confusion engendered by the extravagant 
claims made for it by uncritical enthusiasts. 


Howarp R. Warre, Chief Clinical Psychologist, 
Veterans Administration Hospital, Jefferson. 
Barracks, Missouri. Wee Н, 
The Bender-Gestalt is becoming a frequently 
used test, but only recently has some of the 
needed basic experimental data been appearing. 
One group of workers is concerned with the 
problem of maturation. There is general agree- 
ment that the significant maturation occurs 
within the age range of 4-11 years, with fairly 
stable patterns (5, 35, 41). There is some rela- 
tion to reading factors, but this is limited by in- 
telligence and emotion (7, 27, 30). à 
In the records of adults, those with organic 
brain damage are impressive (5, 35, 41). The 
Bender-Gestalt can be used for screening and 
for prognosis for therapy (41). Significant re- 
lations to schizophrenia are reported (5, 35, 41), 
although it is not primarily a test to establish 
psychiatric diagnosis. There is difficulty in set- 
ting a significant scoring pattern for neurotics 
(14, 38). Significant relations with certain Ror- 
vios. p Pau Ps Lara MEN 


Archivio de Psicologia, Neurologia e Psichiatria 1: 603-64 '40. 
(PA 20:2499) 


schach factors can be found (32). Hutt has 
failed to publish more data since his army re- 
ports, but apparently many workers are apply- 
ing and expanding his suggestions (without 
publishing). „= 

The reviewer concludes that the test has merit, - 
but that it still cries for adequate experimental . 
development. On the basis of current literature, 
it is apparently useful in measuring certain а5-. 
pects of maturation. It is useful in measuring the 
presence of organic brain involvement. It is fre- 
quently useful in recognizing the schizophrenic, 
and at present somewhat less useful with the 
neurotic. It is limited by age and, possibly, by 
low intelligence. The role of cultural influences 
has not been explored. 


For related reviews see 145, 3:109, and 40: 
B643. 

[145] 

*[Re Visual Motor Gestalt Test.] PASCAL, GER- 
ALD R., AND SurTELL, Barsara J. The Bender- 
Gestalt Test: Its Quantification and Validity for 
Adults. Foreword by David G. Wright. New York: 
Grune & Stratton, Inc., 1951. Pp. xiii, 274. $6.50. * 

J Clin Psychol 7:388 О ' 51. * This is the most 
objective research work which has thus far ap- 
peared in this field. 

J Ed Psychol 42:438-9 N '51. Lee J. Cron- 
bach. * Until the publication of this manual, the 
test has been inadequately supplied with scoring 
standards and norms, and validation has been 
much needed. Pascal and Suttell administered 
the tests to hundreds of cases and identified fea- 
tures most likely to occur in records of patients 
as contrasted with normals. These features are 
embodied in a careful set of scoring guides of a 
near-objective sort. Unfortunately the report on 
item validity (Table т) involves faulty statistical 
methods, but the total scoring method appears 
acceptable. Two scorers use it independently, on 
a sample of very wide range, with a correlation 
between scorings of .90. For norms and validity 
studies, the investigators rely on hit-and-miss 
samples, testing whatever groups could be ob- 
tained. Granting that availability is a requisite, 
it is nonetheless unfortunate that so much effort 
should be expended on unplanned samples in 
this and other clinical research. The authors pre- 
sent their findings with due caution and with 
commendable modesty; they often admit to 
having no idea what certain findings mean where 
less restrained clinicians would write several 
pages of rationalization. Yet they are not con- 
stricted: at one point (p. 23) they squeeze a 
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hypothesis out of data from three cases. The 
evidence for the Bender-Gestalt test is seemingly 
solid. Not only does it distinguish patients from 
community persons with bi-serial validities 
about .70; much more valuable in practice, it 
discriminates substantially between patients who 
improve and those who do not, tested at admis- 
sion. The authors discuss the qualitative indica- 
tions that their experience suggests considera- 
tion of, illustrate records of children, organics, 
and psychogenic cases with diagnostic discus- 
sion, and provide samples for training scorers. 
There are one hundred four pages of text, apart 
from appendices, which will be of great assistance 
to those who use the test. Both the style and 
organization of the material are rambling, but 
the authors' ideas advance the usefulness of the 
Bender-Gestalt appreciably. 

Psychiatric Q Sup 25:144 pt 1°51. Pascal and 
Suttell have presented....an objective scoring de- 
vice for the Bender Visual Motor Gestalt Test. 
The method proposed is exhaustively illustrated 
with the scorable deviations in test performance. 
* It is to the authors' credit that the inherent 
limitations of any such device are well recog- 
nized ; and their method is presented in a factual 
manner with a minimum of speculation and 
generalization. The book is of value to the clini- 
cal psychologist ; and, although it is hoped that 
there will be refinement regarding diagnostic 


categories, nevertheless, it lends an objective - 


element toa test that has always been surrounded 
by subjectivity. 


[146] 
X*The Vocational Apperception Test. College ; 
1949; individual; separate sets of pictures for men 
['49] and for women ['49]; no data on reliability in 
manual (fe data presented elsewhere by the authors, 
see I below): mimeographed manual; $7.25 per set of 
18 picture cards (8 for men, 10 for women) and man- 
ual, postage extra; (бо) minutes; Robert В. Ammons, 
Margaret N. Butler, and Sam A. Herzig; Southern 
tae Press, 2122 Confederate Place, Louisville 
y. i 
а) [FORM FoR MEN.] Preference п B areas: teacher, 
executive or office worker, doctor, lawyer, engineer, 
personnel or social worker, salesman, laboratory tech- 
nician. 
b) [ForM FoR woMEN.] Preferences in 10 areas: labo- 
ratory technician, dietician, buyer, nurse, teacher, artist, 
secretary, social worker, mother, housewife. 
REFERENCES 

т. AMMONS, Ковевт В.; BUTLER, MARGARET NEWMAN; AND 

Herzic, SAM А. “A Projective Test for Vocational Resea: 


and Guidance at the College Level.” J Appl Psychol 34: 
205 Je 'so. * (PA 25:38506 wed piss Red 


BENJAMIN Batinsky, Assistant Professor of 


Psychology, City College, New York, New 
York. 
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This test was devised to elicit vocational in- 
terests in the context of personality character- 
istics. The standard paper and pencil measures 
of interest patterns, such as the Kuder Prefer- 
ence Record and the Strong Vocational Interest 
Blank, measure the degree of interest in voca- 
tional areas or in kinds of work without relating 
the interests to personality characteristics. In- 
terest patterns can Ъе considered part of the 
total personality, and measuring them in a test 
that also allows for personality description may 
make the interest patterns more meaningful. 
This is essentially the rationale for the Voca- 
tional Apperception Test. 

In order to measure vocational interests as 
related to personality characteristics, a projec- 
tive technique is used resembling that of the 
Thematic Apperception Test. The situations, 
however, are primarily vocational rather than 
social. 

The test actually consists of 18 line drawings 
on fairly sturdy cardboard 8% by 5% inches 
in dimensions. Ten of the cards are for females 
and eight for men. Each card shows an indi- 
vidual (male or female) in a specific occupation: 
males are shown as teacher, executive or office 
worker, doctor, lawyer, engineer, personnel or 
social worker, salesman, and laboratory techni- 
cian ; females as laboratory technician, dietician, 
buyer, nurse, teacher, artist, secretary, social 
worker, mother, and housewife. In most cards 
other people are depicted so that there is oppor- 
tunity to express social interaction. The, most 
prominently pictured, however, are those in 
the specific occupations. ^ 

The test is individually administered with а 
verbatim record taken. An inquiry would seem 


“necessary after the test to determine the te 


sons for the subject's responses. Clinical inter" 
viewing skill would be required for this pur 
pose. "unt pb: 
The contents of the cards are simple and г А 
tively unambiguous. The cards were pup 
drawn that way after "five preliminary PRT 
for five occupations were used to test ane T 
college men and women informally” (2): у, 
pretesting would by itself be inadequate. A Eis 
rigidly controlled experiment or series 0 li 
periments should have been undertaken to ite 
the content of the cards. Perhaps the an 
In the current projective tests, for example, ри 
Rorschach, Thematic Apperception, Incomp of 
Sentences, the content is not too simple or Ч 


TW 
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ambiguous. This is so to allow for greater range 
and depth of expression. 

Although it is not mentioned by the authors 
of the test, I would assume from my own ex- 
perience that the simplicity and unambiguity 
were believed necessary to obtain expressions of 
vocational interest in the occupations depicted. 
Otherwise the interest patterns might be lost in 
the larger context of personality. 

The manual for the test states that validity 
was indicated by “the clear relationship observed 
between stories and personal information al- 
ready available." This statement is not elaborated 
nor is any data presented in the manual to sub- 
stantiate it. In a later report on the test (1) a 
similar statement is found, but again the evi- 
dence is not delineated. We must assume that 
the evidence exists although it is not specifically 
reported. A brief story given by a male subject 
to one card is offered as an illustration of what 
the test can elicit. This may pass for some evi- 
dence on validity. The story shows evidence of 
personality characteristics such as not wanting 
too much responsibility, wondering if he is go- 
ing to succeed, and not being sure of himself. As 
for vocational interests, the subject would seem 
not to like the kind of work depicted. The re- 
viewer assumes these characteristics were 
checked against other information. Much more 
validity data needs to be cited. 


The manual has no case reports. Scoring and . 


interpretation include many definitions of terms 
in various scoring areas. The areas include gen- 
eral preference for an occupation, reasons for 
entering, those of concern to the individual, 
mechanisms used in solution of conflict and out- 
comes. Interpretation in these terms demands a 
high level of clinical training with many case 
reports studied. 

The authors of the test require users of the 
test to be familiar with vocational problems and 
methods of projective testing. This is rightly 50. 
The consistency of scoring was approximately 
86 per cent for experienced scorers rescoring 
after a week and 69 per cent between experienced 
and inexperienced scorers. These percentages 
are indications of the reliability of the test but 
show the need for caution as well as for experi- 
ence. 

A few more comments need be included. The 
test was standardized on only 40 female and 35 
male subjects, all college students. These are 
small numbers. Many more subjects need to 
be tested. The test is for college people but not 


necessarily for the whole collegiate range. It 
could be that other occupations might be added 
to those already in the test. The cards are not 
numbered. They should be if for no other reason 
than to make identification of the stories easier. 

The Vocational Apperception Test seems to 
rest on a sound rationale. The test itself can be 
considered to have good potential for eliciting 
vocational interests related to personality char- 
acteristics but is in its early stages of develop- 
ment. 


WiLLiAM E. Henry, Associate Professor of 
Psychology, The University of Chicago, Chi- 
cago, Illinois. f 

This test attempts to take cognizance of the 
fact that specific vocational choices, and indi- 
vidual success or failure in those vocations, are 
closely related to broader personality attributes. 
The testmakers here have assumed that the kind 
of personality data elicited by the TAT can be 
similarly elicited by a test which portrays a 
number of specific vocations. It is assumed, how- 
ever, that the personality data so derived will 
have direct applicability to vocational attitudes, 
rather than being of the more general sort de- 
rived from pictures which portray a wider range 
of basic life situations. In so doing, they have 
placed their reliance upon an area of technical 
projective technique method which is at best 
in its early stages of formulation. One may ask. 
to what extent it is possible to determine in ad- 
vance, by picture specificity, the particular area 
of personality or constellation of personality re- 
lated attitudes which will be stimulated. While 
there is little question but what this can indeed 
be done to some extent, I would wonder whether 
the Vocational Apperception Test does not as- 
sume a specificity of attitude formation which 
in fact does not exist in the ordinary person and 
does not rely upon a further untried assumption. 
This latter assumption is that when a subject 
talks about a picture showing, let us say, a 
housewife, she is indeed giving you her attitudes 
toward the vocational aspects of the housewife 
role. Or is she telling you about her image of 
the adult female? Or are these two the same, 
possibly? If they are not the same, then are 
you getting specific attitudes towards specific 
vocations (the housewife) or are you getting 
deeper personality data? If you are getting 
deeper personality data, what evidence is there 
that it is the deeper personality data which struc- 
tures the response toward vocation? If the latter 
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is the fact, then this test will indeed reveal the 
relationship between vocational attitude and 
deeper personality attributes. It is this which 
the test claims to do. 

The real test as to whether the VAT is doing 
what it proposes to do lies in further research 
in this complex area—an area of which the test 
authors are by no means unaware. The system 
for categorizing answers which the authors pro- 
vide seems to me not to be the area upon which 
they should rest their case. It is rather in the 
very qualitative flexibility which the authors 
themselves observe. Further work should take 
the direction of exploring on a broader basis, 
possibly in established vocational groups rather 
than the college population previously used, the 
various relationships between responses to the 
test, vocational attitudes, and deeper personality 
attributes. In a sense, I feel that "scores" have 
been set up prematurely, before the basic nature 
of the data derived from this very promising test 
is understood. 


J Proj Tech 15:534-6 D "sr. George S. 
Rhodes. * the first projective test specifically 
designed to elicit information regarding the 
counselee's vocational goals. In my own ex- 
perience for the past several years, I have felt 
keenly the same inadequacies in vocational coun- 
seling instruments which the authors experi- 
enced and which led to the development of this 
test. First, they recognize that vocational prob- 
lems are but one facet of the individual's total 
adjustment, so the important variables of per- 
sonality and the individual's total social adjust- 
ment must be evaluated in relation to the voca- 
tional problem; second, they describe the 
apparent weakness of objective vocational in- 
terest tests and the questionnaire personality 
tests which are the usual measures of these 
variables in the vocational guidance clinic; 
finally, they attempt to solve this problem by 
adapting the TAT technique to the vocational 
counseling setting. As they state, their purpose 
is, "to provide a projective test that would meas- 

` ure vocational attitudes and interests, and at 
the same time, give information concerning re- 
lated psychological forces operating within the 
individual's personality." The materials of the 
test consist of two sets of 514 X 814 line draw- 
ings. For men, the eight cards portray persons 
engaged in the following occupations: teacher, 
executive or office worker, doctor, lawyer, en- 
gineer, personnel or social worker, salesman 
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and laboratory technician. For women the ten 
pictures depict the occupations of laboratory 
technician, dietitian, buyer, nurse, teacher, ar- 
tist, secretary, social worker, mother and house- 
wife. The plates were designed to portray spe- 
cific occupational situations and yet make the. 
stimulus figures as ambiguous as possible in the 1 
facial expression, posturing and emotional feel- 
ing. Judging by a couple of trial administra- 
tions, the authors have accomplished these ob- 
jectives in plate design. The content of the 
pictures limits the usefulness of the test, par- 
ticularly the set of plates for men. If the test is 
intended to measure vocational attitudes and 
interests it should sample both the longitudinal 
hierarchy of occupations and the breadth of 
the occupational world. For example, without 
unduly lengthening the test, adding pictures of 
a typical outdoor occupation, of a skilled crafts- 
man and of a laboring man should provide more 
adequate information about the client's level of 
aspiration, his attitude toward people in lower 
level occupations and his relative interest in 
the large area of outdoor occupations. The in- 
structions for administering the test are simple 
and concise. The preface to the specific task 
assignment in emphasizing "understanding the 
behavior of others in getting along with them" 
closely resembles the instructions for Sargent's 
Insight Test. The specific directions require the 
client to tell how the person came to be in the 
situation portrayed, how he feels about it, and 
what the future holds in store. The type of pic- 
torial material presented and the slanting of the 
directions, in the vocational counseling setting 
at least, should eliminate most of the resistance 
not infrequently encountered in administering 
the TAT. The administration of the test, the 
recording of the stories and the inquiries are 
essentially the same as for the TAT. For each of 
the two trial cases the total administrative time 
was about one hour. The scoring system pre- 
sented by the authors consists of a qualitative 
scheme for classifying the following informa- 
tion: Vocational attitude or interest. A five 
category scale ranging from “like,” to “dislike, 

with ambivalence added as a final class, is pro" 
vided for rating the client's interest in each vo- 
cation. Reasons for entering the occupation. 
Some 21 different reasons are listed. Areas of 
concern. Eight major areas of conflict with 
seven subareas under personal conflict provid? 
a basis for classifying the problems expresse 

in the stories. Mechanisms used in solution 0 
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conflicts. Ten mechanisms which fairly com- 
pletely cover the responses of individuals to 
conflictful situations are provided. Outcomes. 
Seven story endings ranging from success to 
disaster are given for men and three solutions 
involving marriage are added for women. Some 
figures are given on the reliability of these cate- 
gories: self-agreement for two scorers on five 
protocols of 86%, and inter-observer agreement 
between four graduate students and the experi- 
enced examiners of 69%. Reliability was best 
for general attitude toward occupations and for 
outcomes, worst for areas of conflict. In using 
this scoring scheme I found it occasionally dif- 
ficult to classify the story elements into the 
established categories since the category de- 
scriptions are brief and are not uniform in termi- 
nology. Experimental use of the test will doubt- 
less lead to more adequate systems of scoring 
the responses. Standardization data for the test 
are so limited as to be quite inadequate. The 
authors report experience with only 40 college 
women and 35 college men. The characteristic 
responses of this group in terms of the rough 
scoring scheme have been tabulated, but the test 
user is expected to write to the A.D.I. for micro- 
film copies of the tables, which do not appear 
in the manual! The A.D.I. reference is given 
in the authors’ article in the Journal of Applied 
Psychology, a reprint of which is furnished as 
a kind of supplementary manual. Statistical evi- 
dence of validity is scanty, being suggested only 
in a comparison of general attitude toward 
several occupations and ratings on the Strong 
which yielded Chi-Squares significant at the 
10% level of confidence for women and at the 
2% level for men. In its present level of develop- 
ment, then, the test is little more than an inter- 
esting experiment. If it is to prove useful enough 
in the counseling of clients—especially those 
who are not college students—to justify the time 
it takes to administer and interpret, a great deal 
more work needs to be done on it. The scoring 
categories need to be clarified and their useful- 


ness to the counselor demonstrated. The evi- 
dence presented by the authors suggests that the 
test has merit for the clinically trained voca- 
tional counselor, and my trial experiences with 
it have been intriguing. I wonder, however, 
whether the TAT and the standard measures 
of vocational interest, used along with counsel- 
ing interviews, might not provide more per- 
sonality information and at least as valid meas- 
ures of vocational interest as does the VAT. My 
analysis of trial tests both according to the au- 
thors’ scoring scheme and by my abbreviated 
version of Tomkins’ Scoring of the TAT, in 
the work setting, provided interesting and use- 
ful results, but the variety of information about 
personality obtained does not seem to be as 
great as the TAT affords, because of the struc- 
turing of the pictures. 


[147] 
*The World Test. Ages 2 and over; 1941-50; a re- 
vision of The World Test: A Measurement of Emo- 
tional Disturbance originally published by Psycho- 
logical Corporation in 1941; the earlier test was an 
adaptation of Margaret Lowenfeld’s World Technique; 
individual; 1 form, '49; 160- or 300-item sets; manual 
(749) ; record blank (50) ; $60 per 160-item box of test. 
materials; $100 per 300-item box of test materials; $3 
per 100 record blanks; (20-45) minutes; original edi- 
tion by Charlotte Buhler and Gayle Kelly ; revised edi- 
tion by Charlotte Buhler; published by Charlotte 
iudi 1127 North Sweetzer Ave, Los Angeles 46, 
if. 
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[148] 

ЖА.С.Е.Е. English Usage Tests. Ages 10-13; 1951; 
2 parts; Form C ['51]; no data on reliability and va- 
lidity ; 3s. 6d. per manual ['51]; 6d. per scoring key; 
4s. 6d. per specimen set; cash orders postpaid within 
Australia; 30(40) minutes; Australian ncil for 
Educational Research. * 

г) PART I, WORD USAGE. 25. Id. per 10. 

b) PART 2, SENTENCES. 2s. 6d. per 10. 


[149] 
*Barrett-Ryan English Test. Grades 7-13; 1929- 
vU IBM; Forms I ('29), П ('29), III ('29), IV (40), 
(744), VI (744), 1948, 1949, 1950, 1951; Forms 

1944-47 out of print; manual ('43) ; $1.15 per 25; 35¢ 

г specimen set; separate answer sheets may be used; 
Lom 25 IBM answer sheets; 25% per stencil for 
соеп "answer sheets; postpaid; 50(55) minutes; 
E. R. Barrett, Teresa M. Ryan, E. К. Wood (I-IV), 
and H. E. Schrammel 94-3 ; Bureau of Educa- 
tional Measurements, Kansas State Teachers College 
of Emporia. * 
T [150] 
*Basic Language Skills: Iowa  Every-Pupil 
Tests of Basic Skills, Test C, New Edition. 
Grades 3-5, 5-9; 1940-47; for complete battery, see 
15; 5 (Advanced Battery) or 6 (Elementary Battery) 
scores: punctuation, capitalization, usage, spelling, 
sentence sense p Decker Battery only), total; IBM 
for grades 5-9; 2 levels; Forms L ('40), М (41), N 
('42), О (43) ; manual ('45) ; battery manual ('47) ; 
g* ES specimen set of any one level; postage extra; 

. F. Spitzer in collaboration with Ernest Horn, 
Maude McBroom, H. A. Greene, and E. F, Lindquist; 
Houghton Mifflin Co. * 
а) ELEMENTARY BATTERY. Grades 3-5; $1.60 per 25; 
46(60) minutes. 
b) ADVANCED BATTERY. Grades 5-9; IBM; $1.75 per 
25; separate answer sheets may be used; 756 per 25 
IBM answer sheets; 40 per set of stencils for machine 
scoring of answer sheets; 55 or 58(70) minutes. 


For reviews by Miriam M. Bryan and Anton 
Thompson of the complete battery, see 15; for 
reviews by Frederic L. Ayer, Gustav J. Froeh- 
lich, and Ralph C. Preston of the complete bat- 
tery, see 3:10; for reviews by William A. 
Brownell, J. Murray Lee, and Charles W. Odell 
of the 1937 form of the complete battery, see 
38:872. 
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*California Language Test. Grades 1—4.5, 4-6, 7-9, 
9-14; 1933-50; a revision of Progressive Language 


Tests (see 40:1292) ; a subtest of California Achieve- 
ment Tests (see 2); 3 scores: mechanics of English 
and grammar, spelling, total; IBM for grades 4-14; 
4 levels; Forms AA ('so), BB ('s0), ce ('50), Db 
(750) ; manuals ('50); postage extra; 35€ per speci- 
men set of any one level, postpaid; Ernest W. Tiegs 
and Willis W. Clark; California Test Bureau. * 
а) PRIMARY. Grades 1-4.5; $1 per 25; 16(25) mint 
b) ELEMENTARY. Grades 4-6; уо per 25; ү 
answer sheets may be used; 4¢ per IBM answer sheet; 
7% per scoreze answer sheet; 20€ per stencil for ma- 
chine scoring of answer sheets; 20€ per stencil for 
hand scoring of answer sheets; 26(35) minutes. 

C) INTERMEDIATE. Grades 7-9; prices same as for Ele- 
mentary; 28(35) minutes. 

d) ADVANCED. Grades 9-14; $1.50 
swer sheets may be used; 46 per 
7f per scoreze answer sheet; боё рег stencil for ma- 
chine scoring of answer sheets; 20€ per stencil for 
hand scoring of answer sheets; 31(40) minutes. 


r 25; separate an- 
BM answer sheet; 


GrnALD V. LANNHOLM, Project Director, Edw 
cational Testing Service, Princeton, New Jersey. 
This series of language-correctness tests Ш- 
cludes a test at each of four different educational 
levels from grades 1—14. Four forms are avail- 
able at each of the first three levels, primary, ele- 
mentary, and intermediate. Statements in the 
manual conflict with regard to the number of 
forms (three or four) for the advanced level. 
A separate manual is provided for each of the 
levels. Although much of the material is identi- 
cal in all of the manuals, the material in certam 
sections is modified from one level to another. 
Major emphasis is given to the use of the results 
for identifying specific strengths and weaknesses 
of individual pupils. However, suggestions are 
also presented for the use of the results for 
survey or administrative purposes. 
Each manual contains a fairly detailed de 
scription of the test content, a brief section 0" 
reliability and validity, instructions for prep! 
ing and using a “diagnostic profile,” a discus 
sion of the uses of the test results, directions for 
administering and scoring the test, and a sect 
on norms. In comprehensiveness the manua 
for this series of tests excel most of those whi 
have come to the attention of this reviewer. 
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table of contents would increase its usefulness 
for the busy teacher. 

Each of these language tests is a part of the 
California Achievement Tests, but may be used 
without the other tests in the battery if desired. 
At each of the four levels, the language test is 
divided into two parts. For the primary level, 
the two parts are designated as Mechanics of 
English, and Spelling. At each of the other three 
levels, the parts are Mechanics of English and 
Grammar, and Spelling. To assist the teacher 
in evaluating the pupil's handwriting ability, a 
handwriting scale is also provided at each of 
the four levels. A. sample of each pupil's hand- 
writing is obtained by having him write three 
words which are dictated at the end of the testing 
period. 

In the primary test, the authors decided wisely 
to have the pupils record their answers to the 
exercises in the test booklet. At each of the 
three higher levels, the teacher may have the 
pupils enter their responses in the test booklet 
or on a specially designed “self-scoring” separate 
answer sheet. A single scoring key which may 
be used for any of the forms is provided for each 
of the three tests above the primary level. This 
is made possible by having the pupil select a 
coded response instead of actually writing the 
correct usage. This arrangement probably re- 
duces the possibility that а teacher will use the 
wrong scoring key. 

Despite the claims of the test authors that the 
primary purpose of this test series is to aid in 
the identification of specific strengths and weak- 
nesses of individual pupils, it is this reviewer's 
opinion that the tests will serve better as meas- 
ures of general achievement than they will as 
diagnostic-analytic instruments. This conclu- 
sion is based on the opinion that most learning 
of skills is specific rather than generalized and 
that ability to handle a given sampling of skills 
is not sufficient evidence to demonstrate ability 
to handle a different sampling of skills, even if 
drawn from the same family of skills. It is 
doubted, for example, that one can confidently 
assume from the fact that a pupil can handle 
correctly the use of a comma to set off words 
or phrases in apposition, that he can also use 
an apostrophe correctly to form the possessive 
case of nouns. It should be apparent, therefore, 
that for a test to be effective in identifying the 
specific skills which a pupil possesses or does 
not possess, each of the skills in which one is 
interested should be covered in the test. Further- 


more, a reliable measure of a pupil’s mastery of 
a given skill can be obtained only if the test 
contains an adequate sampling of situations 
which require the pupil to use that particular 
skill. None of the forms of the California Lan- 
guage Test satisfies these requirements, For ex- 
ample, even at the advanced level, the punctua- 
tion test contains only 10 items. This constitutes 
a small sampling, even for obtaining a rough 
indication of a pupil’s general level of punctua- 
tion ability. For obtaining adequate information 
upon which to base a program of remedial in- 
struction, it is entirely unsatisfactory. The ad- 
vanced level test contains four items involving 
the use of commas. If all four items covered 
the same usage, some indication might be ob- 
tained regarding a pupil's ability to handle that 
particular use. But the test would yield no in- 
formation about his ability to use the comma in 
other types of situations. This reviewer laments 
the practice of test authors to label as diagnostic 
nearly any test that samples more than one 
aspect of some field of study, whether or not 
the test includes an adequate sampling of each 
of the specific elements of learning or skill in- 
volved. He considers it equally unfortunate - 
when teachers accept as diagnostic any test which 
is so designated by the author or publisher. 
The authors of this test series are to be com- 
mended for their use of simpler testing tech- 
niques at the lower levels than at the higher 
levels and for their caution to be particularly 
careful about giving directions to primary pupils. 
In the primary level test, the nature of the exer- 
cises resembles more closely real life situations 
than do the exercises used in the higher level 
tests. In the capitalization tests at the primary 
level, pupils are asked to underline each letter 
that should be a capital letter; at higher levels, 
pupils are asked to indicate which of the num- 
bered letters should be capitals. In the primary 
level punctuation test, pupils are asked to place. 
needed punctuation marks in the proper places 
in the material printed in the test book. The 
higher level tests require the pupil to consider 
numbered places in the copy and decide which; 
if any, of four different punctuation marks is 
needed at each place. His decision is recorded as 
a mark under the punctuation selected. Word 
usage and sentence recognition exercises are 
included in each test above the primary level. 
The intermediate and advanced level tests also 
include a section requiring the pupil to classify 
each word in a sentence according to the part 
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of speech it represents. A section covering mis- 
cellaneous elements of grammar and syntax ap- 
pears in the advanced level test. 

The spelling test at the primary level con- 
sists of 25 words which the authors wisely have 
dictated to the pupils who actually write them 
in the test booklet. At the other levels, the 
spelling test consists of 30 groups of words, the 
pupil being required to study each group and 
decide which, if any, of the words is misspelled. 
It is doubted that this technique provides the 
most dependable measure of a pupil's ability to 
spell words correctly in his own writing. Such 
a test would appear to yield more helpful in- 
formation for survey purposes than for diag- 
nostic or analytic functions. 

Three estimated reliabilities are given for each 
of the test levels, the authors stating that the 
coefficients were obtained by "the Kuder-Rich- 
ardson formula" (the manual does not indicate 
which K-R formula was used) for a single grade. 
Separate reliability estimates are given for Me- 
chanics of English, Spelling, and Total Lan- 
guage. In view of the emphasis given in the 
manual on the importance and use of scores on 
the subsections (capitalization, punctuation, 
etc.), it is disappointing to find that information 
is not provided regarding the reliabilities of 
these subsections. No data are given to indicate 
the degree of correlation between scores on pairs 
of the several forms available at each of the four 
levels. 

Several types of norms are given for each of 
the four levels for which the tests are available. 
Percentile norms are given for separate grades, 
for each of the three scores reported on each 
level of the test. For grades one through nine, 
two tables of percentile norms are given for each 
grade. One of these applies to pupils tested 
during the first half of the grade and the other 
to those tested during the second half. Grade 
and age norms are also presented in each of the 
manuals. Finally, a table is provided for use in 
adjusting the norms in relation to intelligence 
quotient medians given for each grade sepa- 
tately. The standardization and norms samples 
appear to be sufficiently large with respect to 
the number of pupils involved. No details are 
given concerning the selection of the schools 
in which these pupils were enrolled. 

The California Language Test is a series of 
tests, available in several forms at each of four 
educational levels. The manuals are quite de- 
tailed and are definitely above average in the 
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amount and value of the information presented, 
Provision is made at each of the three upper 
levels for the use of rapid scoring methods, The 
testing techniques employed have been used 
quite widely, Elaborate directions and sugges. 
tions are given for the use of the results for the 
diagnosis of learning difficulties, It is the opinion 
of this reviewer, however, that in view of the 
limited sampling of the many specific elements 
of usage in this field, more reliance should be 
placed on the test results for determining the 
general level of competence in correct writing 
than for diagnostic and analytic purposes, 


Ковевт C. Роогкү, Professor of English, and 
Chairman, Department of I ntegrated Liberal 
Studies, University of Wisconsin, Madison, 
Wisconsin. 

PRIMARY GRADES 1—3. There are four equiva- 
lent forms of this simple test, making possible 
testing and retesting of the same group with a 
minimum of practice effect. Three elements of 
English written skills are measured : capitaliza- 
tion, punctuation, and spelling. A brief narrative 
contains nine exercises in capitalization and 
leads into a poem of four lines. Correctly capi- 
talizing the first word of each line is the tenth 
exercise. The same narrative and poem are re- 
peated in an adjacent column for punctuation. 
Five exercises are furnished: two terminal 
periods, one comma after "said," one pair of 
quotation marks, and one terminal question 
mark, Spelling is tested by 25 dictated words 
from the first 500 words used in writing. For 
the first grade 15 of these words are used; for 
the second grade, 20. у. 

Clear and easy directions for administration, 
and equally clear instructions for scoring, make 
this test easy to use. The time, 16 minutes, 15 
appropriate to the age and attention span of the 
pupils, and the content of the exercises is x 
rectly derived from the written language skills 
of grades 1—3 and lower 4. The title is incor 
rect; this is not a language test but a test 0 
three skills in written language. Handwriting 
is also measured on a scale, but does not form 
a part of the score. 

An ingenious diagnostic profile chart d 
Possible a quick and simple representation А 
the pupil's achievement in terms of grade bee 
ment and percentile rank. Diagnostic ener 
is provided by a parallel chart. Because dí di 
very few items in the test, the validity of is 
diagnosis may be questioned. Altogether this 
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a simple, specific, and convenient test of writing 
skills. у 

ELEMENTARY GRADES 4-6. Like the primary 
test, this test contains exercises in the mechanics 
of writing and spelling but adds a new section on 
English usage and sentences. Capitalization and 
punctuation are tested in sentences, I5 correct 
responses possible in each. Credit is given for 
correct responses, but no penalty is exacted for 
overcapitalization and overpunctuation. Thus 
the child loses points for failing to make a correct 
response but loses no points for any number of 
incorrect responses. This provision makes the 
test easier to score at the cost of accuracy in 
measuring the child's skill in mechanics. 

The usage section employs the familiar two- 
choice technique: e.g. “Mary is (* awful ? very) 
pretty," in ten exercises. Some usage choices 
are invalid for the elementary grades, such as 
the distinctions between received and got, may 
and can. Sentence knowledge is measured by 10 
exercises, in which the child is to distinguish 
between complete sentences and fragments. The 
spelling test at this level is not dictated, but it 
is a five-choice multiple response exercise in 
which the child selects one of four words as mis- 
spelled or indicates that none is misspelled. 

Except for one or two dubious usage items, 
this test appears to be a convenient, easily scored, 
and reasonably valid test of mechanical skills in 
written English. Handwriting is also scaled but 
does not form a part of the test score. 

INTERMEDIATE GRADES 7-9. The four equiva- 
lent forms of this test are similar in structure 
to the test for grades 4-6, with the addition of 
a new exercise testing knowledge of the parts 
of speech. A sentence of 20 numbered words is 
furnished. In a five-choice multiple response 
exercise the student must indicate the part of 
speech for each word in the sentence. This exer- 
cise is what used to be called “parsing,” a teach- 
ing technique which properly fell into disrepute 
many decades ago. It is distressing to find it 
revived in this test. Since there is absolutely no 
Objective evidence to support the assumption 
that knowledge of parts of speech contributes 
to a child's command of language, this section 
is invalid in a language test and therefore in- 
validates the score of the entire test as a measure 
of language skill. 

ADVANCED GRADES 9-14. This test, prepared 
in three equivalent forms, consists of six sec- 
tions with a total of 110 points distributed as 
follows: capitalization 15; punctuation 10; 
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words and sentences 25; parts of speech 17; 
syntax 13; spelling 30. The first four parts and 
the spelling are constructed on the patterns used 
in the tests for previous grades and’ have been 
described above. The new section, called syntax, 


‘consists of the identification of grammatical 


forms or the classification of kinds of sentences, 


‘Among the items tested are case of pronouns, 


degree of comparison of adjectives, principal 
parts of verbs, and recognition of verbals. This 
section, with the section on parts of speech, de- 
votes more than one fourth of the total points 
of the test to formal grammar. Unless there is 
proof that knowledge of grammar is a factor 
in command of language, these 30 points are 
invalid in a language test, and therefore 
the test is invalid as a measure of language 
skills. 

An evaluation of these tests as a series reveals 
the astounding fact that children are presumed 
to learn nothing more about language other than 
grammatical facts after the fourth grade. No 
item in the advanced tests indicates any advance 
over the elementary tests in English content 
other than in grammar. It is difficult to believe 
that anyone could consider the difference be- 
tween a fourth grade and a twelfth grade pupil 
in written English to lie exclusively in the 
latter's ability to name a noun or select a sub- 
ject, yet these tests force one to that conclusion. 
What factors mark growth in English skill? 
Surely among them are an advance in unity 
and the power to sustain an idea; an advance in 
sentence structure featuring increasing subordi- 
nation ; an increase in the power to organize and 
present materials logically ; a growth in diction 
leading to the use of exact and colorful words 
in effective places. These factors are what good 
English teachers teach; it is upon these and 
similar factors that pupils should be tested. Since 
the California Language Tests ignore these basic 
factors, the use of the tests above the fourth 
grade can yield returns increasingly unrelated 
to the measurement of skill in language. 


For reviews by Warren G. Findley, Alvin 
W. Schindler, and J. Harlan Shores of the 
complete battery, see 2; for reviews by Harry 
А. Greene and J. Paul Leonard of an earlier edi- 
tion, see 40:1292; for reviews by C. W. Odell 
and Hugh B. Wood of an earlier edition of the 
complete battery, see 40:1193; for a review by 
D. Welty Lefever of an earlier edition of the 
complete battery, see 38:876. 
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*College English Test: National Achievement 
Tests. Grades 12-13; 1937-46; 7 scores: punctuation, 
capitalization, language usage, sentence structure, 
modifiers, miscellaneous principles, total; Forms A 
('42—same as test copyrighted in 1937), В (41) ; no 
norms for part scores; manual ('46); series manual 
[44] ; $2.50 per 25; 15€ per manual; 35 per specimen 
set; postage extra; 45(50) minutes; A. C. Jordan; 
Acorn Publishing Co. * 

For reviews by Constance M. McCullough 
and Robert W. Howard of Form A, see 40: 
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[153] 
College Placement Test in English. College; 
1941-43; 8 scores: grammar, punctuation, sentence 
structure, reading, syntax, vocabulary, theme, total; 
Forms А ('41), B (43); no data on reliability and 
Dey AS ан species oot бог пап. огиз MA 
Жн Hector Илл: Turner E. Smith & Co. * я 
CnanLorrE Croon Davis, Test Research Serv- 
ice, 12 Normandy Road, Bronxville, New York. 

In plan and type of item the first half of this 
test is similar to the College Preparatory Test 
in English by the same author. In appearance 
it is much improved since it is spread out into 
twice as much space. The second half of the 
test, to which 60 minutes is allotted, consists of 
the writing of an impromptu theme on one 
of ten reasonably varied and stimulating topics. 
Although it is stated that this composition is the 
most important part of the examination, no di- 
rections for rating it or for combining the rating 
on it with the scores on the other parts are given. 
The condensed norms, which are based on 1,400 
(Form A) or 1,500 (Form B) high school 
graduates, apply only to the total score on the 
first six parts. — 

As a measure of achievement in English, this 
test is open to some of the same criticisms as the 
College Preparatory Test. It seems odd to have 
30 items on punctuation and none on capitaliza- 
tion. Also, reading is such an important and 
complex skill that it would normally be better to 
measure it adequately in a separate test. On the 
other hand, if this test is to be used only for 
placing college entrants in English classes, the 


inclusion of nine reading items may contribute 


to its efficiency. 

Most of the items are satisfactory. More care- 
ful editing might have eliminated such slips as 
defining "aggregate" as "in mass" when the 
context reads "people in the aggregate." Re- 
placing "aggregate" by “їп mass” leads to “in 
the in mass." The word "capital" in "writers 
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make capital out of spicy gossip" probably can- 
not be defined in a single word. Certainly the 
keyed response "profit" is not close enough to 
be acceptable. There are a few inconsistencies 
in format: Part VII of Form A is called 
“Theme” on the cover page and "Composition" 
inside; the various sections are called “Parts” 
on the cover page and “Tests” on the key. 

In summary, this test may have some value 
in college placement in English. However, most 
colleges would hesitate to include theme writing, 
with its attendant difficulties of scoring and un- 
reliability of scores. Normally, a college would 
make much more efficient use of two hours’ 
testing time by giving the Cooperative English 
Tests (either lower or higher level), which are 
completely objective, machine scorable, based 
on a carefully planned outline, well edited, and 
furnished with differentiated norms based on 
thousands of cases accumulated over several 
years of widespread use. 
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College ану Tai in English. High 
school; 1943; 8 scores: grammar, sentence revision, 
comprehension of reading, syntax, vocabulary, sen- 
tence structure, punctuation, total; 1 form; no data on 
reliability and validity; no manual; no norms for part 
scores; $1.75 per 25, postpaid; specimen set not avail- 
able; nontimed (40-50) minutes; Hector H. Lee; 
Turner E, Smith & Co. * 
CuanLorrE Croon Davis, Test Research Serv- 
ice, 12 Normandy Road, Bronxville, New York. 

This test is composed of seven parts, as listed 
above, crowded into seven 514 by 814 inch pages. 
The author has tried to cover so many areas in 
the time and space at his disposal that the items 
are spread very thin over some of them; for 
example, the reading comprehension score 15 
based on four items. Perhaps the author did not 
intend the part scores to be interpreted sepa- 
rately ; however, the total score is also open to 
question as an index of achievement in den: 
preparatory English.” This subject, even the 
part of it that can be measured in tests of M 
type, includes so many important skills that i 
is not possible to cover them adequately in T 
minutes. Nevertheless, a better balanced distri 
bution of items could have been provided. uod 
are I5 items on punctuation and none on E 
talization, 24 on grammar and none on spel il 
Thirty of the 86 items are concerned with "Y 
tence structure. The writer wonders how qu 
fully the author considered the makeup of 


test. ill In 
The individual items are run-of-the-mr^. 
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the first item, the testee is to choose the word їп 
parentheses that makes this sentence correct, 
*He left the plate (sitting, setting) under the 
table." Better than either would be, simply, *He 
left the plate under the table." An occasional 
item is rather silly; for example, Item 18, Part 
6, reads, “Апа I have felt a presence that dis- 
turbs me with the joy of elevated thoughts." 

Some items are badly expressed. Item B, Part 
3, when completed by choice 2, reads, “Coura- 
geous impartiality is hardly to be expected in 
Victorian biography because their natural emo- 
tion was to praise the dead." (Whom does 
"their" refer to? Is praising the dead an emo- 
tion?) The sentence is not much better when 
completed by the keyed answer, “....because it 
was an era of established values accepted with- 
out question." (What does "it" refer to?) Item 
D, Part 3, when completed by choice 4, reads, 
“The author implies that courageous impar- 
tiality is a quality of truth in biography which 
scholarship and patient industry tried to culti- 
vate in earlier times." (Industrious and patient 
scholars may have tried, but not scholarship and 
industry.) It seems to the writer that the author 
of a test in English expression should be es- 
pecially careful to avoid loose and fuzzy sen- 
tences like those cited above. Throughout the 
test there is little evidence of ingenuity in de- 
vising the items or care in editing them. 

The scoring is objective, though several vari- 
ations from the key would surely be permissible 
in Part II, in which the testees write in their 
own revisions of defective sentences. The test 
is not machine scorable. Condensed norms based 
on 2,300 “high school graduates” are provided. 
No additional information about these graduates 
is given. 

, In summary, this test has little to recommend 
1, 

[155] 

*Cooperative English Test: Lower and Higher 
Levels, Grades 7-12, 11-16; 1940-51; IBM; 3 tests 
available separately and in a single booklet edition; no 
specific manual; descriptive folders [Mechanics and 
Effectiveness of Expression, '50; Reading Compre- 
hension, '51]; general Cooperative manual ('51); 
norms ['40]; separate answer sheets may be used; 


cash orders postpaid; Cooperative Test Division, Edu- 


cational Testing Service. * 

4) SINGLE BOOKLET EDITION: LOWER AND HIGHER LEV- 
ELs. Grades 7-12, 11-16; 7 scores: mechanics of ex- 
pression, effectiveness of expression, vocabulary, speed 
of comprehension, level of comprehension, total_com- 
prehension, total; 2 levels; Forms S (Lower Level, 
42—some printings bear 1949 or 1950 copyright; 
Higher Level, ’50—same as test copyrighted in 1942), 
T (о or 'si—same as test copyrighted in 1943), Y 
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(Lower Level, ’51—same as test copyrighted in 1948; 
Higher Level, '48), RX ('49); Forms Q, R out of 
print; $3.90 per 25 of any one level; 50€ per specimen 
set of any one level, postpaid; $1.70 per 25 IBM an- 
swer sheets; 45€ per set of stencils for scoring answer 
sheets; 120(130) minutes. 

b) TEST A, MECHANICS OF EXPRESSION. Grades 7-16; 
Forms S ('42—some printings bear a 1951 copyright), 
T C43), X (47), У ('50—ѕате as test copyrighted in 
1948) ; Forms Q, R out of print; $2.25 per 25; 50¢ per 
specimen set, postpaid; 80€ per 25 IBM answer sheets; 
15¢ per stencil for scoring answer sheets; 40(45) min- 
utes; Geraldine Spaulding, Herbert Danzer (X), and 
W. W. Cook (Y). 

C) TESTS BI AND B2, EFFECTIVENESS OF EXPRESSION: 
LOWER AND HIGHER LEVELS. Grades 7-12, 11-16; 2 lev- 
els; Forms S ('42), T (Lower Level, '43; Higher 
Level, '51—same as test copyrighted in 1943), X (47), - 
Y (Lower Level 'so—same as test copyrighted in 
1948) ; Higher Level, '48; Forms Q, R out of print; 
$2.25 per 25 of any one level; 50€ per specimen set of 
any one level, postpaid ; 80€ per 25 ІВМ answer sheets ; 
15¢ per stencil for scoring answer sheets; 40(45) min- 
utes; Geraldine Spaulding, Miriam May Bryan, and 
Janet Afflerbach (Y). i 
d) TESTS CI AND C2, READING COMPREHENSION : LOWER 
AND HIGHER LEVELS. Grades 7-12, 11-16; 4 scores: 
vocabulary, speed of comprehension, level of compre- 
hension, total; 2 levels; Forms R ('5o—same as test 
copyrighted in 1941), S (Lower Level, '42; Higher 
Level, 'si—same as test copyrighted in 1942), Т 
(Lower Level, 43; Higher Level, 'so—same as test 
copyrighted in 1943), Y (748); Form Q out of print; 
$2.50 per 25 of any one level; 50@ per specimen set of 
any one level, postpaid; 80¢ per 25 IBM answer sheets ; 
15¢ per stencil for scoring answer sheets; 40(45) min- 
utes; Frederick B. Davis, Harold V. King (S), Mary 
Willis (T), Clarence Derrick (Y), Harry R. Neville 
(Y), Jeanne M. Bradford (Y), and Geraldine Spaul- 
ding (Y). 
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such skills may be important in their own right 
and as such constitute legitimate educational 
objectives. If so, then progress in the develop- 
ment of these skills should be appraised peri- 
odically in order to determine the effectiveness 
of those parts of the program that are designed 
to promote these skills. The other is that such 
skills may be related to the ability to use English 
effectively in speech and writing. If so, then 
measures of proficiency in these skills may pro- 
vide an important index of the "real" ability. 
The validity of these Cooperative tests of me- 
chanics of expression and of effectiveness of ex- 
pression must be considered from both these 
points of view. 

The description of the construction of the me- 
chanics and effectiveness tests indicates sources 
that were consulted in preparing the outline of 
topics to be covered and the types of "errors" or 
infelicities to be included. The point of view of 
the testmakers is "modern"; they include as 
sources the Experience Curriculum of the Na- 
tional Council of Teachers of English, gram- 
mars by Curme and Jespersen, and studies of 
present-day usage. This point of view is evident 
in their statement: “All so-called ‘errors’ in 
written English which are considered by most 
linguistic scholars to be good modern usage 
were eliminated." A multiple choice item form 
is used throughout the tests. In the section on 
usage, the subject is asked to judge the cor- 
rectness of four underlined parts of each sen- 
tence and to designate which, if any, of the four 
parts is an incorrect or unsuitable usage. In the 
punctuation section, the subject is asked to select 
from specified alternatives the appropriate punc- 
tuation to be inserted into a designated section 
of an incompletely punctuated passage. Thus 
the materials on punctuation—and those on 
capitalization as well—present incomplete rather 
than mutilated copy to the student. Teachers who 
object to presenting mutilated copy to students 
on the grounds that it tends to reinforce errors 
will prefer the presentation in the Cooperative 
tests to that in other tests, such as the Tests of 
General ‘Educational Development. Those who 
regard the presenting of mutilated copy as neces- 
sary if the actual task of proofreading is to be 
approximated in a test will not. 

The sentence structure and style section poses 
problems of comparing sentences or parts of 
passages and selecting the one that is best ex- 
pressed. In the more recent forms, the active 
vocabulary type of diction item which has been 


criticized hy several persons has been replaced 
by items that require a choice among given 
words of the most appropriate term to insert 
into a given sentence. The later design is prefer- 
able to the earlier one for this section ; however, 
it is possible to quibble with the key since a 
single sentence often is an insufficient context 
to establish one response as the most suitable 
of those given. Also, in some of the items, 
clichés, such as “inhuman” treatment of pris- 
oners, no "extra" charge for engraving, etc., 
are keyed as correct responses. The organization 
section poses problems of classification and or- 
dering of topics and statements. 

The Cooperative tests of mechanics of ex- 
pression and effectiveness of expression are gen- 
erally well-made tests that should be useful as 
measures of the kinds of skills suggested in this 
analysis, Whether or not these are important 
skills must be answered by the prospective user 
of the test. The question of the relationship of 
this type of performance to actual use of lan- - 
guage is not answered unequivocally. The au- 
thors вау: 


The Cooperative English Expression Tests provide 
objective measures of many of the various factors en- 
tering into composition ability. In developing the out- 
line for the tests, an analysis was made of the different 
types of skill and knowledge which go to make up the 
complex ability to express oneself in English, and the 
tests were organized so as to measure as many of these 
types as possible. The tests should therefore provide 
reasonably accurate measures of many of the factors 
involved in skill in written expression. 


That this is likely can readily be granted ; that. 
it is so, is not established. The one validity study 
referred to in the description of the tests is the 
1939 study by McCullough and Flanagan! 
in which Form OM and the 1937 Form were 
correlated with several different criteria, one 
of which was the teacher's estimate of excellence 
in using oral and written English; the median 
of the product-moment coefficients of correla- 
tion between the tests and teachers' estimates 
was .53. As McCullough and Flanagan point 
out, the reliability of these estimates is probably 
low and the estimates were made on a five-point 
scale which introduces an upper limit for that 
is below unity ; this median value, then, is prob- 
ably conservative. However, this is a study ot 
forms of the test that are out of print, and the 
current forms differ in many particulars from 
those studied. 


ohn C. “Тһе 


, Const: M., and Flanagan, 
1 McCullou ‘onstance b gi К Teu 


Validity of the Machine-Scorable Cooperative 
J Exp Ed 7:229-34 Mr '39. * (PA 13:5943) 
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For reviews by J. Paw Leonard, Edward S. 
Noyes, and Robert C. Pooley of Forms R, S, 
and T, see 3:120; for reviews by Robert Murray 
Bear and 7. B. Stroud of Tests Ст and C2, see 
3:497. 


[156] 
*Correct English Usage Test. Grades 9-12; 1944; 
6 scores: nouns-pronouns-verbs, adjectives-adverbs- 
prepositions-conjunctions, capitalization and punctua- 
tion, contractions-spelling-usage, vocabulary, total; 
Forms I, II, III, ТУ; no data on reliability and valid- 
ity; no norms for part scores; no description of norma- 
tive population; manual ['44]; 1-4 copies, 10€ each; 
5-24, 66 each; 25 or more, sé each; 256 per set of 
manual and keys (manual and keys free with orders of 
25 or more) ; specimen set not available ; postage extra; 
e Go minutes; Doris Taylor; Harlow Publishing 
rporation. * 


[157] 

*Correctness and Appropriateness of Expres- 
sion: Iowa Tests of Educational Development, 
Test 3. Grades 9-13; 1942-51 (first published as a 
separate in 1951) ; for complete battery, see 17; Form 
Y-2 ('51—same as 1949 edition) ; manual ('s1); gen- 
eral manual ('s1) ; $3.75 per 25; separate answer pads 
or answer sheets must be used; $1.95 per 25 answer 
pads; $3 per тоо IBM answer sheets; per scoring 
stencil; $2.50 per 25 first semester (46) or second 
semester ('48) profiles for oy one of grades 9-12; 
$1 per 25 self-interpreting profiles for students 51); 
25€ per school summary report ('44); 4€ per pupil 
score sheet ('48) ; 50¢ per specimen set; cash orders 
postpaid; 60(70) minutes; edited by E. F. Lindquist ; 
John ber; Science Research Associates, Inc. * 


For a review by Eric F. Gardner of the com- 
plete battery, see ту; for reviews by Н, enry 
Chauncey, Gustav J. Froehlich, and Lavone A. 
Hanna of Forms X-1 and Y-1 of the complete 
battery, see 3:12. 


[158] 

Hab IX-XII: Achievement Examinations 
for Secondary Schools. Grades 9, 10, II, 12; 1051; 
4 levels; 1 form; no data on reliability and validity ; 
no manual; Minnesota norms (median and quartile 
deviation) available; similar norms for other regions 
by special arrangement with publisher; 7@ per test, 
Ше extra; 60(65) minutes; edited“ y Walter W. 
Cook; prepared by a curriculum committee of high 
school teachers for use in the Minnesota State Board 
Achievement Examinations Program; Educational 
Test Bureau, Educational Publishers, Inc. * 


[159] 

*English—9th Grade: 20th Century Test. Grade 
9; 1048-49; Form А (’48); no data on reliability and 
validity; no norms—author recommends the use of 
local norms ; manual (49) ; IO or more copies, 5¢ each, 
Postage extra; 35¢ per specimen set, postpaid ; 40(45) 
pit i Атав Sanders; Benton Review Publishing 
" [160] 1 

X*English—10th Grade: 20th Century Test. Grade 

I0; 1950; I form; no data on reliability and validity ; 

no norms—author recommends the use of local norms; 
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10 or more copies, Ба еасһ, on extra; 35¢ per 
specimen set, Он; 40(45) minutes; Ardis San. 
ders; Benton Review Publishing Co., Inc. * 
[161] 

X*English Test: Municipal Tests: National 
Achievement Tests. Grades 3-6, 6-8; 1038-39; a 
subtest of Municipal Battery (see 20) ; 5 scores: lan- | 

usage-sentences, language usage-words, punc- 
tuation and capitalization, expressing ideas, total; 2 
levels; Forms А (738), В (’39) ; no data on reliability 
and validity and no description of normative population 
in manual; no norms for part scores; Form A manual 
(38), Form B manual (’39); $1.25 per 25; 356 per 
specimen set; postage extra; 30(35) minutes; Robert 

. Speer and nd Smith; Acorn Publishing Co, * 

[ 162] 

English Test: National Achievement Tests. 
Grades 3-8, 7-12; 1936-44; 2 levels; Forms А ('38), 

B ('38); no norms for part scores; manuals (38); 
series manual ['44]; 35€ per specimen set of any one 
level; postage extra; nontimed (40) minutes; Robert 

К. Speer and Samuel Smith; Acorn Publishing Co. * 

G) GRADES 3-8. 7 scores: capitalization, punctuation, | 
language usage-sentences, language usage-words, ex- 
ressing ideas, letter writing, total; $1.65 per 25. . | 
b GRADES 7-12. 7 scores: word usage, punctuation, 
yocabulary, language usage, expressing ideas, express- 
ing feeling, total ; $2.25 per 25. | 
WiuirnED L, Post, Instructor in English, Dana 
Hall, Wellesley, Massachusetts. ЛАН 

GRADES 3-8. Parts I and II test thoroughly, j 
yet with a minimum consumption of time, h 
student's knowledge of important principles of 
capitalization and punctuation by asking him to 
indicate needed capitals and punctuation Ш 
ten sentences for each section. 

Parts III and IV, Language Usage, ЫТ. 
total of 50 multiple choice items in each of whic 
the student is to select from three or four sen- 
tences the one sentence which is correct. These 
two sections would single out children Top 
illiterate or non-English speaking homes. T id 
value for use in all but underprivileged e: 
munities would be lessened by the large num т 
of options either of the conventionally ШИ 
sort or bearing the stamp of the testmaker aT did 
facturing "bad" English; for example: 4 i 
not ran away,” or “I be home every day, 

"It aren't my book." i Чеш 

Part V, Expressing Ideas, contains 20 HW | 
each giving a sentence in capitals ооң Г 
three other sentences from which the stu jm. 
to select the one which “means the same Wie 
as the sentence in capital letters. a^ Hua 
directions run into inevitable snags a nt 
guage in their assumption that two gi E | 
sentences can ever mean the “same t 2 d 
takes no semantic subtlety to question the p 


"i 
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“means the same” as applied to “Ours is a gov- 
ernment by the people and for the people" and 
“The people rule our country for their own 
good." The difficulty would be obviated by 
changing the directions to: “Draw a line under 
the one sentence which is closest in meaning to 
the sentence in capitals." 

Part VI, Letter Writing, offers ten four- 
response, multiple choice items dealing with 
questions of correct form in social and business 
letters. Aside from misgivings about insistence 
on “Cordially yours" as the closing for a social 
letter to a friend or “Affectionately” as the 
proper signing off of a letter to one's best friend, 
the reviewer questions whether or not objective 
testing is a wise form for this kind of material 
and whether or not the material itself is vital 
enough to justify its place in this kind of test. 

The main strengths of this test are its ease of 
administration and scoring, the variety of skills 
tested in approximately 40 minutes of working 
time, and the swiftness with which it would give 
a teacher an insight into specific strengths and 
weaknesses in a pupil's command of the mechan- 
ics of composition. The most conspicuous lack 
in the test is that it provides no measure of vo- 
cabulary—probably the most significant single 
criterion of a student's background, training, 
and native ability in English. A 25- or 30-item 
multiple choice vocabulary test might, in the 
opinion of the reviewer, take the place of a letter- 
writing section with a resulting increase in the 
worth of the test as a whole. The other general 
weakness of the test is the inevitable presence 
of a considerable number of items either too 
easy for an eighth-grader or beyond the reach 
of a third-grader. This is a weakness inherent 
in any test which seeks to measure people who 
differ so widely in age, training, and experience. 

GRADES 7—12. Part I, Word Usage, consists 
of 30 multiple choice items ranging from a 
choice between literate and illiterate English to 
such finely drawn distinctions as exist between 
“frightened by” and “frightened at.” Except for 
a few seemingly nonfunctioning items such as 
“She believed it to be my,” the test deals dis- 
cerningly with common and important usage er- 
rors, Its keyed answers enforce conservative 
standards of “correct” English. 

In Part II, Punctuation, 20 sentences are to 
be punctuated. As in Part I, conservative prac- 
tice is enshrined by insistence upon a comma 
before and in: “Му sister's headache disap- 


peared, and failed to come back.” Form B, but 
not Form A, does recognize as optional the use 
of the comma before a conjunction joining two 
brief clauses. , 

In its meager 20 items, Part ПІ, Vocabulary, 
secures a range of difficulty from the matching 
of “speak” with “talk” to recognizing “exigen- 
cies” and “requirements” as synonyms. A test- 
ing technique, however, which requires the read- 
ing of six sentences in order to sort out “start” 
and “begin” as synonyms seems needlessly cum- 
brous and time-consuming. 

Part IV, Language Usage, like Part I, se- 
cures range and variety by presenting 45 skill- 
fully devised sentences with directions to un- 
derline those which are correct. 

Part V, Expressing Ideas, contains 15 items 
similar to those in the same part of the ele- 
mentary test and suffering from the same de- 
fects. To imply that ALL MEN ARE CRE- 
ATED EQUAL and “Everyone is born en- 
titled to a fair chance in life” mean “the same 
thing” is linguistically naive. Otherwise, how- 
ever, this part of the test does a difficult and 
valuable job of testing thinking power and grasp 
of such abstract concepts as “Freedom means 
an option between alternatives" or “Courage is 
a mean between cowardice and foolhardiness," 

Part VI, Expressing Feeling, is at once the 
most challenging and the most questionable sec- 
tion of the test. The emotional intensity of 
words is inextricably linked with the tone in 
which they are uttered and with the total context 
out of which they come. From four sentences 
ripped from context, who can select with assur- 
ance or precision the one expressing the most 
anger, regret, confidence, courage, or despair? 
To do so would ignore the sometimes all-per- 
vasive force of tone, understatement, facial ex- 
pression, or temperament and circumstances of 
the speaker. Subtle nuances of language and 
emotion will perhaps always slip through the 
meshes of objective testing techniques. 

Ease of administration, simplicity of scoring 
directions, variety of skills tested in a suggested 
time limit of about 40 minutes are the undeniable 
strengths of this test. The most disturbing ques- 
tion is the degree of finely discriminating differ- 
entiation possible in a test designed alike for 
seventh grade and twelfth. The inherent difficul- 
ties of measuririg by the same instrument people 
of such widely differing ages and experience 
become acute in Parts III, V, and VI. The 
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worth of sections containing only 15 to 20 items 
is impaired by the inevitable presence of items 
too easy for a twelfth grader and items beyond 
the reach of a seventh grader. 


For a review by Harry A. Greene, see 3:126. 
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*English Usage: Every Pupil Test. Grades 3-6, 
7-9, 10-12; 1920-51; new form usually published each 
April and December; 3 levels; form April 1951; no 
data on reliability and validity; no manual; norms 
(’51); 2166 per test; 16 per answer key; postpaid; 
40(45) minutes; Ohio Scholarship Tests, Ohio State 
Department of Education, * 
2) GRADES 3-6. 1930-51. 
b) GRADES 7-9. 1929-51. 
с) GRADES 10-12. 1929-51. 

For a review by J. R. Gerberich of the April 
and December 1946 Forms, see 3:127. 
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Iowa Grammar Information Test. Grades 7-12; 
1935; Forms A, B; $4.50 рег 100; 30f per manual; 
postage extra; 256 per specimen set, postpaid; 25(30) 
minutes; Fred D. Cram and Harry А. Greene; Bureau 
of Educational Research and Service, State University 
of Iowa. * 

Ковевт C. Poorev, Professor of English, and 
Chairman, Department of Integrated Liberal 
Studies, University of Wisconsin, M adison, 
Wisconsin. 

The purpose of this test is to measure quanti- 
tatively the product of grammar teaching in 
terms of the information about grammar ac- 
quired and retained by the student. The authors 
indicate that there is divided opinion concerning 
the value of grammar teaching for the improve- 
ment of English skills, but that an interest in 
grammar for its own sake persists. Most gram- 
mar tests, they point out, are aimed at the func- 
tional results of grammar teaching, or usage. 
This test deals exclusively with the materials of 
grammar as a body of factual information. 

Each form of the test consists of 80 exercises 
of the 3-answer multiple response type (eg. 
Grammar is a science that has to do with (1) 
the earth (2) plants (3) language). The cate- 
gories included, with the number of exercises in 
each, are: kinds of sentences, 4; classes of sen- 
tences, 3; subject and predicate, 2; parts of 
speech, 8; antecedents, 2; phrases and clauses, 
20; complements, 6; comparison, 3; classifica- 
tion of verbs, 8; tense, 3; mood, 3; person, 3; 
number, 4; case, 7; gender, 3; independent ele- 
ments, 2. : 

The tests are completely objective and are 
easily scored with a simple key. The authors em- 
phasize that each form of the test must be 


Bs 
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treated as a single part, to be taken at a single 
sitting and to be scored as a single measure. The 
test is neither diagnostic nor analytic, nor are 
there part scores dealing with separate phases 
of grammar. The single score is an objective 
measure of the student's retention of information 
about grammar. An examiner's manual explains 
the construction of the test forms, presents evi- 
dence for the validity and reliability of the tests, 
and furnishes directions for administering and 
scoring. Percentile norms for grades 7 through 
12 are furnished for the beginning, middle, and 
end of the school year. 

The test is honestly named as a test of gram- 
mar information, and there is every reason to 
believe that it measures what a student has as- 
similated and retained from instruction in formal 
grammar. The authors scrupulously avoid mak- 
ing any claim for the test other than a measure of 
retained information, and they do not. attempt 
to evaluate in any way the place of grammar in 
the teaching of English skills. This restraint is 
highly commendable in a field where tests are 


frequently built on false assumptions and un- # 


sound premises. ' 
The weaknesses of this test derive from the 
inherent weaknesses of English grammar rather 


than from any lack of skill on е. рагі of the au- 4 


thors. How can a test of grammar information 
present crystal clear exercises to the candidate 
when the grammar books themselves are full of 
muddy definitions, confused or inexact terme 
ogy, and contradictory rules? In each form 0 
the test, 10 or more items, or one eighth of the 
test, are invalid for reasons of inexact or faulty 
terminology, bungling definitions, or just domm 
right bad grammar. I am sure the authors coul 
defend each of these items by citation to a gram 
mar book; nevertheless, erroneous information 
should not be listed, no matter what the pii 
For example, the student who takes this tes 
must learn that “Ап infinitive phrase is a phrase 
that begins with to, expressed or understa 
when the żo is not used as a preposition. No 
that the student, in order to identify an inini 
phrase, must look for to; but if it isn’t there, : 
must know that it might be there. Farther mar 
he must be skillful enough to determine whet 
the to which isn't there is or is not a prep 
When he has gone through all this, he is per Ар 
able to discover an infinitive phrase. Has kis 
thing more muddled and clumsy ever been P 
petrated on innocent children ? 5 
Confused апа confusing terminology obsct 
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the value of several exercises. For example, “The 
pronouns J, you, and she illustrate a property of 
pronouns known as (1) number (2) case (3) 
person." Are the pronouns to be considered in- 
dividually or collectively? Individually they il- 
lustrate all three properties; collectively no 
rule can be formed, for you is singular and plural, 
nominative and objective. The three, under spe- 
cial conditions, illustrate "person," but the 
special conditions are not named in the exercise. 
Again, what clarity of grammatical distinction 
is served by an exercise worded like this: “A 
word derived from a verb and serving as a noun 
or as an adjective is a gerund, an infinitive, or a 
(т) subjunctive (2) participle (3) preposi- 
tion"? The only virtue in teaching such terms as 
gerund and participle is to distinguish clearly be- 
tween the noun and adjective functions of words 
otherwise similar. To lump them together this 
way is to create "confusion worse confounded." 
To do this test successfully the student must 
have learned specific and technical meanings for 
such phrases as "related group," "manner of 
assertion," *make complete sense," "base," and 
others. He must know that “А sentence that 
expresses emotion is exclamatory." If he claims 
that the sentence “The little girl sobbed as if her 
heart would break" is exclamatory, who would 
blame him? Not this reviewer, at any rate. 
[165] 

X*Iowa Language Abilities Test. Grades 4-7, 7-10; 
1948, c1946-48; IBM; 2 levels; 2 editions; 256 per 
interpretation manual; postage extra; 35¢ per speci- 
men sét of any one level (does not include interpreta- 
tion manual), postpaid; H. A. Greene and H. L. Bal- 
lenger; World Book Co. * р 
а) ELEMENTARY TEST. Grades 4-7; 6 scores: spelling, 
word meaning, language usage, capitalization, punctua- 
tion, total; 48(55) minutes. 

1) Hand Scoring Edition. Forms A, B; $2.10 per 


25; separate answer sheets may be used; $1.40 per 
25 IBM answer sheets; 80¢ per set of stencils for 
scoring answer sheets. 
2) Machine Scoring Edition. Forms AM, BM; 
$2.70 per 25; separate answer sheets must be used; 
$1.40 per 25 IBM answer sheets; 8о@ per set of scor- 
ing stencils. 
b) INTERMEDIATE TEST, Grades 7-10; 8 scores: same 
as Elementary Test plus grammatical form recogni- 
tion, sentence sense; 46(53) minutes. 
1) Hand Scoring Édition. Forms A, B; $2.65 per 
25; separate answer sheets may be used; $1.40 per 
25 IBM answer sheets; 80€ per set of stencils for 
scoring answer sheets. 
2) Machine Scoring Edition. Forms AM, BM; $3.10 
per 25; separate answer sheets must be used; $1.40 
per 25 IBM answer sheets; 808 per set of scoring 
stencils. 
Marcaret С. McKim, Associate Professor of 
Education, University of Cincinnati, Cincinnati, 
Ohio. 


With the exception of the omission of a sub- 
test on sentence sense at the elementary level, 
these tests cover the aspects of language usage 
typically taught in the upper elementary and 
high school grades. The length of the total test 
allows for at least 50 items in each subtest except 
that of grammatical form recognition at the in- 
termediate level, and for 75 items in the language 
usage section. The latter has separate groups of 
items calling for identification of correct word, 
correct word form, and faulty expressions, but 
provides no separate scores. The authors urge 
teachers to secure a tally of the number of pupils 
missing each separate item, but give no informa- 
tion to show what specific points of usage are 
tested, or how heavily each point is emphasized. 
Inspection of the test suggests that certain types 
of items may recur more frequently than others. 
The most noticeable weakness is in the test of 
sentence sense at the intermediate level where 
the only skill tested is the ability to tell a sentence 
fragment from a complete sentence. Since each 
subtest is given under a short time limit, the test 
as a whole should be thought of primarily as a 
measure of general achievement, not as a diag- 
nostic instrument. 

Although the elementary test is proposed for 
grades 4 through 7, its use with a typical fourth 
grade is open to question. The difficulty of the 
test for children of this grade is indicated by 
the fact that a grade score of 5.0 can be secured 
on three of the five subtests with two fifths or 
less of the items correct. Most open to criticism 
are the spelling and word meaning subtests 
where an attempt to provide for a wide grade 
range has resulted in a limited number of words 
appropriate to the reading and spelling ability 
of the typical fourth grader. The short time 
limits for the subtests are likely to penalize the 
slow reader at any grade, but may work a par- 
ticular hardship on younger children. 

Any language test in which the child chooses 
one of several forms has limitations, in that it 
fails to indicate what the child’s typical pattern 
of response would be were no answers suggested 
to him. The authors recognize this difficulty 
and point to the need for other types of diagnostic 
procedures. The attempt to set up a test which 
can be completely machine scored and easily 
hand scored has resulted in complicated direc- 
tions. On each subtest a child marks his answers 
on a somewhat different basis. On the word 
meaning subtest he must work with two col- 
umns, one for indicating a synonym and the 
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other an antonym of the key word. Where punc- 
tuation marks are to be indicated, the layout 


of the scoring sheet is such that the same mark 


is not always found in the same column. At the 
intermediate level, the test of capitalization calls 
not only for indication of words needing capitals, 
but also for words incorrectly capitalized. For 
elementary children, the wording of the direc- 
tions seems unnecessarily difficult. This will not 
be an easy test to administer, particularly if chil- 
dren have had little experience with standard- 
ized tests. 

The norms are based on a relatively small 
number of cases as yet, but the care with which 
the present standardization seems to have been 
done gives promise of a strong test from the sta- 
tistical point of view. Standard scores allow for 
comparison of subtest with subtest and for di- 
rect comparison with the Jowa Silent Reading 
Test. Both grade norms and percentile scores 
are provided. Corrections have been made for 
the somewhat superior nature of the group on 
whom the tests were standardized. Split half 
reliability quotients for the subtests, based on 
approximately 155 cases at the elementary level 
and тоо cases at the intermediate level are for 
the most part in excess of .88. The lowest relia- 
bilities, at both levels, are for the language usage 
subtests. 

The manual of directions and a supplementary 
manual for interpreting the tests are written 
with care and should be of considerable practical 
use to teachers. Of particular value is a series of 
remedial charts in the supplementary manual 
for interpreting the tests. These give typical 
causes of low test scores and suggest ways of 
securing additional evidence of the deficiency 
and typical remedial procedures. Since teachers 


are urged to study class responses to specific. 


items, a chart showing the distribution of the 

types of items included in the usage, punctua- 

tion, and capitalization subtests would be useful. 
School systems looking for an 


tensive lan- bi jue. : 
guage test will find this one heli eri. $ to TIME how many questions 
above the fifth grade, Examiners should becare- $ п 
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ter and G. D. Stoddard; Bureau of Educati a 
search and Service, State University of low " ү; 
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CLARENCE Derrick, Head, English Section, 
Test Development Department, Educational 
Testing Service, Princeton, New J ersey. 

Even good achievement tests, like most base- 
ball players, have outlived their usefulness after 
20 years of service. Although the Iowa English 
Aptitude Test has been revised since Form À 
first became available in 1925, inspection of the 
present forms suggests that the revisions were 
minor, A quarter of a century ago this test may 
have had some virtues; today it has none. Its 
stated purpose is to differentiate between train- 
ing in English and natural aptitude or fitness 
for English studies. When one analyzes the 
skills measured and considers that the test is 
only 43 minutes long, he must conclude that such 
a pretentious claim must have been made when 
true believers had implicit faith in the efficacy 
of the "new type" tests. 

The test has four parts. In Part 1 the student 
indicates whether given sentences comply with 
a stated and illustrated rule of grammar. The au- 
thors seem to assume that the ability to recog- 
nize whether a pronoun is in agreement with its 
antecedent measures “aptitude,” not “training. 
In Part 2 a reading passage is followed by 10 

_ groups of three sentences. In each group, the 
task is to select the one sentence that pus 
an exact idea given in the passage. A usefu 


technique in evaluating a set of reading items 


s can be an- 


red without reading the passage. The re- 


pos, B f 
fully prepared before attempting to administer viewer obtained perfect scores on Part 2 0 


the test. $71 XV Wort s 

пети 
Iowa Placement Examinations: English Apti- 
tude: Series EA1, Revised. Grades “12-13; 1925- 
26; Forms A (’25), В (’26); no specific manual; se- 
ries manual ['26]; $5 per IOO, postage extra; 60€ per 
specimen set (includes the other ro tests in the series), 
postpaid; soo minutes; prepared under the direc- 
tion of C. E. shore and G. M. Ruch; M. F. Carpen- 


i 
+ 


402 
o) 


- Forms A and B without reading the selection. 


The following typical item shows why such a 
performance was possible : 


es 
Directions: Read the passage as often as neces” 
sary. Do not guess. Put an X in front of the а 
tence in each group which expresses an idea 
. in the passage. ETE 
— Direct quotations should be used only in st 
ries and jokes. TN 
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— Direct quotations are often quite useful in 
humorous stories. 
— A person cannot write a good story without 
using direct quotations. 

Part 3, apparently designed to measure the abil- 
ity to understand poetry, employs the technique 
familiar to users of the Jowa Silent Reading 
Tests. A poem is printed with lines and phrases 
bracketed and numbered. The task is to locate 
the bracketed portions which give the answers 
to unimportant factual questions. The format 
makes an understanding of the poem extremely 
difficult. A student could answer all the ques- 
tions correctly and not have the slightest com- 
prehension of the poem or the least appreciation 
of its value as a work of art. Part 4 is the proto- 
type of a kind of objective exercise that has re- 
ceived rather extensive use since its appearance . 
in the USAFI effectiveness of expression tests. 
A theme is printed in a left-hand column with 
questions about the theme as a whole in a right- 
hand column. For example, the student indicates 
whether commas are never used incorrectly for 
periods, a comma is once used incorrectly for a 
period, or commas are several times used in- 
correctly for periods. Many of the items are time 
consuming and inefficient. More recent uses of 
this item type relate the items to underlined and 
numbered portions of the theme. 

There is no real manual for this test; the gen- 
eral Manual of Directions serves all 11 tests in 
the series. From this source, the test user can 
compare test scores on the English Aptitude 
Test with percentile norms based on a popula- 
tion of 12,056. He has to conjecture whether 
the norms represent high school or college stu- 
dents, When these normative data were gathered 
is as complete a mystery as the reason why this 
test continues to be offered years after it should 
have been respectfully interred. 


W. C. Kvaraceus, Professor of Educational 


Measurement, Boston University, Boston, Mas- _ 


7 
к; 


sachusetts. i №, 

This test is designed to measure, in entering. 
college students, those more specific mental abil- 
ities which constitute factors determining subse- 
quent success in English. This instrument also 
presumes to be an intelligence test of a new 
type which might serve as a special supplement 
to the traditional tests of intelligence since its 
test items are aimed to cover only abilities in the 
area of English. 

This specialized aptit 
four sections: Part 1 (20 items, 


ude test is made up of 
8 minutes) 


measures the ability to comprehend and apply 
two rules of grammar ; Part 2 (10 items, 8 min- 
utes) tests ability to read several paragraphs of 
a college text for accuracy of detailed informa- 
tion; Part 3 (15 items, 12 minutes) involves 
reading comprehension for accuracy in drawing 
inferences or general ideas; and Part 4 (20 
items, 15 minutes) measures ability to evaluate 
various features of a written composition. 

The series manual reports a reliability co- 
efficient of .82 for the total test. Normative data 
include percentile equivalents on more than 
12,000 cases, but the source and nature of the 
normative group are not disclosed. Prediction 
data are reported in terms of an average correla- 
tion of .50 between course grades and tests in 
the aptitude series, but without any mention of 
the types of classes or samples involved. 

Sufficient data are not present in the 4-page 
manual to answer routine questions such as: 
What is the correlation between this test and 
the English training test? Between this test 
and traditional intelligence tests? What are the 
reliabilities of the subtests? What are the inter- 
correlations between subtests? Since this test, 
and the others in the series, have seen rather 
extensive use and tryout in many colleges, a 
substantial bibliography exists to provide some 
answers to these queries—if one has access to 
the primary sources and the inclination to use 
them. The series manual is inadequate as a 
source of information which will enable a test 
purchaser to evaluate the test for personal use. 
It is, however, a good example of the constant 
need for publishers to revise and to supplement 
accompanying test manuals with new validation, 
reliability, and normative data which begin to 
accrue in the literature after the test has been on 
the market for some time. Incidentally, the 
manual suggests 10 uses of the test results 
which, if followed literally, are likely to result 
in 10 abuses. i 

In view of the very restricted coverage of this 


—— instrument, the low reliability of the total test 
_ and particularly of the subtests, and the unrelia- 


bility of the college instructors’ marks used as 
' criteria, it is not surprising that correlations no 
higher than .45 апа .50 have been reported be- 
tween test scores and college grades. 
For a review by Robert C. Pooley, see 3:115. 
[167] 
Iowa Placement Examinations: English Train- 


ing. Grades 12-13; 1925-44; IBM ; 2 editions ; no spe- 
ч manual; M. F. Carpenter, С. D. Stoddard, and 
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L. W. Miller (New Series) ; Bureau of Educational 
Research and Service, State University of Iowa. * 
а) SERIES ETI, REVISED. 102 -26; Forms A ('25), D 
(726) ; series manual ['26] ; $s per 100, postage extra; 
60€ per specimen set (includes the other то tests in the 
series), postpaid ; 40(45) minutes; prepared under the 
direction of C. E. Seashore and G. M. Ruch. 
b) NEW SERIES ET-2, REVISED. 1925-44; also called 
Quick-Scoring Edition; IBM; Form M (41) ; $175 
рег 25; 25€ per series manual (’44) ; 50€ per specimen 
set (includes the other 4 tests in the series), Е 
separate answer sheets may be used ; $3.50 per 100 BM 
answer sheets; 35€ per stencil for machine scor: of 
answer sheets; 25¢ per stencil for hand scoring of an- 
swer sheets; postage extra; 4&5 minutes; revised 
by M. F. Carpenter and D. B. Stuit. 
REFERENCES 

E vc PM Study of the Iowa Place- 
ment Examinations, Master's thesis, University of Iowa (Iowa 
Ci, Тота), yh E.; Larsen, R. P.; AND м, 
aK. "Factors Contributing to Achievement in We Beate at 

lementary German.” Mod Lang J 27:254-62 Ap '43. * 

18. WiTrENBORN, J. R., AND Lamsz, R. P. “А Factorial 
Study P MT ч РШ German." J Ed Psychol 
dr NS KzxNETH Н, Y Predicting Academic Success 


1 
i issions Work.” Jun Col J 19: 8. * 
vid. Оназаос "RICHARD Srepmen, Construction of a Selec- 


do Trat (БСО ев Graduates." J General Psychol 41 1153-93 
CranENCE Derrick, Head, English Section, 
Test Development Department, Educational 
Testing Service, Princeton, New Jersey. 

Tests of the mechanics of English are cur- 
rently in disfavor. The modern English teacher 
is told that, since the function of the gram- 
marian is to describe, not prescribe, he cannot 
provide firm standards of acceptable usage. The 
ability to recognize errors is not perfectly cor- 
related with the ability to avoid such errors in 
written composition; students who obtain high 
scores on a mechanics test frequently write un- 
imaginatively. While all of these things are true, 
to the teacher who still would like to know some- 
thing about the ability of his students to spell, 
punctuate, and recognize common usage errors, 
a good mechanics test can be useful, However, 
the earlier edition of the Iowa English Training 
Test is not a good mechanics test: the later edi- 
tion has limitations but can be used. 

Even if the skills measured were comprehen- 
sive, which they are not, even if all the items 
were beyond cavil, which they are not, the earlier 
edition (Series ETr, Revised) could not be 
recommended. The fundamental reason is that 
there are several mechanics tests which will ac- 
complish the modest objectives of this test with- 
out introducing its limitations—inferior format 
and type, almost exclusive reliance on true-false 
items, cumbersome scoring, an inadequate man- 
ual, and almost meaningless norms. 

In the New Series (ET-2, Revised) many 
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of the inadequacies of the earlier edition haye 
been corrected: the type is more legible, the test 
is set up for quick hand scoring or for use with 
an IBM answer sheet, the section on recogni- 
tion of effective and ineffective sentences is 
eliminated, the manual is greatly improved, and 
percentile norms based on described populations 
are given, Subscores measuring spelling, punc- 
tuation, and usage are obtainable. Spelling is 
tested in the Revised Edition by giving four 
spellings of a word with instructions to select 
the one correct spelling. A more efficient spelling 
item type requires the student to select the one 
misspelled word from five given words, With this 
method many more words can be tested per unit 
of time. The chief criticism of the punctuation 
and usage sections is that, since the items are 
true-false, there is no assurance that the student 
has identified the error and could correct it. 
The New Series ET-2 is to be preferred to 
the earlier form, Series ET1. However, since 
the revised form tests a restricted number of 
skills, with equal weights given to spelling, punc- 
tuation, and usage, and since the norms are 
solely on college freshmen populations, the Eng- 
lish Training Test, New Series, falls short of 


_ being a best buy. P: 


W. C. Kvaraceus, Professor of Educational 
Measurement, Boston University, Boston, Mas- 
sachusetts. 

This is the sister instrument built to accom- 
pany the English aptitude test (see 166) in ji 
Iowa series of 11 instruments for placement 0 
entering college students. It offers a very nat- 
row coverage of a few skills in the language 
grammar area. Forms А and B are made up 0 
four subtests: spelling, punctuation and sef- 
tence structure, language usage, and sentence 
sense. The test purports to determine how W : 
the student has mastered the fundamental e 
tent of the high school English course of study, 
but few teachers of English will grant it апу 
high degree of curricular validity. Little Е А 
ence seems to have been made to any study o 
the actual content of the high school haer 
this area. Dependence for validity in bui ivit 
this test was placed on the experience of co di 
teachers whose chief contact had been w^ a 
select high school group that elected to g 
college. i 

The manual reports a reliability co : 
-90 for Form A. Normative data are in the ci 
of percentiles on a substantial but anonym 


fficient of 
im form 
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sample. In view of the tremendous increase in 
college enrollments in recent years, together 
with a change in the complexion of the college 
population, the use of these norms, gathered in 
the twenties, is hardly defensible today. Under 
the general heading "Predictive Power of the 
Examinations" an average correlation of .60 
is reported for the examinations in the training 
series against the implied criterion of college 
grades. However, lower correlations averaging 
-49 оп 3,450 cases in 28 colleges are reported for 
the English training test in the research litera- 
ture (8, 16). It is interesting to note that average 
correlations between tests in the training series 
and criteria of scholastic success are reported 
in the manual as higher than the correlations 
between tests in the aptitude series and the same 
criteria. In other words, the training tests are 
superior predictors when compared with “ргор- 
nostic" tests in the same battery. However, it 
might be nearer the truth to say that neither 
predicts very effectively. 

"This test and the others in the series have out- 
lived their usefulness long ago. They remain sig- 
nificant historically as we view the development 
of differential aptitude batteries. 

A revised version of this test is now available 
in a new series (the third) in the form of a 
machine scored edition. This last item is in 
three parts: spelling, punctuation, and English 
usage. Normative data are available in terms of 
scale scores and percentiles. 


For a review by Robert C. Pooley, see 3:131. 


[168] 
Los Angeles Diagnostic Tests: Language (A 
Test in Capitalization, Punctuation and Lan- 
guage Usage). Grades 3-9; 1927; discontinued: 
Forms 1-4; no data on reliability and validity; no de- 
scription of normative population; manual [27]; 
30 (35) minutes; Caroline Armstrong; California Test 
ureati, 


GERALD V. LANNHOLM, Project Director, Edu- 
cational Testing Service, Princeton, New Jer- 
sey. 

The major purpose of this test appears to be 
the diagnosis of the individual pupil's difficul- 
ties in correct writing. To assist the teacher in 
classifying and tabulating the errors made on 
the test by each pupil, a diagnostic record sheet 
is provided. 

The test contains 10 short sections which may 
be described briefly as follows. The first section 
presents II sentences and a 4-line stanza from 
a poem for children. Nearly all of the capitaliza- 


tion is omitted, and the pupil is directed to un- 
derline each letter which should be a capital 
letter. In the second section, the pupil is to in- 
sert correct punctuation in six short sentences 
presented without any punctuation. Following 
these are a 9-item section on sentence recogni- 
tion, another capitalization section, a word usage 
section of 20 two-response items, a general punc- 
tuation section, a section on the use of the apos- 
trophe, a section requiring the pupil to form 
adjectives from each of nine proper nouns, an 
exercise involving the writing out of all parts 
of a friendly letter excepting the body, and a 
section directing the pupil to identify the part ` 
of speech of several underlined words or phrases. 
The various sections of the test are not timed 
separately. 

The mechanical features of this test would 
discourage present-day teachers from using it. 
АП responses are entered in the test booklet. 
While such a practice is desirable in the early 
grades, the materials can be arranged to mini- 
mize scoring difficulties without increasing ap- 
preciably the complexity of the pupil's task. As 
this test is designed, the scoring must be accom- 
plished by comparing the pupil's marked booklet 
with a correctly marked copy or with a separate 
sheet which indicates the correct responses and 
the weight each item should receive in the total 
score. This scoring sheet does not have the re- 
sponses arranged with the same spacing as that 
employed in the test booklet. 

For the most part, the usage situations se- 
lected for inclusion in the test are important ones. 
However, the number of different skills tested 
within each area is rather small, especially for 
a test intended to be diagnostic and to be suit- 
able for such a wide range of grade levels. The 
lack of any measure of spelling ability, except 
for the special and limited case of forming ad- 
jectives from a few proper nouns, is surpris- 
ing. The inclusion of an exercise on letter writ- 
ing is commendable, but the scoring of that sec- 
tion will certainly cause difficulties. Testing the 
pupil's recognition of parts of speech as such 
rather than measuring his ability to use them 
correctly is not in conformance with current 
practice in either teaching or testing. The rela- 
tive emphasis given to some of the elements 
tested also appears questionable. In the first sec- 
tion dealing with capitalization, for example, 
about half of the 30 points allowed are for capi- 
talizing the first word in a sentence. While this 
might be justified on the basis of frequency of 
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usage in free writing, it is difficult to defend in 
terms of the percentage of errors. 

This test also has some technical and editorial 
defects. In the first capitalization section, the 
personal pronoun “I” appears uncapitalized in 
some places and capitalized in others, enabling 
the uncertain but observing pupil to make cor- 
rect decisions on a basis other than his mastery 
of capitalization. Other clues to correct capitali- 
zation may be obtained from the punctuation 
section on the same page, the material in that 
section being presented with correct capitaliza- 
tion. Similarly, when the pupil takes the punctu- 
ation section, he may obtain clues from the 
capitalization section. In a later section on capi- 
talization which gives a list of words and terms 
out of context, the pupil's failure to capitalize 
some which carry capital letters on the scoring 
key may result from his interpreting the word 
or term differently from that intended by the 
test author. . 

Even though the primary function of the 
test is purportedly that of diagnosis of specific 
strengths and weaknesses, norms based only on 
total score are presented for each of grades 2—12. 
No description is given of the nature or size of 
the normative population or of the methods used 
in deriving these norms. The norms show exactly 
equal score increments from one grade level 
to the next for the entire range of 11 grades. 
Such results are so unusual that one wonders if 
these data were not intended to represent sug- 
gested standards.rather than summary statis- 
tics based on the actual test scores received by 
pupils at these grade levels. 

The identification of specific difficulties of 
individual pupils is of major importance in assist- 
ing them to master the elements of correct writ- 
ing. Efficient discovery of those difficulties is 
facilitated by one or more good diagnostic tests. 
With many improved tests available in this 
field, it is not believed that this test, unchanged 
for nearly a quarter of a century, will be among 
those chosen for that purpose. 


J 
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*Mechanics of Written English: State High 
School Tests for Indiana. Grades 9-12; 1940-50; 
6 scores: punctuation, recognition of grammatical er- 
rors, sentence structure, vocabulary, spelling, total; 
Form B ['49]; Form A out of print; mimeographed ; 
no data on reliability and validity; no manual; no 
norms for part scores; norms ['50] ; 516€ per test; 156 

r specimen set; postpaid: 50(55) minutes; Ethel 
Toa. J. H. McKee, and G. S. Wykoff; State High 
School Testing Service for Indiana, Purdue Univer- 


sity. 
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*Morgan Test of Modern Engli HR 
9-16 d adults ; 1949-50; title a met bei is 
ern English Usage; IBM; Form A (49) ; manual 
(^50) ; per 25 sets of test and answer sheets; sepa- 
rate answer sheets must be used; 36 per IBM answer 
sheet; $1 per specimen set; sample test free to quali- 
fied test users; cash orders postpaid; 40(50) minutes; 
Antonia Morgan and William J. Morgan; Aptitude 
Associates. * 

Warrer N. Dunosr, Director, Test Service and 
Advisement Center, Dunbarton, New Hamp- 
shire. é 

A competent and critical reviewer of a test 
looks for certain standard information in the 
test manual, as well as for certain characteris- 
tics in the test itself, in determining his evalua- 
tion. These things include such matters as valid- 
ity, reliability, adequacy of norms, utility, etc. 
The manual for the present test appears to be 
singularly lacking in information on these points. 

There is no evidence of the validity of the test 
except the evidence of face validity. A far more 
adequate statement of the purposes for which 
such a test would be used is needed than that 
given in the manual. Certainly, some usages 
which are colloquially acceptable are considered 
to be wrong, although this is said to be a test of 
conversational as well as written English usage. 
It would be interesting and helpful to know what 
authorities were consulted or what the author's 
philosophy is concerning English usage. 

No data whatsoever are given concerning the 
reliability of the instrument. It is difficult to 
account for such an omission for surely some 
such information must be available. | 

The norm population is said to be national 
in scope and to be based upon 3,283 cases, but 
no statement is given as to where these cases 
came from or the conditions under which the 
tests were given. Norms are given separately by 
sex, and there seems to be a slight edge in favor 
of girls over boys in these norm tables. Provid- 
ing the information separately in this way 1$ 
to be considered praiseworthy, but to evaluate 
the differences between boys and girls by means 
of formulas for the standard error of the dif- 
ference is a highly questionable practice. First 
the samples are not random, and formulas, 
strictly speaking, do not apply. Secondly, an 
more important, the question is not whe er 
there is a statistically significant difference but 
rather whether there is an instructionally sig- 
nificant difference. There is no statistical test 
for this. 
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Not much can be said in favor of the mechani- 
cal arrangement of the test or of the manual. 
The manual is badly organized and the state- 
ments concerning the uses of the test leave much 
to be desired. The answer sheet is very difficult 
to use. 

The cost of the materials is excessively high 
considering the nature of the test. 

This test seems to represent an attempt of 
amateurs to play in a professional league. As 
such it is to be discouraged. If someone wanted 
a test of English usage which he would evaluate 
and interpret for himself, he might purchase this 
one to save himself the bother of duplicating the 
materials locally. There would seem to be little 
other justification. 


[171] 
X*Objective Test in Grammar, Grades 10-12; 1950; 
1 form; no data on reliability and validity ; no manual ; 
no norms; 1-4 copies, 15¢ each; 5-1000, 6¢ each; 15¢ 
per key (free with orders of 24 or more) ; as¢ per 
specimen set; postage extra; nontimed (60-90) min- 
utes; Nellie Е. Falk; Perfection Form Co. * 


[172] 
*Objective Tests in English [Grammar]. Grades 
7-9; 1950; 1 form; no data on reliability and validity ; 
no manual; norms ('50); 1-4 copies, 156 each; 5- 
1000, 6¢ each; 156 per key (free with orders of 24 or 
more) ; 256 per specimen set; postage extra; nontimed 
(1000) minutes; Gunnar Horn; Perfection Form 
о. 


[173] 
X*The Purdue Placement Test in English: The 
Clapp-Young Self-Marking Tests. High school 
and college; 1931-41; 8 scores: punctuation, gram- 
matical classification, recognition of grammatical er- 
rors, sentence structure, reading, vocabulary, spelling, 
total; IBM for Forms A, B; Forms A (31), B C32), 
C ('41) ; manual [41]; $1.35 per 25 (Forms A or 
only) ; $2.20 per 25 sets of test and answer booklet; 
separate answer sheets or booklets must be used; $1 
per 25 answer booklets; 756 per 25 IBM answer 
sheets; боф per set of stencils for machine scoring of 
answer sheets; 276 per specimen set; yen d extra; 
Forms A, B: go (4) minutes; Form Crs i min- 
utes; С. S. Wykoff, J. Н. McKee, and Н. Н. Rem- 
mers; Houghton Mifflin Co. * 
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for a Trade and Industrial Education Curriculum," pp. 9-14. 
In Motives and Aptitudes in Education: Four Studies, Edited 
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J Ed Psychol 42:52-8 Ja °51. * (PA 25:6486) 
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*Stanford Achievement Test [Language Arts]. 
Grades 4-6, 7-9; 1941-46, c1940-46; an adaptation of 
the reading, language usage, and spelling tests of Stan- 
ford Achievement Test (see 25) ; 5 scores: paragraph 
meaning, word meaning, language usage, spelling, 
total; IBM; 2 levels; Forms DM (лт), EM (43), 
FM (46); the Manual for Interpreting referred to in 
the directions for administering has not been pub- 
lished; directions for administering ('41); $2.70 per 
25 of any one level; separate answer sheets must be 
used; 856 per 25 IBM answer sheets; 406 per set of 
machine scoring stencils for any one form of any one 
level; 556 per 25 profile charts (’42) ; postage extra; 
356 per specimen set of any one level, postpaid ; 60(65) 
minutes; Truman L. Kelley, Giles M. Ruch, and Lewis 
M. Terman; World Book Co. * 

а) INTERMEDIATE LANGUAGE ARTS TESTS. Grades 4-6. 
b) ADVANCED LANGUAGE ARTS TESTS. Grades 7-9. 


REFERENCES 


1. Eaton, Merritt T. A Survey of the Language Arts 
Achievement of Sixth Grade Children in 18 Counties and Six 
Cities in Indiana. State of Indiana, Department of Public 
Instruction, Research Bulletin No. 3. Indianapolis, Ind.: the 
Department, 1942. Pp. vii, 75. Paper. * (PA 17:3245) 


For reviews by Paul R. Hanna and Claude 
E. Norcross and Vergil E. Herrick of the com- 
plete battery, see 25; for reviews by Walter W. 
Cook and Ralph C. Preston of the complete bat- 
tery, see 3:18. 
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*Test of English Usage. High school and college; 
1950; 4 scores: mechanics of writing, accurate use of 
words, building sentences and paragraphs, total; IBM; 
Forms A, B; $2.50 per 25; 35¢ per specimen set, post- 
paid; separate answer sheets mI used; 76 per 
Scoreze answer sheet; 3¢ per IBM answer sheet; 60€ 
per set of stencils for machine scoring of answer 
sheets; 20€ per set of stencils for hand scoring of an- 
swer sheets; 2€ per 26 or more individual diagnostic 
records; postage extra; 100(110) minutes; Henry D. 
Rinsland, Raymond W. Pence, Betty S. Beck, and 
Roland L. Beck; California Test Bureau. * 


CuanLorTE Croon Davis, Test Research Serv- 
ice, 12 Normandy Road, Bronxville, New York. 

This test consists of 150 four-choice items, 
distributed as follows : Test I: capitalization, 16 
items; use of the apostrophe, 9 items ; and punc- 
tuation, 30 items. Test II: grammar, 52 items 
[33 of these are selecting the correct verb form, 
(e.g., go, going, went, gone) to use in a given 
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sentence]. Test III : sentence structure, 8 items ; 
and paragraph organization, 35 items. 

It is questionable whether sections on sen- 
tence structure and paragraph organization be- 
long in a test of usage. However, the skills tested 
in these sections are important, and it may be 
that the title of the test is too narrow. It might 
be desirable to cut the number of items on para- 
graph organization and to increase the number 
on sentence structure. Too large a portion of 
the test (over one fifth) is devoted to verb 
forms, especially since the type of item used 
does not seem particularly valuable. If this sec- 
tion were substantially cut, more items could 
well be included in the other grammar section, 
as there are many common grammatical errors 
not covered, and perhaps a new section could 
be added. The writer would like to see included 
a section on diction—using the proper word to 
express one's meaning. These items would not 
depend on grammatical principles but would 
test such distinctions as the difference in mean- 
ing between “uninterested” and “disinterested,” 
etc. 

The difficulty level seems about right, since 
much of the material is what might be called 
minimum essentials which every college fresh- 
man, and indeed every tenth grader, ought to 
have mastered. The test might be improved by 
using a few more difficult items in some of the 
sections. 

The test is intended for diagnostic as well as 
survey purposes and is accompanied by an in- 
dividual diagnostic record, for which the manual 
gives detailed instructions. 

The test has been carefully prepared and 
analyzed. Where appropriate, the items are 
based on error counts in pupils’ writings and 
Thorndike’s counts of frequency of use. Kuder- 
Richardson reliability coefficients are supplied 
for the part scores and the total score at grade 
levels 10, 11, 12, and 13. The median coefficient 
for the part scores for a single grade is about 
.85. The coefficients for the total score are all 
.90 or over. The raw correlation between total 
score and grade in English for one sample of 
125 college freshmen is .37. Percentile norms, 
based on 3,841 students in grades 10 through 
I3 in representative school districts and col- 
leges in all parts of the country, are provided 
for each grade level for the three part scores and 
the total score. 

The scores are not corrected for guessing, and 
the directions for administration do not touch 


on this point. It may be noted that a relatively 


ignorant or slow reading testee who guesses. 


freely will tend to have an advantage over one 
who does not. For example, an eleventh grader 
who marks correctly the first 27 of the 43 items 
in Test III and omits the rest, perhaps because 
he does not have time to read them carefully, 
will have a percentile rank of 40. Another eley- 
enth grader who also marks correctly the first 
27 items in this section and rushes through the 
remainder guessing at random will be most likel 
to have a percentile rank of 65. # 

The writer did not notice any ambiguous or 
incorrect items. The punctuation section would 
be improved by testing occasionally for situa- 
tions where no punctuation is needed. Many 
pupils overpunctuate. The capitalization section 
would be improved by placing the words that 
are to be considered for capitalization in the 
order in which they occur in the sentence. Also, 
in this section and others a, b, c, d are used as 
the choice letters for some items and e, f, g, h 
for others; this peculiarity of format introduces 
unnecessary (and unwanted) clerical facility 
into the scores. In this connection, would it not 
be better to use numbers instead of small letters 
for the choices? This might speed up the testing 
(it quickly becomes habitual to call the choice 
in the third position 3, etc.) and might prevent 
difficulty in scoring in the situation where the 
testee writes his answers in the booklet. Certain 
small letters when hastily written are easily con- 
fused, e.g., a and d, f and h. з 

The appearance of the test would be improved 
by spreading the material out and using the two 
blank pages. In the writer's opinion, the green 
printing on pale green paper is not an improve 
ment over the customary black on white. 

In summary, this test appears to be carefully 
constructed and standardized and should be а 
useful measuring and diagnostic instrument. 


[176] 
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*Tests of Language Usage: Active Vocabular 
and Expression: ЖСцоретанув Inter-Americat 
Tests. Grades 8-13; 1950; 3 scores: active io ny 
lary, expression, total; IBM; 2 editions; Englis AS. 
tion: Forms AE, BE; Spanish edition: Forms ive 
BS; no data on reliability and validity; poA 
norms; $2.50 per 25; separate answer sheets m УМ 
used; 80¢ рег 25 IBM answer sheets; 15€ рег Ped te 
stencil; 50¢ per series manual; 50¢ per speci 
(does v — series manual), е ars J 

stpaid ; minutes; prepar 
ue Candi on Modern Languages of ш gos 
can Council on Education, Herschel T. Manua orvision, 
tor of Test Construction; Cooperative Test 
Educational Testing Service. * 
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These tests are described as being 


designed primarily for measuring ability to use, as dis- 
tinguished from ability to understand, English and 
Spanish. * The ability to think of words and the de- 
velopment of a "feeling" for idiomatic expressions are 
especially significant in measuring progress toward 
the mastery of the language. A comparison of scores in 
the second language with scores on similar material 
in the native language provides a basis for interpre- 
tation which is lacking in ordinary tests of foreign lan- 
guages. 

In keeping with these objectives, Part I, Ac- 
tive Vocabulary, directs the students to "find 
the first letter of the [English] word that has 
been omitted from each sentence. The number 
in parentheses is the number of letters in the 
word." The students then record their selection 
of the letter on a separate answer sheet suitable 
for scoring either by means of a perforated 
stencil, or by machine. 

Ё i ; Тат his —(3)—HSFR ү 
t RS edes (OF AHCI 

As shown by these examples, a greater de- 
gree of "active" recall is required than in the 
usual multiple choice type of test. In taking the 
examination, however, the bilingual student, 
like this reviewer, is likely to experience the 
following limitations : у 

а) In completing Item 4 above, а girl might, 
under the pressure of time, be tempted to look 
for the letter D (for daughter) or H (for hija), 
and then, after a little confusion, realize that 
she is his three-letter American son. This might 
involve a little loss of time through confusion— 
especially since this test is for students who are 
presumably not very secure in English. 

b) In completing Item 5 above, the student 
who is more at home in Spanish than in English 
would be “baited into error” by the tempting 
choice of F for frio—a perfect 4-letter word— 
unless he realized that, after all, the English 
word, not the Spanish word is called for. Since 
these momentary but time consuming confusions 
beset the reviewer throughout the test, the com- 
plete avoidance of letters suggestive of possible 
Spanish equivalents is highly desirable in short 


176 


tests for bilinguals in which time becomes a 
pressing factor. 

c) Since many of the vocabulary definitions 
contain words of rarer frequency than the key 
word that is to be identified, the test inevitably 
becomes as much a reading test as a test of ac- 
tive vocabulary. For example, in Item 20, Part 
I, Form BE, the word implement is not found 
among the first four thousand words of highest 
frequency in English, while the key word, plow, 
occurs among the first three thousand: 

A —(4)— is a farm implement for breaking the 
ground. PTS FM 

Apparently only the spelling plow (not 
plough) is acceptable here, although even 
abridged dictionaries usually give both. A su- 
perior student who knows both would be more 
confused here than the less able who know only 
one. 

d) Item 10 of the same section does not legiti- 
mately rule out the word jewel, even though 
only the word crown is acceptable : 


The queen wears a —(5)— on her head. J W Y 
CM 


A more careful tryout and evaluation of the 
individual items would have eliminated this kind 
of ambiguity long before the test reached the 
publication stage. 

e) Item 16 of the same section has a quaint 
foreign flavor in its use of prepositional phrase 
in preference to the almost universal use of 
the possessive construction in real life: 

The son of my uncle is my —(6— PAJSC 

Similar tendencies toward classroom English 
are found in the upper reaches of the test of 
expression, as, for example, in Part II, Item 
43. In this item, the student does not have much 
chance to make a selection in terms of a sentence 
“that is best from the point of view of simple 
speech.” All choices here are rather poor by any 
standards of effective speech or writing. 

Inasmuch as the Spanish forms of the ex- 
amination are merely a translation of the Eng- 
lish forms, limitations of a more serious nature . 
are compounded here. Although the description 
of the Cooperative Inter-American Tests recog- 
nizes the fact’ that “a simple translation of a 
standardized test from one language to another 
was obviously not adequate because of differ- 
ences in speech idiom and in culture," the lan- 
guage tests are nothing else. Each is a direct 
translation from the English without regard for 
the relative difficulty of the same words and ex- 
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pressions in the two languages. For example, 
in Form BS, Part I, the key word corona does 
not occur among the first thousand words of 
highest frequency in Spanish, but in English, it 
does. Although awareness of the existence of 
scientific frequency lists for vocabulary is evi- 
dent from the descriptions of the tests, the re- 
viewer can find little evidence of a serious use 
of them in Spanish as a basis for phrasing the 
items. Neither have the items in the Spanish 
forms been arranged in order of difficulty as 
verified by an actual tryout and item analysis 
of student responses. It is inconceivable that 
the separate tryouts of the forms on different 
language populations would yield exactly the 
same scale positions as in every single form 
of the present tests. Because of this bias, the 
reliability of the Spanish forms is probably lower 
than that of the English forms, though no co- 
efficients of reliability are, as yet, available for 
any of the examinations. 

The nature of the Spanish usage tests (Part 
II in each case) limits their usefulness almost 
exclusively to American students of Spanish or 
to bilingual Mexican-Americans who, through 
long residence in this country, have begun to 
live in a linguistic “no-man’s-land” in which a 
half-breed Spanish or a half-breed English is 
spoken, comparable to the dialect Pennsylvania 
Dutch. 

A native of any Spanish speaking country 
would, by the time he reaches grade 8 abroad, 
be able to score perfectly on Part II simply by 
eliminating usages that jar his ear as badly as 
an off-key note in a well known song. If the 
test is for bilinguals in our Southwest whose 
Spanish has been subject to adulteration, its 
usefulness is likely to be confined to those who 
can read Spanish. Among the majority, how- 
ever, it is a spoken language only. Hence an 
aural comprehension test and an aural vocabu- 
lary test measuring ability to distinguish be- 
tween legitimate Spanish and its adulterated 
forms (e.g., patinar vs skatiar, frenos vs braques, 
etc.) would serve a more useful diagnostic pur- 
pose in their case. 

Although it is true that a direct comparison of 
scores in a second language with scores on sim- 
ilar material in the native language is not pos- 
sible in terms of previously available tests, rela- 
tive percentile ranks on substantial tests of 
known reliability and validity provide a safer 
basis for comparison than direct scores on brief 


tests whose rather superficial construction does 
not as yet meet the standards required for the 
purpose. Ж 
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*Tools of Written дермі: State High School 
Tests for Indiana. Grades 7-8; 1941-51; Forms A 
Га], B ['з1]; Аарц; no data on reliabili 
and validity ; no manual ; Form A norms ['45], Fame 
norms ['51] ; 514€ per test; 15¢ per specimen set; post- 
paid; 27(35) minutes; W. R. Thompson (A), NL 
Gage (A), and Margaret Hackett (B); State High 
School Testing Service for Indiana, Purdue Univer- 
sity, * 
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*College Entrance Examination Board Achieve- 
ment Test in English Composition. Candidates for 
College entrance; 1943-51; available only in College 
Entrance Examination Board Admissions Testing 
Program (see 526); 60(70) minutes; prepared by 
College Entrance Éxamination Board Committee of 
Examiners in English Composition in cooperation with 
the Staff of Educational Testing Service: 1951 mem- 
bership: Edward S. Noyes (Chairman), R. Stanley 
Peterson, Charles M. Rice, Priscilla Tyler, and Edna 
R. Williams; program administered by Educational 
Testing Service for the College Entrance Examination 
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During the 1930's a number of ingenious 0 3 
jective techniques for measuring various сй 
ресіѕ of the ability 'to write correctly йү 
developed, and by'1940 tests employing com 4 
nations of these techniques were widely eat 
Validity data showed that some of them boit 
quite promising. However, until 1947 eni 
techniques were not made use of in the Ee 
Composition Test of the College Entrance a 
amination Board. In fact, as late as 1948, 


TUM 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK [ jd 


315 ] 


ward S. Noyes, a member of the College Board 
Committee of English Examiners, wrote of some 
of these techniques, “one should beware ac- 
cepting their results as a true indication of 
‘effectiveness of expression,’ which, after all, can 
only be discovered by giving the student a blank 
page and a pencil and setting him to write” 
(3 : 120). 

The attitude represented by this statement, 
which is composed of a straw man (no techni- 
cian accepts any test result as a "true" indica- 
tion of ability) and a misleading statement 
(setting a student to write a single composition 
under highly artificial conditions cannot yield a 
"true" indication of his effectiveness of expres- 
sion), did not prevent the CEEB Committee of 
Examiners in English from trying out an ob- 
jective section in their April 1947 examination. 
Their findings indicated that a full-length (60- 
minute) test composed of essay material would 
have “markedly less predictive value [of teach- 
ers’ ratings of ability to write expository prose, 
and course grades in English] than a full-length 
test composed entirely of objective material" 
(5). Since 1947 the English Committee has de- 
veloped or adapted, tried out, and used a variety 
of objective approaches. 

Five forms of their English Composition Test, 
appearing between August 1950 and May 1951, 
were received for review together with analyses, 
descriptive statistics, and reports of research 
studies. No two of the forms are alike in 
make-up, though the same type of exercise and 
even a few of the same items appear in more 
than one form. This is unfortunate since the 
most compelling reason for using these tests is 
that they are not likely to have been seen by 
examinees in advance of the examination period. 
If items are reused, this merit is to some extent 
lost. 

The content of the five forms reviewed was 
as follows: 


Form WAC4: Correcting Usage and Punctuation 
in wA (35 e^ ге (65 items), and Para- 
graph Organization (34 items). mer 

orm YAC4: Correcting Usage and Punctuation in 
Sentences (45 items), Poetry Completion (15 items), 
and Paragraph Organization (41 items). } 

Form 7АСт: Literary Comprehension (10 items), 
Editing for Usage and viii uia (50 items), and 
Paragraph Organization (39 items). $ 

For P AC: Editing for Usage and Punctuation 
(47 items), and ESTA Sentences (not objectively 
scorable, 42 "items"). j; 

Form ZAC. з: Editing for Usage and Punctuation 
(45 items), and Identification of Errors in Grammar 
and Usage (30 items). 
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Some of the types of items do not seem to 
measure primarily aspects of English composi- 
tion. One section, for example, presents several 
four-line stanzas of poetry from each of which 
one line has been omitted. For each stanza, the 
examinee is to select from five single lines the 
one that best fits into it. These items, though in- 
teresting and well-constructed, appear to meas- 
ure aspects of literary appreciation and compre- 
hension rather than composition. 

Close inspection of the individual items indi- 
cates that considerable thought and ingenuity 
have gone into their preparation. However, 
some criticisms may be made of them. The items 
designed to test ability “to understand the or- 
ganization of a piece of writing" seem distinctly 
below standard. Some of them appear unrelated 
to the organization of the material but test the 
ability to make inferences from the content ; e.g., 
*We may infer that the writer is (1) a biolo- 
gist, (2) a physicist," etc. Other items have 
more than one defensible answer ; e.g., the stem, 
“Tf the passage were written for newspaper pub- 
lication, it might well begin," is followed by five 
choices, any one of which might, under certain 
circumstances, be an answer. Other items are 
ambiguously phrased; e.g., "The author's own 
position first becomes apparent in (1) para- 
graph т, (2) paragraph 2," etc. The answer 
hinges on whether the item means, “When is 
the position later identified as the author's first 
indicated ?" or “When does it first become clear 
what position the author subscribes to?" 

Another highly ambiguous item of a different 
type is the following, in which the examinee is 
to indicate which, if any, of the numbered words 
is incorrect: "Happiness is one of the intangi- 
bles which is wanted by almost everyone." Those 

lig 3 4 5 

who object to the use of "which" in a restrictive 
clause will argue that the answer is r. Others 
will take the sentence to mean, “Happiness is 
one of those intangibles which are wanted by 
almost everyone," and will mark 2. Still others 
will contend that the sentence means, “Наррі- 
ness is that particular one of the intangibles 
which is wanted by almost everyone,” and will 
mark choice C, indicating that the sentence is 
correct as it stands. 

The instructions about guessing are inade- 
quate; the directions do not say whether the 
scoring includes a penalty for incorrect re- 
sponses. Unless there is, time spent in “hap- 
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hazard guessing" is far from wasted—a fact 
widely known among high school students and 
almost universally known among preparatory 
school students especially coached to take the 
College Board examinations. 

The test items and the response spaces on the 
answer sheets could be geared to each other in 
such a way that the examinee would not have 
to give so much attention to keeping his place. 
The directions for the paragraph organization 
sections (in which the sentences of a paragraph 
are scrambled and the examinee is to arrange 
them in proper order) are not easy for the in- 
experienced examinee to follow. 

It is gratifying to find in recent bulletins and 
memoranda of the Educational Testing Service 
that staff members are now devoting time and 
energy to the preparation and assembly of Eng- 
lish composition tests for the College Board. 
This task ought to be easier than that of pre- 
paring tests for general school use. The CEEB 
tests have an assured market and are constructed 
for a narrow, well-defined purpose—that of 
assisting in the selection of students for entrance 
to certain colleges. They need not be suitable for 
diagnostic use or for discriminating among pu- 
pils at all levels of ability. The examinees are 
relatively homogeneous. 

When one considers that the Board has been 
setting examinations in English for a half cen- 
tury, the five forms under review are not im- 
pressive. Any one of them must be regarded 
as unnecessarily inadequate as a measure of 
ability in English composition, despite some 
statistical data indicating moderate correlations 
with previously obtained grades in English and 
with teachers' ratings of ability to write. If the 
rank and file of teachers of college-preparatory 
English could have the same opportunity as the 
reviewers to study these examinations, it is hard 
to believe that a chorus of disapproval would 
not arise. 

"The reliability data that the reviewers have 
seen were computed by means of Kuder-Rich- 
ardson formula No. 20 except in two instances. 
Coefficients obtained for total scores on the 
various forms are reasonably high: .9r for 
Y ACA, .92 for ZAC1, .93 for ZAC2, and .84 
for ZAC3. All of these coefficients may be 
spuriously high, in varying degree, because in 
no instance did every examinee have time to 
finish, although for most parts of the various 
forms a large proportion of the examinees did 
consider every item. 
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Hudelson's Typical Composition Ability Scale, 
Grades 4-12; 1923; І form ['23] ; no data on reliabili 
and validity and no description of normative po 

tion in manual (for data presented elsewhere by 
author, see 1); 6¢ per scale (only 1 copy needed per 
class) ; 12¢ per manual; postage extra; 24¢ per speci 
men set, postpaid; 15(25) minutes; Earl Hude á 
Public School Publishing Co. * * 


REFERENCES 

1. Ниркїзох, Eart, English Composition: Its Aims, Meth 
ods, and Measurement. Discussion by James F, Hosic, M, H, 
Willin; b and S. A. Courtis, The Twenty- 
Second Yearbook of the National Society for the Study of 
Education, Part 1, Bloomington, Ill.: Public School Publishing 

172. 

UDELSON, EARL, “The Effect of Objective Standards 
B» Composition Teachers’ Judgments.” J Ed Res 12:329-40 

25. 


з. Hutren, C. E. “The Personal Element in Teachers’ 
Marks.” J Ed Res 
4, Нилтех, C. E. Ex, 


Upon 
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5. KNUDSEN, билка Ww. D Test of the Praia 


Wonrn J. Оѕвовх, Professor of Remedial and 
Experimental Education, University of Wash- 
ington, Seattle, Washington. [ 

According to the author, the aims of this scale 
are: 


(1) to test impartially the various methods of ae 
ing composition by measuring their results: (2) 
measure those results in accurate, objective, stable, fu 
understandable terms; (3) to furnish a common id 
for comparing the writing proficiency of обеси р» 
pils within the same class or school or that o pup; 
in different classes or schools; (4) to classify pup! 
fairly in compositions; (5) to grade them justly ыз 
their group; (6) to enable teachers to discover P 
reliability in judging the general merit of Englis elf 
position; and (7) to furnish pupils an incentive to 
competition. 


The paragraphs in the scale were selected 
from “thirty-two representative theme assig" 
ments which inspired responses most typical si 
the average quality of composition work dm 
on all assignments" by nearly 800 шуу И 
senior high school pupils in the states of iue 
Virginia and Indiana. Each pupil wrote ГӨ 
compositions on related subjects and these a : 
rated independently by eight experienced jud a 
In order to get the best possible compos! ed. 
from each pupil, motivation was pro 
Stories were read to the pupils for complet! З 
Stimulus words were given to incorporate M { 
composition. Thought provoking titles were A 
gested such as “А Trip to Heaven, A з 
versation with Mars” and the like. Pract! 
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paragraphs were provided for use by the inex- 
perienced teacher. 

Work on this scale, which was constructed 
28 years ago, was done by using the best tech- 
niques and procedures available at that time. 
Hudelson must have done the best possible job 
because no advance has been made upon his 
work since that time. Unfortunately, however, 
the scale suffers from two very serious disad- 
vantages which seem to be inherent in all scale 
making. In the first place, scales are not diag- 
nostic. If a pupil gets a low score the teacher 
has no means of knowing what is lacking or 
what to do about it. The second disadvantage is 
lack of reliability. The correlation of the median 
judgments of the eight experienced judges on 
the paired compositions ranged from .69 to .84. 
Had there been only one judge (as is the case 
when one teacher uses the scale) the reliability 
coefficient could hardly have been more than 
.40. This is not much better than sheer guess- 
ing. 

The probable error of a single estimate, ap- 
proximately .5, is very high. According to the 
standards a 4th grade pupil is supposed to write 
a composition of value 3.0; a 12th grade pupil, 
one of value 6.7. The entire difference is 3.7. The 
average growth per year for the eight years is 
46. But the probable error of a single rating by 
an experienced judge is .5. This means that 
even an experienced judge will misplace a child's 
composition by less than a year on the scale half 
of the time, During the other half he will mis- 
place it by more than a year. Results more reli- 
able than these can be obtained by simply rank- 
ing the pupil's composition in terms of the num- 
ber of running words which he writes. This large 
unreliability of experienced judges should re- 
mind English teachers everywhere of what a 
stupid waste of time it is for them to assign 
grades to pupil's compositions. Perhaps it should 
also spur some brilliant genius to devise a meth- 
od of rating compositions that really is worth 
while. 

In spite of these serious defects the scale is 
worth the 20 cents that it costs. Using it will 
focus our attention on the tremendous impor- 
tance of English composition and upon the ur- 
gent need for a better means of rating it. Per- 
haps the best thing of all to do is to buy the scale 
and require each member of a class to rate the 
composition of each of his classmates. An aver- 
age of these ratings is the most accurate measure 
that is now possible. 
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*Ability to Interpret Literary Materials: Iowa 
Tests of Educational Development, Test 7. 
Grades 9-13; 1 51 (first published as a separate in 
1951) ; for complete battery, see 17; Form Y-2 (’51— 
same as 1949 edition) ; manual ('51); general manual 
(51) ; $3.75 per a separate answer pads or answer 
sheets must be used; $1.95 per 25 answer pads; $3 per 
100 IBM answer sheets; 50€ per scoring stencil ; do 
per 25 first semester (/46) or second semester ('48) 
profiles for any one of grades 9-12; $1 per 25 self- 
interpreting profiles for students ('51) ; as per school 
summary report ('44) ; 4€ per pupil score sheet ('48) ; 
50¢ per specimen set; cash orders postpaid; 50(55) 
minutes; edited by E. F. Lindquist; Julia Peterson; 
Science Research Associates, Inc. * 


For a review by Eric F. Gardner of the com- 
plete battery, see 17; for reviews by Henry 
Chauncey, Gustav J. Froehlich, and Lavone А. 
Hanna of Forms X-1 and Y-1 of the complete 
battery, see 3:12. 
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*American Literature: Every Pupil Test. High 
school; 1934-51; new form usually published each 
April and December; form December 1951; no data on 
reliability and validity; no manual; norms (’51) ; 20 
per test; 1$ per answer key; postpaid; 40(45) min- 
utes; Ohio Scholarship Tests, Ohio State Department 
of Education. * 


[182] 
*An Awareness Test in 20th Century Literature. 
Grades 9-16; 1937-40; Forms A (37), B ('40); no 
data on reliability and validity; no manual; norms 
(37) ; $2.25 per 25, postpaid; specimen set not avail- 
able; 45(50) minutes; edited by Harold H. Bixler; 
Elmer R. Smith; Turner E. Smith & Co. * 


For reviews by Н. Н. Giles and Ann L. Geb- 
hardt, see 40: 1296. 
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X*Check List of Novels. High school and college; 
1942-50; 15 scores: 3 scores (novels read, movies seen, 
novels liked) for each of 4 categories (difficult novels, 
standard novels, best-sellers, light novels) and total 
group; I form, '50; no data on reliability and validity 
in manual (for data presented elsewhere, see 1-2 
low); manual ('50) ; norms (742) ; $2.50 per 25; sep- 
arate answer sheets must be used; 756 per 25 answer 
sheets; 256 per specimen set; cash orders postpaid ; 
nontimed (50) minutes; developed by the Cooperative 
Study in General Education (Ralph W. Tyler, Direc- 
tor); Cooperative Test Division, Educational Testing 
Service. * 
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About Fiction," pp. 121-71, 302-8, 309-11. In his General 
Education in the Humanities. Washington, D.C.: American 
Council on Education, 1947. Pp. xix, 323. * (PA 22:434) 
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Joun S. Diexnorr, Director, Center for the 
Study of Liberal Education for Adults, 940 East 
58th St., Chicago 37, Illinois. 

With reference to 300 novels, students are 
asked to indicate whether they have read the 
novel, seen a movie based on it, read and liked 
it, or never read it. The list includes 30 novels 
classed as “difficult,” 90 as “standard,” 90 as 
"best sellers," past or present, and 90 as “light.” 
There is not likely to be any quarrel with the 
classification. Such writers as Dostoevski, Mere- 
dith, and Proust are difficult; Austen, Scott, 
and Tarkington are standard; Hervey Allen, 
Margaret Mitchell, and Arthur Koestler are best 
sellers; Temple Bailey, Erle Stanley Gardner, 
and Zane Grey are light. 

For its purpose, which is to inventory the 
reading already accomplished by high school 
(or college) students, the Check List of Novels 
is no doubt useful. The check list was originally 
designed for use with another instrument (In- 
ventory of Satisfactions Found in Reading Fic- 
tion). It is described in the manual as “helpful 
to instructors in planning reading assignments, 
finding ‘common denominators’ for classroom 
discussion, and guiding individual reading pro- 
grams.” There is a risk (perhaps not great) that 
the check list will encourage students to think 
that to read a book—any book, Temple Bailey 
equally with Fielding—is somehow an action 
worthy of praise. 
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“Cooperative Literary Comprehension and Ap- 
preciation Test. Grades 10-16; 1935-51; IBM ; Pro- 
visional Forms R (41), Т (43) ; no specific manual ; 
descriptive sheet (’51); general Cooperative manual 
('51) ; tentative high school norms ['38] ; tentative col- 
lege norms ['38]; $2.25 per 25; 50¢ per specimen set, 
postpaid; separate answer sheets may be used; 80¢ per 
25 1BM answer sheets; 15¢ per stencil for scoring an- 
swer sheets; cash orders postpaid; 40(45) minutes ; 
Mary Willis, Hyman Eigerman (R), Frederick B. 
Davis (R), and Н. A. Domincovich (Т); Cooperative 
Test Division, Educational Testing Service. * 


REFERENCES 
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p 
. HUMBER, Watson J. “The Relationship Between Reading 

Efficiency, and Academic Success in Selected University Cur- 

гісша.” J Ed Psychol 35:17-26 Ја '44. * (PA 18:2581) 


For a review by Holland Roberts, see 3:142. 
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*English Literature: Every Pupil Test. High 
school; 1934-51; new form usually published each 
April and December; form April 1951; no data on 
reliability and validity; no manual; norms (51) ; 216€ 
per test; 1¢ per answer key; Sorted 40(45) min- 
utes; Ohio Scholarship Tests, Ohio State Department 
of Education, * 
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*Graduate Record Examinations Advanced Lit. 
erature Test. Senior year college through graduate 
school and candidates {ог graduate school; 1939-51; 
available only in Graduate Record Examinations pro- 
grams (see 527) ; 180(220) minutes; prepared by the 
Advanced Literature Test Committee appointed by 
Educational Testing Service: Don Cameron Allen 
(Chairman), Bradford A. Booth, E. K. Brown, John 
W. Spargo, and Harry R. Warfel; Educational Test- 
ing Service. * 
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*Interpretation of Literature Test: General 
Education Series, High School ; 1936-50; a revision 
of An Interpretation of Literature: Test зг (36); 
original test published by Evaluation in Eight Year 
Study, Progressive Education Association; 7 scores: 
understanding of the story, recognizing of a persons 
point of view, understanding human motivation, weigh- 
ing evidence supporting an ending, weighing evidence 
discrediting an ending, analyzing literature from a 
technical standpoint, total; 1 form, '50; no data on 
reliability and validity; no norms; manual ('50) ; $2.50 
per 25; separate answer sheets must be used; 75f per 
25 answer sheets; 20€ per set of scoring {еп ind 
orders postpaid; 50© per specimen set, postpaid; non- 
timed (75) minutes; Evaluation Saf (Ralph W. 
Tyler, Director) of the Eight-Year Study of the Pro- 
ressive Education Association; published in 1930 by 
раа Test Division, Educational Testing Serv- 
ice. * 
Joun S. Diexuorr, Director, Center for the 
Study of Liberal Education for Adults, 940 East 
58th St., Chicago 37, Illinois. A 

This test is based upon a summary of A 
Henry's story, 4 Municipal Report. Althoug 
the summary is excellent, one wishes that the 
student might be required to read the entire 
story. Since the test time is already rather long, 
the choice was probably made between basing 
it on a summary or basing it on a story aem 
as outside reading. Nevertheless, the кас Ў 
he prefers, may assign the story in ep 
Directions for scoring and interpreting the m 
sults suggest that a comparison of the Md. 
with the whole story would be an Шш ji 
exercise, presumably after the test has 
administered. al 

The suggestion is characteristic of t 
in which the test has been construct Ы 
manual states that "in its present ip 
instrument is probably more useful as a teac ^" 
device than as a test." Its usefulness e р; 
former is apparent, and to a reviewer "€: e “i 
in techniques of testing, it seems an aded 
test also. 

r 

Each section of the test approaches the ia 
from a different point of view. The first req ен 
the student to understand what has happen 


he spirit 
ed, The 
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the story (or its summary). The second requires 
him to understand the attitude of the narrator 
and to explore his own attitudes. Searching 
moral questions are raised as a basis for future 
class discussions and as an indication of the ex- 
tent to which imaginative literature may be re- 
lated “to the problems of living." A third section 
continues this exploration of the relation of lit- 
erature to life by exploring the motives of the 
agents in the story and asking the student to 
judge them. Less directly (apparently not as 
part of the intention of the examiners) it illumi- 
nates the interplay of character and incident in 
a well constructed fable. Other sections test 
ability to weigh evidence and ability to make a 
literary analysis of the story in the conventional 
terms of setting, purpose, classification (e.g., 
moralistic story, love story, story of incident 
and plot, etc.), style, action, and author. 

If new editions of the test are contemplated, 
two sections would benefit from revision. (a) 
The narrator in A Municipal Report is a fic- 
tional character. Section II of the test, “Are 
You Learning a Person's Point of View?," 
identifies the fictional narrator with O. Henry 
and asks the student to indicate O. Henry's 
point of view toward various moral problems. 
Unsophisticated readers are too prone to attrib- 
ute the opinions of fictional characters to their 
authors. It should be made clear that O. Henry 
is the inventor of the narrator as well as of other 
agents in the story. (b) Section IV on weighing 
evidence is obscure. At any rate, the reviewer 
found close rereading necessary. There is the 
risk that it measures ability to understand a 
complex exposition rather than ability to weigh 
evidence. 

The final section of the test, made up of some- 
what leading questions, asks the student to evalu- 
ate the test itself. No assurances that "answers 
to this part of the test in no way affect your 
score on the test" will keep most students from 
indicating "correctly" the item which reads, 
“This test caused me to like the story more than 
I did upon first reading it" ; and it seems hardly 
fair to entice unwary pupils into saying, "It 
would be interesting to take more tests in litera- 
ture similar to this one." Even in this section, 
however, students may give the desired answers 
honestly as well as tactfully, for the test, attempt- 
ing both to measure the student's ability to read 
imaginative literature and to teach while it tests, 
is a good one, 


Joun M. Sratnaxer, Director of Studies, 
Association of American Medical Colleges, Chi- 
cago, Illinois. 

In the accompanying manual of instructions, 
it is pointed out that this test, developed by the 
Eight Year Study, had long been. out of print, 
but that interest in it continued. In response to 
such interest the Educational Testing Service 
reprinted the test. 

This is not a test in the usual sense but a 
teaching device. It was designed to aid the 
secondary school teacher of literature in showing 
students how fiction can be analyzed and in 
giving students some experience in relating their 
reading to their own lives and problems. The 
test should be used in connection with class 
discussion; its effectiveness depends upon the 
skill of the teacher in using it. 

As a measurement of achievement this “test” 
cannot be recommended. Indeed, it was not de- 
signed to be so used. The teacher of literature 
might well examine a specimen set of this test 
and determine for herself whether it is a suit- 
able instrument to facilitate her instruction. 
Some teachers will find it of value while others 
will wish to make an entirely different approach 
in their teaching. 
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*Inventory of Satisfactions Found in Reading 
Fiction: General Education Series. High school 
and college; 1942-50; II scores: satisfactions of relax- 
ation and pastime, escape, associational values, intimate 
personal relations, socio-civic, philosophy, miscellane- 
ous informational, total informational, technical-criti- 
cal, self-development, dislikes; Forms A ('50), B 
(50) ; manual ('50) ; norms ['44?]; $2.25 per 25; sep- 
arate answer sheets must be used; боф per 25 answer 
sheets; cash orders postpaid; 50€ per specimen set, 
postpaid; nontimed (о minutes; developed by the 
Cooperative Study in General Education, (Ralph W. 
Tyler, Director) ; Cooperative Test Division, Educa- 
tional Testing Service. * 
REFERENCES 

з, COOPERATIVE STUDY IN GENERAL EDUCATION, EXECUTIVE 
Commirtee, RALPH W. Tyrex, DIRECTOR. "Second Project: 
The Inventory and the Check List in Fiction," Pp; aiu In 
Cooperation in General, Education. Maint, (f 531439, 
cin Comel on HAROLD Baier. General Education inthe the 

itii = © , 1r, im ing a вес! 

TURO и Meares 11:3). fees eH b» “Ralph W. Tyler. 
Washington, D.C.: American Council on Education, 1947, рр. 
xix, 321. * (РА 22:434) 


HorrAwp Ковектѕ, Director, California Labor 
School, 321 Divisadero St., San Francisco, Cali- 
fornia. 

This revision of the inventory developed by 
colleges participating in the Cooperative Study 
in General Education is an ambitious effort “to 
help school and college instructors improve the 
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pattern of student reading." It is organized in 
two parts. Part I consists of 90 statements of 
satisfactions reported by readers of fiction, with 
which the student is asked to agree, disagree, or 
to mark "uncertain." Most of the statements 
are individual personal responses such as "try- 
ing to guess what various characters will do" 
and “being reminded of places I have been," but 
there are a number that penetrate beneath the 
level of superficial generality. The authors have 
listed the 9o items in 6 broad categories: re- 
laxation and pastime (10 items), escape (8), 
associational values (8), information (33), 
technical-critical (15), and self development 
(16). The 33 items on information are subdi- 
vided into 4 parts: intimate personal relations 
(9), socio-civic (10), philosophy of life and 
religion (8), and miscellaneous (6). 

As the inventory is intended to fulfill a key 
function in building and guiding literature pro- 
grams, the authors' approach to the field is im- 
portant. They have observed "that instructors 
attempt to work in terms of material only rather 
than in terms of the satisfactions which the stu- 
dent derives from reading" and that students 
frequently confine their reading of fiction within 
narrow bounds, limiting themselves to adven- 
ture stories, historical fiction, the works of a 
few favorite authors, and the like. They believe 
that through the use of the inventory it is “роз- 
sible to determine the chief satisfactions a stu- 
dent now obtains from fiction" and that with 
this knowledge the instructor can then present 
his students with a program designed to im- 
prove their reading level. They believe that, if 
a student is told that the satisfactions he finds 
in his reading can be gained in greater measure 
in better novels, “the chances are good that he 
will change his reading habits for the better." 
Then, if the instructor points out new satisfac- 
tions to be found only in the recommended books, 
and the student confirms the instructor's pre- 
dictions, "the change in reading habits should 
be permanent." To complete their program of 
reading guidance, the authors recommend the 
use of a related inventory, the Check List of 
Novels. The instructor finds out what books 
gave the student the satisfaction he reported 
and makes specific recommendations to improve 
the student's pattern of reading. 

Clearly, this inventory is planned as part of 
a serious attempt to map out a major advance 
in teaching literature—one of the oldest estab- 
lished fields of creative activity. But the pub- 
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lisher’s catalog statement makes even greater 
claims, stating that the inventory “provides in- 
formation about...the amount and kind of ma- 
terial he (the student) is likely to read" (italics 
mine), and reveals not only the kind but the 
“degree” of satisfactions students believe they 
obtain from reading fiction. Of course there is 
no objective evidence that the use of the in- 
ventory will enable teachers and students to 
do any of these things. Some of them may be 
possible, but the major question is whether even 
then the purposes for which this instrument was 
devised cannot be achieved more effectively 
through discussion, reading records, and other 
ways. Until measurement of satisfactions in 
literature is demonstrated, we have no sure 
knowledge in this important field. The authors 
have themselves recognized certain limitations 
in their work, and others will be apparent to 
experienced teachers of literature. First, it is 
assumed that the data from the inventory repre- 
sent reality and are reliable; yet teachers recog- 
nize that students are under grade pressure to 
give the kinds of responses that teachers want. 
Can it be assumed that students will respond 
spontaneously to the items in this inventory? 
Again, use of the inventory is entirely based 
upon the introspective analysis of inexperienced 
young people. They are asked to report whether 
they gain satisfaction from “feeling the beauty 
of the author’s style” or “getting my mind off my 
troubles." It is possible for them to get satisfac- 
tion in these ways without being able to recog- 
nize it, or they may connect their satisfaction 
with an unrelated cause. Can youth who are just 
finding their way in the world isolate their pres- 
ent feelings from past experiences or from those 
they anticipate or desire? Р 
A more significant weakness is the passivo 
static character of the majority of the questions. 
There is none of the enthusiasm and fire ui 
covering a great book, unless "getting ! v4 
about manners and etiquette" and "feeling t T 
I am developing my personality" can be so co? 
ceived. Not only is the use of literature a5 wr 
action lacking, but practically all sense of do! Й 
in the inventory is individual. The student m 
his book are alone in a great void. There e 
suggestion of the power of group action, W sth 
tides have made our history, nor any SURE E 
or conception that great literature unites ii 
and nations and gives them security апдар di 
in a friendly universe. Literature is treat er 
though it were so much baled hay, and the tea 
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ing of literature a mechanical process. The no- 
tion that human emotions should be deyeloped 
according to any set of formal categories may 
fit a compartmentalized psychology, but teach- 
ers of literature must work in a field whose 
growing complexities and depth range beyond 
such a concept. 

Part II is a series of 25 statements of “dis- 
likes,” beginning with “I dislike fiction," with 
which the student is asked to agree, disagree, 
or indicate uncertainty. The approach is nega- 
tive, and the emphasis in this final section of 
the inventory is unrelieved by any positive sug- 
gestion. 

Experience in using Part I of this inventory 
may disclose some positive values, particularly 
in stimulating discussion about specific books 
and thus gaining exact information from stu- 
dents. To the extent that it may encourage a 
teacher to find out more about students, it may 
have its uses. But other types of responses such 
as autobiographies of student reading, reading 
histories, interviews, and the students’ own 
statements about their reading interests and sat- 
isfactions would seem to meet the needs for 
reading guidance more exactly and less danger- 
ously—particularly in the hands of inexperi- 
enced or formal teachers. Certainly more experi- 
ence and further careful experiment by able 
teachers is necessary before the inventory can 
be recommended for general use. 


[ 189] 

*The Jones Book-A-Day Tests: For Checking 
Outside Reading of High School Pupils. High 
school; 1946-52; over 3,000 tests on books commonly 
recommended for high school pupils; 1 form for each 
test; 1-99 mixed titles, 7¢ each; 100-499, 6¢ each; 25 
or more of any one title, 5¢ each; postpaid; 3(4) min- 
utes per test; Book-A-Day Series. * r 


[190] 

*Literature Appreciation Tests. High school; 
1936-40; I3 tests; I form; no data on reliability and 
validity; no manual; tentative norms—authors recom- 
mend the use of local norms; $1.75 per 25 of any one 
test, postpaid; specimen set not available; nontimed 
(40-50) minutes per test; edited by H. H. Bixler; 
Turner E. Smith & Co. * у 
а) A MIDSUMMER NIGHT'S DREAM. 1937; Salibelle Roy- 

ster. 
b) As YOU LIKE IT. 1936; Elmer R. Smith. 
c) EVANGELINE. 1937; Evelyn L. Stovall. 
d) HAMLET, 1938; Salibelle Royster. 
е) IDYLLS OF THE KING. 1937; Luella B. Cook. 
iy IVANHOE, 1938 j Eimer T PW 
g) MACBETH. 1936; Mary J. J. Wrinn. 
h) SILAS MARNER. 1936; Luella B. Cook. 


i) THE ADVENTURES OF TOM SAWYER. 1940; Elmer R. . 


. Smith. 
j) THE COURTSHIP OF MILES STANDISH. 1937; Evelyn 


L. Stovall. 
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k) THE HOUSE OF THE SEVEN GABLES, 1936; Holland D. 
Roberts. 

1) THE MERCHANT OF VENICE. 1936; Elmer R. Smith. 

m) TREASURE ISLAND. 1936; Luella В. Cook. 


For a review by Paul B. Diederich, see 38: 
976. 


[191] 

*Literature Test: Municipal Tests: National 
Achievement Tests. Grades 3-6, 6-8; 1938-39; a 
subtest of Municipal Battery (see 20) ; 3 scores: mo 
tives and moods, miscellaneous facts, total; 2 levels; 
Forms A (738), B (739); no data on reliability and 
validity and no description of normative population in 
manual; no norms for part scores; manual ['38] ; $1.25 
per 25 of any one level; 356 per specimen set of n 
one level; postage extra; 20(25) minutes; Robert K. 
Speer and Samuel Smith; Acorn Publishing Co. * 


[192] 

*Literature Test: National Achievement Tests. 
Grades 7-12; 1937-44; 5 scores: recognizing effects, 
recognizing qualities, analyzing moods, miscellaneous 
facts, total; Forms A (’44—same as test copyrighted 
in 1937), В (39) ; no norms for part scores; Form 
manual (38), Form B manual ('39); series manual 
['44]; $2.25 per 25 ESA per specimen set; postage 
extra; nontimed (4 ) minutes; Robert K. Speer 
and Samuel Smith; Acorn Publishing Co. * 


For reviews by H. H. Giles and Robert C. 
Pooley, see 40:1304. 


[193 ] 
X*Objective Tests in English [Perfection Form 
Co.]. High school ; 1929-50; 41 tests; no data on relia- 
bility and validity; no manual; no norms; 6¢ per test; 
20@ per specimen set; cash orders postpaid; (20-40) 
minutes per test; Alpha Hobbs Darlington (b, g, r, t, i 
cc, ee, ii), Sarah E. Dorn (jj, mm, nn), Nellie M. Falk 
(h, k, v, x, bb, 11, oo), Margaret Leeney (c, d, w, g Dy 
Hannah Van Nostrand (p), Garland Miller Taylor 
(a, f, i, j, 1, n, о, s, u, 2, aa, dd, ff, hh, kk), Maye Alex- 
ander Wilson (m), and LaDuskie Wood (e, q) ; Per- 
fection Form Co. * 
a) AS YOU LIKE IT. 1947; I form. 
b) DAVID COPPERFIELD. 1931—48; І form, 48. 
c) ENOCH ARDEN. 1047-49; 1 form, '47. 
d) EVANGELINE, 1929; I form. 
e) HAMLET, 1931-48; 1 form, '48. 
f) HENRY у. 1931; 1 form. 
g) THE HOUSE OF THE SEVEN GABLES. 1947; I form. 
А) HUCKLEBERRY FINN. 1950; 1 form. ; 
i) IDYLLS OF THE KING. 1020-47; 1 form, '47. 
j) IVANHOE. 1948; 1 form. 
k) JOHNNY TREMAIN. 1950; I form. у 
1) уоілиѕ CAESAR. 1948-49; 1 form, '48. 
m) KENILWORTH. 1932; I form. 
n) KIDNAPPED. 1948; 1 form, 
д) KING LEAR. 1929; 1 form. 
P) THE LADY OF THE LAKE. 1929- 
q) MACBETH. 1947-48; 1 form, '48. 
ғ) THE MAN WITHOUT A COUNTRY. 1932; I form. 
5) THE MERCHANT OF VENICE. 1929-49; I form, '49. 
{) A MIDSUMMER NIGHT'S DREAM. 1931-48; 1 form, '48. 
и) MOBY DICK. 1949; I form. 
v) OLIVER TWIST. 1950; 1 form. 
w) OTHELLO, 1929; I form. 
х) THE RETURN OF THE NATIVE. 1950; І form. 
» THE RIME OF THE ANCIENT MARINER. I947; I form. 
т) ROMEO AND JULIET. 1929-49; I form, '49. 
аа) THE SCARLET LETTER. 1932; І form. 


; 1 form, '48. 


M 
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bb) SHE STOOPS TO CONQUER. 1949; 1 form. 

сс) SILAS MARNER. 1948; І form. 

dd) SIR ROGER DE COVERLEY PAPERS. 1931; 1 form. 
се) THE SKETCH BOOK. 1937; I form. 

ff) sNow-pouND. 1931-48; 1 form, '48. 

99) SOHRAB AND RUSTUM. 1929; I form. 

hh) THE SPY. 1931-47; І form, ’31. 

ii) A TALE OF TWO CITIES. 1930-48; 1 form, '48. 
jj) THE TEMPEST, 1948; 1 form. 

kk) TREASURE ISLAND. 1947 749; 1 form, '47. 
П) TWELFTH NIGHT, 1950; І fo 

mm) THE VICAR OF WAKEFIELD. 1947; 1 form. 
nn) THE VISION OF SIR LAUNFAL. 1948; 1 form, 
00) THE YEARLING. 1950; I form. 


[194] 

*Objective Tests in English [Turner E. Smith 

and Co.]. High school; 1926-41; 47 tests; no descrip- 

tion of normative population; manual ('30) ; $1.75 per 

35 of any one test, postpaid; specimen set not avail- 

able; nontimed (30-40) minutes per test; edited by 

George A. Rice; Mable S, Satterfield and others; Tur- 

ner Ё. Smith & Со. * 

а) A MIDSUMMER NIGHT'S DREAM. 1927-30; I form, '29. 

b) AS YOU LIKE IT. 1926-30; 1 form, '28. 

с) ^ TALE OF TWO CITIES, 1927-30; 1 form, '28. 

d) BURKE'S SPEECH ON CONCILIATION WITH AMERICA. 
1926-30; 1 form, '28. 

€) THE CANTERBURY TALES. 1928-30; 1 form, '29. 

f) CARLYLE'S ESSAY ON BURNS AND BURNS' POETRY. 
1926-30; 1 form, '29. 

9) DAVID COPPERFIELD. 1928-30; І form, ’28. 

h) DR. JEKYLL AND MR. HYDE. 1926-31; I form, '31. 

i) ENOCH ARDEN. 1929-30; І form, '29. 

j) FRANKLIN’S AUTOBIOGRAPHY. 1930; I form. 

k) HAMLET. 1926-30; І form, '28. 

1) HENRY ESMOND. 1930-34; I form, '34. 

m) IDYLLS OF THE KING. 1927-30; 1 form, '30. 

п) IVANHOE. 1926-30; 1 form, '28. 

0) JULIUS CAESAR. 1926-30; I form, '28. 

ф) KING HENRY THE FIFTH. 1928; 1 form. 

4) KING LEAR. 1930; out of print. 

r) LORNA DOONE. 1929-30; I form, '29. 

5) MACAULAY'S ESSAY ON THE LIFE OF SAMUEL JOHN- 
SON. 1927-30; 1 form, '29. 

!) MACBETH, 1926-30; 1 form, '28. 

и) MILTON'S MINOR POEMS AND OTHER LYRICS. 1927- 
30; 1 form, '29. 

v) OLIVER TWIST. 1930-34; out of print. 

w) POE'S POEMS AND TALES. 1927-30; I form, '29. 

х) QUENTIN DURWARD. 1927-30; out of print, 

y) ROMEO AND JULIET. 1930-34; I form, '34. 

2) SHE STOOPS TO CONQUER. 1929-30; I form, '29. 

аа) SIR ROGER DE COVERLEY PAPERS. 1927-30; I form, "27. 

bb) SILAS MARNER. 1926-40; І form, '40. , 

CC) SOHRAB AND RUSTUM. 1929-30; 1 form, '29. 

dd) THE DESERTED VILLAGE. 1930; I form, 

ee) THE HOUSE OF SEVEN GABLES. 1927-31; І form, ’31. 

ff) THE LADY OF THE LAKE. 1926-41; І form, '41. 

20 THE LAST OF THE MOHICANS. 1927-30; 1 form, '29. 

hh) THE MERCHANT OF VENICE. 1927-30; I form, '30. 

ii) THE ODYSSEY. 1928-30; І form, '29. 

jj) THE OREGON TRAIL. 1930-34; 1 form, '34. 

kk) THE RIME OF THE ANCIENT MARINER. 1020-30; I 


form, '29. 
Il) THE RISE OF SILAS LAPHAM. 1929; 1 form. 
mm) THE SKETCH BOOK. 1929-30; І form, "29. 
nn) THE SPY. 1928-30; І form, '28. 
00) THE TEMPEST. 1928-30; 1 form, '28. 
йр) THE VICAR OF WAKEFIELD. 1930; I form. 
qq) THE VISION OF SIR LAUNFAL. 1930; I form. 


` Fr) TRAVELS WITH A DONKEY. 1930; I form. 


$$) TREASURE ISLAND. 1927-30; I form, '28. 
it) TWELFTH NIGHT. 1927-30; 1 form, '29. 
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иц) WEBSTER'S FIRST BUNKER HILL ORATION AND 

WASHINGTON’S FAREWELL ADDRESS. 1929-30; 1 
form, '29. 


[195 ] 
*Stanford Achievement e]. 
Grades 4-9; 1941-43, c1940-41; an adaptation of the 
literature tests of Stanford Achievement Test (see 
25); IBM; Forms DM ('41), EM ('43) ; the Manual 
for Interpreting referred to in the directions for ad- 
ministering has not been published; directions for ad- 
ministering ('41) ; $1.65 per 25; separate answer sheets 
must be used; 70€ per 25 IBM answer sheets; 20¢ 
scoring stencil for any one form; 55€ per 25 profile 
charts. (42) ; postage extra; 35 per specimen 
p" d; 15(20) minutes; Truman L. Kelley, Giles 
. Ruch, and Lewis M. Terman; World Book Co, * 
Winirrep L, Post, Instructor in English, Dana 
Hall, Wellesley, Massachusetts. 

Each form contains 70 three-choice items, 
with a total working time of 15 minutes, The 
term literature includes material ranging from 
Little Red Riding Hood and Rumpelstiltskin to 
Moby Dick or The Merchant of Venice. The 
test achieves a wise balance between books of 
special interest to boys or to girls, with a ju- 
dicious number of items drawn from classic or 
Norse mythology. A range of difficulty in the 
test items is secured not by the type of informa- 
tion demanded but rather by wide variations 
in the likelihood of a child’s exposure toa 
given book. Each item asks for a single piece 
of factual information, with the emphasis pre- 
dominantly on ability to spot the correct name 
of a character for a given book title or to attach 
the right book title to fit a briefly described 
character. For example: “The girl who lived 
on Sunnybrook Farm was (1) Roselle (2) 
Betsy (3) Rebecca.” 

The strong features of this test are: (a) the 
brevity and ease of administration; (b) the 
adaptability to objective test techniques of the 
material; (c) the rapidity and precision with 
which the resulting scores would yield informa- 
tion on the reading backgrounds of numbers 0 
students too large for more individual and ome 
subtly discriminating treatment. And this leads 
me to the limitations of the tests—limitatione 
closely linked with or growing inevitably Wn 
of the strengths I have enumerated. The mon 
significant limitation is the purely factual m 
of question—a type of question which preda p 
the possibility of getting at the quality 0 
child's reading or understanding. Unless L^ 
with caution, this kind of test can further 
quiz-mindedness which is already tending i- 
wipe out the distinction between ап. undis 
natingly retentive memory and an inquiring 


Test [Literatur 


| 
| 
| 


323 ] 


TESTS & REVIEWS: ENGLISH—SPELLING 


198 


synthesizing intellect. From the standpoint of 
sound English teaching—a place where most of 
all this distinction needs to be kept alive—I 
should recommend these tests chiefly as time- 
savers. If used with a clear realization of their 
limitations and with critical faculties alert for 
the deceptive immediacy, precision, and com- 
pleteness of information yielded with the ap- 
parent omniscience of an objective test score, 
these tests would seem to me serviceable, prac- 
tical, and informative, especially when supple- 
mented with tests of vocabulary, language skills, 
and comprehension. 


For reviews by Paul К. Hanna and Claude 
E. Norcross and Vergil E. Herrick of the com- 
plete battery, see 25; for reviews by Walter W. 
Cook and Ralph C. Preston of the complete 
battery, see 3:18. 


[196 ] 

Survey Test in English Literature. High school 
and college; 1940; 1 form; no data on reliability and 
validity; no manual; no norms for part scores; no de- 
scription of normative population; $2.25 per 25, post- 
paid; specimen set not available; 50(55) minutes; 
edited by Harold H. Bixler; Elmer R. Smith; Turner 
E. Smith & Co. * 

Јонх S. Drexnorr, Director, Center for the 
Study of Liberal Education for Adults, 940 East 
58th St., Chicago 37, Illinois. 

If one wishes or is required to teach what this 
test tests, perhaps it will measure the students’ 
rote learning adequately. There is a prior ques- 
tion, for it is one of the values and one of the 
dangers of externally prepared tests that they 
direct the teaching of those who use them. The 
teacher who selects this test is confirmed in a 
misunderstanding of the purpose of secondary 
school English courses; the teacher upon whom 
it is imposed is required to seek the wrong out- 
comes. For this test has little relevance to what 
the pupil may have read and no relevance to 
his understanding of what he has read. It is a 
test of his retention of information he might 
have gathered from an outline syllabus of Eng- 
lish literature. : 

"The pupil need have read nothing of Richard- 
son, Fielding, Smollett, or Sterne to identify the 
author of Tom Jones. He need not have read 
L'Allegro to know that it was not by Moore, 
Landor, or Blake; indeed, he need never have 
heard of Moore, Landor, and Blake. He need 
not know what a heroic couplet is to know that 
Pope was skilled in its use. He need have read 
nothing but a textbook to know or to guess that 
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Ralph Roister Doister is an early comedy by 
Nicholas Udall, to have the misinformation that 
"in the 17th century came the first critical lit- 
erary essays from the pen of John Dryden,” or 
to know that “the Gothic novel" is "progenitor 
of the modern mystery tale." Since the word 
"progenitor" is not likely to be in his vocabu- 
lary, he must guess the answer to this question. 

These illustrations suffice to show that there 
is no pretence that the pupil is being examined 
on what he has read. Surely no high school lit- 
erature courses require pupils to read Pamela, 
Tom Jones, L’Allegro, Ralph Roister Doister, 
The Rape of the Lock, The Preface to the Fables, 
and The Castle of Otranto—the handful of ti- 
tles with which we have illustrated the range of 
the test. If any introductory college courses do, 
they ought not to. The test can do nothing but 
harm. 


[197 ] 

*Ullman-Clark Test on Classical References 
and Allusions. Grades 8-12; 1938; 1 form; no data 
on reliability and validity; no description of normative 
population; mimeographed manual; per 100, post- 
age extra; 156 per specimen set, postpaid ; 20(25) min- 
utes; L. Ullman and Grace W. Clark; Bureau of 
ашан Research and Service, State University of 
owa. 
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xAyer Standardized Spelling Test. Grades 9-12; 
1950; Forms 1, 2; no data on reliability and validity ; 
$1 per 40; rof per specimen set; postpaid; nontimed 
(30) minutes; Fred C. Ayer; Steck Co. * 
REFERENCES 

1. Aver, Евер С. “An Evaluation of High-School Spelling.” 
Sch R $9:233-6 Ар '51. * 

Навор Н. Bixrem, Professor of Education, 
Western Carolina Teachers College, Cullowhee, 
North Carolina. 

DESCRIPTION. This test is designed for use in 
high schools, norms being given for grades 9 
to 12. Each pupil is to be given a test sheet, on 
which is printed 30 sentences, Pupils are to 
write the words dictated by the examiner in the 
blanks found in the sentences. The brevity of 
the test makes it possible to administer it in less 
than a class period. 

vaLiDITY. No data on validity are reported in 
the manual, but inspection shows that the words 
chosen are common words such as are found in 
existing spelling scales. The words are not ar- 
ranged in order of difficulty. 

mELIABILITY. No data are reported on the 
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reliability of this test. It is regrettable that the 
author did not include at least 50 words, as 
found in other standardized tests such as the 
Metropolitan Achievement Tests. The inclusion 
of 20 additional words would doubtless have 
improved the reliability of this instrument. 

USABILITY. Norms are based on tests of 35,- 
ooo high school students in 84 different cities, 
scattered throughout the United States. Pre- 
sumably, they are also applicable to rural schools. 
Norms are reported in terms of averages for 
grades 9, 1o, 11, and 12, respectively. In addi- 
tion, the scores of the highest and the lowest 
schools are reported. A helpful feature is a table 
showing the percentage of pupils in grades 9, 10, 
II, and 12 spelling each word correctly. 

SUMMARY. This is an excellent short test, use- 
ful in situations where a quick spelling survey is 
desired. Teachers should compute the percent- 
ages of pupils spelling each word on the test, 
since norms in terms of per cents are provided. 
Users may find it interesting to compare results 
obtained on this test, where context is provided 
on the test sheet, with results obtained in column 
spelling without context. 


[199] 

Davis-Schrammel Spelling Test. Grades 1-9; 1935- 
ELS ДС 
p per class) то materials е for examiocers 
minutes; Vera Davis and HL E, Schram Busts 
of Educational Measurements, Kansas State Teachers 
College of Emporia, * 

ANTON Тномрѕом, Supervisor of Research, 
Long Beach Public Schools, Long Beach, Cali- 
fornia. 

Although the authors have not stated the 
purposes for which this test was designed, the 
reviewer assumes that it is intended to serve 
as a measure of general spelling ability for pu- 
pils in grades 1-9. The tests for each grade con- 
tain 20 words. Four "equivalent" forms of the 
test are available. Because the words used in 
testing pupils of one grade overlap with those 
used in the next higher grade, each form in- 
cludes a total of only 96 words for use in test- 
ing nine separate grades. The testing procedure 
is a common one: the teacher reads a sentence, 
states which word in the sentence is to be writ- 
ten, and the pupils then spell the word on locally 
prepared blanks. It is an inexpensive test to 
administer and one which requires very little 
class time. 

As a measure of the pupil's ability to spell 
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the more commonly written words, this test has 
doubtful validity. The words were selected from 
the Buckingham Extension of the Ayres Spell- 
ing Scale and they have apparently not been 
checked against the more accepted frequency- 
counts now available. The reviewer's analysis 
of the words in one form of the eighth grade test 
shows that 9 of the 20 words are not included 
among the first 5,000 of the Horn list—in fact, 
5 of these 9 words are not to be found іп Нот? 
complete list of the 10,000 words most commonly 
written by adults. A second check of this same 
eighth grade test shows that 11 of the 20 words 
are not included among the 5,000 words which 
Rinsland's research shows are most commonly 
written by eighth grade pupils. Typical of 
words whose inclusion is questionable on the 
basis of frequency of child and adult usage in 
writing is the word "financier." 

It is well known that spelling textbooks differ 
greatly in the words assigned to a given grade. 
Consequently, the validity of this test asa meas- 
ure of progress in spelling achievement during 
a specified period must be determined by à 
school system through a check of the test words 
against those included in the local course of 
study. 

On the basis of giving Forms A and B to 1,506 
pupils in three Kansas school systems, the au- 
thors report that the reliability coefficients for 
the nine grades tested ranged from .84 to 93, 
with an average of .87. The average probable 
error of measurement reported is 1.1 words. Al- 
though the reported reliability meets minimal 
requirements, a longer test would not inconven- 
ience the teacher and would yield a more reliable 
measure for the individual pupil. 

Midyear and end-of-year percentile norms 
supplied for each grade. According to the tes 
catalog, these are based upon “more than 49990 
pupils in a large number of representativ 
schools.” The manual states that these scores 
were obtained in 1935 and 1936 from the 25 
tion-wide Every Pupil Scholarship Frog 
which are not further described. A test ti 
chaser working in, say, a New Jersey city 5 Ron 
system would be interested in knowing ii 
many of the 49,000 pupils in the norm group à 
tended public schools in the urban cente 
New Jersey and how their performance CO 
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pared with that of the total norm group. The 
*more than 49,000" cases include all pupils in 
any grade from one through nine tested at mid- 
year or at the year's end. The test purchaser 
who plans to give the test at midyear to a ninth 
grade class should note that the norms he will 
be using are based on only 342 of the pupils in 
the total norm group. 

According to the authors' manual, the per- 
centile equivalent of a pupil's score means the 
percentage of the pupils in the norm group 
whose performance has been exceeded. This ex- 
planation may be confusing to the teacher who 
gives the test at midyear to first graders and 
concludes from the norms table that a child who 
spells none of the words correctly has exceeded 
the performance of 25 per cent of the norm 
population ! f 

The manual provides the teacher with no 
answers to many practical questions which arise 
after a spelling survey. How shall the scorer 
mark the spelling of a word if the pupil arranges 
the letters in correct sequence but incorrectly 
capitalizes it? What procedure is advised when 
the average percentile equivalent for a sixth 
grade class tested at midyear is only 15? 

This reviewer can see no important educa- 
tional purpose served by giving a standardized 
spelling test to pupils in grade one. For testing 
the spelling ability or spelling achievement of 
pupils above the first grade, this test cannot be 
strongly recommended. It is a short test, includ- 
ing a number of words infrequently written by 
children or adults, which has been normed on a 
fairly large population that has not been ade- 
quately defined. 


For reviews by Walter W. Cook and Joseph 
C. Dewey, see 40:1311. 
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Gates-Russell Spelling Diagnosis Test. Grades 
2-6; 1937; 9 scores: spelling words orally, word pro- 
nunciation, giving letters for letter sounds, spelling one 
syllable, spelling two syllables, word е spelling 
attack, auditory discrimination, visual-auditory-kinaes- 
thetic and combined study methods ; individual; 1 form; 
no description of normative population; $5.50 per 100; 
75% per manual; 75€ per specimen set; postpaid; Ar- 
thur I. Gates and David H. Russell; Bureau of Publi- 
cations, Teachers College, Columbia University. * 
REFERENCES 

i. Russert, Davip H. Characteristics of Good and Poor 
Sbellers. Columbia University, Teachers College, Contribu- 
tions to Education, No. 727. New York: Bureau of Publica- 
tions, the College, 1937. Pp. 103. (РА 12:3186) 


GroncE $рАСНЕ, Head, Reading Laboratory 
and Clinic, University of Florida, Gainesville, 
Florida. 
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This is a battery of nine tests offered for use in 
cases of spelling disability. Considered as a 
whole, the battery will contribute some under- 
standing of the role of certain factors in this 
type of academic disability. It will, for example, 
give some ideas concerning the student’s meth- 
ods of spelling, of word analysis and blending, 
and of studying spelling. It will enable some ob- 
servation of his error tendencies, his knowledge 
of letter sounds, his auditory discrimination, the 
extent of reversals in his reading, and his success 
with various methods of learning new words. 
Such knowledge will be helpful to the teacher in 
understanding some of the reasons for the stu- 
dent’s difficulties with spelling. 

However, neither the research (1) which re- 
sulted in the publication of these tests in 1937 
nor the research evidence available from other 
studies at that time or later justifies the use of 
some of the individual tests of the battery. The 
tests were chosen by a priori reasoning and re- 
tained in the published form without any critical 
examination of their validity or reliability. These 
criticisms may be justified in an examination of 
the separate tests of the battery. 

Three tests are offered for observing a stu- 
dent’s methods of spelling and his error tenden- 
cies: (a) spelling words orally (a spelling scale 
of 25 words), (b) spelling one syllable (10 non- 
sense monosyllables), and (с) spelling two syl- 
lables (8 nonsense 2-syllable words). Aside 
from the fact that the three are offered for the 
same purposes, it is very doubtful that they con- 
stitute an adequate sample of spelling perform- 
апсе. Space does not permit the citation of the 
extensive research evidence which is relevant 
here; this has been summarized elsewhere.* 
There is ample evidence that a minimum of 100 
misspelled words is necessary for a reliable sam- 
ple of error tendencies. Furthermore, the system 
of classifying spelling errors suggested in the 
battery is superficial and misleading in that it 
uses only gross categories of errors and fails to 
reveal the finer differences between the error 
tendencies of good and poor spellers. 

A fourth test on giving letters for letter sounds 
attempts to measure the student’s ability to give 
the name of the letter represented by a single let- 
ter sound. In effect, the purpose of this is identi- 
cal with that of the preceding tests. Certainly, 
ET UT 2 IP 
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separate tests of the spelling of single sounds, 
two- or three-letter sounds, and five- or six-letter 
sounds as well as common words seem unneces- 
sary duplication and may lead to rather artificial 
discriminations. Knowledge of how the student 
attempts to spell sounds can be easily obtained 
from an examination of his errors in an adequate 
size sample of misspelled common words of mod- 
erate difficulty, without this circuitous approach. 

The authors felt that the inclusion of a test 
of word reversals is justified because of the 
marked frequency of this kind of reading error 
among poor readers. Russell (r) shows the 
same tendency among poor spellers. However, 
these facts are no proof of a causal role in disa- 
bility. The phenomenon of reversals is universal 
among learners of all ages in the initial stages of 
their study of new materials. It is specific to the 
particular material being learned and not trans- 
ferred to other activities. It spontaneously di- 
minishes and disappears as orientation to the 
new forms increases. These facts imply that re- 
versals are errors due to immaturity of orienta- 
tion, not a cause of disability. Thus the test as 
used here has little or no relationship to reversals 
in spelling. Russell’s own data (т) and that of 
others confirm the lack of relationship between 
reversals in reading and spelling. 

The four remaining tests although some are 
of doubtful adequacy and reliability, are justified 
by the studies of poor spellers. A test of word 
pronunciation for evaluating sight word vocabu- 
lary, word analysis, and blending skills should 
aid in revealing the difficulties of poor spellers 
in these areas. A brief test of auditory discrimi- 
nation should help to detect the extent of audi- 
tory inaccuracy in poor spellers. Two tests called 
“Learning to Spell Hard Words” and “Visual, 
Auditory, Kinaesthetic and Combined Study 
Methods” should give some indication of the 
pupil’s own methods and his success with dif- 
ferent learning procedures, even though the 
tests are very brief and based on the learning of 
words of unequal difficulty. 

This battery of tests may be useful as the ini- 
tial step in the observation of the difficulties of 
a poor speller. Some of the tests will reveal cer- 
tain tendencies and skills, others will need to 
be supplemented by more adequate testing, as 
we have indicated. Several probably should not 
be used as they yield no additional information 
or ‚because they bear little relationship to the 
pattern of difficulties present in spelling dis- 
ability. 5 
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For reviews by John C. Almack and Thomas 
G. Foran, see 38:1159. 
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*Graded Word Spelling Test. Ages -I5; 1950; 
Forms A, В; no data on reliability and validi d cs 
12; 4d. per single сору; 18s. 6d. per copy of Diagnostic 
and Attainment Testing ('5so—see 1 below) which 
serves as the manual and includes full reproductions of 
both forms; postage extra; nontimed (1 40) min- 
utes; Fred J. Schonell; Oliver & Boyd Lik 
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1, ScmowxLL, Faro J., Амр ScmowxLL, F, ELEANOR, ^ 
esting: Including a Manual Tur 


nostic and Attainment h 
ir Nature, Use, Recording and Interpretation, Edinburgh, 


Scotland: Oliver & Boyd Ltd., 1950. Pp. viii, 168. 
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*Lincoln Diagnostic Spelling Test. Grades 8-12; 
1941-51; II scores: erroneous pronunciation, #-@ 
rule, y to i rule, final e before suffix, double consonant 
before suffix, demons, English prefixes and suffixes, 
endings and bases derived from Latin, homonyms and 
words frequently confused, possessives-contractions- 
solids-hyphens, total; Forms 1 ['41], 2 ['42], 3 [43], 
4 [44] ; manual ['51]; norms: Forms 1 ('48-50),2 
e 3 (040-51), 4 (49751) i к pen ден Mis 
ra; 25¢ per specimen set, postpaid; 
кеу Pp A. L. Lincoln; Educational Records Bu- 
reau, * 


REFERENCES ) 

1. TOWNSEND, Аслтил. “A Study of (ће Lincoln DA 
Spelling Test," pp. 49-53. (PA 38:31 3 title ont He fas 
Supplementary Studies. Educational Records Bulletin, No. $ 
New York: Educational Records Bureau, June 1943. Pp. xh 
53. Paper, lithotyped. * 
WALTER SCRIBNER GUILER, Professor of Edw- 
cation, and Director of Remedial Education, 
Miami University, Oxford, Ohio. 

The chief purpose of this 100-word spelling 
test is “to attack three elements of diagnosis- 
pronunciation, enunciation, and the use of pu 
Although the term “diagnostic” appears in te 
title, it seems to the reviewer that the test 15 
analytical in nature, for it aims to identify phases 
of spelling which cause difficulty. ў ai 

The content of the test was derived сше 
from two sources: (a) Lester’s list of Evi 
spellings gathered from College Entrance E 
amination Board English examination pape 
and (5) Simmons and Bixler’s The New Sta 
ard High School Spelling Scale. The Ж 
states that some words were derived from J от } 
“Опе Hundred Spelling Demons" and that ME. 
ers were "added in order to provide а fes à 
fairly advanced difficulty.” No mention 15 p 
of the source of the added words. Меш 
the manual provide any data showing the E 
of difficulty and familiarity of the added Vo 
or of the words derived from the English d 
nation papers. In the absence of such oem s 
reviewer checked Form 1 of the test with LM 
A Basic Writing Vocabulary and found tha 


327 ] 


TESTS & REVIEWS: ENGLISH—SPELLING 


202 


of the 100 words are beyond the 5,000 and that 
тт are beyond the 10,000 words in most common 
use. These findings lead one to question the cur- 
ricular validity of the test. 

No norms based on pupil performance in pub- 
lic schools are available; however, percentile 
rank norms based on the records of private 
school pupils have been established both for 
part and total scores. No mention is made of 
the geographical distribution of the schools that 
were used in establishing the norms. 

In spite of its limitations, the test possesses 
many good features. It represents a praiseworthy 
effort to discover some of the causes of spelling 
deficiency. Then, too, the manual provides timely 
suggestions for a remedial program based on 
the test results. 


GrorcE $рРАСНЕ, Head of the Reading Labora- 
tory and. Clinic, University of Florida, Gaines- 
ville, Florida. 

This test represents an attempt to evaluate 
the spelling performances of secondary school 
pupils. The 100 test words are divided into sub- 
groups according to the spelling rules or con- 
ventions governing their spelling. A few in which 
mispronunciation, confusion of homonyms, or 
word elements such as prefixes appear signifi- 
cant are included. This use of rules is a con- 
venient system for classifying words for testing 
and, perhaps, for teaching procedures. It is 
doubtful, however, that the method really sam- 
ples error tendencies or reveals probable causes 
of misspelling. It ignores the real bases of spelling 
disability, which lie in study methods, auditory 
discrimination, phonic skills, handwriting, and 
other common factors. 

Selection of the test words was made largely 
on the basis of frequency of misspelling. Little 
attention was paid to the studies of actual errors 
within the test words. This results in the use of 
test words in which the commonest and most 
frequent misspellings bear no relationship to 
their placement in subgroups in the test. For 
example, in Form 1, Item 7, awful, is cate- 
gorized as an example of the spelling of the 
suffix, ful. Research studies of actual misspell- 
ings of this word show that about 88 per cent 
of the errors occur on the w. Thus this item is 
very unlikely to function in the manner claimed 
for it. 

Since the method of selection of test items may 
often result in misspellings of other types than 
those expected by the author, the teacher using 


the test should certainly be given instructions 
in classifying the actual errors in some mean- 
ingful and diagnostic system. Or, at least, the 
items should be chosen so that they will operate 
within the classification system suggested in 
the test. 

This faulty method of item selection is prob- 
ably responsible for the relatively poor results 
found in the statistical analyses cited by the au- 
thor. The reliability of the entire test is appar- 
ently satisfactory for discrimination of the gen- 
eral spelling levels of individuals. However, the 
low reliabilities of the subscores indicates the 
tests are inadequate for the discrimination of 
types of errors or for comparisons among indi- 
vidual pupils. On some of the subscores, class 
averages might conceivably be used for com- 
parative purposes. Despite the probable lack of 
discriminatory value in the subscores, the au- 
thor offers norms for these as well as for the total 
scores on all four forms of the test. These norms 
are based on use of the test in private schools 
in grades 8-12. The value of these norms 
for public school use remains to be demon- 
strated. 

The tests are administered by the modified 
list method in which each test item is inserted 
in an illustrative sentence. These sentences are 
printed on the test blank in order to standardize 
the administrative procedure. Research studies 
indicate that this is the most valid type of 
spelling test for assessing general performance. 

Itis apparent that these tests will serve certain 
useful purposes in evaluating spelling abilities. 
They will indicate with a high degree of accuracy 
the grade level of spelling ability of individuals. 
They may be used to determine class tendencies 
to observe certain spelling conventions. These 
class performances might then be used in plan- 
ning group instruction in these same conven- 
tions. The tests will reveal what words from the 
test list the pupils misspell, and the correct spell- 
ing of these might then be taught to the group 
if the words are deemed essential for the spelling 
curriculum. 

At the same time, the tests will not reveal with 
any great degree of accuracy why the pupils 
misspell, or the error tendencies of individuals. 
They will not indicate the knowledge or use of 
spelling rules by individuals or permit compari- 
sons among the individual pupils in types of 
errors. Their greatest service to the teacher lies 
in their indications of group needs and of possible 
emphases in the spelling curriculum. 
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*Lincoln Intermediate Spelling Test. Grades 5-8; 
1947-51; II scores: erroneous pronunciation-number 
of syllables, ie-ei rule, y to i rule, final e before suffix, 
double consonant before suffix, demons, English pre- 
fixes and suffixes, erroneous pronunciation-phonic qual- 
ity of syllables, homonyms and words frequently con- 
, possessives-contractions-solids-hyphens, total; 
Forms А ['47], B ['48], C [48], D ['49] ; no norms for 
E scores of Forms B, C, D; manual s ; norms; 
orms A ('51), В ('50), C ('50), D ('51) ; 6€ per test, 
postage extra; 25€ per specimen set, postpaid; non- 
timed (30) minutes; A. L. Lincoln; Educational Rec- 

ords Bureau. * 


REFERENCES 


WALTER SCRIBNER GUILER, Professor of Edu- 
cation, and Director of Remedial Education, 
Miami University, Oxford, Ohio. 

This 100-word spelling test is similar in struc- 
ture and in purpose to the Lincoln Diagnostic 
Spelling Test. It is designed to identify phases 
of spelling in which pupils encounter difficulty ; 
it emphasizes pronunciation and the use of rules. 
The words are arranged in approximate order of 
difficulty, a feature which should enhance the 
validity of the test. The method employed in 
administering the test is the word-used-in- 
sentence list dictation. 

The manual states that the principal sources 
of the selection of the words were Betts’s Spell- 
ing Vocabulary Study, Gates’s A List of Spell- 
ing Difficulties, and the New York State Spelling 
List; however, no mention is made of the other 
sources that were used. The curricular validity 
of the test is open to question because of the 
sources from which the words were selected. 
When there is available such a word list as Rins- 
land's А Basic Vocabulary of Elementary School 
Children, based on a wide sampling of children's 
own writings, one wonders why the author 
should select words for an elementary school 
spelling test from such sources as Betts's compi- 
lation of words from 17 spelling books and from 
the New York State list, the sources for which 
are unknown. 

No data are presented showing the degree of 
familiarity of the test words. On checking Form 
I of the test with Horn's A Basic Writing Vo- 
cabulary, the reviewer found that 14 of the 100 
words are beyond the 5,000 and that 3 words 
are beyond the 10,000 most commonly used 
words. It would be difficult to justify the in- 
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clusion of words above the 3rd thousand and 
certainly words above the 4th thousand in a 
test designed for use in the upper grades of the 
elementary school. 

There is a serious question as to whether a 
test designed for use in the upper level of the 
elementary school should cover four school 
grades. Children in these grades differ widely in 
maturity. For this reason, a test should be de- 
signed for use in one or, at most, two school 
grades, Norms are available for private school 
pupils; however, no norms exist for public 
school pupils. Since the test was designed as an 
aid to teaching rather than as a means of select- 
ing best spellers, the lack of norms of the latter 
type is not a serious limitation. 

Although certain shortcomings in the test 
have been cited, the reviewer feels that the au- 
thor has produced a very useful measuring in- 
strument and that our profession is greatly in- 
debted to him for constructing a type of test 
designed to reveal some of the major causes of 
spelling deficiency. 


GroncE $РАСНЕ, Head of the Reading Laboro- 
tory and Clinic, University of Florida, Gaines- 
ville, Florida. à 
This test is an effort to evaluate the spelling 
performances of elementary school pupils. It 
consists of тоо test words commonly taught it 
the intermediate grades. These test words are 
divided into ten subgroups according to the 
spelling rules or conventions governing n 
spelling. Each of four subgroups presumably 
measures knowledge of a particular spelling 
rule, while the remaining subgroups give oppor" 
tunity for the demonstration of such confusion 
as mispronunciation, spelling demons, Би 
nyms, etc. This technique of grouping т 
according to the spelling rules they exemplily, 
or according to some of the elements that m 
contribute to misspelling is a very conveniet, 
device for classifying words for testing x 
perhaps, for teaching procedures. It is dou d 
however, that the method really samples €" id 
tendencies or reveals probable causes of d dy 
spelling unless there is considerable further $ dl 
of individual pupils. Furthermore, such be F^ 
proach, while highly logical, would fail nu 
veal some of the very common bases of pi^ 
disability. There is considerable епа 0 
show that the more important causes of Ti К 
disability lie in such areas as study cn rit- 
auditory discrimination, phonic skills, han 
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ing, etc. Unfortunately, these primary causes of 
difficulty are not revealed by such a formal ap- 
proach to the study of misspellings. 

Another faulty assumption is present in the 
nature of the selection of the test words, These 
words were chosen largely on the basis of fre- 
quency of misspelling. Little or no attention was 
paid to the many research studies of the actual 
errors made in spelling the test words. This 
results in the use of some test words in which 
the commonest and most frequent misspellings 
bear no relationship to their placement in the 
various subgroups in the test. For example, in 
Form A of the Intermediate test, Item 87, an- 
nually, is categorized as an example of the mis- 
spelling of certain prefixes and suffixes. The 
research studies of actual misspellings of this 
word show that almost 70 per cent of the errors 
occur in the omission of the double n. Thus, 
this item is not likely to function in a manner 
claimed for it in the spelling test. 

The teacher using this test will often find that 
misspellings are of other types than those ex- 
pected by the author. Since this is the case, he 
should be given instruction in classifying the 
actual errors in some meaningful and diagnostic 
system. The author neither gives such instruc- 
tions for the teacher nor suggests how errors 
are to be interpreted when they do not fall within 
the narrow confines of each subgroup or type. 
In the statistical analyses that have been made 
ofthe test, there appears to be little justification 
for the division into part scores. The high inter- 
correlations of the subscores and their low re- 
liabilities indicate that the subgroups are prob- 
ably not measuring distinctly different spelling 
situations. Moreover, these results indicate that 
the part scores are inadequate for the discrimina- 
tion of types of errors or for comparisons among 
individual pupils. The reliability of the entire 
test, on the other hand, is apparently satisfac- 
tory for discrimination of the general spelling 
levels of individuals. Norms based on the doubt- 
ful part scores as well as on the total scores are 
available for grades 5-8 in private schools. 

Despite these adverse criticisms, there are 
several practical uses for this spelling test. It 
may be employed with a high degree of accuracy 
to indicate the grade level of spelling ability of 
individuals. It may be used to determine group 
or class tendencies to observe certain spelling 
conventions. These group performances might 


then be used in planning group instruction in 
these same conventions. The test will reveal 
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what words from the test list the pupils misspell, 
and then the correct spelling of these might be 
taught to the group if the words are deemed 
essential for the spelling curriculum. At the same 
time, however, the test will not reveal with any 
great degree of accuracy details as to why the pu- 
pils misspell, or as to the error tendencies of in- 
dividuals. It will not indicate the knowledge or 
use of spelling rules by individuals or permit 
comparisons among the individual pupils in the 
types of errors they commit. The greatest value 
of the test to the teacher lies in its indications 
of group needs and in its desirable emphases in 
the spelling curriculum. 
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*Morgan Spelling Test for Business and Indus- 
try. Applicants for stenographic and secretarial posi- 
tions; 1948; Forms A, B; no data on reliability and 
validity; manual ['48] ; $2.50 per 25; 50€ per specimen 
set; sample test free to qualified test users; cash orders 
postpaid; 20(25) minutes; Antonia Morgan and Wil- 
liam J. Morgan; Aptitude Associates. * 

Harop Н. Brixrm, Professor of. Education, 
Western Carolina Teachers College, Cullowhee, 
North Carolina. 

DESCRIPTION. This test is designed for use in 
employment or guidance offices and in high 
school commercial classes where only a short 
time is available for testing. In the left-hand 
column will be found a list of 75 words, some 
correct and some misspelled. If a word is not 
spelled correctly, the correct spelling is to be 
written on the line opposite. This test, therefore, 
is self-administering. 

vaLIDITY. The authors report no data on va- 
lidity, but inspection of the items reveals that 
they include such words as “ай right," “Wed- 
nesday,” and “recommend” as well as a few busi- 
ness terms, such as "correspondent," "receipt," 
and "merchandise." It is unfortunate that all 
words are printed with initial capital letters. 
This presents an unusual situation and may well 
affect the validity of the results. 

RELIABILITY. No data are reported on relia- 
bility, but the test includes 75 items. It should, 
therefore, be fairly reliable. : 

vusabiLITY. Two kinds of norms are reported. 
A percentile table is given for a random sample 
of 421 office workers, including file clerks, typ- 
ists, secretaries, stenographers, and administra- 
tive assistants. It would have been more helpful 
if stenographers and secretaries could have been 
reported separately. In addition, a table for in- 
terpretation is given, showing scores judged to 
be needed for stenographers classified as su- 
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perior, average, and below average. Obviously, 
norms are based on a very small number of cases 
in comparison with other standardized tests on 
the market. Furthermore, no indication is given 
to show how the sample was selected. 

SUMMARY. This test may be of some value to 
employers who wish to use a self-administering 
type of test for screening applicants. No data are 
available to prove that the two forms are equiva- 
lent. Hence, prospective users would be well 
advised to use one of the standard tests in spell- 
ing rather than this instrument. 
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‘Spelling Evaluation With Standardized 
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ANTON THOMPSON, Supervisor of Research, 
Long Beach Public Schools, Long Beach, Cali- 
fornia. 

This 1923 spelling scale is still being marketed. 
Printed in booklet form, it consists of a manual 
of directions and eight word lists of equivalent 
average difficulty, All lists are so words in 
length. Each list contains words which range in 
difficulty from wor judged easy enough for 
primary grade pupils to words considered suit- 
able for testing capable junior high school pupils. 
A complete list is used in testing any grade from 
2 through 8. A teacher having pupils of more 
than one grade classification in her classroom 
can test them simultaneously. d 

The testing procedure is famili 1 “i the teacher 
pronounces a word, uses it in a sentence, repeats 
the word, and the pupils write the word. A 
pupil’s score is interpreted by referring to the 
manual which provides data for converting the 
raw score to a T score, a G score (grade status), 
a C score (midyear classification in spelling), a 
“В score (brightness in spelling), and a spelling 
age.- 

Words in this scale were selected from the 
Buckingham Extension of the Ayres Spelling 
Scale. Only words that were among the first 
5,000 in Thorndike’s original Teachers Word 
Book were included. The reviewer checked one 
of the lists against Horn's list of 10,000 words 
most commonly written by adults, and the Rins- 
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land list based upon children's writings? The 
Horn and Rinsland studies were, of course, un- 
available at the time this scale was constructed, 
Of the 5o words analyzed, 96 per cent were 
among the first 5,000 on Horn's list and 78 per 
cent were among the first 5,000 words which 
Rinsland's research shows are used by eighth 
grade pupils. This lends support to the соп. 
sion that the test words are, for the most part, 
those commonly used in the writing of adults 
and upper grade children. 

The manual provides explicit directions for 
administering, scoring, and recording the re- 
sults of the test. The tables of norms are based 
on scores from 57,331 pupils in the rural and 
village schools of New York State. The norm 
group included about 8,000 pupils in each grade _ 
from 2 through 8, and about 1,000 ninth graders, 

Balanced against the scale’s desirable features 
are several deficiencies. Many lower grade pu- 
pils will be disturbed when given a wide range 
test that includes words considered difficult for 
junior high school students. To make matters 
worse, the manual sternly forbids any “unstand- 
ardized introductory remarks" through which a 
skilled tester might hope to build up the courage 
of the younger pupils. The reviewer believes the 
authors were somewhat naive when they wrote 
these lines: “If, at the close of the test, younger 
pupils should seem bewildered and confused, 
the examiner should smilingly encourage кет 
by explaining that he didn't expect en 
spell all the words. He should endeavor to 
the test period leaving the children unifo 
happy." s, 

Although the principal reason for the e 
ner's reading a sentence containing a test s" 
is to help pupils understand the word, per 
the sentences offer little assistance. Illustra! dif- 
of such sentences are these " “There is little 
erence.” "I am not particular.” — d 
: The manual provides no statistical data e 
cerning the test's reliability. And 
fault is the lack of discussion of the possible | 
of the test results. y 

Five pages of the manual are devoted d 
plaining how the teacher can convert pup! éco 
scores to T scores, B scores, and other T s 
mended units. Only those teachers — ‘ 
had recent or extensive training in statistic эй 
understand such sections аз the following: 

: College _ 
graphs in Education, Series 1 No. 4. Iowa ciy paer, 
of Education, University of lowa; spula of Elementary 


ee 
i asic Voci ry ‚636. 
School Children: New Work: Macmillan Co. i949. PP 
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то units of T-score represent т SD (standard 
deviation) of distribution of 12-year-olds. Any 
pupil or class, for example, whose Ts is 70 has 
an ability that is 20 T (or 2 SD) above the mean 
ability of 12-year-olds." 

Because the same words are used in testing 
pupils widely separated in grade classification, 
the norms reflect the sharp scaling made neces- 
sary. This places great emphasis upon the cor- 
rect spelling of a single test word. Thus, a pupil 
correctly spelling 41 words earns a grade status 
of 8.0 while a pupil spelling two more words 
correctly earns a grade status of 8.8. 

A serious weakness of the norms for most edu- 
cational purposes is the fact that they are based 
upon rural and village pupils of one state, tested 
more than a quarter century ago. Comparison 
of the spelling ability of an urban school of the 
1950's with the norm group might be interest- 
ing as a research project; however, the usual 
purposes for which standardized tests are given 
require a comparison of scores with those 
earned by a standardization group with similar 
background and preparation. 

This test has undoubtedly been a valuable 
measuring instrument. In certain "then and 
now" comparisons, it may still have value al- 
though the reviewer would not recommend its 
use in the lower grades. For ordinary educa- 
tional purposes, this scale, normed on rural and 
village pupils tested before 1923 in but one state, 
has limited usefulness. 


[206 ] 
*The New Standard o School Spelling Scale. 
Grades 7-12; 1925-49; a 66-page book (40) consisting 
of 2,560 words with difficulty values in each of grades 
9-12; $1.32 per copy, postage extra; Ernest P. Sim- 
mons and Harold H. Bixler; Turner E. Smith & Co. * 


[207 ] 

*Rich-Engelson Spelling Test. Grades 9-13; 1947; 
IBM; Forms A, B; mimeographed manual; $1.05 per 
25; 20€ per specimen set; postpaid; separate answer 
sheets may be used; 75€ per 25 IBM answer sheets; 
25% per stencil for scoring answer sheets; postage ex- 
tra; 30(35) minutes; Vernita Rich, Ieleen Engelson, 
and H. E. Schrammel; Bureau of Educational Meas- 
urements, Kansas State Teachers College of Empo- 
ria, 

Henry D. RixsrAND, Professor of Education, 
The University of Oklahoma, Norman, Okla- 
homa. 

The validity of this spelling test is in serious 
question because of (a) the age of the research 
on which the test is based—namely, the Buck- 
ingham Extension (1918) of the Ayres Spelling 
Scale (1915), the Iowa Spelling Scales (Ander- 
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son's research, 1921), and the Thorndike Word 
List (1921, 1931, or 1944—which one of the 
three Thorndike lists was used, the authors do 
not state) ; and (b) the simple fact that none 
of this research provides information concern- 
ing the specific spelling needs of high school 
students and college freshmen. The Ayres, Buck- 
ingham, and Iowa scales are measures of the 
achievement of elementary school children ; all 
Thorndike lists are made up of words from the 
writing of word-talented adults. Between these 
two there is an unknown gap. It might as well 
be admitted that there are at present no facts 
upon which to build a valid spelling scale for 
the grades for which this test is intended. 

The test is in the form of a list of 150 words, 
some correctly and some incorrectly spelled. The 
student is to decide for each word whether the 
spelling is right or wrong. Is successful accom- 
plishment on a test of this type an indication of 
spelling or proofreading ability ? 

With validity so seriously questioned, re-, 
liabilities of from .76 to .92 are not particularly 
reassuring; nor do norms based on 8,477 cases 
provide any accurate representation of what the 
spelling achievement of high school students 
and college freshmen should be. 


[ 208] 

Spelling and Vocabulary: Every Pupil Test. 
Grades 3-0; 1948-51; new form usually published each 
April and December; form April 1951; no data on 
reliability and validity; no manual; norms (751); 2% 
per test; 14 per answer key; stpaid; 40(45) min- 
utes; Ohio Scholarship Tests, Ohio State Department 
of Education. * 


[209] 


*Spelling s Test. Grades 2-4, 5-6, 7-8; 1942- 
52; I3 scores: silent letter omission, sounded letter 
omission, doubled letter omission, addition by doubling, 
single letter addition, transposition, phonetic substitu- 
tion-vowel, phonetic substitution-consonant, phonetic 
substitution-syllable, phonetic substitution-word, non- 
phonetic substitution-vowel, non-phonetic substitution- 
consonant, unrecognizable; 3 levels; 1 form ['42]; 
mimeographed ; no data on reliability ; ihid man- 
wal ['52]; distribution gratis; nontimed (30-40) min- 
utes; George D. Spache; Reading Laboratory and 
Clinic, University College, University of Florida. * 


[210] 

: Seven Plus Assessment: Northumber- 

1951; for complete battery, 
see 24; I form ['51]; no data on reliability ; manual 
[51]; 7s. 6d. per 25; ба. per single copy; Is. per man- 
ual; 2s, 6d. per specimen set (includes the other 2 tests 
in the series); postage extra; (60) minutes; C. M 
Lambert; University of London Press Ltd. * 


For a review of the complete battery by Stan- 
ley D. Nisbet, see 24. 


*Spellin, 
land Series. Ages 7-8; 
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[211] 
*Spelling Test for Clerical Workers. Steno- 
graphic applicants and high school; 1947; Forms A, 

; manual ['47]; $2 per 25, postage extra; 50€ per 
specimen set, postpaid; nontimed (15-20) minutes; 
Jay L. Otis, David J. Chesler, and Irene Salmi; Per- 
"os Research Institute, Western Reserve Univer- 
Harotp Н. Brxter, Professor of Education, 
Western Carolina Teachers College, Cullowhee, 
North Carolina. 

Since this is a self-administering test, it is 
especially adapted for use in employment and 
guidance offices. The test is organized in three 
columns: (a) how word sounds, (5) definition, 
(c) correct spelling. The words are to be writ- 
ten in the third column. Each form includes 73 
items. 

The authors do not report the basis for the 
selection of the test items other than the state- 
ment that the test consists of “commonly mis- 
spelled business words." They do report cor- 

; relations ranging from .64 to .82 with the spell- 
ing sections of the Cooperative English Test 
and the Progressive Language Test. These cor- 
relations give some indication of the validity of 
the test. Inspection of the items reveals that the 
words include both commonly used words and 
business terms. 

Reliability of the test is satisfactory, as shown 
by correlations between the two forms of .96 
and .90, respectively. Usability of the test may be 
judged by the fact that it may be taken by an 
individual on a self-administering basis, or it 
may be given to a group. Less than one class 
period is required. 

Norms, in terms of deciles, are reported for 
high school graduates applying for admission to 
college and for employed clerical workers. These 
norms are based on 196 and 57 cases, respec- 
tively. Obviously, these numbers of cases are 
rather small for satisfactory norms. The manual 
gives no indication as to how these cases were 
chosen nor do the authors report the types of 
clerical jobs in which the workers were em- 
ployed. The difficulty of the test may be judged 
by the fact that the means (norms) are 59.2 and 
63.7 for the high school graduates and clerical 
workers, respectively, as compared with a per- 
fect score of 73. 

The authors have used a clever idea, phonetic 
spelling, as the basis for this test. It is to be 
hoped that they will assemble additional data 
on such groups as high school graduates apply- 
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ing for stenographic jobs, thus developing more 
usable and more dependable norms. 


[212] 
Traxler High School Spelling Test. Grades 9-12; 
1937-40; Forms т [37], 2 [737], 3 [39]; mimeo- 
graphed manual ['37]; norms: Form І (40), Form 2 
(38), Form 3 с ; 4€ per test; 25¢ per specimen set; 
stage extra; (20) minutes; Arthur E. Traxler; 
ducational Records Bureau. * 
Henry D. Rrnsvanp, Professor of Education, 
The University of Oklahoma, Norman, Okla- 
homa. 

This spelling test is based on the old research 
of Ayres (1915), Buckingham (1918), and 
Horn (1926). Recent research has shown that 
spelling lists for the elementary grades based on 
this older research bear little relation to the 
vocabulary that children in those grades actually 
use. Accordingly, high school spelling lists based 
on the same research are also suspect. Until in- 
formation comparable to that presented in 
Rinsland's Basic Vocabulary of Elementary 
School Children? has been secured for high 
school pupils, there is no way of knowing 
whether this test or any other spelling test ade- 
quately samples “the active vocabulary of high 
school pupils." к 

Apart from this fundamental consideration, 
the Traxler High School Spelling Test gives 
evidence of having been carefully planned and 
constructed. The test is designed to be taken on 
spelling sheets on which are printed illustrative 
sentences. The spelling word is pronounced by 
the examiner; and the pupil reads the illustra- 
tive sentence and then writes the word in the 
blank provided. This form of presentation ap- 
pears to be superior to that which requires only 
simple recognition of right and wrong spelling 
Forms 1 and 2 of the test are directly compa- 
rable in difficulty; Form 3 was purposely pre- 
pared to be more difficult than the first two 
forms. 

Reliability coefficients for the test, secured by 
correlating the scores of 393 public high school 
pupils on Forms т and 2, range from .91 to .95: 
Percentile norms are provided for each 0 
the three forms. These are based on score 
obtained by several hundred independent 
school pupils in three successive programs d 
the Educational Records Bureau, in each iD 
which a different form of the test was adminis 
tered. 


tary 
1 Rinsland, H D. Basic Vocabulary of Eleme" 
School Children. New York: Macmillan Co., 1945. PP- 636- 
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[213] 
*Cooperative Vocabulary Test. Grades 7-16; 
1940-51; IBM; Forms О ('40), 2 (49); Forms R, 
Y, QS, RS, YS, ZS out of print; manual ('40) ; gen- 
eral Cooperative manual ('51) ; norms [’40] ; $2.25 per 
25; 50€ per specimen set, postpaid; separate answer 
sheets may be used; 90€ per 25 IBM answer sheets; 
30€ per set of stencils for machine scoring of answer 
sheets; cash orders postpaid; 30(35) minutes; Fred- 
erick B. Davis, F. S. Beers (Q), D. G. Paterson (Q), 
Mary Willis (Q), and Charlotte Croon Davis (9); 
Cooperative Test Division, Educational Testing Serv- 
ice. 
REFERENCES 


1, Cooperative Test Service, The Cooperative Vocabulary 
Test: Information Concerning Its Construction, Interpreta- 
tion and Use, New York: Cooperative Test Service, 1940. Pp. 


2. Paper. 
2. Simpson, R. С. “The Vocabulary Sections of the Co. 


operative English Tests at the Higher Levels of Difficulty.” 
J Ed Psychol 34:142-51 Mr '43. * (PA 18:311 

3. Davis, FREDERICK B. «fhe Interpretation of Frequency 
Ratings Obtained From ‘The Teachers Word Book.’” J Ed 
Psychol 35:169-74 Mr '44. * (PA 18:2957) 

4. Моск, Ѕлмғовр J. “Cooperative Vocabulary Test, Form 
R, CI604A, Cl6osA," рр. 53-6. In Printed Classification Tests. 
Edited by J. Р. Guilford with the assistance of John I. Lacey. 
Army Air Forces Aviation Psychology Program Research Re- 
Orts, REDE No. s. Washington, SDC.: U.S. Government 

rinting Office, 1947. Pp. xi, 919. * (РА 22:4145) 


For reviews by Edgar Dale and Henry D. 
Rinsland of Forms Q, R, QS, and RS, see 
3:160. 


[214] 

*General Vocabulary: Iowa: Tests of Educa- 
tional Development, Test 8. Grades 9-13; 1942-51 
(first published as a separate in 1951); for complete 
battery, see 17; Form Y-2 ('si—same as 1949 edi- 
tion) ; manual ('s1); general manual ('51) ; $3.75 рег 
25; separate answer pads or answer sheets must 
used; $1.95 per 25 answer pads; $3 per 100 IBM an- 
swer sheets; 506 per scoring stencil; $2.50 per 25 first 
semester (/46) or second semester ('48) profiles for 
any one of grades 9-12; $1 per 25 self-interpreting 
Profiles for students ('51); 256 per school summary 
report (744); 4€ per pupil score sheet (/48); 50¢ per 
Specimen set; cash orders postpaid; 22(27) minutes; 
edited by E, F. Lindquist; К. W. Vaughn; Science Re- 
Search Associates, Inc. * 


For a review by Eric F. Gardner of the com- 
plete battery, see 17; for reviews by Henry 
Chauncey, Gustav J. Froehlich, and Lavone A. 
Hanna of Forms X-1 and Y-r of the complete 
battery, see 3:12. 


[215] 
*Holborn Vocabulary Test for Young Children. 
Ages 3.5-8.5; 1044-40; reprinted in 1949 from The 
anguage and Mental Development of Children (see 
I below) ; individual; no data on reliability and valid- 
ity; 1 form; 1s. per examiner’s manual, postage extra; 
no other materials needed; (40-50) minutes; А 
Watts; George G. Harrap & Co. Ltd. * 
REFERENCES 
age and Mental Development of 


1. Warts, А, Е. Th 
Children, pp, Атм Le oett George G. Harrap & Co. 
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еН; Фр, don Boston) Mass.: D.C. Heath & Co., 
C. M. Fremine, Reader in Education, Uni- 
versity of London, London, England. 

This test can best be evaluated if one thinks 
of it as an extract from the author’s The Lan- 
guage and Mental Development of Children 
(т) and realises that its separate publication 
seems to have been intended to stimulate the 
interest of teachers in the study of pupils rather 
than to provide psychologists with a measuring 
instrument whose reliability has been fully es- 
tablished by statistical devices. On this account, 
it is the more to be regretted that the test is 
not prefaced by some discussion of the problems 
involved in any estimatign of vocabulary and 
by an exposition of the importance for a teacher 
to understand the range of word and phrase 
comprehension expected from pupils ages 3-8. 
The facts should have been mentioned that the 
test was devised in the light of evidence obtained 
from both English and American researches and 
that the 100 words to be supplied as answers are 
fairly evenly spread through a list of the 6,000 
most common English words (with specific 
reference to Thorndike’s Teachers Word 
Book). Information might also have been given 
as to the number of pupils in each age group in 
the sample on which the vocabulary growth 
curve given in the test booklet was based. 

The test is, however, an interesting one. The 
contents are well balanced; they include simple 
descriptions of face and features and actions 
with hands and fingers, household articles, 
sounds, fabrics, information as to meals, quan- 
tities, occupations, the street, and children’s 

es. While it cannot yet be recommended as 
a fully standardised test, it does provide enter- 
prising teachers with a somewhat formalised 
means of commencing the study of their pupils. 


[216] 
*Michigan Vocabulary Profile Test. Grades 9-16 
and adults; 1937-49; 9 scores: human relations, com- 
merce, government, physical sciences, biological sci- 
ences, mathematics, fine arts, sports, total ; IBM; 
Forms AM (’49—same as test copyrighted in 1939 ex- 
cept for norms), BM ('49—same as test copyrighted 
in 1939 except for norms); revised manual (49) ; 
$2.25 per 25; 35¢ per specimen set, postpaid; separate 
answer sheets may be used; $1 per 25 IBM answer 
sheets; 40€ per set of stencils for machine scoring of 
answer sheets; postage extra; nontimed (50-60) min- 
utes; Edward B. Greene; World Book Co. * 
REFERENCES 
1-2, See 40:1320. 


8. 1:166. , 
7 oer Wisor J. “The Relationship Between Read- 
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ing Efficienc: 


and Academic Success in Selected University 
Curricula,” ) 


Ed Psychol 35:17-26 Ja '44. * (PA 18:2581 
10, HALL, Улам E., AND VEM ds Р. “Ап 
an ETE Approach to the (суу d o rading Skills." J Ed 
усі 1429-42 О '45. 20:1680 4 
eee AE Балтан са Боуи “A Test Battery for Selecting 
Technical Magazine Editors." Personnel Psychol 2:75-91 sp 
"49. * (PA 23:509 д 
12. EpMonson, Lawrence Davis. Comparative MT of 
a Test Battery Used for the Prediction of Scholastic Success 
at the University of Missouri. Doctor's thesis, University of 


Missouri (Columbia, Mo.) 1949. Abstract: Microfilm Ab- 
stracts 9:64-6 no 3 '5o. IG 24:4846, title only) А 
13. Super, Юомлір E. Appraising Vocational Fitness Ву 


Means of Psychological Tests, pp. 474-6. New York: Harper 
& Brothers, 1949. Pp. xxiii, 727. $5.00. * (РА 24:2130) 

14. GREENE, E IIS B. Michigan Wocatiilaey Profile 
Test, After Ten Years," Ed & Psychol Meas 11:208-11 su '51. 
* (РА 26:2749) 

15. LEVIN, S RICE, AND Stacey, CHALMERS L, "Awareness 
of Vocabulary Size: Its Relation to Class Standing and Sex 
Differences." J Ed Psychol 42:174-84 Mr ’51. * (PA 25:7652) 
Davin SEGEL, Specialist in Tests and Measure- 
ments, Office of Education, Federal Security 
Agency, Washington, D.C. 

This test is designed primarily to differentiate 
vocabulary ability as among eight different 
fields : human relations, commerce, government, 
physical sciences, biological sciences, mathema- 
tics, fine arts, and sports. This edition (1949) 
of the test is the same as the earlier edition but 
the manual has been revised to include addi- 
tional norms and new data bearing on the va- 
lidity of the test. The author sets up three prob- 
lems to the solution of which he thinks the test 
results contribute specifically. This reviewer will 
consider the test in relation to these three prob- 
lems. 

The first problem to which the author be- 
lieves the test contributes a solution is that of 
the growth and retention of specific informa- 
tion connected with specific training. As evi- 
dence for this, the author quotes a study of test 
scores for beginning students and for graduates 
of a nurses training course which shows that the 
vocabulary in science and especially for biologi- 
cal science is increased substantially by such a 
course. That the vocabulary in the various areas 
of the test is increased with training in related 
subject areas seems a reasonable result, and the 
test results may well be used in studies of the 
learning of individuals in various areas. 

The second problem claimed to be solved by 
this test is the extent to which technical knowl- 
edge is related to success in a particular field. 
For this the manual cites norms showing that 
the persons in certain occupational areas have 
substantially higher norms in the vocabularies 
related to these occupations. More definite data 
showing the relation of certain parts of the vo- 
cabulary test to specific occupational areas or 
marks in certain subjects should be brought to- 
gether to prove the point. 
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Тһе third problem to which the author be- 
lieves the test contributes is the prediction of 
success in a particular educational field. The 
normative data given in the manual do not have 
a bearing on the problem since the presence of a 
higher vocabulary score in an area related to 
that in which the pupil is specializing or in an 
occupation in which he is engaged is not a pre- 
dictive situation. The higher vocabulary score 
in an area can be explained by the reasonable 
assumption that the student learned the vocabu- 
lary on the job. In fact, the author gives evi- 
dence to this effect in the manual when he shows 
that the biological vocabulary score increases 
substantially when trainees are passing through 
the nurses training course. 

As indirect evidence on this point, the inter- 
correlations among the subtests may be consid- 
ered. These intercorrelations are all below .55 
and several are zero or practically zero, The 
author says in this connection, “It is evident that 
these figures indicate the presence of a number 
of fairly well isolated factors." These intercor- 
relations were obtained from students (sopho- 
mores in college) who were already beginning 
to specialize in subject areas. The point that the 
lack of correlation among the vocabulary score 
areas shows well isolated factors seems not to 


be well taken since the differences are those 


which are in large part attained through differ- 
ences in learnings in the subject areas them- 
selves. E 
The Michigan Vocabulary Profile Test is a 
well constructed test of vocabulary or of one type 
of verbal ability. It may be used, therefore, with 
assurance in aiding the prediction of college suc- 
cess in general and for educational experiments 
in which a vocabulary score is of value. Its use 
as a differential instrument—to predict success 
in various subject fields and occupations— 
should be on an experimental basis. Not enough 
data have been gathered and analyzed as yet {0 
establish its validity in this particular respect. 


For a review by Joseph E. King, see 3:166; 
for a review by Herbert A. Landry and an ex- 
cerpt from a review, see 40:1320; for reviews 
by Richard Ledgerwood, John M. Stalnaker, 
M. R. Trabue, and Arthur E. Traxler oj а" 
earlier edition, see 38: 1171. 


[217] К 
XTest of Literary Essentials. Grades 10-12; 193; 
I form; no data on reliability and validity; no та oi: 
$1.75 per 25, postpaid; specimen set not available; 


. 
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timed (35-45) minutes; Elmer R. Smith; Т 1 
Smith & Co. * s urner E 


[218] 
*Word Dexterity 'Test. Grades 7-16; 1942-50; 
IBM; Form A (42) ; mimeographed manual ('50)— 
reproduced in part from 3:170(2) ; $2 per 25; 100 or 
more, 616€ each; 256 per specimen set; postpaid; sep- 
i 
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arate answer sheets (IBM Form ITS 1100 A 444) 
may be used; IBM answer sheets must be purchased 
directly from the International Business Machines Cor- 
poration; nontimed (40-45) minutes; Shailer Peter- 
m the Author, 222 East Superior St, Chicago 11, 
REFERENCES 
1-2. See 3:170. 


FINE ARTS 


Reviews BY William S. Larson, John McLeish, William B. Michael, Harold A. Schultz, Herbert 
D. Wing, and Edwin Ziegfeld. 


ART 


[219] 
*Graduate Record Examinations Advanced Fine 
Arts Test. Senior year college through graduate 
school; 1941—51; available only in the Institutional 
Testing Program of Graduate Record Examinations 
(see 527); 105(125) minutes; Educational Testing 


Service. * 


[220] 

*Graves Design Judgment Test. Grades 7-16 and 
adults; 1948; IBM; 1 form; [revised] manual (48); 
1-0 copies, $1.75 each; 10-99, $1.50 each; separate an- 
swer sheets must be used; $1.85 per 50 IBM answer 
sheets; $1.85 per specimen set; postpaid; nontimed 
(20-30) „minutes; Maitland Graves; Psychological 
Corporation. * 


REFERENCES 


1. Graves, MAITLAND. Chap. 6, “Unity,” 


. 6. In hi 
The Art of Color and Design. New York: aa "Book 
о., Inc., 1941. Pp. xvi, 


292, * 
2. Graves, MAITLAND. The Art of Color and Design, Second 


Edition, pp. 66-79, 164-9. New York: McGraw-Hill Book Co., 
nc., 1951. Pp, xvi, 439. * 


Wittam B. Micwaet, Director, The Testing 
Bureau, University of Southern California, Los 
Angeles, California. 

Devised to yield a measure of those aspects 
of aptitude thought to be important in the ap- 
preciation or production of art structure, the 
Graves Design Judgment Test represents a con- 
scientious and noteworthy achievement in test 
construction within a field of extraordinary com- 
Plexity and ambiguity. In the construction of 
the items an attempt has been made to evaluate 
the extent to which the examinee is able to per- 
ceive and react to those basic principles of 
aesthetic order described in the manual as being 
unity, dominance, variety, balance, continuity, 
symmetry, proportion, and rhythm. Obviously, 
it is in the meanings of such concepts and in the 
subtle dynamic interrelationships of their asso- 
ciated affective values within the context of a 


given design that the expert in test construction 
meets perhaps his greatest challenge in the de- 
velopment of valid items, for he cannot describe 
adequately the numerous differential affective re- 
sponses a given design may elicit among any 
group of individuals. In short, because it is 
difficult to obtain agreement among experts as 
to the meanings and relative effects produced 
by the so called basic principles of aesthetic 
judgment, the realization of a substantial degree 
of validity is almost an unattainable objective. 
Thus, the major weakness of this test, as well 
as of others purporting to measure aesthetic 
judgment, is the theoretical one of establishing 
more or less universal criteria (principles) 
against which the test might be validated. 
Nevertheless, in the construction of the test 
the author has displayed high standards of 
workmanship in attempting to perfect an empiri- 
cally valid instrument. First following his the- 
oretical assumptions, Graves made a conscien- 
tious attempt to incorporate the eight basic 
principles enumerated above within one de- 
sign (keyed “right’””) and to introduce violations 
of one or more principles in the one or two other 
designs making up the item. Within each de- 
sign, abstract or nonobjective, geometric ele- 
ments were employed to eliminate the influence 
of ideas, prejudices, or specific meanings asso- 
ciated with works of visual art, In avoiding 
representational art, Graves has probably suc- 
ceeded fairly well in divorcing form from con- 
tent and in minimizing the importance of cul- 
tural experience that would permeate the repro- 
ductions of familiar settings such as landscapes, 
utilitarian commodities, persons, and the like. 
Of the 150 items prepared for experimental 
tryout, 90 were retained that satisfied all three 
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E к-н, agreement among 
of the following criteria: (a) 221 specializa- 


teachers of art (whose educ# ] 
tion had not been visual art) 25 to which of the 
two or three designs in each i£? was best, (b) 
"greater preference for the design by art stu- 
dents than by non-art students,” and (с) 
“greater preference for the de!8" by those who 
achieved high scores on the entire test than by 
those receiving low scores (internal consist- 
ency).” As mentioned in the manual no data 
have been obtained regarding the degree to 
which the test scores can predict success in vo- 
cational or educational activities related to the 
purpose of the test. Evidence 15 presented re- 
garding performance of yariouS groups which 
on a priori grounds might be expected to differ. 
Marked separations in scores Were obtained 

. between art students and ponart students at both 
the college and high school levels- Art students 
consisted of subgroups of individuals majoring 
in general art, illustration. costume design, 
crafts, architecture, and art education. Nonart 


students consisted of studet? Е Jem 
: ; x à tal tec gy, 
ricula in engineering, den joi. 


conventional high schoo] 5% Ё 

For each of the subgrouP? еш e 
tween 36 and 246 students) * Dueh 5 
norms is presented. Sigsific™ ves dos 
tween the means of art and piri Eur түм 
apparent, but no statistic EP th Hidde ub 


differences. rain’ reliability of the test 
s n cin anual for each of the 14 sub- 
is presented in е the homogeneity of the items 
groups. In view um а time permitted for com- 
and of the bs det the odd-even technique 
pletion e t length by the Spearman-Brown 
corrected for аро yed appropriately yielding 
formula was iid between .8т and .93. Stand- 
coefficients rangi" cement are also presented 


a 
ard епо oa groups: The median reliability of 
ш ir e to be quite satisfactory for a group 
.86 apP 


test of e commendable features may be 
perd directions are simple and clear. Scor- 
noted. | pe achieved rapidly by either manual 
ing gare techniques. Although the time limit 
or po it is found that most subjects complete 
» EUR items in only 20 to 30 minutes. 

t “the format of the test booklet is particularly 
clear and delightful to the eye. Since each set 


designs for a given item is beautifully repro- 


duced on a single page, taking the test should 
prove to be both a pleasant and novel experience, 

Two or three sources of possible weakness of 
this test should perhaps be cited. First, in meas- 
ures of aesthetic judgment, the problem may be 
raised as to whether the recorded responses 
correspond to those designs which actually were 
most appealing to the subject or whether the 
responses signify what the subject thinks would 
be the design preferred by the experts—that is, 
the response which would be counted as correct 
in the key. The reviewer asked seven subjects, 
all of whom had been exposed to an introduction 
to aesthetics in courses in general psychology, 
to complete the test under two sets of directions 
emphasizing the contrasting mental sets just 
cited. Without exception, when the examinee 
attempted to anticipate what the correct answer 
would be, he scored several points higher than 
when he indicated his own personal preference. 
Although the sample is too small to permit the 
drawing of any conclusions, the results are sug- 
gestive of the fact that valid measures may be 
difficult to realize in the instance of sophisticated 
subjects. Certainly further research is needed 
to clarify the issue. 

Second, there appeared to be a tendency for 
the principles of variety and dominance to out- 
weigh in importance several of the other princi- 
ples, especially symmetry. While taking the test, 
the reviewer suspected that such might be the 
case and answered items accordingly. When he 
employed other criteria and recorded his actual 
preferences his raw score dropped 23 points. 
Certain themes were also repeated in the de- 
signs which might lead to existence of a high 
degree of interdependence among many of the 
items and possibly to the generation of syste- 
matic bias in the scores. А 

Finally, does a test of this type measure арі" 
tude components for the production of art struc- 
ture as adequately as the components of ap- 
preciation? Although the manual makes no dis- 
tinction between the two components, it would 
appear perhaps that some type of performance 
test might be required to duplicate the compo- 
nent of production. й 

Despite the existence of inherent difficulties 
inyolved in the construction of a test of aesthetic 
aptitude, Graves' test does seem to represent ап 
advance in measurement in the fine arts. In addi- 
tion to such points of strength as excellent for- 
mat, clarity of directions, ease of scoring, presen" 
tation of fairly adequate norms, and satisfactory 
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reliability, considerable care was exercised in 
the construction and empirical validation of the 
items. On the negative side is the lack of follow- 
up data to indicate the validity of the test in 
the prediction of success in vocational and edu- 
cational activities in which aesthetic judgment 
is thought to be important. Certain theoretical 
difficulties have been cited relating to establish- 
ment of universal criteria of judgment, the ex- 
istence of different mental sets of subjects in 
their responses to items, the distinction between 
productive and appreciative sides of aesthetic 
judgment, and the relative emphasis to be placed 
on various basic principles of aesthetic order in 
the construction of items. Only further research 
can aid in overcoming these and other difficulties 
involved in the measurement of aesthetic apti- 
tude. 


Epwin ZieGrELp, Professor of Fine Arts and 
Head of the Department of Fine and Industrial 
Arts, Teachers College, Columbia University, 
New York, New York. 

Devised “to measure certain components of 
aptitude for the appreciation or production of 
art structure....by evaluating the degree to which 
a subject perceives and responds to the basic 
principles of aesthetic order—unity, dominance, 
variety, balance continuity, symmetry, propor- 
tion, and rhythm,” this test consists of 90 items, 
each made up of two or three plates from which 
the subject selects the one he considers the better 
or the best. The plates in each item consist of 
nonobjective, two-dimensional or three-dimen- 
sional designs, one of which is organized in ac- 
cordance with the fundamental principles of de- 
sign on which the test is based, the other one 
or two violating these principles. The correct 
response in each item was established on the 
basis of agreement among art teachers as to the 
preferred design, greater preference for the de- 
sign by art students than by nonart students, and 
the internal consistency of the item. The manual 
gives data from 14 groups to which the test was 
administered and the reliability varies from .81 
to .93 with a median of .86. The norms are based 
on students from two high schools and three post- 
high school institutions. Evidence presented in 
the manual indicates that the test discriminates 
sharply between art and nonart groups at both 
the high school and college levels. 

In comparison with other tests with the same 
or similar objectives, the innovation of the 
Graves Design Judgment Test is the nonobjec- 
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tive character of the items. By the removal of 
subject matter from the choices, the author 
hoped to secure responses that have a purely 
aesthetic basis. 

No published studies on this test have ap- 
peared. Little data are available on the effective- 
ness of the test by itself and none on its relation 
to other tests. Based upon the evidence presented 
in the manual, the test measures design judg- 
ment well, the “component” on which the test is 
based. But it is not, however, clear what the role 
of design judgment is in the total complex of 
traits involved in “the appreciation or produc- 
tion of art structure.” By removing all repre- 
sentational features from the items, the author 
has ruled out most or all of the associational 
factors which might affect choice. Feelings and 
emotions are basic to art and the use which an 
artist makes of experience is one essential factor 
in successful art production. By excluding one 
of the important factors in artistic success the 
test may have been improved but at the same 
time the use of it as a predictive instrument has 
thereby been limited. Until evidence is presented 
on the validity of the test, it is recommended that 
results be used with caution. 


J Consult Psychol 13:66 Е ’49. Laurance F. 
Shaffer. The counselor’s limited resources for 
measuring art aptitude are augmented by this 
test, which uses a new approach. Each plate has 
two or three designs of entirely novel and ab- 
stract character, so that judgment is relatively 
uninfluenced by familiarity or by subject-matter 
association. * Validity is discussed in terms of 
the capacity of the test to separate groups such 
as students majoring in art and nonart curricula. 
This validation does not distinguish between 
aptitude and achievement, and further validation 
studies involving prediction are proposed. * 

[221] 
*Horn Art Aptitude Inventory. Grades 12-16 and 
adults; 1939-51; 2 Scores: scribbling and doodling, 
ty tm Peli таша И ыр 
DUM C "eia ir Stoelting Co. $^ Formerly 
ublished by Office of Educational esearch, Rochester 
Institute of Technology.) 
For a review by Edwin Ziegfeld, see 3:171. 


[222] 


nontimed (180) minutes; Alma Jordan Knauber ; the 


REFERENCES 
1-4. See 40:1323. 
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EDWIN ZIEGFELD, Professor of Fine Arts and 
Head oj the Department of Fine and Industrial 
Arts, Teachers College, Columbia University, 
New York, New York. 

This test was devised as a measure of art 
ability for use from junior high through college. 
It is long, requiring about three hours for ad- 
ministration although no time limit is set. A 
considerable variety of responses is called for 
such as: reproducing a design from memory, 
drawing a figure of Santa Claus, adapting or 
modifying given designs to other conditions, 
drawing objects in perspective, creating pictorial 
compositions making use of designated elements 
and/or themes, checking drawings for accuracy 
in perspective and shadows, and working out 
light and dark arrangements. Scoring on each 
item (with one exception) is on a 10-point scale, 
and the key presents examples on three levels 
of excellence to assist in assigning valid scores. 

This test is of interest in the range and di- 
versity of traits which it measures for it is ap- 
parent that art ability is highly complex. The 
author, however, gives no clues as to why she 
chose these particular traits for measurement 
or why she considers that they, in their totality, 
measure art ability. The validity data that are 
provided are from groups of art majors and 
nonart majors in college and the considerable 
differences between the scores of the groups 
might be due as much to training as to ability. 
Little definite information is given on the source 
of the relatively small groups of students on 
whom the validity data are reported. If drawn 
from the classes taught by the author or by her 
associates, as appears to be the case, it is quite 
understandable that the art majors would score 
high in a test embodying the author's criteria 
of ability or achievement. Evidently no studies 
have been reported by other researchers to in- 
vestigate further the traits which this test meas- 
ures, the success with which it measures them, 
or its relation to other art tests. Further infor- 
mation on what the test actually measures and 
data from larger and more diversified groups 
would seem to be necessary before results could 
be used or interpreted with much assurance. 


For a review by Norman С. Meier, see 40: 
1323. 


[223] 
Knauber Art Vocabulary Test. Grades 7-16; 1932- 
35; 1 form, '32; manual Сав); 9f per test, postage 
extra; 30€ per manual; 350 per specimen set; post- 


paid; nontimed (40) minutes; Alma Jordan Knauber; 


the Author, 6988 Warder Drive, Cincinnati 24, Ohio, * 
REFERENCES 
1-4. See CENE «Ж 
Banners ARRY O. An Examination of Certain Stand. 


ardised Art Tests to Determine Their Relation to Classroom 


Achievement and to оре usta thesis, University of 


Toronto (Toronto, Canada), 1948. 

Barrett, Harry О. “An examination of Certain Stand- 

ardized Art Tests to Determine Their Relation to Classroom 
Achievement and to Intelligence.” J Ed Res 42:398-400 Ja 
'49. * (PA 23:3894) 
Epwin ZIEGFELD, Professor of Fine Arts and 
Head of the Department of Fine and Industrial 
Arts, Teachers College, Columbia University, 
New York, New York. 

This test, designed to measure knowledge of 
art vocabulary, consists of 100 multiple choice 
items each. concerned with the definition of an 
art term. Norms are provided for each year from 
the seventh grade through the senior year in 
college. 

Technically, the test has a number of limita- 
tions. It is doubtful whether the test measures 
“knowledge of art” as the author claims. Its 
title seems a more accurate description of what 
it measures. However, no information is given 
on the basis of selection of the terms whose defi- 
nitions make up the test. The two coefficients of 
reliability reported by the author are .67 and .78, 
both too low to make the test generally useful. 
The manual, furthermore, supplies no informa- 
tion on the source or number of the students 
used in establishing norms, and the norms, them- 
selves, with their great variation of increase 
from year to year (one point between grades 10 
and тт, ten points between the freshman and 
sophomore college years) would indicate that 
the test is not equally well standardized at each 
grade level. No measures of variability are given. 
The validity of the test has been established only 
by a comparison of scores of art majors and non- 
art majors, but no data are provided on the ex- 
tent of these differences or on their significance. 
Although the instructions state that one of the 
multiple responses in each item is “correct,” the 
desired response in many items is in reality “the 
most correct.” Scoring is awkward and labori- 
ous. 

It is felt that because of the several limitations 
of the test in construction, reliability and valid- 
ity, that its usefulness is questionable. 


For reviews by Ray Faulkner and Joseph Ё. 
Moore, see 40:1324. 


[224] 
The Meier Art Tests: I, Art Judgment. ou 
7-12; 1929-42; a revision of The Meier-Seashore 
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Judgment Test (see 40:1326); IBM; 1 form, '4o; 
manual (42) ; $1 per test; 5-24, 90€ each; 25 or more, 
856 each; separate answer sheets must be used; 4€ per 
IBM answer sheet; 30€ per machine scoring stencil ; 
postage extra; $1.25 per specimen set, postpaid; non- 
timed (45-60) minutes; Norman Charles Meier; Bu- 
reau of Educational Research and Service, State Uni- 
versity of Iowa. * 


REFERENCES 


1-15, See 40:1326. 

20-195 See 3:172. - е i 

20. WENTWORTH, HeLen C. Comparative Achievement of 
College Students as Measured A Three Art Appreciation Tests. 
Master's thesis, University of Minnesota (Minneapolis, Minn.), 
1935. 

21. Morrow, Rozert S. “An Analysis of the Relations 
Among Tests of Musical, Artistic, and Mechanical Abilities.” 
J Psychol $:253-63 '38. * (PA 12:5025), 
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J Ed Res 43:391-3 
26. Ркотнко, E. 


Hanorp A. Scuuttz, Professor of Art Educa- 
tion, University of Illinois, Urbana, Illinois. 

This test, a revision of the Meier-Seashore 
Art Judgment Test first published in 1929, con- 
tains тоо plates carefully selected on the basis of 
experience and analysis from the 125 used in the 
earlier form. According to the author the revi- 
sion has resulted in a significant improvement by 
increasing high scores and lowering poor ones. 
The present test takes less time to administer. 

Each of the тоо plates consists of two pictures 
in black and white exactly alike except that a 
small portion has been altered in one of them 
to bring about a change in the design or compo- 
sition. The subject taking the test is asked to 
select the better of the two. The correct solutions 
were determined by the agreement of experts in 
the art field, and thus the subject attempts to 
match his judgment against that of the profes- 
sional, Reliability of the test is from .70 to .84. 
Norms are provided in percentiles and quartiles 
for the junior high, senior high, and college- 
adult levels. 

The test was designed to measure aesthetic 
judgment in terms of the capacity to sense good 
organization in a work of art. An examination 
of the test shows that most of the items used are 
derived from paintings and drawings. The as- 
sumption is that aesthetic judgment can be based 


primarily upon responses to two-dimensional 
expression. No opportunity is provided to re- 
spond to design as exemplified in such art forms 
as sculpture, architecture, furniture, and various 
types of industrial design. The test items deal, 
almost exclusively, with the linear and pattern 
qualities in composition. Although this is one 
of the significant traits in artistic aptitude, aes- 
thetic judgment involves sensitivity to a variety 
of interlinking factors such as color, texture, 
space, form, and subject matter. 

The manual states that the material used in 
the test was adapted from the works of old mas- 
ters, contemporary artists, block prints, and sim- 
ilar sources. Reputability of the work was one 
of the requirements in the selection. Many of the 
items appear to be dated, and individuals using 
the test will find very few examples, if any, that 
can be classed as contemporary expression. In 
the light of the development of artistic thought 
in recent years, one may question the reputation 
of some of the examples. The selection of the 
better of two drawings is made difficult when 
both appear to be weak in artistic qualities. 

There are several instances where the altered 
portions do not change the design of the picture 
sufficiently. In these cases the choice has little 
meaning in terms of aesthetic judgment. Some of 
the illustrations are poorly drawn and are repro- 
duced on too small a scale. Thus, responses may 
be affected by an inability to see detail, by the 
rejection of an item because of facial expression, 
and by lack of a logical treatment of action. 

The test remains as the most satisfactory of 
all the art tests devised. It is statistically relia- 
ble and easy to administer and score. The instru- 
ment does well as a measure of sensitivity to 
design, and the results may be used by counsel- 
ors as one means of discovering talent. The man- 
ual clearly indicates that traits other than aes- 
thetic judgment are needed by individuals who 
intend to find a successful career in art. 

It is possible that a basic restudy of the test 
items would result in an improved new edition. 
Such a revision should give consideration to the 
use of color, the elimination of dated subject 
matter, and the use of contemporary art expres- 
sion including nonobjective painting. The types 
of art-selected should include a wider variety of 
art forms. A reexamination of the selections 
should result in changes which include some of 
the recent and accepted developments in design 
and composition. The value of the test book 
could be enhanced by using better drawings, en- 
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larging the reproductions, and improving the 
quality of the printing. 


For a review by Edwin Ziegfeld, see 3:172; 
for reviews by Paul R. Farnsworth and Aulus 
Ward Saunders of the original edition, see 40: 
1326. 


MUSIC 


[225] 
*Conrad Instrument-Talent Test. Ages 7 and 
over; 1041; 6 scores: pitch, tempo, rhythm, harmony, 
tone recognition, total; 1 form; no data on reliability 
and validity; no description of normative population ; 
$3 per set of test booklet and тоо grading sheets; $1 
per I00 grading sheets; piano and electric metronome 
essential for administration; postpaid; Jacques W. 
Conrad; Mills Music, Inc. * 
[226] 

*Diagnostic Tests of Achievement in Music. 
Grades 4-12; 1950; 11 scores: diatonic syllable names, 
chromatic syllable names, number names, time signa- 
tures, major and minor keys, note and rest values, let- 
ter names, signs and symbols, key names, song recog- 
nition, total; IBM; Forms A, B; no norms—authors 
recommend the use of local norms; $2 per 25 test- 
answer sheets; 60€ per set of either machine or hand 
scoring stencils; postage extra; 35€ per specimen set, 
postpaid; nontimed (20-60) minutes; M. Lela Kotick 
and T. L. Torgerson; California Test Bureau. * 
М/пллАм S. Larson, Chairman, Music Educa- 
tion Department, Eastman School of Music, 
University of Rochester, Rochester, New York. 

This test is very impressively presented and 
can be easily administered. The pupils mark 
their answers on a 4-page IBM answer sheet. 
Nine of the то parts are printed on the answer 
sheet; Part то, Song Recognition, is printed on 
a separate sheet. 

The authors suggest that schools which differ 
in their curricular offerings or in their instruc- 
tional emphasis may omit tests not appropriate 
for them. No norms are presented. The authors 
recommend that each part score be converted: 
into percentage of the possible score. In effect, 
the authors do not present a standardized test of 
achievement in music, but rather they offer ten 
separate tests. Each teacher who uses the tests 
must assume the responsibility of deciding what 
level of excellence shall be considered adequate. 
The authors suggest that such a standard may 
be set between 70 and тоо per cent but recom- 
mend that the standard should be at least 90 per 
cent if the test is in agreement with the curricu- 
lum of the school. However, the statistics given 
in the manual do not correspond to the percent- 
ages recommended. In Table I, the mean, stand- 


ard deviation, and reliability are given for each 
of the ten tests. These statistics are based upon 
179 to 259 children in grades 4-10; no indica- 
tion is given as to the number from each grade, 
The mean scores are far from the 70 to 100 per 
cent standard suggested by the authors; in fact, 
in some of the subtests the means are but little 
higher than would be obtained by guessing, For 
example, in Test 2, Chromatic Syllable Names 
(20 examples), the mean score is 4.81, or 24 per 
cent ; the chance score is 5 or 20 per cent. Simi- 
larly, in Test 3, Number Names, the mean score 
for 234 cases from grades 4—10 is given as 5.68 
or 28 per cent, only eight percentage points 
above the expected chance score of 20 per cent. 
Also, in Test 5, Major and Minor Keys, the 
mean score for 205 cases from grades 4-10 is 
5.59 or 56 per cent ; the chance score on this test 
is 50 per cent. 

The means given in Table II are not much 
more impressive. The mean, standard deviation, 
and reliability coefficient are given for the total 
test for each of the grades 4-10. The number of 
students used in each grade is very small, rang- 
ing from 28 to 46, the total for the seven grades 
being 179. The mean scores again are low, vary- 
ing from 29 per cent of the possible score in 
grade 4 to 53 per cent in grade 8. These com- 
parisons with chance scores are possibly subject 
to:some modification because some students may 
not have completed all items. However, the au- 
thors state that the suggested time limits are 
approximate only, that 9o per cent of the ри- 
pils should complete all items, and that, if the 
test is to be used to identify difficulties, all pupils 
should complete all items. 

These details show the apparent weakness of 
this test for practical use. Lack of standardiza- 
tion for each grade is a particularly grave omis- 
sion. If, as the authors state, the test content i5 
in agreement with the music curriculum recom- 
mended by music authorities and contained in 
commonly used music series, there should be 
little or no concession to diversity of music cur- 
ricula or differing emphasis on course content. 

Much of the material of this test can also be 
questioned as a true measure of musical achieve- 
ment, as most music educators understand the 
term. The Kotick-Torgerson test has mainly to 
do with the rudiments of music, as have a num- 
ber of similar so-called achievement tests in 
music published during the last several decades. 
With the exception of Part 10, Song Recogn- 
tion, which makes demands on musicality, paf- 


341_] 


ticularly through musical imagery, the test con- 
tains mostly factual material. Ultimately, 
however, musical achievement rests not on a 
mastery of rudiments of music of a factual nature 
but on musical talent, in which students differ 
greatly. And there is little relationship between 
musical talent and ability to learn facts about 
the mechanical aspects of music reading as found 
in the content of the first nine parts of this test. 
Music educators have come to recognize more 
and more the significance of this basic and im- 
portant point, with the result that less attention 
is being given to drill on the mechanical aspects 
of music. Rather, more emphasis is being placed 
on giving the child an aesthetic experience in 
music, particularly through musical expression 
in one form or another. It is questionable, there- 
fore, that any test which gives a minor place to 
the more vital aspects of musical achievement, 
such as musical performance and appreciation 
should be called a test of musical achievement, 
unless the title is qualified by the term “rudi- 
ments of music,” or by some similar term, so 
as to clarify specifically its limited purpose and 
function. 


Hersert D. Wine, Principal, City of Sheffield 
Training College for Teachers, Sheffield, Eng- 
land. 

These are pencil and paper tests taking about 
an hour to administer. There are IO sections: 
(a) tonic sol-fa names from staff notation; (5) 
tonic sol-fa names from staff notation but with 
chromatic intervals; (c) number names (tonic 
as I, mediant as 3, dominant as 5, etc.) ; (d) 
beats corresponding to time signatures; (e) 
stating whether a short form of notation is in the 
major or minor key; (f) stating the number 
of beats corresponding either to a written note 
or a rest; (g) giving the letter names of written 
notes; (h) recognition of the common signs 
and symbols used in notation; (7) recognition of 
key signatures; and (j) recognition of common 
tunes from notation. 

The instructions are fairly long; for Test 1A 
they run approximately 400 words and, in addi- 
tion, require a certain amount of teaching. They 
are, therefore, not fully standardised but de- 
pend for their efficiency on the teaching ability 
of the test administrator. For English children, 
at least, the procedure used for writing down 
the answers would require considerable expla- 
nation in order to make sure that the children 
were carrying out the instructions correctly. 
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No norms are given, but the results are left to 
be evaluated as a result of local experience. This 
must detract from the value of the test, but it 
may be a necessary procedure in the test in its 
present form, as schools vary considerably in the 
amount of teaching given in the particular as- 
pects of music which are being tested. 

The tests cover a good sample of the various 
abilities in the theory of music, and the results, 
after experience in the evaluation has been 
gained, should prove helpful to music teachers. 

'The grading in difficulty of the items needed 
more attention; for example, Series A, Item 2, 
Test 5 is by no means easy to solve. It is unfor- 
tunate to have a very difficult item so early in a 
test where a candidate may waste considerable 
time over it instead of getting on with those he 
can do. The item mentioned may have been made 
worse by a possible misprint. 

Test 7 is rather homogeneous in difficulty ; 
some examples in keys other than C and of chro- 
matic notes might have been added with ad- 
vantage. 

The use of the recognition of songs in Test 10 
brings certain difficulties with it, for a selection 
which is to be used over a wide sample of people 
must be limited to a comparatively few popular 
songs ; those who have had a good musical edu- 
cation and are familiar with the classical songs 
and symphonies cannot then do themselves jus- 
tice. 

If it were desired to use the test in England, 
Test 3 would be better omitted, Test 10 would 
need a new list of songs appropriate to English 
children, and other difficulties would be over- 
come by asking the children to ignore the an- 
swering procedure on the test sheet and to write 
out their answers in full, possibly on a separate 
sheet, as they would for a normal examination 
in music. 

[227] 
*Graduate Record Examinations Advanced Mu- 
sic Test. Senior year college through graduate school ; 
1951; available only in the Institutional Testing Pro- 
gram of Graduate Record Examinations (see 527); 
105(125) minutes; prepared by the Advanced Music 
Test Committee appointed by Educational Testing 
Service: G. W. Woodworth (Chairman), G. S. Dick- 
inson, and Stephen D. Tuttle; Educational Testing 
Service. * 

[228] 
*Musical Aptitude Test: Series A. Grades 4-10; 
1950; 4 scores: rhythm, pitch, melody, total; IBM; 
1 form; $3 per set of testing materials needed by ex- 
aminer; separate answer sheets must be used; 4€ per 
IBM answer sheet; piano essential for administration ; 


postage extra; 40(50) minutes; Harvey S. Whistler 
and Louis P. Thorpe; California Test Bureau. * 
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WiLLIAM S. Larson, Chairman, Music Educa- 
tion Department, Eastman School of Music, 
University of Rochester, Rochester, New York. 

The authors of this musical aptitude test stress 
the point that in testing for pitch, rhythm, and 
melody they use musical examples rather than 
mechanical devices of a nonmusical nature which 
were common in earlier musical aptitude tests. 
'The examiner is instructed to play the musical 
examples on the piano at a moderate speed. 

"Тһе test has five parts. In the first part, 
Rhythm Recognition (ro items), each problem 
has two rhythms, each two measures in length. 
The student is to recognize whether the second 
rhythm is the same as or different from the first. 
In Part 2, Pitch Recognition (10 items), a tone 
is first played at an established pitch ; in the mel- 
ody of four measures of 13 quarter notes which 
follows, the student is to determine the number 
of times the established pitch occurs. In Part 3, 
Melody Recognition (25 items), two melodies 
of two measures in length comprise each prob- 
lem. If the melody is the same in the second play- 
ing as in the first playing, the student answers 
S; if different, D. In Part 4, Pitch Discrimina- 
tion (15 items), two chords are played. If the 
second chord is the same as the first, the student 
responds with an S; if higher, with an H; if 
lower, with an L. In Part 5, Advanced Rhythm 
Recognition, the problems are essentially the 
same as those in Part 1, except that they are 
more difficult. The student again is instructed to 
answer whether the repeated pattern is the same 
as the first or different from it. 

Reported reliabilities are high. The “face” va- 
lidity of the test rests on the fact that the test 
contains examples of rhythm, pitch, and melody, 
these being considered as basic elements of mu- 
sical aptitude. Statistical validity was investi- 
gated by correlating against part and total scores 
teachers' estimates of instrumental talent and of 
vocal talent; and such factors as whether the 
student at the time of testing had played an in- 
strument at least a year, or whether he was a 
member of a band, orchestra, chorus, choir, or 
glee club. The reported correlations, obtained 
by the use of the Otis unlike signs method, range 
from .21 (total score with membership in 
chorus, choir, or glee club) to .78 (total score 
with teachers' estimate of vocal talent) when 
corrected for attenuation. Intercorrelation data 
for part and total scores are also reported. In the 
past, teachers’ estimates of musical talent have 
not proved to be a highly satisfactory criterion 
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of validity. And using such experiences in music 
as a criterion seems questionable because they 
could well be basically more closely related to 
achievement than to aptitude. With such limited 
material, the section on validity does not seem 
very adequate or convincing. 

Directions for administering and scoring the 
test are very explicit and thorough. The test can 
be either hand or machine scored. For purposes 
of scoring and preparing a diagnostic profile, the 
two sections for pitch are combined, as are the 
two for rhythm, thus making a total of three sub- 
test scores, viz., pitch, rhythm, and melody. Ta- 
bles of norms in deciles for the three subtests and 
the total test are given for each of grades 4-8 
and for combined grades 9-10. 

It would seem that a standardized, recorded 
version of this test might prove to be better, 
because such would obviate the possible effect 
of differences in uniformity and accuracy on the 
part of different examiners who must play the 
musical examples on the piano. 


[229] 

Seashore Measures of Musical Talents, Revised 
Edition. Grades 5-8 and adults; 1919-39; 6 scores: 
pitch, loudness, time, timbre, rhythm, tonal memory; 
IBM; Series А (^39) ; Series B discontinued; man 

(739); $12 per set of three 12-inch records (78 rpm), 
manual, and 100 record blanks; $1 per тоо record 
blanks; $2.10 per 50 IBM answer sheets; 25 per sten- 
cil for scoring answer sheets; 60€ per manual; post- 
paid; (60-70) minutes; Carl E. Seashore, Don Lewis, 
and Joseph G. Saetveit; Psychological Corporation. 
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Jonn. McLzrsn, Lecturer on Psychology, De- 
partment of Adult Education and Extra-Mural 
Studies, The University of Leeds, Leeds, Eng- 
land. 

The Seashore Measures have been subjected 
to a cross fire of criticism from both sides of the 
Atlantic for many years on the grounds that they 
are based on fallacious principles of elementar- 
ism, that they are musically meaningless and in- 
valid, Such criticism has proceeded mainly from 
those who hold what Seashore has called the 
“omnibus” theory of musical talent, as opposed 
to the idea of specific musical talents to which 
he himself adhered. 

Although Seashore himself, in his published 
work, seemed to deprecate the use of factorial 
procedures in the validation of his measures, it 
seemed worthwhile to adopt the methodology of 
one school of critics and to investigate to what 
extent, granting the assumptions of the critics 
about the nature of. musical ability, these criti- 
cisms were justified. For the purposes of this 
demonstration the original measures were used 
(114) although similar results are expected from 
the use of the revised tests (86). The paradoxical 
result of an empirical study of the Seashore 
Measures (using internal criteria of validity in 
the form of factorial and regression analysis, and 
external criteria in the form of other standard- 
ised tests of musical appreciation and ability, 
questionnaires, and nonmusical tests) was that 
neither Seashore’s views as to the specific na- 
ture of musical abilities, nor his critics’ views as 
to the atomistic and irrelevant character of his 
tests could be upheld. The curious position arose 
that it can be demonstrated that the battery of 
tests devised by Wing, who is of all Seashore’s 
critics probably the best grounded in actual em- 
pirical experience of musical tests, is for all 


practical purposes identical, so far as the factor 
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composition is concerned, with the Seashore 
tests. 

The 1939 revision meets most of the criticisms 
with regard to the actual technical problems of 
test construction, except that the test of timbre 
still fails to fill the gap for which the original 
consonance test was devised. In this respect the 
criticism of atomism would seem to be justified, 
at least to the extent that the perception and ap- 
preciation of harmony is perhaps best measured 
in a dynamic, musical context. The experimental 
demonstration that consonance-dissonance is a 
matter of context (174) and acculturation ren- 
ders the Seashore approach to timbre a little out 
of date. 

The main question regarding the Seashore 
Measures still remains: there is still the pressing 
need for a definitive study of validity of these 
revised measures, and for a comparison of the 
different tests of musical ability using the same 
population of subjects. A repetition of Stanton's 
investigation (17) of the validity of the original 
tests, but using all the resources of modern sta- 
tistical procedures to elucidate the interrela- 
tionships of significant variables is urgently 
needed. The tests as now recorded have reached 
probably the peak of technical excellence so far 
as presentation of the stimuli are concerned ; it 
is rather shocking to find that so little is known 
about the validity of the tests after so many 
years of investigation. 


Herzert D. Wine, Principal, City of Sheffield 
Training College for Teachers, Sheffield, Eng- 
land. 

DESCRIPTION. There are six tests, each of 
which occupies one side of a 12” record (78 
rpm) ; the whole series takes about an hour. The 
tests are as follows: Pitch. Two notes (pure 
tones) are sounded consecutively. The candi- 
date is asked to say whether the second is higher 
or lower than the first. Loudness. The same note 
is sounded twice. The candidate is asked to say 
whether the second is stronger or weaker than 
the first. Rhythm. Two tapped time patterns are 
sounded. The candidate is asked to state whether 
they are the same or different. Time. The same 
note is sounded twice. The candidate is asked to 
state whether the second is longer or shorter 
than the first. Timbre. Two notes are sounded 
consecutively. The candidate is asked to say 
whether they are the same or different in qual- 
ity. Memory. A number of consecutive notes 
which form no particular melodic line are 
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sounded : then follows a second playing in which 
one note is changed. The candidate is asked to 
give the number of the altered note. 

APPLICATION. The instructions are short, clear, 
and, except for the practice items, standardised. 
The answers are entered on a simple form. The 
candidates have no difficulty in following what 
is required to be done. The marking is straight- 
forward. Practice sections are taken off the 
actual test record. This procedure has the disad- 
vantage that if the examples are taken at the be- 
ginning of the record, they may help the candi- 
date to do the first few items, while if they are 
taken from a place towards the middle, there is 
some possibility of damage to the record. 

EVALUATION. As is usual with group tests, it 
is considered that a high score is a safer indica- 
tion of high ability than is a low one of low abil- 
ity, as disturbing factors may cause a candidate 
to do less well than he ought. The norms are 
not given for the total marks. The marks for 
each test are evaluated separately by converting 
into a simple scale of 1 to 10, which corresponds 
to steps of tens in percentiles. 

GENERAL COMMENTS. The reliabilities given 
for the individual tests vary between .62 and 
‚79. Validity coefficients are not given. In justice 
to the tests, it is only fair to say that validity co- 
efficients in music are notoriously difficult to 
obtain. This is for a variety of reasons, amongst 
which may be mentioned : music teachers rarely 
keep records of marks of their students as is 
done in other subjects—indeed such marks are 
very difficult to award; secondly, on the rare 
occasions when such mark lists are obtained, 
they usually refer to a small highly selected 
group of music students aud are, therefore, al- 
most certain to give a much lower coefficient 
than that for a truly representative sample of 
people. 

In some recent testing of 150 students with 
these tests, the ranking for 13 (those taking ad- 
vanced music) was obtained from the music lec- 
turers. A validity coefficient of .4 was obtained 
with the total of the Seashore tests. It should 
be borne in mind that the reliability of the tu- 
tors' judgments is not high and that the group 
is so highly selected that even this low figure can 
be taken as an indication that the test has some 
value. In general, results obtained in testing 
were certainly more satisfactory than results I 
obtained with the earlier form of the tests. 

Та finding this validity coefficient, the results 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK 


[ 344 


from the total series of tests were correlated 
with the teacher's judgment. In my opinion, the 
publishers of the test would do well to give stand- 
ardised figures for the totals, and users of the 
test should place more emphasis on results from 
the total series rather than on those from the 
single tests. This is because the reliability of the 
single tests must be so much less than that of 
the total series. To take a specific example, the 
rhythm test has 30 items, with a choice of two al- 
ternatives for the answer; the candidate is re- 
quired to guess when not sure, and for the av- 
erage person there will be many occasions when 
he will guess. The possibility of gaining or los- 
ing an extra two marks over a candidate's true 
figure because of good or bad guessing is quite 
high. A chance mark of +2 would shift an aver- 
age candidate into the top third, and a chance 
mark of —2 would shift the same candidate into 
the bottom third. 

Musicians, in general, do not take kindly tó 
the tests, since they feel that the relationship 
with music is not obvious. Whether the criticism 
is justified or not, it does affect the use of the 
tests by teachers of music, who regard the tests 
with a certain amount of suspicion which it is 
difficult to remove. 


For reviews by Paul R. Farnsworth, William 
S. Larson, and James L. Mursell and a review 
of the original 1919 edition by William S. Lar- 
son, see 3:177. See also 40:1338. 
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*Wing Standardized Tests of Musical Intelli- 
gence: A Test of Musical Ability on 10 Records. 
Ages 10 and over ; 1948; 8 scores : chord analysis, pitch 
change, memory, rhythmic accent, harmony, intensity, 
phrasing, total; orm; no norms for part scores; 
manual (see 4 below) ; 147s. ($20) per set of 10 rec- 
ords (standard 78 rpm), manual, scoring key, and 50 
answer sheets; 15s. ($2) per single record; 125. 6d. 
($2) per 100 answer sheets; postpaid; (60) minutes; 
Н. D. Wing; Cecilia Wing, Sheffield City Training 
College, Sheffield, England, * 
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Јонх McLetsu, Lecturer on Psychology, De- 
partment of Adult Education and Extra-Mural 
Studies, The University of Leeds, Leeds, Eng- 
land. 

These gramophone tests of “musical intelli- 
gence” represent many years of intensive and 
valuable work, involving the empirical compari- 
son of all the feasible types of musical test re- 
corded in the literature or used by musicians on 
their pupils (z, 2, 4). The underlying theory 
takes issue with Seashore, whose tests are re- 
garded as irrelevant, atomistic and musically 
meaningless. A much closer imbrication of test 
procedure with general musical practice is ob- 
tained. As one result, the advantage is gained 
that these tests are much more acceptable to 
music teachers or to educators generally and are 
more interesting to potential musicians than are 
those of Seashore. The “omnibus” theory of 
musical ability (Seashore) on which the tests 
are based is supported by the evidence of factor 
analyses (3, 5), so that the method of obtaining 
а total score for each subject for “musical ability” 
instead of half а dozen scores for specific talents 
appears to be justified. 

Following the lead of Adler,’ the attempt is 
made to test a sufficiently wide sample of musi- 
cal talents, each in an integral musical setting, 
piano music being used in each of the seven tests. 
This use of the same type of stimulus probably 
tends to maximise the generality of the test bat- 
tery for reasons unconnected with the general 
nature of the hypothetical musical ability factor. 
The technique of using English folk melodies 
along with less familiar pieces as test content 
may have a similar tendency, although Wing 
maintains, on the basis of preliminary experi- 
ment, that the known melodies are no easier 
than the unknown (4, p. 54). 

The recording fails to come up to the high 
standard of technical excellence we have grown 
to expect of such tests. The smallness of the po- 
tential market and the expense of recording has 
led to an economy which fails to do justice to 
the tests, A certain lack of crispness in the pres- 
entation of the instructions and material im- 
poses an unnecessary strain on the listener. The 
tests, in general, require rather more concentra- 
tion and cause more fatigue than other recorded 
tests, the strain being comparable to that found 
пае Mortimer Петре, remet. Archives oi Porch ogy, 


o. 110, New York: Archives of Psychology, Columbia 
versity, 1929. Pp. 102. * (PA 14:285) 


in listening attentively to “real” music for about 
an hour (the approximate duration of the tests). 
The records are slightly smaller than other music 
tests (10 inches diameter, 19 sides), so that even 
with a practiced experimenter a disproportion- 
ate amount of testing time is wasted in changing 
records. 

The reliability of the battery as a whole is 
satisfactory; Wing reports .9r. The present 
writer, in an unpublished study with a group of 
100 adults, found a reliability of .9o (split half, 
Spearman-Brown method). The individual 
tests, however, do not show high reliability, 
ranging from .65 to .86, which is lower than those 
for the Seashore tests with the same group. This 
low reliability of the individual tests is not per- 
haps a very serious criticism, since Wing is 
interested in the total score rather than in the 
specific tests. It does seem to indicate the de- 
sirability of further item analyses, or perhaps 
of a general lengthening of the tests, since the 
low reliabilities set a limit on the interpretation 
of factors extracted from this battery. 

In the case of the 100 subjects previously re- 
ferred to, it was surprising to find that the same 
unitary ability is being measured by the Wing 
as by the Seashore tests. This factor, which ac- 
counts for between 30 and 40 per cent of the 
total test variances, is readily identified as the 
cognitive aspect of musical ability (using ex- 
ternal criteria of musical and nonmusical labora- 
tory tests and questionnaire). Although cogni- 
tive, however, it is not influenced to any con- 
siderable degree by test intelligence. Pitch 
change and musical memory, contrary to the 
published finding of Wing (3), were in this 
group, the important constituents of musical 
ability as defined factorially. This emphasises 
the degree of marked similarity between the 
Wing and Seashore batteries as far as validity 
is concerned, since exactly the same result was 
obtained with the latter as with the former. 

Although it has been demonstrated that the 
tests are not affected by musical training or op- 
portunity for hearing music, or even by practice 
on the tests themselves, a second factor which 
accounts for about ten per cent of the total test 
variance has been identified as an interest atten- 
tion factor associated with musical training and 
with age. 

The Wing battery is probably most suitable 
for surveys of populations which are known to 
be better than average in the level of musical 
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talent (e.g., students in a music school) or for 
assessing the extent of potential ability in a sub- 
ject known to be musically gifted. They do not 
seem so well adapted as the Seashore records 
for more general dragnet surveys, since they are 
probably less interesting to the person not highly 
motivated and to the musically unsophisticated. 
They lack the interest of the Seashore tests in 
the variety of stimulus and the simplification of 
the musical situation found in the latter tests. 
But in their circumventing of the prejudices of 
many professional music teachers, and their 
primary concern with the total musical situation, 
the Wing tests are a valuable addition to the 
armoury of the tester and open up many new 
research vistas. 


For related reviews, see 231. 
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*[Re Wing Standardized ДМ) of Musical In- 
telligence.] Wiwc, НЕввЕвт. Tests of Musical 
Ability and Appreciation: An Investigation Into 
the Measurement, Distribution, and Develop- 
ment of Musical Capacity. British Journal of Psy- 
chology Monograph Supplements, Vol. 8, No. 27. Lon- 
don: Cambridge University Press, 1948. Pp. vii, 88. 
Paper. 8s. 6d. * 

A.M.A. 43:220 Je-J1 48. * a most thorough 
and well documented report of research * the 
work seems to be in the right direction. It is of 
interest to note that very little difference was 
found in testing people of different races, which 
helps to dispel the illusion that some races are 
more musical than others. Of course what is 
vital is the training and approach. It is a pity 
that the test on correct tempi had to be discarded 
owing to length, and not a shorter one devised 
instead. 

ACER Test News (13):7 Jl ’50. D, S. 
A most satisfying feature in the development of 
this series of tests is the author's direct approach 
to the problem of assessing musical ability. His 
preliminary choice of test material was based on 
tests in general use among music teachers and 
examining bodies. Estimates of the musical 
ability of children and adults are far more likely 
to be obtained with material of this nature than 
with tests of the sensory-perception type, which 
seem to have neither face validity nor real va- 
lidity for this purpose. The test battery finally 
evolved by Wing is a sufficient justification of 
his attempt to measure the more intangible area 
of appreciation also, despite the fact that he has 
been seriously criticized for such an intrusion 
into the field of musical aesthetics. The musical 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK 


[ 346 


quality of the passages used makes the taking of 
the tests quite a pleasant experience. A short 
practice test is followed by tests of chord analy- 
sis, pitch change, memory, rhythmic accent, har- 
mony, intensity and phrasing, all given in a 
musical fashion from a set of gramophone rec- 
ords and quite suitable for group testing, Al- 
though each item is well spaced in time and 
clearly announced, good testing conditions and 
accurate operation of the gramophone appear to 
be essential. The tests can also be given from the 
piano, but this requires some adjustment of the 
gramophone norms to meet the changed condi- 
tions. The author has been painstaking and thor- 
ough in the construction of the tests. Item analy- 
ses were based on the responses of a large 
number of children, and the standardization was 
carried out on adequate numbers at most age 
levels from 8 years to adult. A retest reliability 
coefficient of .91 is very satisfactory considering 
the nature of the material. There is no practice 
effect over twelve months, and a negligible 
amount over a week, provided a second form of 
the memory test is used on the second occasion. 
Validity coefficients, although high, need to be 
established for a greater number of cases, as 
also do the results of factor studies. The calcula- 
tion of musical quotients seems to be an unneces- 
sary refinement, as the presentation of norms as 
a broad classification into five levels for each 
age should be quite sufficient for most purposes. 
Furthermore, the test as a whole has an un- 
necessarily high ceiling, even allowing that re- 
liability is increased and guessing score is re- 
duced with the greater number of items. Sub- 
sidiary aspects arising out of the development 
of the tests such as the effect of general intelli- 
gence, interest, and environment on musica 
ability, have been thoroughly investigated. The 
inescapable conclusion that there is much waste 
of effort in the teaching of instrumental music 
because it is being expended on unmusical in- 
stead of musical children, is a cogent argument 
in favour of the selection of children for special 
musical training, for which the present series 
of tests is admirably suited. The tests undoubt- 
edly deserve a wide use by musicians, and should 
be a valuable supplement to the work of music 
examiners, whose years of experience do not 
altogether absolve them from individual “fads 
and fancies” and whose estimates may be in- 
fluenced to an unknown extent by external con- 
ditions. The monograph would also repay care 
ful study by teachers engaged in planning ОГ 
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in revising music courses in school systems. 

Brit J Ed Psychol 19:226-7 N "49. J. M. * 
differs from the many and varied contributions 
hitherto made in this field of investigation in its 
exhaustive and critical analysis of previous rele- 
vant researches and in its attempt to provide 
which tests cover the whole area of musician- 
ship. References, sometimes frequent and de- 
tailed, are made to a hundred and seventy-two 
books and papers, included in the bibliography. 
* 'The inclusion of only one rhythm test, a test 
offering for preference rhythmically accented 
апа rhythmically distorted items, is justified on 
the ground that such tests have a comparatively 
weak association with general musical ability. 
This view seems inconsistent with another of 
Dr. Wing's principles of selection, that “A test 
might be aimed at assessing an ability considered 
desirable оп general musical grounds, and yet 
might not correlate very well with the total bat- 
tery because it is dealing with some capacity 
which is more or less independent of the general 
factor." If the necessarily restricted purpose of 
the tests is accepted, however, it seems reason- 
able to accept Dr. Wing's claim that "no vital 
test is missing from the short series." 

Brit J Psychol, Gen Sect 39:112-3 D ^48. 
C. B(urt). Dr. Wing's chief aim in his various 
researches has been to construct satisfactory 
tests for musical ability and appreciation. But 
he has also used his tests to throw light on vari- 
ous subsidiary problems—the general nature of 
musical ability, its distribution in the general 
population, its development with age, and the 
effect of environmental conditions, particularly 
at school and at home. * He begins with a brief 
critical account of earlier efforts, both in this 
country and abroad. * the procedures drawn up 
by psychologists—for example, Seashore's tests 
for measuring "musical talent"—were based 
principally on qualities which musicians would 
regard as of little importance, and ignored many 
of the qualities which musicians would regard as 
highly desirable. What Dr. Wing calls “һе 
atomistic type of psychological test" was in the 
main concerned with the more elementary as- 
pects of musical hearing, such as the capacity to 
discriminate pitch or loudness, which could 
readily be measured by laboratory methods ; and 
it assumed, without any adequate evidence, that 
these simple devices would be diagnostic of a 
general musical capacity. Before any satisfac- 
tory scheme of tests could be attempted, there- 
fore, it was essential first of all to discover 
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whether in point of fact there was any general 
capacity underlying the different aspects of 
musical appreciation or musical training. Here 
the natural approach was to carry out a factorial 
analysis of the apparent manifestations of musi- 
cal skill; and, as a matter of fact, a group factor 
for music proved to be one of the earliest and 
easiest to demonstrate. Although the existence 
of such a factor was at first contested (more par- 
ticularly by Spearman in his Abilities of Men, 
р. 340), nevertheless later researches, among 
which those of Dr. Wing himself have been the 
most convincing, have fully confirmed its pres- 
ence. It is, as Dr. Wing points out, a complex 
and by no means an elementary capacity ; never- 
theless, it appears to rest largely on innate en- 
dowment, and may manifest itself at an ex- 
tremely early age. The next step was to discover 
what kinds of mental process showed the high- 
est correlations with this general capacity and 
with independent or external assessments, Here 
again a factorial approach furnished the most 
conclusive evidence. Nearly all the factorial in- 
vestigations agreed in indicating that, as in test- 
ing general intelligence, so in testing musical 
ability, the more complex processes would gen- 
erally provide better tests than the more ele- 
mentary. The low correlations reported by the 
earlier investigators were due to the fact that 
most of them had relied solely on elementary 
tests of a laboratory type. Dr. Wing quotes strik- 
ing examples to show how boys with excellent 
musical ability may at times obtain exceedingly 
low scores with the Seashore tests, while others 
who were excellent musicians might be “almost, 
bottom in the Seashore pitch test.” On the basis 
of his preliminary review he lays down eleven 
essential requirements, which a “scientific set of 
psychological tests acceptable both to psycholo- 
gists and to musicians and teachers of music” 
should satisfy. His own series comprises 26 tests 
of “musical intelligence"—a phrase which he 
adopts to include both musical “ability” and 
musical “appreciation” in the stricter sense of 
those terms. The majority are given by specially 
prepared gramophone records ; others by means 
of the piano only ; and the whole concludes with 
a questionnaire on general musical interests, 
opportunities for hearing music at home, musi- 
cal training, and the like. * Dr. Wing’s mono- 
graph will prove highly suggestive both to the 
theoretical psychologist and to the practical in- 
vestigator. One point he has rightly stressed 
throughout his writings, namely, that here, as 
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in so many branches of applied psychology, there 
isa pressing need for close co-operation between 
the psychological expert on the one hand and 
the subject-expert on the other. Unlike most 
other investigators, Dr. Wing is himself at once 
a musician and a psychologist; and, moreover, 
he has successfully secured the co-operation of 
specialists on each aspect of his work. As a re- 
sult the tests which he has selected and stand- 
ardized so carefully will be of the greatest assist- 
ance to those concerned with educational and 
vocational guidance and especially to teachers 
and examiners in the field of music itself. 
Occupational Psychol 22:162 Jl "48. P. E. 
V (ernon). In the 1941 British Journal of Psy- 
chology, Dr. Wing published a short account of 
the series of tests he had developed for meas- 
uring musical ability, and of his method of ex- 
pressing the results in terms of Musical Ages 
(analogous to Mental Ages), together with a 
factorial analysis. Psychologists realized that 
here at last was a reliable battery which really 
involved musical perception and appreciation, 
instead of sensory capacities. But they have had 
to wait till after the war for a full description of 
the 26 tests from which the final set was selected, 
and of the investigations involving nearly five 
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thousand children and adults. Particularly in- 
teresting is the way in which the initial ideas 
for the tests were derived from the unstandard- 
ized aural tests which music teachers themselves 
use. This assures what previous batteries have 
usually lacked—acceptability to musicians. At 
the same time the factor analysis and other cor- 
relational data will assure psychologists of the 
representativeness and validity of the set of 7 
tests, Reliability coefficients for this one-hour 
battery average .95 over I year, and .86 over 
2 to 4 years. Perhaps the most striking evidence 
of validity is a study of 333 boys aged 14 to' 
16 who had at some time studied, or who still 
were studying, musical instruments. When 
classified into Above Average, Average and Be- 
low Average groups on the tests, it was found 
that 40 per cent of the Below Average group 
had given up studying, and 27 per cent of the 
Average group, but only 2 per cent of the Above 
Average group had lapsed. The monograph also 
contains a useful review of tests devised by 
Continental and American as well as British au- 
thors, together with some interesting evidence 
on the extent to which musical ability is affected 
by music in the home and other environmental 
influences. * 


FOREIGN LANGUAGES 


Reviews sy Nelson Brooks, Donald С. Burns, Harold B. Dunkel, Konrad Gries, Elton Hocking, 
Walter V. Kaulfers, W. C. Kvaraceus, William B. Michael, Herbert Schueler, James B. Tharp, 
Mary E. Turnbull, and Clarence E. Turner. 
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Foreign Language Prognosis Test. Grades 8-9; 
1930; 2 forms (listed below) ; author recommends use 
of both forms; norms ['30]; $9.90 per 100; 356 per 
Specimen set; postpaid; 44(60) minutes; Percival M. 
Symonds; Bureau of Publications, Teachers College, 
Columbia University. * 
4) FORM A. 5 scores: English inflection, word transla- 
tion (English to Esperanto), sentence translation (Es- 
peranto to English), related words, total. 
» нам B. 5 scores: word translation (Esperanto to 

nglish), 
(English to Esperanto), formation of parts of speech 
in English, total. 

REFERENCES 
1-6. See 40:1340. 


WirLriAM B. MICHAEL, Director, The Testing 


Bureau, University of Southern California, Los 
Angeles, California. 


artificial language, sentence translation | 


Designed to predict the degree of success that 
pupils in the eighth or ninth grade will realize 
in the study of a foreign language, this test con- 
sists of two forms of somewhat different content. 
The manual states that either form may be used 
successfully but recommends that both forms be 
employed whenever feasible. : 

Although the test appears to possess consid- 
erable face validity, the rather confident state- 
ments expressed in the manual regarding 1ts 
guidance value in the selection of students who 
should pursue foreign language courses and in 
the formation of groups of homogeneous ability 
do not seem to be entirely justified with respect 
to the evidence presented concerning its relia- 
bility and validity. Estimates of the coefficient 
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of reliability ranging between .73 and .78 are 
expressed in terms of the correlation between 
total scores on Form А and Form B (which do 
not begin to approximate equivalent forms) for 
four schools concerning which no explanation 
is given regarding size, location, or scope of cur- 
riculum. No reliability coefficients are reported 
for part scores, although it may be expected that 
they would be rather low in view of the rela- 
tively small numbers of items in several of the 
parts. 

As to validity, average coefficients of .60 and 
.61 are reported between total scores on one or 
the other of the two forms and achievement test 
scores of classes studying French, Spanish, or 
Latin. For four schools the coefficient was .71 
when both forms were used. Since descriptive 
information regarding size of samples, type or 
name of achievement test, and grade placement 
of pupils is lacking, it is difficult to evaluate the 
genuine worth of these coefficients. No inter- 
correlations among part scores are presented. No 
mention is made as to whether an item analysis 
was undertaken. In fairness, it should be men- 
tioned that two references are listed in a foot- 
note that yield further information regarding 
previous work in validating the test. Neverthe- 
less, more data should be available in the manual 
than has been presented. 

Although the manual states that no norms are 
available, a supplementary sheet is included on 
which the decile equivalents for part scores and 
for total scores are furnished for 261 and 120 
pupils who took Form A and Form B, respec- 
tively. The groups are not identified except that 
they were tested in September and that they 
intended to take Latin. It is interesting to note 
that, as would be expected, the range of scores 
is exceedingly narrow on the subtests. Similar 
normative data are presented for 45 pupils in- 
tending to take Spanish ; again there is no further 
identification of the group. 

In addition, one other potential point of weak- 
ness should be cited. In view of the extremely 
short time limits of many parts, there is a sub- 
stantial risk that sources of extraneous variance 
in test scores may be introduced. Since the com- 
pletion of a large number of items in a short 
time would be likely if one had a good rote 
memory, many examinees high in other abili- 
ties related to success in language might be un- 
fairly penalized. For example, the individual 
who possesses the superior reasoning and verbal 
comprehension abilities required in mastering 
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the complex interrelationships of language struc- 
ture and meaning may not have sufficient time 
to put his ability into practice and to demon- 
strate his potential superiority over the less ca- 
pable individual whose highly developed memory 
places him at an advantage in answering many 
of the relatively low difficulty level items. 

Although this test might prove to be a worth- 
while prognostic instrument, it is difficult for 
one to recommend its widespread use in light of 
the meager data presented in the manual соп-, 
cerning reliability, validity, and norms. It would 
appear that one could make at least as depend- 
able a prediction of success in foreign language 
study from several of the well known standard- 
ized tests that yield measures of verbal intelli- 
gence as one could from this test. 


For a review by Walter V. Kaulfers, see 40: 
1340. 


ENGLISH 
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xEnglish Examination for Foreign Students: 
Including a Test of Non-Verbal Reasoning. Ap- 
plicants from non-English language countries for ad- 
mission to American colleges; 1947; 7 scores (listed 
below) ; 3 parts; $3.50 per set, 10-19 sets of test and 
answer sheets; $3 per set, 20-99; $2.50 per set, 100- 
299; $2 per set, 300 or more; one set of phonograph 
records (78 rpm) provided with orders of 10-50 tests ; 
two sets with orders of 51-100; two sets, plus one set 
for each multiple of 50 beyond roo, with orders of 
more than 100 tests; booklet of sample questions free; 
replacement answer sheets not supplied; postpaid; 270 
(305) minutes in two sessions; Educational Testing 
Service. * 3 A 
a) BOOK I. 4 scores: reading comprehension, aural 
comprehension, pronunciation, total; separate answer 
sheets must be used; 140(155) minutes. 
b) воок 2, ENGLISH COMPOSITION. Forms A, B; no 
norms; 40(45) minutes. — . 
c) воок 3. 2 scores: scientific vocabulary, nonverbal 
reasoning; separate answer sheets must be used; 90 


(105) minutes. 


[234] 
*English Language Test for Foreign Students. 
Applicants from non-English language countries for 
admission into American colleges; 1951; Form A; 25¢ 
per test; separate answer sheets must be used ; sé per 
answer sheet; $1 per set of scoring stencils; $1.30 per 
specimen set; postage extra; nontimed ) min- 


` utes; Robert Lado; George Wahr Publishing Co. * 


REFERENCES 
Lavo, Ronert. “Phonemics and Pronunciation Tests.” 


1. 
dL Ј 35:531-42 N ’s1. * 

Med TADS: [REUS TA Practical English Language Test for 

Foreign Students." News B Inst Int d 26:16-7 Ар 1 'sr. 


CLARENCE E. Turner, Professor of Romance 
Languages, Rutgers University, New Bruns- 
wick, New Jersey. 
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Much careful linguistic analysis and testing 
ingenuity have gone into the development of 
this test. It promises real usefulness in the ad- 
vance evaluation of a foreign student's linguis- 
tic fitness to pursue college or university study 
in the United States. Its practical advantages 
are numerous and important, for it can be ad- 
ministered and scored by personnel without 
special training, if need be in a testing center 
abroad; it is simple, clear, and objective; and it 
requires no special equipment. 

The mastery of an acquired language is seldom 
easy. English, for all its comparatively simple 
inflections, is a peculiarly difficult tongue for 
the foreigner. For that matter, what native 
speaker of it never mispronounces a word or 
flies to the dictionary to fill a gap in his vocabu- 
lary? The specific difficulties of English for most 
foreigners lie in the vast extent of its ever grow- 
ing vocabulary, in the problems relative to the 
production and aural comprehension of its 
sounds, and in its oddly capricious relation- 
ship of pronunciation to spelling. (“What, are 
there no rules!” was the perceptive remark of 
a highly literate Italian gentleman on hearing 
English read aloud for the first time.) 

The test at hand meets head on these very 
difficulties. All its 134 items test vocabulary to 
some extent, although careful control keeps this 
from becoming the major factor in Parts I and 
П. Pronunciation is quite rightly awarded the 
largest number of items (54). This part of the 
test is the most novel and has been discussed 
elsewhere (т) by the author. The student’s oral- 
aural control of English is tested through his 
ability to distinguish and match (however 
spelled) English phonemes in situations like 
those where they are typically distinctive in 
English. (For example, sink becomes a differ- 
ent word, or no word at all, if we remove the 
[s]-phoneme or replace it with various others, 
so [s] is distinctive in this situation.) Each item 
has only three parts, yet offers five possible an- 
swers, since all three may contain matching 
phonemes, or none, or any combination of two. 
A random example, Item 34, will illustrate the 


principle. Many languages lack the [1] -phoneme,. 


as in the English word bit, yet this sound is dis- 
tinctive in this situation, since bet, bat, bait, and 
so on have meanings different from bit. The stu- 
dent is asked to match identical sounds in: (1) 
We had to b--ld a fence. (2) It keeps the 
r--ndeer out. (3) Last year they k-cked and 
broke many little trees. To give the correct 


answer, I and 3, the student must pronounce 
correctly to himself the three key words, avoid 
the distractor offered by the queer spelling ui 
in build, and distinguish the [1] of kicked and 
build from the [e] of reindeer, which is the 
sound most likely to be confused with [1] by 
most foreigners. Specific traps are laid for na- 
tives of various languages: the unwary Oriental 
may match —ibrary with —ead books; the Span- 
ish speaker may match Ri-er and La-or; the 
Italian will tend to fail to match La-or with 
ra—- —its, and so on. The reduction of unstressed 
vowels to [1] and [9], a process so characteristic 
of English, receives due attention. To deal cor- 
rectly with the groups p-—ple, t- —m won, to 
s-— the game and -imple problem, —ure, ea-y 
the student must have grasped the point that in 
English the same sound may be spelled in many 
ways, and the same spelling pronounced in many 
ways. 

It is not necessary to discuss Part I (Struc- 
ture) in detail. Here as in the more novel Part 
II, the items have been chosen by sound criteria. 
Can the student sense interrogation from word 
order? Can he trace pronouns to their antece- 
dents and subjects to their verbs? Can he tell to 
what action did so refers? Here, as consistently 
throughout the test, the emphasis is upon those 
features of English which typically have the 
function of determining meaning. % 

Part III is more debatable. The sampling 
problem in measuring the English vocabulary 
is a staggering one. One clear and commendable 
trend in the present test may be mentioned. The 
author has included no fewer than то items (out 
of 40) which involve “two-word verbs," those 
tricky combinations so puzzling even to the 
foreigner who knows their elements, as in ran 
out of wood, got over his headaches. In other 
words, idiom is recognized as an aspect of vo- 
cabulary, a point that teachers of language often 
fail to discern. 

In so carefully constructed a test, one feels 
that there must be a reason even for practices 
that appear illogical. There is a puzzling ОП 
in Part III. The majority of the items involve 
detecting one or more words which may be di- 
rectly substituted for an underscored word, as 
started in “My classes began yesterday.” One 
is therefore ill-prepared to match “My watch 
has a luminous dial" with shines in the dark, 
where the relationship, though real, is entirely 
different. Surely, too, some logical Frenchman 
will fail to equate dog with an animal. The eX 
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amples preceding the test items have, of course, 
prepared the student for some peculiar match- 
ings, for in Example S we are told that with 
wings, usually flies and with feathers “ате cor- 
rect about bird" in the sentence, -“That is а 
beautiful bird.” This would be difficult to formu- 
late either in English or in logic. If anything is 
gained by this sort of mental gear shifting, let 
us tolerate it, for the test as a whole is an un- 
usually sound and practical one. 
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xTest of Aural Comprehension in English as a 
Foreign Language. Applicants from non-English 
language countries for admission into American col- 
leges; 1946; Forms A, B, C; 10¢ per test; separate 
answer sheets must be used; 2¢ per answer sheet; 25¢ 
for examiner's materials; 50¢ per scoring stencil; $1 
per specimen set; postpaid; nontimed (30-45) min- 
utes; Robert Lado; English Language Institute, Uni- 
versity of Michigan. * 
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X*Cohen French Test. 1-3 years secondary school; 
1945-49; 4 parts; Forms А [45], B [45]; manual 
[49] ; 75. 9d. per 10; 2s. per 10 of any one part; 6d. per 
scoring key; 2s. 6d. per manual; 3s. 10d. per specimen 
set; cash orders postpaid within Australia; S. W. 
Cohen; Australian Council for Educational Research. * 
а) PART I, VOCABULARY. 20(30) minutes. 
b) PART 2, SILENT READING. 25(35) minutes. 
C) PART 3, GRAMMAR. 25(35) minutes. y 
d) PART 4, AURAL COMPREHENSION. (25) minutes. 
Mary E. Товмвоіл, Formerly Head of 
Test Production, Educational Testing Service, 
Princeton, New Jersey. 

Part 1, Vocabulary, gives 75 short French 
sentences, a portion of each of which appears 
in boldface; five suggested English translations 
for the boldface expressions follow each sen- 
tence. A good sampling of easy vocabulary in- 
cludes several parts of speech, although nouns 
and verbs are most numerous. The part would 
be more effective if it were well edited to re- 
move the careless errors, repetitions, and ex- 
traneous clues to answers with which it is re- 
plete. For example, in Form A, Item 46 asks 
for the English equivalent of “craindre” and 

P › 
Item 54 for the translation of “crainte.” “Early 
appears as a choice in Items 6 and 8 of Form A 
and there are several further instances of this 
same fault. In other items (e.g., Items 20 and 40 
of Form A) the fact that the key word is a noun 
is clearly indicated by the context (it follows 
"un"); yet some of the choices given are not 
nouns and so can be eliminated on the basis of 
this extraneous clue. The same carelessness has 
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crept into the proofreading : some boldface words 
are not translated and some choices are mis- 
numbered. 

Part 2, Silent Reading, consists of eight short 
paragraphs with four questions on each. The 
paragraphs, in general, appear in order of dif- 
ficulty (except G and H of Form A) and they 
present rather interesting and complete situa- 
tions. More questions and more searching ones 
could have been provided for some of the meat- 
ier paragraphs, and the difficulty of the ques- 
tions and the passages eould have been better 
matched. This part also shows a distressing lack 
of proofreading (e.g. Item 5 of Form В men- 
tions “women” when “woman” was intended). 

Part 3, Grammar, covers many of the basic 
points in French grammar by suggesting five 
French expressions or endings to complete each 
of the given French translations of 65 English 
phrases or sentences. This part occasionally be- 
comes a vocabulary test (cf. Item 51 of Form 
A and Item 35 of Form B) and it concentrates 
too much on the same words, such as "finir," 
"venir," "nager," and “lire.” On the other hand, 
some items, like Item 23 of Form A, test several 
points at once. There is great diversity in the 
difficulty of consecutive items. Misprints again 
occur throughout and unfortunately there is an 
error (duplication of choices) even in sample 
question 5. 

Part 4, Aural Comprehension, consists of 50 
items, each of which has five English words 
from which the correct answer can be selected. 
Here again a lack of variety lessens the potential 
usefulness of the test (Items 4 and 20, 7 and 
29 of Form A, and Items 11 and 24 of Form 
B). This part is also weakened by poor choices : 
for example, Items 5, I5, and 16 of Form A 
each have one clue word which, if understood 
by the examinee, makes only one given choice 
possible; several items require the comprehen- 
sion of only one word of the question (usually 
near the beginning of the question) to limit the 
possible choices to one or two (cf. Form B, 
Items 19 and 32). Again misspellings occur in 
the French questions (Item 33 of Form A and 
Item 5 of Form B). 

Space for the answers is provided along the 
right margin of each test page; for each form 
there are neatly printed and easy to use answer 
keys. The directions for all parts seem clear and 
include useful examples. Adequate time limits 
are allowed for each part. . 

The manual gives short descriptions of the 
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parts, suggests possible uses, and provides tables 
of norms and a concise statistical appendix. 
Separate scale scores are provided for pupils in 
each year and in each of the six Australian 
states. The scale scores are obtained by reading 
from a norms table (the reverse of usual prac- 
tice in this country) which is identical for Forms 
A and B. This identity implies the direct equiva- 
lence of raw scores on the two forms, but no evi- 
dence of such equivalence (which seems too 
good to be true) is presented. There are no data 
concerning validity. Reliability coefficients for 
the several parts, determined for samples of 
more than 100 cases, range in the .70’s and .80’s. 

The Cohen French Test is an interesting at- 
tempt to test four of the basic elements of achieve- 
ment in elementary French. It could be useful 
to both teachers and pupils in classroom situa- 
tions where an easy, unspeeded measure is de- 
sired. It would be much improved by careful 
editing which would eliminate repetitions and 
the numerous proofreading errors. 
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*College Entrance Examination Board Achieve- 
ment Test in French Reading. Candidates for col- 
lege entrance; 1901-51; available only in College En- 
trance Examination Board Admissions Testing Pro- 
gram (see 526) ; 60(70) minutes; prepared by College 
Entrance Examination Board Committee of min- 
ers in French Reading in cooperation with the Staff of 
Educational Testing Service: r951-52 membership: 
H. Linn Edsall (Chairman), Anne Marie de Com- 
maille, Henry Dupont, Donald M. Frame, and Eleanor 
L. Michel; program administered by Educational Test- 
ing pu for the College Entrance Examination 
oard. 
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[Review of Forms VAC2, WAC2, XAC2, and 
YAC.] 

With a consistent reliability of .97 and con- 
tent graded on the basis of frequency counts of 
French vocabulary, idioms, and syntax, these 
strictly objective, machine scorable, one-hour 
multiple choice tests are probably as scientifically 
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efficient per unit of testing time as any stand- 
ardized tests in any field of subject matter. A 
careful examination of the methods used in 
validating and scaling the individual test items 
shows strict conformity to the best in current 
statistical theory and practice. Similarly, in ty- 
pography and use of standard idiomatic French, 
the tests are technically about all that can be 
desired. 

The several types of five-item multiple choice 
statements employed in the different parts of 
the examination are generally well suited to 
the purpose, with the possible exception of the 
stress that the subtests place on the recognition 
of grammatical forms in context. A strategic 
phrasing of items here should eliminate the 
necessity of supplying English translations in a 
test of reading ability. Moreover, command of 
word order, which is often as important as 
grammatical form per se, is not measured by 
this syntax test, even though simple devices for 
measuring it objectively have been in use for 
several years. Fortunately, all these matters are 
of relatively minor importance. # 

Despite their generally excellent qualities, 
however, the examinations are somewhat con- 
fused as to their ultimate purpose. First, none 
of the “French Reading” examinations is strictly 
a reading test. All contain substantial sections 
devoted exclusively to vocabulary and also a 
separate section on the ability to select the most 
appropriate grammatical form from five tempt- 
ing choices. Since vocabulary is inseparably re- 
lated to reading comprehension as well as to 
usage, its separate measurement in a test to 
which reading comprehension scores are added 
to vocabulary scores means measuring the same 
ability at least twice—once directly and again 
indirectly—with a consequent confusion of the 
true meaning of the total score. It is doubtful if 
a separate section on vocabulary or recognition 
of grammatical forms as such is ever justifie 
on an achievement test. The function of vocabu- 
lary tests and special form recognition tests 15 
primarily diagnostic; for example, they 4! 
teachers in locating specific reasons for а pat 
ticular student’s inability to use a language " 
synthesis as a means of communication in reat- 
ing, writing, or speaking. Any process of adding 
scores obtained from subtests of closely inter- 
related or overlapping abilities leads to 4 pi 
score whose significance can be established only 
by arbitrary academic fiat, with a gradual wid- 
ening rather than narrowing of the schism be- 
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tween the real world and a strictly intramural 
school world. If pressures from college French 
departments do not permit the dropping of the 
separate vocabulary and grammar sections, the 
name of the examination should at least be 
changed to suit its present content. The term 
“Reading Test" is definitely a misnomer. 

Secondly, the French reading tests are too 
narrow in scope to be considered achievement 
tests for high school students of French. A can- 
didate who, through special effort on the part 
of himself and of a skillful teacher, has attained 
a commendable ability to understand spoken 
French and some ability to speak it would not 
receive even one point of credit on the entire 
examination. Like most language examinations, 
it does not distinguish between normal people 
and literate deaf-mutes. Moreover, a student who 
has attained some knowledge of French life 
and culture in keeping with the widely accepted 
social-cultural objectives of high school teaching 
would have no opportunity at all to display his 
achievement. 

The undeniable fact that courses of study vary 
too much from one high school to another to 
permit the construction of an achievement ex- 
amination that will be equally fair to all makes 
the term “achievement” test a misnomer. Both 
the specialized nature of the examination and 
the use to which it is put indicate that it is pri- 
marily a college placement examination to assist 
institutions of higher learning in placing people 
in courses for which the examination shows them 
to be best fitted in terms of such local offerings 
as are available. Even here, the examination 
would, by virtue of its content, limit its present 
placement value to reading courses stressing 
belles lettres and conducted primarily in English. 
With minor exceptions, the items test only 
French of the narrative or descriptive type, OF 
conversational passages of the kind found in 
novels, Examples of expository writing, edi- 
torial writing, news reporting, etc., are tested 
indirectly, if at all. Moreover, since the exami- 
nation does not include a measure of ability to 
understand spoken French, it does not guarantee 
satisfactory placement in courses featuring lec- 
tures or discussions in the foreign language. At 
best, it provides only a convenient indication 
of a student’s ability to read the materials as- 
signed for silent reading in literature courses 
where these are graded in difficulty. 

Within the limitations noted above, however, 
the tests are as objective, reliable, and scien- 
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tifically valid as modern statistical theory and 
practice allow. By converting raw scores into 
College Entrance Examination Board standard 
ratings, student performance is equated to rule 
out the possible “halo” effect of having taken 
the test with a group of relatively mediocre 
candidates or, conversely, the penalty of having 
had to compete against relatively superior can- 
didates. The new standard ratings also enable 
college admissions officers to compare scores 
directly, language to language. The average test 
score of a normal, representative group who 
have studied a language for two years, for ex- 
ample, is now 460 in German, Spanish, French, 
and Latin. For three years, it is 520, and for 
four years, 580. Each year of training beyond 
two thus yields, on the average, a 60-point rise 
in total score. The convenience of this arrange- 
ment cannot be overestimated in evaluating stu- 
dent performance for placement purposes, even 
though the ultimate validity of the examina- 
tions, either as measures of achievement or as 
placement tests, is limited by the considerations 
cited in preceding paragraphs. 

In summary, the College Board French read- 
ing tests are superior to most standardized for- 
eign language tests in scientific excellence as 
regards test construction and authoritative lin- 
guistic content. Their limitations arise from con- 
fusion of achievement testing and placement 
testing, both of which cannot be served effec- 
tively at the same time in present circumstances 
of variations in high school and in college 
courses of study. For this reason, the College 
Board should concentrate its time, efforts, and 
resources on what it can hope to do with some 
promise of success and give up what it can only 
pretend to do with respect to the evaluation 
of work in high school. Specifically, this means 
abandoning the achievement testing objective (as 
well as the use of the term achievement in con- 
nection with its tests) and concentrating exclu- 
sively on the improvement of the college place- 
ment and guidance function of the examinations. 
Such a revision might well include an objective 
test of ability to understand spoken French as 
an aid in guiding students with respect to enroll- 
ment in intermediate and advanced courses 
where lectures and discussions are likely to be 
in the foreign language. Recent improvements 
in the way of recording machines, nonbreakable 
long playing records, and portable record play- 
ers with tone as well as volume controls (and 
multiple loud speakers to compensate for de- 
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fective acoustics) remove nearly all the hazards 
that once beset the administration of a brief but 
thoroughly standardized, objective examination 
of aural comprehension. 

A report to the college or university in the 
form of a profile chart showing a particular 
student's relative standing (e.g. percentile 
rank) on tests of ability to distinguish between 
correct and incorrect French usage, to under- 
stand spoken French, and to comprehend the 
printed page would be a more valid and discrim- 
inating indication of a student's fitness for work 
in a particular course than the present omnibus 
total score. Although a separate test of vocabu- 
lary may be included as a concession to institu- 
tions desiring a separate score for diagnostic or 
special guidance purposes, it should be evident 
that a good test of reading comprehension and 
aural comprehension will inevitably co-measure 
vocabulary in about the only context, apart from 
ability to speak the language, where its existence 
has any practical importance. 

Inasmuch as implementation of the foregoing 
recommendations implies a change in the policy 
of the College Entrance Examination Board be- 
yond the control of its able staff of testing ex- 
perts, it is to be hoped that its officers will re- 
consider the functions of its language examina- 
tions. If a change in policy is not feasible in the 
immediate future, the present examination 
should be renamed College Entrance Examina- 
tion in French, since it obviously leaves out too 
many worthwhile "achievements" to be an 
achievement test and is far too much interested 
in pure vocabulary and grammatical form to be 
just a "French Reading Test." With appropri- 
ate changes, as. indicated, to conform to funda- 
mental changes in College Board policy, the 
examinations might appropriately be called Col- 
lege Placement Examination in French with less 
invitation to criticism and considerable benefit 
to institutions of higher learning as a solution to 
the transfer problem in college foreign language. 


E [238] 

*Cooperative French Test: Lower and Higher 
Levels. 3-6 semesters high school or 1-2 semesters 
college, more than 2 years high school or more than 
I year college; 1942-51; 4 scores: comprehension, 
grammar, civilization, total; IBM; 2 levels; Forms S 
(Lower Level, '49—same as test copyrighted in 1942; 
Higher Level, '42), X ('49 or ’50—same as test copy- 
righted in 1947) ; no data on validity; no specific man- 
ual; no college norms; general Cooperative manual 
C51); high school norms ['50]; $2.50 per 25 of any 
one level; 50 per specimen set of any one level, post- 
e separate answer sheets may be used; per 25 
BM answer sheets; 30€ per set of stencils for scoring 
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answer sheets; cash orders postpaid; 80(85) minutes; 
Geraldine Spaulding, Laura Towne, and Sarah Wolí- 
son Lorge; Cooperative Test Division, Educational 
Testing Service. * 
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C rative Tests in French, Latin, and Secondary School 
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9-86. (РА 2 309 In 1948 Achievement Testing Program m 
TES edant “chooks and Supplementary Studies. Educational 
Records Bulletin, No. so, New York: Educational Records 
Bureau, July 1948. Pp. xiii, 86, Paper, lithotyped. * 


Erron Hocxiwo, Professor of Modern Lan- 
guages and Head of the Department, Purdue 
University, Lafayette, Indiana. 

Unlike the earlier Elementary and Advanced 
Forms of this test, which required only 40 min- 
utes of working time, the Lower and Higher 
Levels require 80 minutes of working time dis- 
tributed among three parts as follows: I, Com- 
prehension : Section 1 (50 vocabulary items with 
choices in English, and то all-French sentence 
items), 15 minutes, Section 2 (eight to ten pas- 
sages of French prose, usually narrative, on 
which are based 45 all-French items), 25 min- 
utes; II, Grammar (50 items combining Eng- 
lish and French), 25 minutes ; and IIT, Civiliza- 
tion (50 questions of fact, entirely in English), 
IS minutes. The Comprehension sections for 
Form S are available separately as the 40-minute 
Cooperative French Comprehension Test, 
Lower and Higher Levels, but the cost is ex- 
cessive: $2.25 for 25 booklets and $.80 for 25 
answer sheets, as compared with $2.50 and $.90 
for the 80-minute test. 

The advantage of the newer test, in both forms 
and at both levels, is clearly in the expanded 
treatment of Comprehension (Part I) : the num- 
ber of separate vocabulary items is increased 
somewhat ; the ten sentence items are new; and 
the eight to ten prose passages offer considerably 
more reading material than is found in the older 
forms. The vocabulary words are well chosen in 
terms of both range and level of difficulty, and 
the reading passages are carefully graduated in 
difficulty. The content of the reading passages 7 
the Lower Level tends to be over-sentimental, 
but to criticize them for this would be captious. 
The items throughout this part of the test reflect 
considerable professional skill in their selection 
and construction. 

The items in Part II, Grammar, suffer from 
a fault that has long been chronic with the gram- 
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mar items of the Cooperative French Test 
(Form R excepted). A large proportion of these 
items require only the selection of words to fill 
in blanks in French translations of English sen- 
tences, and some require only the selection of 
word endings, as, for example, in this item 
(Form X, Lower Level, Item 30) : 
There were two blue hats in the shop window. 


Il y avait deux chapeaux bleu- ( 
dans la vitrine, 


30-1 -s 
30-2  -x D 
30-3  -es 
30-4 -se 


No ending needed 


Such butchery seems unjustified. No space is 
saved on the page; there is always space availa- 
ble for complete words, and usually for complete 
sentences. For the grammar items a much larger 
percentage of complete sentence choices should 
be used if the test is to be fair to students who 
learn by auditory and visual imagery rather than 
by matching French words with English words. 
In this part of the test also, there are dupli- 
cations of constructions, and sometimes even of 
the same word in the same construction (Form 
X, Lower Level, Items 3 and 8, 9 and 34, 10 and 
31). However, since the constructions which 
are duplicated are those generally stressed in 
conventional grammar books, the student's score 
here should be a good index of his ability to 
translate textbook "sentences" into French. 
Part III, Civilization, is an innovation in the 
Lower and Higher Levels. In 15 minutes the 
student is to answer 5o items covering several 
different aspects of French culture. This attempt 
to test the student's knowledge of French cul- 
ture is highly questionable. There is no quarrel 
with the concept of culture as an important aim 
of the course, and there is no doubt that a mature 
and cultivated person would know the answers, 
incidentally, as a by-product of his general cul- 
tural background and reading. However, un- 
cultivated young students do not know the an- 
swers incidentally or any other way. When once 
the nature of this part of the test becomes known 
locally (one administration is enough, for the 
pattern is painfully obvious), students will feel 
impelled to cram the answers by conning guide- 
books and outline manuals. They can easily do 
so, for the repertory of conventional landmarks 
in history, geography, and the arts is limited. The 
situation here is very different from that of 
vocabulary, for instance, where there are thou- 
sands of valid items to be sampled, and where 


recognition of each item is an end in itself. But 
civilization is not quantitative, nor is it assessed 
by identifying Hugo as a writer rather than a 
painter or a musician, or Debussy as a com- 
poser rather than a painter. No, this part of 
the test is not valid; and, much worse, the in- 
clusion of it may encourage students to mistake 
the guidebook for the tour, the menu for the 
meal, the cliché for the experience. Civilization 
is traduced in the name of civilization. 

Part III of this test is thus self-defeating. As 
for Part II, it has long been known that the 
scores on grammar tests like this one do not 
correlate highly with functional skills in a for- 
eign language; hence, this part is not likely to 
prove particularly useful. On the other hand, 
Part I seems to be the most thorough and reli- 
able test of reading ability in French yet,to be 
published. This part is highly recommended. 
Teachers who are primarily interested in meas- 
uring skill in reading may prefer to use the Com- 
prehension test alone rather than the complete 
test. 


For reviews by John Н. Meyer and Roland 
Vinette of Form S, see 3:182; for reviews by 
Joseph F. Jackson and Clarence E. Turner of 
Part 1, Form S, see 3:180. 
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XFrench I and II: Achievement Examinations 
for Secondary Schools. 1 or 2 years high school; 
1951; orally administered in part; 1 form; no data on 
reliability and validity; no manual; Minnesota norms 
(median and quartile deviation) available; similar 
norms for other regions by special arrangement with 
publisher; 7¢ per test, postage extra; 60(65) minutes; 
edited by Walter W. Cook; prepared by a curriculum 
committee of high school teachers for use in the Min- 
nesota State Board Achievement Examinations Pro- 
gram; Educational Test Bureau, Educational Pub- 
lishers, Inc. * 


Erron Hocxrne, Professor of Modern Lan- 
guages and Head о} the Department, Purdue 
University, Lafayette, Indiana. 

This test is not recommended for use in any 
circumstances, It is faulty in construction, in 
French, in English, and even in layout, typogra- 
phy and spelling. 

The heading calls for the names of both par- 
ents to be written in a space of two inches ; asks 
the pupil to fill in a blank cryptically headed 
“time used"; admonishes him to “read direc- 
tions carefully” (there are almost none) ; says 
nothing about guessing or the formula for scor- 
ing. There is a plethora of headitigs and sub- 
headings. Thus, each individual French passage 
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of the Reading Examination has a subheading, 
as for example, Article No. I, Section I of 
Section One. Each section is numbered; it is 
also called “examination” and “test.” The ner- 
vous child who sees on page one “You have 
fifteen (15) minutes for this test" may well be 
reduced to panic. 

The Reading Comprehension part (15 min- 
utes) of the test involves four French passages 
(averaging ten lines) on which are based 15 
items. Each item is an incomplete French state- 
ment followed by three completions in English. 
This dubious practice is complicated by the 
faulty French of the passages, the awkward or 
incorrect wording of the responses, the patent 
absurdity of some responses, and the “old chest- 
nuts” used, doubtless familiar to some students 
but not to all. 

The Vocabulary part (10 minutes) has 40 
items, each a French sentence with a word or 
phrase underlined. Each is followed by five 
choices in English. The technique is faulty : dis- 
tractors are too close (Items 42, 44, 52) ; some 
responses are disqualified because they are ab- 
surd (Item 54), or use the wrong tense (Item 
36). Also, the key is wrong for Item 53, and 
there are mistakes of French in Items 17, 23, 
42, 46, and 51. 

Section III, Aural Examination (20 min- 
utes), involves ten definitions (choice of three 
each) and three "anecdotes" totalling 34 lines 
of French. These are to be read aloud, once 
only, by the teacher. A very slow oral reading of 
the entire section consumes only five minutes, 
leaving 15 minutes for the student to choose 
his 22 responses, each from a choice of three. 
This is an absurdly long time. 

There are no instructions about the speed of 
reading or the length of pauses during which the 
student responds. With these left to the whim 
of each teacher, the scores will vary accordingly. 
This aural examination should have been re- 
corded on a disc, like the University of Chicago 
examination upon which it is obviously pat- 
terned although there is no acknowledgment. 
The latter, however, used five times as many 
items and a skill in construction that is not even 
approximated here. Twenty-two items are in- 
adequate. Of the ten definitions, five are taken 
verbatim from the University of Chicago tests; 
two of the remainder are incoherent, even in 
print. 

Two of the three “anecdotes” are old favor- 
ites (“Why don’t they eat cake?” and the work 


of the Curies), and the third is exposition, All 
involve elementary blunders in French. The sug- 
gested responses are sometimes irrelevant or 
absurd, and frequently couched in tortuous Eng- 
lish. The author has a weakness for redundant 
phrases and adverbs. 

А technical blunder which can be of capital 
importance is the fact that the examiner reads 
the Aural Examination from a separate sheet 
on the back of which is printed the scoring key 
for the whole test. Presumably he will stand for 
20 minutes holding the key up before the eyes 
of the students. If he stands close, as he should 
for an aural examination, the students near him 
should make perfect scores. 

In Section Four (Grammar, 25 items, 10 min- 
utes), each item is an English sentence with an 
incomplete French translation followed by five 
suggested completions. Twenty-five items are 
too few ; there is no settled policy on distractors 
(they are sometimes in good French, and some- 


` times not) ; Items 81 and 94 require the identi- 


cal word; Items 95 and ror both supply two 
correct responses, although the key accepts only 
one ; Item 85 supplies no correct response; there 
are misprints in Items 88, 95, and 100; the page 
is crowded, although the following page is 
blank. 

In sum, this test is the product of persons who 
have little competence in test construction, in 
French, or even in English. It will not bear com- 
parison with the work of experts, such as the 
Cooperative tests. 
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XFrench Recognition Vocabulary Test: State 

High School Tests for Indiana. High school ; 1948; 

Forms K, L; mimeographed ; no data on reliability an 

validity; no manual; no norms; 4166 per test; 15¢ per 

specimen set; postpaid; nontimed (30-40) minutes; 

E. R. Ryden; State High School Testing Service for 
Indiana, Purdue University. * 

REFERENCES 
1. RvpeN, Ernar КироьРн, Vocabulary as an Index to 


Learning in a Second Language. Doctor's thesis, Northwestert 


[iced Куда ПЫ. air, eke an ОР С 
Crarence E. Turner, Professor of Romance 
Languages, Rutgers University, New Bruns- 
wick, New Jersey. - 
It is obviously unfair to neglect the question 
of how well the authors have accomplished what 
they undertook in order to examine the value 
of the undertaking. In the case of any passive 
recognition test, it is really the latter question 
that haunts one. In the tragic confusion that 
exists among modern language teachers as tO 
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what they ought to be trying to achieve, it seems 
safe to the reviewer to assert that passive recog- 
nition of the correct English equivalents of 
foreign words constitutes one of the least sig- 
nificant of desirable objectives. This is the skill 
most likely to be concentrated on by poor 
teachers and poor students. It is one of the few 
skills which may to some extent be taken for 
granted. It is the thing which the entering col- 
lege freshman who is going to fail French can 
do nothing else but. 

Granted that language skills are intercorre- 
lated. If we had no other way of measuring di- 
rectly the skills we consider most important, a 
test of this sort would throw some light on a 
student’s showing relative to that of his fellows. 
So would a good test of English grammar, or 


one of English vocabulary. Indeed, tests of the 


present type are to a considerable extent really 
tests of English vocabulary. To realize this, one 
has only to imagine the test being given to a 
Frenchman. What are we to say of a type of 
test which a literate Frenchman could not pass 
unless he knew English? But, it will be pro- 
tested, our school children are not French! True, 
but they are being sadly cheated if they are not 
being led toward the possibility of doing the 
same things with their French that the French 
do. If such teaching is being done, tests like the 
one under review will no longer reflect the 
curriculum. 

When we examine the fairer question of how 
good the test is of its type, we find merits and 
defects which are common in the field. The pro- 
portion of items too easy to have discrimina- 
tory power seems fairly high—animal, nation, 
observer (the really interesting question about 
the last word is whether the student knows how 
to pronounce it). Too often the distractors are 
obviously absurd words thrown in at random. 
Has anyone ever really thought that nation 
means "ragamuffin"? While the correct answer 
is not too often an evident English cognate, it 
is very frequently the synonym of one (spécial, 
“particular”). The authors have wisely included 
items of more than one word, e.g., il fait beau, 
the translation of which as “it is beautiful” is 
not unequivocally the best. 


[241] 
*Graduate Record Examinations Advanced 
French Test. Senior year college through graduate 
school and candidates for graduate school; 1939-51; 
available only in Graduate Record Examinations Mor 
grams (see 527); 105(125) minutes; Educationa 
Testing Service. * 
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*A Standardized French Grammar Test. Ages 
11-17 with 1—5 years of French; 1951; Forms A psi], 
В [’51]; manual [’51]; 1s. 9d. per 12; 2d. per single 
copy; 15. 6d. per manual; Is. 100. per specimen set; 
postage extra; 35(40) minutes; T. S. Percival; Uni- 
versity of London Press Ltd. * , 
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1. Percivar, T. S. Achievements Tests in French Grammar 
and Vocabulary. Master’s thesis, University of Durham (Dur- 
ham, England, $ Ь195031. Abstract: Brit J Ed Psychol 21: 
156-8 Je 's1. * (PA 26:1025, title only) 

Netson Вкоокѕ, Westover School, Middlebury, 
Connecticut. 

A few points of difference between the English 
and the American languages catch the eye on 
the first page of this test prepared under the 
auspices of Durham University in England. In 
the U.S., the student does not "turn over," but 
turns the page; he does not “work” a test, but 
takes it; he is in a "form" only in certain inde- 
pendent schools; he is much more likely to be 
classified by grade or year. The student will find 
himself very much at home, however, in the 
problems presented in the test items, for the 
English attack upon the structural patterns of 
French seems to be very similar to our own. The 
test is, on the whole, very good and has clearly 
been prepared with the greatest care ; such nega- 
tive features as it has are connected with de- 
batable points of procedure, a few details of 
fact, and a degree of inexperience. 

This reviewer is concerned about the effect 
of interrupting the flow of a sentence by the 
interpolation of a word in the foreign language 
as: “An old (vieux) friend. Un ( ) ami." 
What is convenient and economical from the 
point of view of printing may well be psycho- 
logically harmful from the point of view of 
equating the two languages. Is not the true lin- 
guistic relationship more naturally expressed by 
a form like, *An old friend. (vieux) Un: 
ami.”? Printed thus, the coaching can be done 
from the sidelines without getting in the run- 
ner's way. This detail might be considered minor 
if it were not for the fact that this deliberate and 
unnatural mixing of the two languages is stand- 
ard practice in grammars and exercise books on 
both sides of the Atlantic. The instructions on 
page т of the test are not quite accurate. The 
“phrases” are sometimes dependent clauses 
printed as sentences, a practice that should not 
be countenanced by any teacher of grammar. 
The “incomplete French translation” in a num- 
ber of cases is reduced to a pair of parentheses. 
Such matters might seem trivial in another in- 
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stance, but in a concentrated sampling exercise 
of this kind, there is no detail that does not in- 
fluence the final result in some way. 

The author has given careful thought to the 
principle of testing one thing at a time, and the 
test is very satisfactory on that score. Necessary 
vocabulary help is provided, verbs are given in 
the infinitive and are even conjugated when the 
emphasis is on word order rather than on the 
verb form. In some cases, the author seems to 
have gone almost too far in this direction. In 
the items, “A yellow (jaune) chair" and “Une 
chaise ( ),” whatever problem there was 
seems to have been reduced to the vanishing 
point. 

While it is well to have at the beginning of 
any test a few extremely easy items in order to 
complete the gamut of difficulty, by the time 15 
per cent of the test has been done, the student 
should be meeting problems with a definite chal- 
lenge. In Form A, Item 15, the student is asked 
to write “nous sommes" for “we are"; in Form 
B, Item 14, he is to write “nous avons" for “we 
have.” The ability to do this is not an indication 
that one has traveled very far along the road to 
the Eiffel Tower. Furthermore, in Item 35 of 
Form A, the student is asked to deal with, “1 
am preparing myself. Je ( ) prépare." If 
such a problem has proved so difficult that it 
merits this position in the test, then there is 
something grossly wrong in the instruction the 
students are given. The proper relationship be- 
tween teaching and testing needs far more analy- 
sis than it has so far received, but it seems clear 
that fixed response tests, rightly used, can be 
very helpful in indicating what can be taught 
and what should be emphasized. It seems to this 
reviewer that there are too many items in the 
two forms of this test that are scarcely entitled 
to a place in the measurement of students be- 
tween the ages of 11 and 17, with up to five 
years of study of French. 

One may take issue with the answers pro- 
vided in at least two instances. In Form A, Item 
50, not only “de” but also “des” must be ac- 
cepted; in Item 98, “soit venu” may be used 
for “fat venu." We must not forget that the 
French themselves are the loudest champions 
of the “tolérances.” 

Finally, to use these tests “repeatedly at in- 
tervals of a few months” would seem to be a 
dubious procedure. Surely some students, and 
particularly the bright ones, are going to re- 
member certain exasperating little items, look 
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them up, and jolly well get them right the next 
time. The true significance of such items as 
samples of the areas they represent will thus be 
lost. 

In spite of these comments, the test is a good 
one; it demands that the student write his an- 
swers in the form of French words and that he 
write them correctly to receive credit. It will 
measure what is a major concern of all teachers 
of French a great part of the time. 


Donatp С. Burns, Lecturer in Education, Uni- 
versity of Leeds, Leeds, England. 

The principal purpose of this test is to facili- 
tate a rapid grading of the pupils on the basis 
of their achievement in French grammar. The 
reliability of the test has been shown to be satis- 


factorily high at all levels within the grammar 


school. The procedure which has been followed 
in selecting and validating test items as well as 
the data obtained by comparing these tests with 
other criteria is a sufficient indication of the 
validity with which they will achieve this. Their 
chief merit is perhaps that they enable the 
teacher to assess pupils over a particularly 
wide range of achievement ; and they will there- 
fore be of special service in helping to allocate 
not only pupils transferred from one school to 
another but also those who have done three or 
four years' work in the language and for whom 
some independent but reliable measure of 
achievement is desirable. 


[243] 
ЖА Standardized French Vocabulary Test. Ages 
11-17 with 1-5 years of French; 1951; Forms A [51] 
B ['51]; manual ['51]; rs. od. per 12; 2d. per single 
copy; Is. 6d. per manual; Is. 100. per specimen Set; 
postage extra; 35(40) minutes; T. S. Percival; Uni- 
versity of London Press Ltd. * 


REFERENCES .— л 
and рли TuS. АМ Teats, "риш (Dar. 
ham, England), [19050?]. Abstract: Brit J Ed Psychol 21: 
156-8 Je '51. * (PA 26:1085, title only) 

Netson Brooxs, Westover School, Mi 
bury, Connecticut. 

There are many things to be kep 1 
the preparation of a vocabulary measuring (© 
vice in which a French word is followed by #Y¢ 
English words; some of these things have 65 
caped the author of this test. In general, it wil 
be easier for the student to write his answer 27 
for the instructor to read it if the answer paren” 
theses are placed in the right hand margin 9 
the page; in this test the answer parentheses 
(referred to erroneously in the instructions 25 
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t in mind in 


359 ] TESTS & REVIEWS: FOREIGN LANGUAGES—FRENCH 


M8 


“brackets” ) are printed at the left of the page, 
just after the item numbers. АП the distractors 
(the wrong answers) should be words that are 
in the working vocabulary of the student, but 
such words as “osier, aloes, portmanteau, alu- 
minium, tyre, ewer, prequisite, reconnoitre, 
assises" are certainly not sufficiently available 
to the American student to play their proper 
role here. Every distractor should have some 
plausible connection with the test word if it is 
to do its work, and many of these wrong answers 
do not seem likely to lead very far astray. No 
student with a measurable knowledge of French 
is very likely to think that “là” is the equivalent 
of "towards," or that “mur” is translated 
"ready," or that “bouche” is "ankle." The 
greater the degree of likelihood in every distrac- 
tor, provided it is clearly wrong, the more exact 
will the measurement be. 

For best results, the right answer should fall in 
all five positions an approximately equal number 
of times. In Form A, the right answer occurs 10 
times in the first position, but 29 times in the 
fifth position. There should be, obviously, only 
one right word in each item, but in a number of 
cases here there are two correct answers. Taking 
the Concise Oxford French Dictionary as arbi- 
ter, Item 32 of Form A, "champ," may be an- 
swered either “country” or “field”; in Item 94, 
Form A, "avis" may be answered either “ad- 
vice" or “opinion”; in Form B, Item 8, "terre" 
may be answered either "earth" or “world.” 
Furthermore, in Form A, Item 41, the transla- 
tions of “vivre,” “live” and “survive,” are too 
nearly alike for one to be right and the other 
wrong. Item 9, Form A, “maintenant: main- 
tain, now” is hardly acceptable either, for after 
all, the test word is also a form of the verb 
“maintenir,” which might in certain contexts 
be rendered “maintain.” In Form B, Item 30, 
the test word “chapeau” is an English word 
(given in Webster's Collegiate Dictionary) and 
as such hardly merits a place in this test. In 
Form A, Item 25, the words Negro and Negress 
are printed without the capital letter; the word 
Jewish occurs in the item immediately following. 

The test contains too many repetitions of 
words in the possible answers (ideally, there 
should be none at all) : “yesterday” occurs five 
tines and Sathana Сне n ee 
“whilst” occurs four times, once as a right an- 
swer and three times as a wrong answer. It 
should be noted that “whilst,” though commonly 
accepted on equal terms with “while” in Eng- 


land, is no longer usual in the United States. 

This reviewer is concerned about the state- 
ment in the directions that “the meaning of 
‘plume’ is pen.” The meaning of “plume” is not 
“реп” but an instrument one writes with. “Pen” 
is the English equivalent of “plume”; both are 
verbal symbols referring to the same object. All 
language teachers are aware of this distinction, 
but it will be of great assistance to their pupils 
if such relationships are kept clear from the 
start. The Concise Oxford French Dictionary 
observes this distinction most carefully in its 
introduction, using “equivalent,” “translation,” 
and “rendering” but never “meaning” unless the 
symbol-thing relationship is intended. 

A further device that does not seem to be 
quite cricket is the use of three or even four 
parts of speech in the same item. When dealing 
with isolated words, as in this test, one must 
remember that in real life there are no isolated 
words. To obtain best results, the test maker 
should contemplate his test words in context 
and choose distractors among the possible trans- 
lations that suggest themselves when there is 
relationship with other words in a phrase or 
sentence. All class room procedure is far enough 
removed from life as it is; in these highly selec- 
tive tests, it is of first importance that the great- 
est possible degree of naturalness be maintained. 

This long list of criticisms should not obscure 
the fact that the test, in both its forms, contains 
a large majority of items of first quality. As it 
stands, it is a fairly good test, and with the se- 
vere editing that is obligatory in all measures of 
this kind, it can become an excellent one. 


Donatp G. Burns, Lecturer in Education, Uni- 
versity of Leeds, Leeds, England. 

The chief merit of these tests is that they 
offer a continuous scale against which to measure 
pupils’ achievement in vocabulary at any stage 
during the first five years of study. The norms 
given in the manual are admittedly based on 
experience in grammar schools in Newcastle 
and Northumberland, but the tests should pro- 
vide valuable information to teachers working 
under similar conditions in any part of the 
country. 

Coefficients of reliability (between two forms 
of the test) range from .77 (for a first year group 
of 339 cases) to .90 (307 cases in the fourth 
year) and .89 (142 cases in the fifth year). 
Many investigators would find that these are 
not large enough to indicate a high degree of 


244 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK [ 36 
БА 


reliability, and it might Ье an advantage there- 
fore if the manual included some indication of 
the internal consistency of the tests such as 
might be obtained by using a split half relia- 
bility coefficient. 

Considerable care has been taken in the con- 
struction of these tests to select material which 
should make them suitable for application in 
English schools, and from this point of view their 
validity must be considered high. A correlation 
of .90 was obtained between this test and the 
vocabulary section of the Cohen French Test. 

These are the first French vocabulary tests 
to be published with norms based on perform- 
ance in English Schools for many years. The 
range of achievement covered by each form of 
the test is wide enough to provide the teacher 
with a ready and reliable means of comparing 
groups of pupils at different levels within the 
school, and the tests should-be particularly use- 
ful in assessing the knowledge of pupils who are 
transferred or admitted to a school after the 
normal time of entry. They represent a valuable 
addition to the material available for the meas- 
urement of achievement in the schools. 


GERMAN 


[244] 
*College Entrance Examination Board Achieve- 
ment Test in German Reading. Candidates for col- 
lege entrance; 1001—51; available only in College En- 
trance Examination Board Admissions Testing Pro- 
gram (see 526) ; 60(70) minutes; prepared by College 
Entrance Examination Board Committee of Examiners 
in German Reading in cooperation with the Staff of 
Educational Testing Service: 1951 membership: Rob- 
ert W. Kesler (Chairman), Paul G. Graham, W. G. 
Hollman, Maxim Newmark, and Arthur J. Wat- 
zinger; program administered by Educational Testing 
Service for the College Entrance Examination Board. * 


REFERENCES 


877) 
“Some RUE ue on the College 
ue КУЛУНУ Pis Ed & Psychol Meas 8:593-602 w 
45. 24:145 
з. Tucker, LEDYARD R. “Interpreting Scores on the Foreign- 
Language Tests." Col Bd R 1:47-9 sp "48. * 


HERBERT SCHUELER, Associate Professor of 
Education, and Director of the School of Gen- 
eral Studies, Queens College, Flushing, New 
York. [Review of Forms ZAC, YAC, XAC, 
WAC2, and VAC2.] 

These examinations of the College Entrance 
Examination Board are not available for use 
outside of the official testing program of the 
Board. Their use is, therefore, largely confined 
to the area of admissions. The forms under re- 


view make use of multiple choice techniques of 
testing and are essentially measures bf recogni- 
tional levels of language skill. The various parts 
of these examinations follow familiar language 
testing patterns. Each form contains the now 
inevitable vocabulary section which makes use 
of the four-alternative English translations of 
a single German word. In addition, there is a 
section in which the student is asked to deter- 
mine which two of four German words or ex- 
pressions are synonyms. Elements of language 
structure are tested in a third section in which 
the student is asked to choose one of four alterna- 
tive translations of English words or phrases 
given within a German sentence. Another sec- 
tion in the forms beginning with W АС-2 (1048) 
is a short situational group in which the student 
is asked to choose one among four remarks 
which a person would be most likely to make in 
a number of situations. The device seems quite 
promising, although the test constructors seemed 
to have found it difficult to construct items that 
are sufficiently challenging to yield a wide spread 
of scores. It is, however, the one section in 
these tests which departs from conventional test 
patterns and should be developed further in fu- 
ture tests. Finally, each of the forms has a read- 
ing comprehension section consisting of four 
passages of approximately equal length followed 
by several multiple choice comprehension items 
in German. 

In general, these tests are carefully con- 
structed. The Board's process of conscientious 
evaluation, involving repeated review and pre- 
testing before a final form is set, assures high 
reliability. The demonstrated reliability of these 
tests was from .96 to .97, surely an eminently 
praiseworthy record of internal consistency. 
Considering the population served by these tests 
(they are most commonly taken by students with 
а background of two years of secondary school 
German), they are quite difficult. As a result, 
they fail to discriminate as much as could be 
wished among those falling at and below the 
median scores. On the other hand, the scores 
among the higher levels are spread out quite 
well. From an inspection of the items by this 
reviewer, it would seem that more items of low 
and middle difficulty should be introduced, es- 
pecially in the final section of the tests—the 
passages for reading comprehension. All Wei 
passages in each test are of approximately equa 
length. The comprehension of each is tested bY 
five multiple choice questions. The student not 
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familiar with the meaning area of any опе of 
these passages is, therefore, at a considerable 
disadvantage, since so many questions depend 
on each. The introduction of more passages of 
shorter length with fewer questions depending 
on them would help in widening the range of 
difficulty and in giving more students of average 
and low average ability the opportunity to com- 
plete some items of this reading comprehension 
section. 

The technique of recording scores of these 
examinations has been so refined that the inter- 
pretation is made extremely simple for admis- 
sions officers. The raw scores are converted into 
scaled scores in such a manner that the average 
test score of the normal representative group 
who have studied German for two years is 460, 
for three years 520, and for four years 580. 
(These figures apply to French, Spanish, and 
Latin as well, making it possible to compare 
students in different languages directly.) While 
this form of score reporting is an invaluable aid 
to the admissions officer, it means much less 
to the language department once the student 
has been admitted. The need for diagnosis of 
relative strengths and weaknesses in structure 
or vocabulary range and paragraph comprehen- 
sion, etc. is clear to the language chairman in- 
terested in homogeneous sections of language 
classes and to the instructor interested in mak- 
ing plans for the class he is to teach. It is 
unfortunate that scores on the separate parts can- 
not be made available. The scores on the transla- 
tion section and on the paragraph comprehen- 
sion section are measures of sufficiently discrete 
elements of language competence to be ex- 
tremely useful to the professionally enlightened 
instructor. Under the circumstances, however, 
an institution desiring this type of analytical 
score must administer another examination to 
those students already admitted. For this pur- 
pose, the later forms of the Cooperative German 
tests (reviewed in this volume) are among the 
most useful. 


[245] 
*Cooperative German Test: Advanced Form. 
More than 2 years high school or 1 year college; 1937- 
51; 4 scores: reading, vocabulary, grammar, total; 
IBM; Forms O (’50—same as test copyrighted in 
1938), Р ("si—same as test copyrighted in 1939), Q 
(’s0—same as test copyrighted in 1940) ; Form N out 
of print; no data on validity; no specific manual; 
norms ('38); general Cooperative manual C51); 
$2.50 per 25; 25€ per specimen set; postpaid; separate 
answer sheets may be used; 80€ per 25 IBM answer 
sheets; 156 per stencil for hand or machine scoring of 
answer sheets; cash orders postpaid; 40(50) minutes ; 
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E. Herman Hespelt, Miriam Van Dyck Hespelt, and 
Geraldine Spaulding ; Cooperative Test Division, Edu- 
cational Testing Service. * 


REFERENCES 

1-4. See 40:1360. 

5. Gazpe, WiLLIAM Е. [Editorial Note on the Cooperative 
German Test.] German О 14:238-9 N '41; 15:37 Ja '42. * 

6. НЕ5РЕ Т, Miriam | VAN German 
Tests." German Q 15:169-71 My '42. 

7. Ѕснлеғғев, Корор F. “What ‘Kind of Tests for Oral- 
Aural Courses?" German О 21:94-101 Mr "48. * 


Dyck. “Cooperative 


HERBERT SCHUELER, Associate Professor of 
Education, and Director of the School of Gen- 
eral Studies, Queens College, Flushing, New 
York. 

Each form of this test consists of three parts 
and can be administered within a single aca- 
demic hour. Part I is basically a test of reading 
comprehension utilizing the five-alternative com- 
pletion type item. Both stem and alternatives 
are in German. So cleverly are these items con- 
ceived that the usual objection to the all for- 
eign language comprehension test—that it is 
too often possible to match correct alternative 
to stem without necessarily comprehending the 
meaning—has been successfully overcome. The 
authors are wise in avoiding long paragraphs 
with several questions based on each, By con- 
centrating on relatively short items—many con- 
sist of one sentence—they have made it possible 
to test a much wider range of vocabulary with- 
out unduly penalizing the student who just 
happens not to know the vocabulary relating to 
a particular subject. 

Part II consists of single word vocabulary 
matching items. The stem words and the alterna- 
tives are, in the main, well chosen, both as to 
frequency range and as to distribution among 
parts of speech. However, in view of the grow- 
ing recognition among language teachers and 
linguists that the primary unit of language is 
not necessarily the single word but the con- 
textual phrase, clause, or sentence, as the case 
may be, the single word vocabulary test stem 
seems somewhat archaic. A more satisfactory 
type of vocabulary item is that used in the for- 
eign language tests developed for the United 
States Armed Forces Institute, where the word 
or phrase to be tested is presented in a sentence 
followed by English translation alternatives. 

Part III consists of two subparts. The first 
of these presents English sentences followed by 
translations with an inserted blank which the 
student completes from five given alternatives. 
Such items of structure as adjective endings, 
verb endings, verb auxiliaries, and idiomatic 
expressions are tested in this section. The sec- 
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ond subpart consists again of English sentences, 
followed, however, by complete alternative 
translations. This section tests such items as 
word order, tense sequence, and idioms. This 
reviewer has strong reservations as to the need 
and propriety of the English translations in this 
section of the test. In the overwhelming majority 
of items, they fulfill no legitimate function ; the 
alternatives are in no way dependent on the 
translation. For example, consider the follow- 
ing item from Form О: 
18. It e very cold in the cottage, so they built 

E es sehr kalt im Háuschen, 

18-1 darum sie haben gemacht Feuer. 

18-2 darum sie Feuer haben gemacht. 

18-3 darum sie Feuer gemacht haben. 

18-4 darum sie haben Feuer gemacht. 

18-5 darum haben sie Feuer gemacht. 
The correct answer is the only possible one re- 
gardless of meaning, because it is the only one 
with the correct word order. The translation, 
therefore, fulfills no legitimate function. In 
Form O, 30 of the 40 items in this section are 
in no way dependent upon the English transla- 
tion; for most of the rest of the items, if one or 
two alternatives were changed, the translation 
could be omitted without disturbing the item 
validity. Those few items in which the English 
translation is an integral part of the item—as, 
for example, in the case of idiomatic transla- 
tions involving model auxiliaries—might well 
be dispensed with in this section and different 
ways found to test them in other sections of the 
test. These translations add to the reading time 
of each item and introduce English as an un- 
necessary distractor. It is unfortunate that the 
authors could not have seen fit at last to produce 
an all foreign language test, especially since it 
would have involved no fundamental change in 
the test. 

These shortcomings do not, however, sig- 
nificantly impair the usefulness of this test. It 
is carefully constructed, has been proved in the 
field, and shows the evidence of steady improve- 
ment from form to successive form. 


For a review by Harold B. Dunkel of Ad- 
vanced Forms O, P, and Q, see 3:190; for a 
review by C. H. Handschin of Advanced Form 
Р, see 40:1360; for a review by Curtis C. D. 
Vail of Revised Series, Advanced Form N, see 
38:1000. 


[246] 
*German I and II: Achievement Examinations 
for Secondary Schools. т or 2 years high school; 
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1951; 1 form; no data on reliability and validity; no 
manual; Minnesota norms (median and quartile devia- 
tion) available; similar norms for other regions by 
special arrangement with publisher; 7¢ per test, post- 
age extra; 60(65) minutes; edited by Walter W, 
Cook; prepared by a curriculum committee of high 
school teachers for use in the Minnesota State Board 
Achievement Examinations Program; Educational 
Test Bureau, Educational Publishers, Inc. * 


[ 247] 
*Graduate Record Examinations Advanced Ger- 
man Test. Senior year college through graduate 
school and candidates for graduate school; 1939-51; 
available only in Graduate Record Examinations pro- 
grams (see 527) ; 105(125) minutes; Educational Test- 
ing Service. * 


GREEK 


[248] 
*College Entrance Examination Board Achieve- 
ment Test in Greek Reading. Candidates for col- 
lege entrance; 1001—51; available only in the March 
series of College Entrance Examination Board Admis- 
sions Testing Program (see 526); 60(70) minutes; 
prepared by College Entrance Examination Board 
Committee of Examiners in Greek Reading in cooper- 
ation with the Staff of Educational Testing Service: 
1951 membership: Henry Phillips (Chairman), Mal- 
colm MacLaren, William H. Marnell, and James A. 
Notopoulos; program administered by Educational 
bec for the College Entrance Examination 
oard. 


ITALIAN 


[249] 

*College Entrance Examination Board Achieve- 
ment Test in Italian Reading. Candidates for col- 
lege entrance; 1924-51; available only in the Mar 

series of College Entrance Examination Board Admis- 
sions Testing Program (see 526); 60(70) minutes; 
prepared by College Entrance Examination Boar 
Committee of Examiners in Italian Reading in cooper- 
ation with the Staff of Educational Testing Service: 
1951 membership: Maria Piccirilli (Chairman), Fran- 
cis Baccari, Joseph De Simone, Camillo P. Merlino, 
and Louis Sorieri; program administered by Educa- 
tional Testing Service for the College Entrance Exam- 
ination Board, * 


LATIN 


[250] s 
*College Entrance Examination Board Achieve. 
ment Test in Latin Reading. Candidates for college 
entrance; 1901-51; available only in College Entrance 
Examination Board Admissions Testing Program Meg 
£9; 60(70) minutes; prepared by College pana n 
xamination Board Committee of Examiners in 1a 1 
Reading in cooperation with the Staff of Educa 
Testing Service: 1951 membership: Charles T. mu 
hy (Chairman), George Land, Dorothy M. Robath2 
TL W. Spaeth, Jr., and J. A. Thayer; program # 
ministered by Educational Testing Service for 
College Entrance Examination Board. * 
REFERENCES saat 
1. Tucker, Lrpvamp В. “Interpreting Scores on the Foreig” 
Language Tests.” Col Bd R 1:47-9 sp '48. * 


363 | TESTS & REVIEWS: FOREIGN LANGUAGES—LATIN 


sins he Rey aem Sram Дони Wo Jan The’ Сойер: 
Н са ia Datus ence! аан. Cm; 
Навор B. DunKEL, Examiner in the Uni- 
versity Examiner's Office, and Associate Pro- 
fessor of Education, The University of Chicago, 
Chicago, Illinois. [Review of Forms VAC2, 
WAC2, ХАС, YAC, and ZAC.] 

Since all five forms reviewed follow the same 
general plan of organization and are similar in 
detail, they can be reviewed simultaneously. 
Each form contains three parts. In Part I the 
student selects from among four Latin words 
offered those two which are synonymous or 
related in some fashion. Part П is a set of multi- 
ple choice items requiring translation from Eng- 
lish to Latin. In the earlier forms, often only a 
single English word is to be translated; later 
forms make more use of short phrases. Part III 
contains four short Latin passages (two of prose 
and two of poetry) on each of which three dif- 
ferent exercises are based: (a) multiple choice 
items requiring the translation into English of 
selected words or phrases in the passage, (b) 
multiple choice completion-items in simple 
Latin, rephrasing parts of the passage or making 
comments about it, and (c) true-false statements 
in English about the passage in general or im- 
portant elements in it. Each of these subparts 
for each passage contains 3 to 8 items. Only the 
total score is reported in the form of the usual 
CEEB standard scores, so corrected, however, 
in these and the other language tests that the 
standard scores are comparable from language 
to language and are interpretable in relation to 
the number of years’ experience (two, three, or 
four) the student has had with the language. 

The tests are meticulously prepared and the 
results analyzed with equal care. For those forms 
old enough for data to be available, the reliabili- 
ties reported are in the low 9075. Though these 
coefficients are slightly lower than those of the 
CEEB tests in other foreign languages, they are 
certainly adequate. And recent forms of the 
Latin test closely approximate CEEB's more 
than adequate standards for level of difficulty, 
length, and the like. Within the framework set, 
these tests are well executed, and it would be 
difficult even to quibble. The objective forms 
are probably as valid measures as the older 
essay forms they replaced a few years ago, and 
certainly more reliable ones. И 

It seems to me proper, however, to question 
this general framework even though it is easy 
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to see how the Latin test committee and the 
Board are led to it. One debatable issue. is the 
practice of using a single total score, with the 
items originally. selected from the tryout forms 
on the basis of high correlation between the in- 
dividual items and the total. Admissions officers 
in the colleges involved undoubtedly prefer to 
have to work with a single score ; and the present 
total is a reliable measure of whatever it is it 
measures. Another value of the present pro- 
cedure for public relations is that teachers whose 
students take the test will have little ground 


‘for complaint that any of their language drill 


goes unheeded, Translation (English-Latin and 
Latin-English), comprehension (expressed in 
Latin and expressed in English), and vocabu- 
lary command measured within Latin all get 
added into the total score. Others of us, how- 
ever, would prefer to see more than one score 
or else a total which was somewhat less of a 
fruit salad; we would like to have part scores 
with low mutual intercorrelations and hence 
some diagnosis of the different skills involved 
(if they are different), or else a total score 
which would be less of a hash of these different 
tasks which are distinctions without a differ- 
ence anyway since only those items are kept 
which show high correlation with the total score. 

Another question might be raised concerning 
the objectives covered. One of the test com- 
mittees has pointed out quite properly that some 
of the presumed outcomes of Latin instruction 
are measured elsewhere in the CEEB battery 
(e.g. the increased command of English should 
be reflected in higher scores on the Scholastic 
Aptitude Test) and need not be treated in the 
Latin test also. But the reviewer doubts whether 
this is true of some of the other objectives: un- 
derstanding of a foreign culture, historical per- 
spective, increased sympathy with a foreign 
people, and the like. As the battle continues over 
the place of Latin in the curriculum, one hears 
much about these outcomes as constituting a 
major justification for the study of the Latin 
language ; yet these important and carefully pre- 
pared tests pay no attention to these objectives. 
Students, as well as parents, school administra- 
tors, and this reviewer, are inclined to suspect 
that outcomes which are not measured on such 
tests are not really considered by teachers to 
be worth bothering with or at least do not get 
much attention paid them. These comments may, 
of course, be more properly considered criti- 
cisms of the talk about Latin rather than of these 
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Latin tests. The schools concerned with CEEB 
tests might well ponder the issue and then 
change either the usual talk about Latin or the 
tests of Latin. 

In summary, for those who accept the general 
principles on which these tests are based, they 
are admirable examinations and among the best, 
if not the best, in print today. This is not to 
say that the reviewer would not like to tinker 
with the principles. 


[251] 


*Cooperative Latin Test: Lower and Higher | 


Levels. 3-6 semesters high school or 1-2 semesters 
college, more than 2 years high school or more than 
I year college; 1942-51; 4 scores: comprehension, 
grammar, civilization, total; IBM; 2 levels; Forms S 
(Lower Level, '51—same as test copyrighted in 1942; 
Higher Level, '42), Y (Lower Level, T or ’5I—same 
as test copyrighted in 1947; Higher Level, '47); no 
data on validity ; no specific manual; no college norms; 
general Cooperative manual (’51) ; high school norms 
[50] ; $2.50 per 25 of any one level; 50 per specimen 
set of any one level, postpaid; separate answer sheets 
may be used; 90€ per 25 IBM answer sheets; 30€ per 
set of stencils for scoring answer sheets; cash orders 
postpaid ; 80(85) minutes; Harold V. King and Ger- 
aldine Spauldidg (S); Cooperative Test Division, 
Educational Testing Service. * 
REFERENCES 


1. TRAXLER, ARTHUR E. “Some Data on the Results of the 
Cooperative Tests in French, Latin, and Secondary School 
Mathematics, Form S," pp. 55-9. In 1942 Achievement Testing 
Program in Independent Schools and Supplementary Studies. 
Educational Records Bulletin, No. 36. New York: Educational 

ecords Bureau, June 1942, Pp: xiii, so. Paper, lithotyped. * 

2. TRAXLER, ARTHUR “The Relation of Vocabulary and 
Grammar to Reading Achievement in Latin, French, and Span- 
ish,” pp. 61-5. In 1947 Achievement Testing Program in Inde- 
pendent Schools and Supplementary Studies, Educational Rec- 


ords Bulletin, No. 48. New York: Educational Records Bureau, 


June 1947. Pp. xii, 66. Paper, lithotyped. * (PA 22:449) 

3. Jacons, Ковевт. “An Evaluation of the 80-Minute Forms 
of the Cooperative Tests in French, Latin, and Spanish," pp. 
79-86. (PA 23 :399) In 1948 Achievement Testing Program in 
Independent Schools and Supplementary Studies. Educational 
Records Bulletin, No. 50. New York: Educational Records Bur- 
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КохкАр Gnrzs, Assistant Professor of Classical 
Languages and Chairman of the Department, 
Queens College, Flushing, New York. [Review 
of Forms S and Y.] 

One may safely agree with the previous re- 
viewer that these tests “probably deserve rank 
as the best tests dealing with the phases of 
Latin which they cover." 

Forms S and Y are the latest in a long series 
of tests based upon much experimentation, care- 
ful evaluation, and the most approved testing 
procedure. All items are of the. multiple choice 
type, requiring either the writing of a number 
or, if the test is to be machine scored, the black- 
ing of a vertical space. A sheet of "Percentile 
Ranks for Secondary School Students" gives 
percentile tables for part and total scores based 
on the results of varying numbers of students, 


ranging from over two thousand third semester 
students in 50 different schools to 399 sixth 
semester students in eleven different schools, 
Provided with detailed directions, attractively 
and effectively printed, easily and scientifically 
scored, these tests are the professional product 
one would expect from the Cooperative Test 
Service. That there are defects and deficiencies 
lies in the nature of the world: errare humanum 
est. 

The forms under consideration are all 80- 
minute tests. The disadvantage of so long a 
test is outweighed by the range and depth of 
the items, and by the fairness effected through 
the large number of items in each area tested. 
All four forms have three parts : comprehension 
(combining the vocabulary and reading of the 
older forms), grammar, and civilization (an in- 
novation not appearing in previous forms). - 

In Part I there are 60 well chosen vocabulary 
items, as foolproof as such isolated matchings 
can be, and 40 comprehension items, based 
largely on Latin passages of some length and 
requiring in their answering both linguistic 
knowledge and considerable thought. The first 
IO comprehension items are incomplete Latin 
sentences, the correct Latin completion of which 
is to be chosen from five possibilities listed. The 
remaining 30 items are based on five to seven 
excellent passages that are well graded, in- 
teresting, real units of thought adapted from a 
variety of sources and topics: anecdotes, my- 
thology, Nepos, Caesar (Lower Level Y is un- 
fortunately almost entirely taken from this 
author) for the Lower Level and Nepos, Cicero, 
Livy, Vergil, and even Lucretius for the Higher 
Level. The only general objection to this section 
is that the 30 minutes allotted to it would seem 
to be inadequate; 45 minutes would seem a 
minimum for the often thought-provoking ques- 
tions, which require a real understanding of the 
Latin they refer to. Specifically, in Lower Level 
S the answer to Item 6 could be 3 or 4, and that 
to Item 18 could be 2 or 5; in Higher Level Y 
the answer to Item 28 could be 2 or 5; and in 
the Livian passage in Higher Level S, page 7 
column 2, praestarent is an unfortunate replace- 
ment for Livy's praesto essent. 

Part II consists of 40 (Lower Level) or 45 
(Higher Level) items designed to show knowl- 
edge of forms and syntax. In each item a sensible 
English sentence is followed by an incomplete 
Latin version, the test being to choose the cor- 
rect complement from a list of five suggestions. 
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Form S on both levels varies this procedure by 
reserving the first 10 items for a word comple- 
tion test, in which correct endings are to be 
supplied from a long list of endings. As well 
suited for their purpose as these sentences are, 
they show too little differentiation in difficulty 
as between the two levels; also, 15 rather than 
20 minutes seem an ample time allotment. It is 
too bad that the authors have seen fit to in- 
clude some impossible forms as suggested an- 
swers : geruit, militorum, eiam, and saxe. Lastly, 
note that in Item 19 of Higher Level Y the word 
suggested as equivalent to "another" is not 
alius but alter. 

Part III is made up of 60 items on civiliza- 
tion, in its broadest sense: it covers Latin ab- 
breviations, derivation of English words, “ап- 
tiquities,” history, mythology, government, re- 
ligion, later influence, etc., as well as specific 
questions on the triumvirate of the American 
high school curriculum—Caesar, Cicero, and 
Vergil. The general effect is one of patchiness 
and lack of direction (perhaps unavoidable in 
view of the broad field to be covered) ; most of 
the questions, however, appear to be valid in 
themselves, although here and there a question 
is overly obvious (e.g. about Dido's fate) or 
of no great significance (e.g. on the locale of 
the Second Catilinarian). Again, there could be 
more differentiation between the two levels, and 
the time allotment seems too generous for pure 
recall work. 

In conclusion, some corrections should be 
cited. For all of Lower Level S and for all but 
the last two comprehension passages of Lower 
Level Y the length of the Latin vowels is indi- 
cated. Note these misprints: S I, 1, Item 48, 
dedé for dédo; Y I, 1, Item 52, cedó for сёйб; 
page 6, column 2, line 9 of the passage propésitis 
for propositis. Other misprints in the Latin are 
these: Lower Level S, page 6, column 2, line 
9 of the passage, classi for the normal classe; 
Higher Level Y, page 6, column т, line 5 of the 
passage, eum in Bithynia regis hospitio esse, 
where read Bithyniae. 


For a review by С. W. Odell of Form S, see 
3:205; for a review by Hazel M. Toliver of Part 
1, Form S, see 3:203. 


[252] i 
*First Year Latin Test: State High School 
Tests for Indiana. 1, 2 semesters high school ; 1934- 
49; 2 levels; mimeographed ; no data on reliability and 
validity: no manual; norms [49]; 430% per test; 156 
Der specimen set; postpaid; 40(45) minutes; Inez 
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Painter; State High School Testing Service for Indi- 
ana, Purdue University. * 
a) FIRST SEMESTER. Form А ['48]. 
b) sEcoND SEMESTER. Form N ['48]. 
[ 253] 
Godsey Latin Composition Test. High school; 
1926; 2 scores: sentences, rules; Forms A, B; no data 
on reliability; $1.30 per 25, postage extra; specimen 
set not available; 30(35) minutes; Edith R. Godsey ; 
World Book Co. * 
REFERENCES 

т. AMERICAN  CLassicAL LEAGUE, Apvisory COMMITTEE, 
The Classical Investigation: Part One, General Report: A 
Summary of Results With Recommendations for the Organiza- 
tion of the Course in Secondary Latin and for Improvement in 
Methods of Teaching. Princeton, N.J.: Princeton University 
Press, 1924. Pp. vi, 305. 

2. BRUECKNER, Leo ў. “Тһе Status of Certain Basic Latin 
Skills." J Ed Res 9:390-402 My '24. * 
КохкАр Gries, Assistant Professor of Classi- 
cal Languages and Chairman of the Department, 
Queens College, Flushing, New York. 

Potential users of this test should not be de- 
ceived. This is not a Latin composition test, nor 
does it fulfill its stated aim of measuring “the 
accuracy of Latin composition work, at the same 
time showing the correlation of sentence-writing 
skill to knowledge of Latin grammar” ; for the 
student is nowhere required either to recreate 
Latin forms nor to assemble such forms inde- 
pendently into a Latin sentence, be it as transla- 
tion or as original thought. Instead, he is con- 
fronted in each form with three sets of eleven 
English sentences, each followed by a Latin 
translation, one element of which is given in 
four different forms, of which only one is cor- 
rect. This he is to encircle. Next to each sen- 
tence are four numbers referring to four of some 
16 succinctly stated rules printed at the bottom 
of the page. His second job is to encircle the 
number of that rule which applies to the en- 
circled Latin form. The abilities tested, there- 
fore, are (a) the student’s ability to eliminate 
impossible forms (e.g., pdcam and profictscerés), 
the occasional presence of which is perhaps the 
most unfortunate aspect of the test; (b) his 
ability to recognize forms (e.g., in “These are 
the books which you lost. Hi sunt libri (qui, 
quae, quem, quos) Gmisisti,” he must recall that 
libri is masculine and quds accusative plural mas- 
culine) ; (c) his understanding of grammatical 
concepts (e.g., that of the indirect object if he 
is to choose tibi as the translation for “you” in 
“I granted you this favor") ; and (d) his ability 
to select that rule which supports his choice of 
the correct Latin. 

This is a good test of the areas covered. The 
sentences, based mainly on Caesar, but graded 
in difficulty from simple agreement to the pas- 
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sive periphrastic, make good sense. Except for 
the impossible forms mentioned above, the items 
of choice are well calculated to reveal the stu- 
dent's mastery of morphology and syntax; both 
sentences and rules (there are about 35 of the 
latter, when repetitions among the three sets 
are subtracted) are abundant enough to give 
fair coverage; and the rules are, for the most 
part, both well chosen, being all of functional 
importance, and well phrased. The accuracy of 
Rule 5, Part II, Form B, is questionable: “A 
cum-temporal clause has its verb in the....sub- 
junctive,” time as such being indicated by the 
indicative; as is also the advisability of includ- 
ing the rather technical “clause of volition.” 

Other virtues of this test are its brevity, its 
broad applicability (second semester through 
fourth year), its ease of administration and scor- 
ing (both the key and the manual are excel- 
lent), and the large number of cases upon which 
its norms were based. As the test was devised 
and administered in connection with the Classi- 
cal Investigation of 1921-1923, over 20,000 pa- 
pers were available for the determination of 
norms. 


[ 254] 
Kansas Second Year Latin Test. 1, 2 semesters 
high school; 1935-36; 2 levels; manual (’36) ; 90% 
25 of any one level; per specimen set; postpaid; 
40(45) minutes; W. L. Holtz and H. E. Schrammel ; 
Bureau of Educational Measurements, Kansas State 
Teachers College of Emporia. * 
а) TEST п. First semester; Forms A (’35), B (36). 
b) TEST п. Second semester; Forms C (^35), D (^36). 


W. C. Kvaraceus, Professor of Educational 
Measurement, Boston University, Boston, Mas- 
sachusetts. 

These tests were constructed for use with 
second year Latin students in high schools. 
Forms A and B are available for use at the end of 
the first term; Forms C and D are to be used at 
the end of the second term. Forms A and B 
contain 65 items each and Forms C and D are 
made of 70 items each. Each of these forms con- 
tains four parts measuring proficiencies in the 
following areas : ability to determine the thought 
content from a Latin paragraph, knowledge of 
Latin forms and syntax, and vocabulary control. 
If the teaching of Latin still avows any objec- 
tives other than the knowledge of Latin gram- 
mar and the ability to translate a Latin passage, 
they are not visible in this test. 

These tests have been poorly constructed and 
have been carelessly edited. Some of the items 
are lacking in objectivity and a number of seri- 


ous errors are noted. To name a few examples: 
in Form A, Amezonum should read A mazonum, 
the acknowledgment should read Fabulae for 
Faulae, and the answer for Item 36 calls for a 
cum descriptive clause when temporal is more 
nearly correct; in Form B, Item 15 is ambigu- 
ous and unrelated to the specific passage, in 
Item 28 genere is better described as an ablative 
of means, in Item 32 there are two defensible 
answers, in Item 51 the answer calls for isset 
but could more probably be erit, in Item 53 the 
answer could also be erint (pl.), in Item 59 
the answer could also be at or ant, the answer 
sheet has a misprint for Item 60 and should ap- 
pear as isset; in Form C the translation text has 
been rewritten from indirect discourse to direct 
discourse and utilizes punctuation that is very 
misleading. In line 9 of the Latin paragraph 
mihi should be separated from deteriora; Items 
21 and 24 could be keyed otherwise than false, 
in Item 61 siege not seige; in Form D, Items 22 
and 23 could be scored plus, and the answer 
space under Item 46 is misleading. These errors 
and ambiguities are inexcusable. 

Two pages of mimeographed statements con- 
stitute the manual. Percentile norms are avail- 
able based on 805 cases for Form A, 1,001 cases 
for Form B, and 2,411 cases for Forms C and 
D. The inference is made that the norm sample 
was obtained on a nation-wide basis, but no 
information is given to substantiate the claim. 
In view of the age of the norms and the question 
of the determination of the norm sample, present- 
day use of these distributions for comparative 
purposes is seriously questioned, especially when 
one considers the marked falling off in Latin 
elections within the past years and the many 
selectivity factors operating in the election of 
this subject today. Any norms that are offered 
should be accompanied by specific descriptions 
of the grade and ability of the norm group. 

These tests do not do a very discriminating 
job in showing up differences in achievement. 
The reported Q values range from 4.5 {0 
points of raw score with probable errors E 
measurement ranging from 2.46 to 3.27. At t 
same time there is a very heavy dependence ra 
true-false items throughout the forms, but n 
provisions are made for corrections for Lisci 

These one-sheet, poorly printed tests, vinta 
1935-36, should either be completely revise 
and new norms established or they ee à 
withdrawn from the market. A Latin teachét 
who has had a basic course in educational meas 
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urement should be able to serve his own evalua- 
tion needs more adequately by constructing his 
own objective tests for local use. If the teacher 
wants to obtain comparative test data, he will 
find a better tool in the Cooperative Latin Test. 


For a review by W. L. Carr, see 40:1369. 


[255] 
*Latin I and II: Achievement Examinations for 
Secondary Schools. 1 or 2 years high school; 1951; 
1 form; no data on reliability and validity ; no manual ; 
Minnesota norms (median and quartile deviation) 
available; similar norms for other regions by special 
arrangement with publisher; 7€ per test, postage ex- 
tra; 60(65) minutes; edited by Walter W. Cook; pre- 
pared by a curriculum committee of high school teach- 
ers for use in the Minnesota State Board Achievement 
Examinations Program; Educational Test Bureau, 
Educational Publishers, Inc. * 


[256] : 

*Latin I and II: Every Pupil Test. 1 or 2 years 
high school; 1929-51; new form usually published each 
April and December; form April 1951; no data on re- 
liability and validity; no manual; norms ('51); 216€ 
per test; 1¢ per answer key; и 0 40(45) min- 
utes; Ohio Scholarship Tests, Ohio State Department 
of Education. * 


[257] 
*Second Year Latin Test: State High School 
Tests for Indiana. 1, 2 semesters high school; 1934- 
49; 2 levels; mimeographed ; no data on reliability and 
validity; no manual; norms [/49] ; 4€ per test; 156 per 
specimen set; postpaid; 40(45) minutes; Josephine 
Lilian Lee; State High School Testing Service for 
Indiana, Purdue University. * 
a) FIRST SEMESTER. Form A ['45]. 
b) SECOND SEMESTER. Form N ['45]. 


[258] х 
White Latin Test. High school; 1924; 3 scores: 
vocabulary, sentences, total; Form B; Form A out of 
print; no data on reliability ; revised manual; $1.80 per 
25, postage extra; 35€ per specimen set, postpaid ; 
35(40) minutes; Dorrance Stinchfield White; World 
Book Co, * 
REFERENCES 

„1, SvMoNps, PercivaL M. Ability Standards for Standard- 
ised Achievement Tests in the High School. New York: Bu- 
reau of Publications, Teachers College, Columbia University, 
1927. Pp. x, 91. * ' 
Koxnap Gris, Assistant Professor of Classical 
Languages and Chairman of the Department, 
Queens College, Flushing, New York. 

The manual indicates that this is “a com- 


prehensive and standard Latin test suitable to . 


measure growth in knowledge of Latin on the 
part of high school and college students through 
four years of Latin." In reality, the knowledge 
of Latin tested is strictly recognitional and is 
confined to vocabulary, sentence structure, and 
translation; one would hope that the term 
"knowledge of Latin" covers a somewhat 
broader field. 

Within its limitations, however, the test is a 


well constructed instrument which will clearly 
show whether a student has mastered a basic 
vocabulary, and whether his study of grammar 
and his readings in Latin have given him the 
ability to see which of several suggested trans- 
lations of a given Latin sentence is the correct 
one. The roo vocabulary items (for each four 
equivalents are proposed) and the 20 sentences 
(for each, three or four translations are offered) 
which each of the forms contains are, in tlie one 
case, abundant enough to provide a fair selec- 
tion, and, in the other, inclusive enough really 
to check the student's mastery of important 
principles. Both parts are graded in difficulty 
and based upon frequency of occurrence “in the 
works of authors that are read for college en- 
trance." This seems to mean mainly Caesar and 
Cicero; Vergil (and verse) is omitted save for 
an occasional vocabulary item. The time allot- 
ments—15 and 20 minutes respectively—are 
adequate, and the scoring can be done "rapidly 
and accurately.” The manual is helpful in its 
instructions on administration, scoring, and in- 
terpretation; it also contains tables of percen- 
tile norms, based on 3,500 cases for the one form 
and 1,650 cases for the other. 

Administratively, then, this is a good test; 
and it undoubtedly measures a certain amount 
and kind of knowledge, Pedagogically, however, 
its value is not so definite. The advisability of 
presenting isolated words and sentences has been 
questioned before by Mental Measurements 
Yearbook reviewers (Atkins, 38: 1065; Dunkel, 
40:1365; Toliver, 3:203); it may be wise to 
question it again. Thus, in vocabulary, “word- 
matching” is possible, and even laudable, up to 
a certain point : lacrima does equal “tear,” surgo 
“rise,” and sic “thus.” But the words that are 
univalent, or whose semantic range is equiva- 
lent in Latin and English, are strictly limitéd in 
number. To equate pario with "win," largior 
with “bribe,” and fides with “protection” is of 
doubtful validity ; and when eripio becomes "'res- 
cue," entirely too much emphasis is laid on the 
single context in which the student is expected 
to have met the word. 

In the case of isolated sentences, there is, of 
course, the advantage of being able to increase 
the difficulty of structure gradually, almost sci- 
entifically, and to cover a large number of gram- 
matical forms and principles efficiently. But how 
much is lost for the student who meets such 
inanities as “Ille locus quem vos castris petebatis 
mihi deligendus est,” “Pauci Athenis multos 
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menses habitare poterant," or Pascere, crudelis, 
nostro dolore corque ferum satia; miserae mihi 
plura supersunt quam tibi felici." (Incidentally, 
the first comma of this last sentence is missing, 
the only misprint noticed, although more commas. 
in the longer sentences would be a legitimate 
aid to the student.) Even the best of the sen- 
tences, those taken or adapted from Latin au- 
thors, have no real life when deprived of their 
rightful context. No one has ever spoken or 
written in isolated sentences, nor does one learn 
Latin to be able to read Latin quotations. The 
self-sufficient paragraph is the normal unit of 
speech, and the most proper unit for testing 
comprehension in a foreign language, Latin or 
any other. 


SPANISH 


[259] 


*College Entrance Examination Board Achieve- 
ment Test in Spanish Reading. Candidates for col- 
lege entrance; r902-51; available only in College En- 
trance Examination Board Admissions Testing Pro- 
gram (see 526) ; 60(70) minutes; prepared by College 
Entrance Examination Board Committee of Examin- 
ers in Spanish Reading in cooperation with the Staff 
of Educational Testing Service: 1051 membership: 
Donald D. Walsh (Chairman), Linton L. Barrett, 
Amelia del Río, Rose E. Martin, and Naomi W. Zie- 
ber; program administered by Educational Testing 
Service for the College Entrance Examination Board. * 
REFERENCES 


1. Dyer, Henry S. “Some Observations on the College 
Рош, Талаш Tests." Ed & Psychol Meas 8:593-602 w "s. 
24:145! 
2. Tucker, Lepyarp R. “Interpreting Scores on the Foreign- 
Language Tests." Col Bd R 1:47-9 sp ^48. * 
3. Warsu, Donatp DrvrwisH. "The College Board Spanish 
Test.” Hispania 34:343-8 N 's1. * 
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*Cooperative Spanish Test: Lower and Higher 
Levels. 3-6 semesters high school or 1-2 semesters col- 
lege, more than 2 years high school or more than 1 year 
college; 1948-51; 4 scores: comprehension, grammar, 
civilization, total; IBM; 2 levels; Form Y (Lower 
Level, '50—same as test copyrighted in 1 ; Higher 
Level, '48) ; no data on validity; no specific manual ; 
no college norms; general Cooperative manual (51); 
high school norms ['50] ; $2.50 per 25 of any one level ; 
50¢ per specimen set of any one level, postpaid; sep- 
arate answer sheets may be used; 906 per 25 IBM an- 
swer sheets; 30€ per set of stencils for scoring answer 


sheets; cash orders postpaid; 80(85) minutes; Wil- ` 


liam H. Shoemaker and Geraldine Spaulding; Cooper- 
ative Test Division, Educational Testing Service. * 


REFERENCES 


т. Jacozs, ROBERT. “An Evaluation of the 8o-Minute Forms 
of the Cooperative "Tests in French, Latin, and Spanish," рр. 
79-86. (P. 24:209) In 1948 Achievement Testing Program in 
Independent ci s and Supplementary Studies. Educational 
Records Bulletin, No. so. New York: Educational Records 
Bureau, July 1948. Pp. xiii, 86. Paper, lithotyped. * 

2. CraPP, Hanorp L. ‘Meditations on a Placement Program 
or When Should a Foreign Language be Studied.” Mod Lang J 
31:203-7 Ap '47. 

з. TRAxLER, ARTHUR E. “The Relation of Vocabulary and 
Grammar to Reading Achievement in Latin, French, abd. Span: 


BAUR траене кра ЫЛ илм | 
Educational Records Bulletin No. 48. New York: Educational 
Records Bureau, June 1947. Pp. xii, 66. Paper, lithotyped. 
James B. ТнАвР, Professor of Foreign Lan- 
guage Education, The Ohio State University, 
Columbus, Ohio. 

The Lower and Higher Levels of this test 
have the same pattern of items and the timing 
for all parts is the same, Throughout both levels 
the five-option multiple choice item is used, with 
provision for marking the appropriate answers 
in the test booklet or on a separate answer sheet. 
Answers in the booklet must be scored by hand 
with a fan key; the separate answer sheets can 
be scored either by machine or by hand with a 
scoring stencil. 

Part I, Comprehension, is organized in two 
sections. Section т (15 minutes) contains 50 
Spanish words, each accompanied by five num- 
bered English meanings. About half of the words 
are nouns. According to ratings suggested by 
Spaulding,’ the first 30 words in the Lower 
Level have an average frequency index of 2.3, 
that is, an average within the second 500-word 
group; and the next 20 words show an average 
index of 3.3, or an average within the third 500- 
word group. In the Higher Level this section is 
more difficult, the average frequency indices 
being 3.8, 4.1, 6.3, 5.9, and 8.4 for the 50 words 
taken in groups of ten. 

The last 10 items of Section 1 represent 4 
welcome innovation in vocabulary measurement. 
For these items an idiom is used in a Spanish 
sentence which is followed by five Spanish sen- 
tences, only one of which paraphrases the mean- 
ing of the test sentence. Rating the idioms by - 
а self-made system similar to that of Spauld- - 
ing, this reviewer found an average frequency - 


index of 3.3 for the ten idioms in the Lower 
Level and an average frequency index of 60 . 


for those in the Higher Level. Neither level 
contains a word that is not in the Buchanan 
Word-list or an idiom that is not in the Keniston 
Idiom List. NY: А) 

Section 2 (25 minutes) consists of sever? 
Spanish paragraphs, each followed by three ii 
five paraphrasing sentences with five possible 
endings, only one of which conforms to the d 
in the paragraph. The reviewer applied diam Й 
ing's Formula І to the ten paragraphs in gr 
Lower Level and obtained difficulty ratings 0 
54, 69, 79, 45, 92, 50, бо, 82, 53, and 70. Base 
on word frequency and sentence length alone, 


LEUR d- 
1 Spaulding, Seth, “Two Formulas for Estimating 589 ре 
ing Difficulty of Spanish." Ed Res В 30:117-24 My 


| 


369 ] 


TESTS & REVIEWS: FOREIGN LANGUAGES—SPANISH 


* 


262 


these ratings indicate relatively easy prose. The 
reviewer analyzed only the first three and last 
three of the nine paragraphs in Section 2 of 
the Higher Level, obtaining for the six para- 
graphs difficulty ratings of 90, 70, and 80 (first 
three), and 118, 85, and 130 (last three). These 
data indicate the generally higher difficulty of 
the paragraphs of this level. 

It must be remembered that the Spaulding 
formulas of necessity ignore idiomatic and syn- 
tactical complexities and the nature of the story 
content. With this in mind, the reviewer is of 
the opinion that, in spite of the apparently un- 
steady pattern of difficulty shown by the para- 
graph ratings for both the Lower and Higher 
Levels, there is probably little doubt that the 
paragraphs are arranged in order of ascending 
difficulty, as judged by actual performance of 
students who took the trial administration of 
the test. 

Part II, Grammar (25 minutes), consists of 
a series of English sentences which set the gram- 
matical problem, each followed by a partial 
Spanish translation and five numbered expres- 
sions, only one of which correctly completes the 
Spanish translation. The Lower Level has 40 of 
these completions, and the Higher Level 39. The 
second part of this section employs a selection- 
type response by which knowledge of correct 
word order and position in a sentence may be 
measured. An English sentence is followed by 
five translations in Spanish, only one of which 
is correct. There are 10 of these in the Lower 
Level and 11 in the Higher Level. 

It is test items of this sort that some teachers 
object to on the complaint that the pupil must 
look at incorrect language and may, as a conse- 
quence, suffer harm to his language skill. Every 
classroom has similar errors cropping up in daily 
recitations, and the errors are successfully used 
for teaching jurposes. It should be recognized 
that “proof-reading” is a valid testing technique, 
especially when the pupil is well aware that he 
is taking a test and not encountering model lan- 
guage that he should imitate. j 

Part III, Civilization (15 minutes), is an- 
other welcome innovation in a foreign language 
test. This part is made up of 50 items about 
geography, history, art, literature, and other 
aspects of Hispanic civilization. The statements 
and the five-choice completions are in English, 
except where Spanish places and titles make 
up the responses. The reviewer counted eight 
or ten items in each level that could be called 


of general application to Hispanic life; the re- 
mainder are about equally divided between 
Spain and Latin America. In each level three 
pictures are included for identification, 

Some teachers may raise the question as to 


"whether some of the facts tested here are repre- 


sentative of a knowledge of Hispanic civilization. 
Let him who objects try to compose a better 
test, and at that, to compose two forms of the 
test, one easier than the other. Of course; there 
are some items in the Higher Level that are 
easier than some in the Lower Level, but the 
table of scores shows an overalf difference. A 
Scaled Score of 50 is equivalent to a raw score 
of 24 on the Lower Level, but only 13 on the 
Higher Level. , 

No attempt will be made here to explain the 
system of Scaled Scores, which is described in 
some detail in the manual. Suffice it to say that 
the tables of percentile ranks given in terms of 
Scaled Scores for both levels together furnish 
data for six semesters of secondary-school study. 
No college norms are thus far available. 

Form Y of the Cooperative Spanish Test 
shows evidence of the careful selection of ma- 
terials and skillful execution of test exercises 
that have long characterized the Cooperative 
foreign language tests. The authors of this test 
are to be complimented for presenting the usual 
type of material well and for offering some new 
testing techniques much needed by the Spanish- 
teaching profession. 


[261] 
*First Year Spanish Test: State High School 
Tests for Indiana. 1, 2 semesters high school; 1945; 
2 levels; mimeographed; no data on reliability and 
validity; no manual; norms ['45]; 444€ per test; 156 
per specimen set; postpaid; 40(45) minutes; O. H. 
Patterson; State High School Testing Service for In- 
diana, Purdue University. * | 
a) FIRST SEMESTER. Form A 45]. 
b) SECOND SEMESTER. Form [45]. 


[262] 

*Furness Test of Aural Comprehension in Span- 
ish. 1-3 years high school or 1-2 years college ; 1945- 
51;2 editions; no data on reliability and validity ; $1.50 
per 25; 10€ per 5 ecimen set; postage extra; Edna Lue 
Furness; Banks Upshaw & Co. * 
a) [ORIGINAL EDITION. ] 1945-46; 4 scores: vocabu- 
lary, completion, identification, total ; Forms A ('46— 
same as test copyrighted in 1945), В ('46—same as test 
copyrighted in 1945) ; manual [46] ; tentative norms; 
40(45) minutes. 
b) [RECORDED EDITION.] 1951; 5 scores: vocabulary, 
completion, identification, question-answer, total ; 
Form C; no norms; manual ['51]; 3 es of record- 
ings: “LP” 33% rpm (28-inch records), wire (s-inch 
spool), dual-track tape (5-inch spool) ; $6 per “LP” 
record set; $7.50 per wire recording ; $7.50 per dual- 
track tape recording; 32(37) minutes. 
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For reviews by Frederick B. Agard and Wal- 
ter V. Kaulfers, see 3:213. 


[263] 
*Graduate Record Examinations Advanced 
Spanish Test. Senior year college pues graduate 
school and candidates for graduate school; 1946-51; 
available only in Graduate Record Examinations pro- 
grams (see 527) ; 105(125) minutes; Educational Test- 
ing Service. 

[264] 
*Kansas First Year Spanish Test. First year high 
School or college; 1947; Form A; $1.05 per 25; 20€ 
per specimen set; postpaid; 40(45) minutes; Minnie 
M. Miller; Bureau of Educational Measurements, 
Kansas State Teachers College of Emporia. * 

К [265] 

*Spanish I and II: Achievement Examinations 
for Secondary Schools. 1 or 2 years high school; 
1951; 1 form; no data on reliability and validity; no 
manual; Minnesota norms (median and quartile devia- 
tion) available; similar norms for other regions by 
special arrangement with publisher; 76 per test, post- 
age extra; 60(65) minutes; edited by Walter W. 
Cook; prepared by a curriculum committee of high 
school teachers for use in the Minnesota State Board 
Achievement Examinations Program; Educational 
Test Bureau, Educational Publishers, Inc. * 

[266] 
The Stanford Spanish Tests. 1-4 years high school 
or 1-2 years college; 1927; 3 parts; Forms A, B; 
per 25 of any one part; 25 per specimen set; cash or- 
ders postpaid; Aurelio M. Espinosa and Truman L. 
Kelley; Stanford University Press. * 
а) PART I, GRAMMAR. 20(25) minutes. 
b) PART II, VOCABULARY. 15(20) minutes. 
C) PART III, PARAGRAPM MEANING. 20(25) minutes. 


James B. Tuarp, Professor of Foreign Lan- 
guage Education, The Ohio State University, 
Columbus, Ohio. 

Five parts were originally planned for this 
test; the last two projected parts, Sentence 
Meaning and Pronunciation, were never pub- 
lished. 

Separate norms are presented for high school 
and university classes by years. Within each year 
of study there are norms for a variety of groups, 
according to. the amount of study given to other 
languages. This factor, usually neglected in other 
tests, seems to have considerable importance if 
one or more other languages have been studied 
for more than one year. For example, the total 
of all three norms for first year high school in 
group A (no other language or 1 year Latin or 
French) is 65.2. In group C (3 years Latin or 
French) each norm is 11 or 12 points higher, 
and the total is 103.4. The university group A 
(no other language) total is 90.6; group E (8 
to 10 years of other languages in 4 different 
combinations of French and Latin) has a total 
of 130.2. 


The arrangement of parts, beginning with the 
more difficult grammar and ending with the 
easier reading, may have a psychological value 
that other tests could copy. : 

Тһе vocabulary test uses a matching tech- 
nique: groups of 6 test words (7 in the last 


two groups) printed in a column, each word ` 


followed by a blank; to the right of each group 
appears a column of 12 English words contain- 
ing the correct meanings plus confusion items. 

It must be remembered that these tests came 
out the same year of the publication of Bu- 
chanan's A Graded Spanish Word Book? and 
probably without the advantage of its data. Con- 
sequently it is gratifying to see how each form 
of Part II, Vocabulary, checks against the Bu- 
chanan list. Of the 68 test words, there are 7 
words in each form not found in the 6,702 items 
given by Buchanan. There are 39 words in 
Form A and 41 words in Form B which occur 
within the first 1,500 of the Buchanan list. When 
the remaining, less frequent words are given the 
Frequency Index suggested by Spaulding,’ the 
average index for the 22 words in Form A and 
the 20 words in Form B is 8.17 and 8.44, re- 
spectively. These figures show a progression 
from easy to difficult, so far as the factor of fre- 
quency is concerned, and a close equivalence of 
difficulty between forms. 

When the 10 paragraphs of Part III are ana- 
lyzed by the Spaulding formula, the results are 
not so satisfactory. Although the paragraphs 
grow longer toward the last of the то selections, 
and the sentences are shorter at the beginning, 
simpler in content, and less idiomatic, the sta- 
tistics of difficulty do not show progression to 
greater difficulty at the end or near equivalence 
of difficulty between the forms. According to 
Spaulding's Formula I, paragraphs VI to X 
are rated 77, 62, 58, 155, and 120 in Form A, 
and 45, 78, 55, 72, and 61 respectively in Form 
B. It would be interesting to learn whether Form 
A proves more difficult in the pupil responses to 
these paragraphs, a fact not revealed in the 
norms, which give one figure for both forms. 

Itisin the grammar test, Part I, however, that 
this reviewer finds the most to criticize for lac 
of validity. The 5o items of each form are pre 
sented in three types of test procedures: (4 
Mutations, Form A, 27; Form B, 29. (b) Com- 

i ' i 7 ‚ Pub- 
naana UP he adi баайа бинадан on я 
ern Languages, Vol. 3. Toronto, Canada: University 0 
ronto Press, 1927. Pp. 195. Рај * the Read- 


per. * Pac ЭМ 
. 2 Spaulding, Seth. "Two Formulas for Estimating no" 
ing Difficulty of Spanish.” Ed Res B 30:117-24 Мут > 
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pletion, recall response, Form A, 6; Form B, 
5. (c) Selection, 2-item choice, Form A, 17; 
Form B, 16. In each form nearly one fourth of 
the changes of form of completions in a blank 
depend on the knowledge of a grammatical term, 
such as “past absolute,” “possessive adjective,” 
“reflective pronoun,” etc.: 13 cases in Form A; 
15 in Form B. Does a failure to respond indicate 
lack of knowledge of the language form or of 
the grammatical nomenclature? Scoring direc- 
tions say “The omission of accent marks... 
should not be counted against the student,” but 
should there not be added “unless vital to the 
meaning"? For example, the answer to Item 11 
in Form A is sé, not se; and the answer to Item 
35 in Form B is Les hablé, not Les hable. In 
both forms, Item 12 appears tricky. When asked 


to “Write the correct forms to complete the 
meaning," Los peces viven en (el) agua in Form 
B, the answer el remains the same, unless somé 
culprit thinks he must change it, as nothing has 
been said about some items being correct as they 
stand. But in Form A, for Necesito (un) zapatos, 
the answer on the key sheet is wnos. Suppose a 
pupil writes dos or says "nothing"; is this re- 
sponse wrong or does the scorer accept it in 
defiance of the key? 

Since the Stanford Spanish Tests have con- 
tinued in demand for many years, even as newer 
tests came on the market, there is ample evidence 
that teachers regard the tests highly and find 
themselves well served. The minor faults indi- 
cated above have not marred the solid results 
delivered in a quárter-century of popular usage. 
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Reviews sy Murray Aborn, William D. Altus, 
Balinsky, Charlotte Banks, 


James M. Anderson, Gwen F. Arnold, Benjamin: 
George К. Bennett, Robert G. Bernreuter, E. J. С. Bradford, 


W. D. Commins, William M. Cruickshank, Edward E. Cureton, John T. Dailey, Frederick B. 
Davis, Harold A. Delp, Robert G. Demaree, Jerome E. Doppelt, Raleigh M. Drake, Walter N. 


Durost, W. С. Emmett, George A, Ferguson, John С. Flanagan, Н. 


M. Fowler, Henry E. Garrett, 


William R. Grove, J. P. Guilford, Alice W. Heim, Edwin R. Henry, Carl I. Hovland, Gertrude 
Keir, William S. Kogan, Jane Loevinger, C. M. Louttit, Boyd R. McCandless, Louis L. McQuitty, 
Herschel T. Manuel, James Maxwell, Ivan Norman Mensh, Joseph Newman, Stanley D. Nisbet, 


Е. A. Peel, М. L. Kellmer Pringle, 


Willis C. Schaefer, 


William Schofield, F. J. Schonell, 


William B. Schrader, Gladys C. S: chwesinger, Harold G. Seashore, David Segel, Naomi Stewart, 
Charles A. Stickland, Erwin K. Taylor, Mildred С. Templin, David V. Tiedeman, W. D. Wall, 
F. W. Warburton, George Westby, and Dael Wolfie. 
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[ 267] 
XA.C.E.R. Adult Test (B40). Ages 13 and over; 
1940; 1 form; no data on reliability and validity; no 
description of normative population; 2s. 9d. per 10; 
6d. per manual ; 15. per specimen set; cash orders post- 
paid within Australia; 55 (65) minutes; Australian 
Council for Educational Research, * 


REFERENCES 


1. Соок, Р. Н. “Criteri _ {ог the Selection of Personnel 
Officers.” B Ind Psychol & Personnel Prac 2:28-37 Je '46. * 


(РА 20:4761), 15 › 
4261) “The Prediction of Academic Success." 


2, Houxe, Н. Н, 
Austral J Psychol 1:38-42 Je '49. 
- [268] 
X*A.C.E.R. General Ability Test: Advanced M. 
Ages 13 and over; 1948; 3 scores: linguistic, quanti- 
tative, total; 2 parts; 1 form; no data on reliability 
and validity; no description of normative population ; 
25. 4d. per 10 of any one part; 6d. per set of scoring 
keys; 6d. per manual; 15. 6d. per specimen set; cash 
orders postpaid within Australia; Australian Council 
for Educational Research. * 


а) SECTION L. I score: linguistic; 15 (25) minutes. 
b) sECTION Q. I score: quantitative; 20(30) minutes. 


[269] 

XA.C.E.R. General Test A. Ages 10-14; 1938; 1 
form [38]; no data on reliability and validity; no de- 
scription of normative population; mimeographed 
manual [’38] ; 25. 6d. per 10; 3d. per key; 6d. per man- 
ual; rs, per specimen set; cash proas postpaid within 
Australia; 30(40) minutes; Australian Council for 
Educational Research. * 


For related reviews, see 3:1110 and 40: 
B1005. ' 


8“ E270] 
XA.C.E.R. General Test C. Ages 10-14; 1939; I 
form ['39]; no data on reliability and validity ; no de- 
scription of normative population; mimeographed 
manual [39] ; 25. 6d. per 10; 3d. per key; 6d. per man- 
ual; rs. per specimen set; cash orders post aid within 
Australia; 35(45) minutes; Australian Council for 
Educational Research. * ү, 


[271] 


*A.C.E.R. Junior A. Ages 9-11; 1046; т form; no 
data on reliability and validity; no description of nor- 
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mative population ; 4s. per то; 6d. per Кеу; 6d. per man- 
ual; Is. 5d. per specimen set; cash orders postpaid 
within Australia; 30(40) minutes; Austrailan Coun- 
cil for Educational Research. * 

[272] 
A.C.E.R. Non-Verbal Test. Ages 10-14; 1936; 1 
form; manual ['36] ; 3s. 9d. per 10; 6d. per set of keys; 
2s. per manual; 25. 10d. per specimen set; cash orders 
postpaid within Australia; 24(40-45) minutes; Aus- 
tralian Council for Educational Research. * 


REFERENCES 
1, McIntyre, С. A. The Standardization of Intelligence 


Tests in Australia. Australian Council for Educati Re- 
search, Educational Research Series, No. 54. Melbourne, 
Australia: Melbourne University Press, 1938. Pp. 82. Paper. 
(London: Oxford University Press, 1939.) (New York: G. E. 
dr Mary, AND Davirs, F, R. J. The Standardiza- 
tion of Two Men An Tests. New Zealand Council for Edu- 
cational Research, Educational Research Series, No. 14. Well 
129. (London: Oxford University Press, 1941) (РА 15:2841 
F. J. Ѕснохегл, Professor of Education and 
Head of the Department, University of Queens- 
land, St. Lucia, Brisbane, Queensland. 

This is a nonverbal test, suitable for children 
between the ages of 10 and 14, a group for whom 
a suitable nonverbal group test is very badly 
needed. The test consists of four subtests—simi- 
larities, geometrical relations, analogies, and 
time sequence—with a total of only 56 items. 

Validity coefficients of the test were derived 
from correlations with the Stanford-Binet Scale 
(.63), with the Otis Self-Administering Test of 
Mental Ability (.74), and with the Tasmanian 
Education Department's General Ability Test 
(.68). None of these validity coefficients is really 
sufficiently high to give users very great confi- 
dence in the test. Although research has shown 
that the specific verbal factors in verbal group 
tests and the specific spatial factor in nonverbal 
tests tend to reduce the size of correlation coeffi- 
cients between the two types of test, if both types 
of test purport to give an indication of general 
intellectual ability we are justified in expecting 
higher validity coefficients. 

It may well be that validity has been reduced 
by having only four subtests covering 56 items, 
and it would be interesting to see what validity 
coefficient would result from an expansion to 
subtests covering 100 items. Reliability on a split 
half method using the Spearman-Brown proph- 
ecy formula gave a correlation of .85. 

The test has been exceptionally well stand- 
ardised, and most effective statistical analysis 
has been done to provide users with effective in- 
terpretative instruments. The only point I would 
make here relates to the corrections—of some- 
what doubtful value and validity I feel—that 
haye been worked out for children in three types 
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of schools somewhat arbitrarily divided into. 
class schools with over 200 pupils, B-class wit 
50 to 200, and C-class with under 50 pupils, 
would have thought that each child should 1 
assessed in comparison with members of } 
own age or grade group irrespective of scho 
even although there may be evidence of differe 
levels in schools of different size or type. 
test is excellently provided with good, cle 
drawings, an important element in nonverl 
tests. 
There are interesting interstate norms ai 
comparisons between city and country childrel 
the latter bearing out the now generally accept 
difference between city and rural school chi 
dren; in this case city children are superior 
roughly four points of IQ. L 
In general, this test has such excellent poss 
bilities that one would like to see it extended 
100 items with a possible improvement in valí 
ity so that schools and research departmen 
would then have at their disposal a most valual 
measuring instrument. 


[273] 
*A.C.E.R. Test L. Ages 13 and over; 1943; 1 fori 
a practice exercise must precede administration of te: 
no data on reliability and validity; no description. 
normative population; 3s. 6d. per 10; 6d. per mani 
IId. per specimen set; cash orders postpaid v 
Australia; 50(70) minutes; Australian Council Ё 
Educational Research. * 


[274] 
Academic Aptitude Test: Non-Verbal Intel 
gence: Acorn National Aptitude Tests. Grades. 
16 and adults; 1943-44; 4 scores: spatial relatio 
physical relations, graphic relations, total; 1 form, 4 
no norms for part scores; $2.75 per 25; 256 per ma 
ual ('44) ; 50€ per specimen set; postage extra; 26(4 
minutes; Andrew Kobal, J. Wayne Wrightstone, @ 
Karl К. Kunze; Acorn Publishing Co. * J 


Waırtram B. SCHRADER, Assistant Director, Ё 
partment of Statistical Analysis, Education 
Testing Service, Princeton, New Jersey. — . 
This 57-item test is designed to provide pa 
scores for physical, spatial, and graphic rela! 
as well as a total score for “non-verbal in 
gence.” Each part seems too short to have sat 
factory reliability; the pictorial test of phy 
relations, in particular, includes only 17 it 
of which offer only two choices. No state 
about guessing is provided in the instruct 
If, as is likely, the parts of the test differ in 
torial composition, the total score is diffic 
interpret without evidence on the relative 
tribution of each part. 
Tn spite of its brevity, this test may prove use 
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ful in supplementing other information about a 
person. Such usefulness would depend on. vali- 
dation of a much wider scope than the two studies 
of validity cited in the manual. In the first study, 
it was found that in grades 7-9 “slow” sections 
did less well on the test than students in “пог- 
mal" sections. In the second study, it was found 
that professionals engaged in engineering, archi- 
tecture, and designing did somewhat better than 
professionals in social science fields. No evi- 
dence that sex or age was considered in the latter 
study is given, though both might well be im- 
portant in this type of test. 

The items published were selected from a 
larger group on the basis of test-retest compari- 
sons. Half of the professional-technical norm 
group made four or fewer errors, and one fifth 
of the group made no errors; this provides some 
indirect evidence that the items are not ambigu- 
ous or defective. 

Total score norms based on a total of 4,109 
pupils are provided for grades 7 through 12; 
separate norms for boys and girls are not pro- 
vided. The gain in test score for grades 8 through 
11 amounts to about four raw score points. 

The sections on interpretation and use go con- 
siderably beyond available evidence in interpret- 
ing test performance. In particular, the state- 
ment that a score at or below the 3oth percentile 
“should disqualify a student for technical work” 
is surely unwarranted without an extensive study 
of validity. The manual discusses the prediction 
of success in engineering but does not mention 
the evidence, long available from the studies of 
Thorndike and Thurstone, that mathematical 
ability is of outstanding value in predicting suc- 
cess in engineering school. Instead, much stress 
is placed on "ambition," "diligence," and the 
nonverbal test score. 

The authors of this test have made a contribu- 
tion by developing and publishing a brief test 
embracing the following potentially useful item 
types: figure analogies, size estimation, pictorial 
physical relations, and paper formboard. Until 
further validation data are made available, the 
test is likely to be useful primarily to persons 
equipped to make their own validation studies. 
If used cautiously, however, this test may be 
useful in counseling and selection as a supple- 
ment to other information. 


у TAAS 
*Academic Aptitude Test: Verbal Intelligence: 


Acorn National Aptitude Tests. Grades 7-16 and 
adults; 1943-50; 4 scores: general information, mental 
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alertness, comprehension of relations, total; т form, ’50 
—same as test copyrighted in 1943 except for slight 
changes; $2.50 per 25; 156 per manual ('45); 50€ per 
specimen set; postage extra; 40(45) minutes; Andrew 
Kobal, J. Wayne Wrightstone, and Karl R. Kunze; 
Acorn Publishing Co. * 


WiLLIAM В. SCHRADER, Assistant Director, De- 
partment of Statistical Analysis, Educational 


Testing Service, Princeton, New Jersey. 


According to the authors, this test is primarily 
useful for estimating academic promise of sec- 
ondary school and first year college students and 
for predicting success in business positions which 
require “а high degree of mental alertness and 
ability to handle complex situations intelli- 
gently." The authors envisage a broader range 
of possible usefulness: normative data are given 
for each of the four years of college, for adminis- 
trative and executive employees, and for profes- 
sionals. 

Тһе test is printed in distinctive (Gothic) 
type. Except that the letter “j” looks a bit odd 
in this type when used to designate an answer 
option, the material is quite legible. However, 
the possibilities of print and layout to aid the 
test administrator are not well utilized. Thus, 
some of the “sections,” which are not separately 
timed, are headed in the same type as “tests,” 
which are separately timed. Test III (separately 
timed) does not begin on a new page. Printed 
instructions to keep the students going at the 
end of a page or section are not used. No state- 
ment about guessing appears in the printed (or 
oral) instructions. 

The manual makes it clear that considerable 
work was done in developing the test. The 140 
items in the final test were selected from a total 
of 1,050 items on the basis of “discriminative 
power.” A reliability coefficient of .97 was ob- 
tained by one of the Kuder-Richardson formulas. 
The size and grade range of the group are not 
reported, and no mention is made of the possi- 
bility that the reliability coefficient may have 
been increased by the speededness of the test. 
A validity coefficient of .82, using IQ’s on vari- 
ous tests as a criterion, is reported for a group 
of students in grades 9-12. A validity study 
based on “employees in administrative and exec- 
utive positions” yielded a coefficient of .59 with 
“overall employment rating.” A clear descrip- 
tion of the employee groups and of the criterion 
measures used would make the results more 
useful. 

The section on interpretation of scores is dis- 
tinctly below the standard set by the rest of 


г 
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the manual ; in this section the problem of using 
the test scores in guidance is seriously oversim- 
plified. 

Much stress is laid on the three part scores of 
Information, Mental Alertness, and Compre- 
hension of Relations. Separate norms are given 
for each part for grades 9-12, but no intercorre- 
lations are reported. It seems likely that these 
three parts are too highly similar in the abilities 
measured to have much differential value. (The 
fact that the roth to goth percentile range of the 
total score agrees quite closely with the sum of 
the roth to goth percentile ranges of the part 
scores also indicates high intercorrelations of 
parts.) Lois ЖЕДЕ 

The reported norms give the deciles and the 
1st and goth percentiles. For grades 7-12, norms 
for each grade were based on at least 500 stu- 
dents in both city and rural high schools in. 
several states. The groups for the other norms: 
given are less clearly defined. If the norm group 
for college students is reasonably typical, the test 
is too easy to be efficient for use with college 
students. The 1st percentile for first year college 
students corresponds to a score of 78 of a possi- 
ble 140. For professionals, the rst percentile is 
at ror and the 8oth percentile is at 140. How- 
ever, a reasonably good spread of performance 
is shown by the "executive and administrative 
employees," especially for the lower half of the 
distribution. 

This test is based upon a substantial amount 
of test development. It seems to be best adapted 
for use in high school (especially junior high 
school) and for selection in business positions 
requiring high school training. 


For a review by Marion A. Bills, see 3:215. 


[276] 

*Akron Classification Test. Grades 11-16 and 
adults ; 1950; 1 form; $4 per 50; 35€ per specimen set; 
postpaid ; nontimed 1045) minutes; Robert W. Неп- 
derson and Francis J. Werner; Robert W. Henderson, 
940 Eighth St., N.E., Massillon, Ohio. * 

Erwin К. Tavrom, Director, Personnel Re- 
search Institute, Western Reserve University, 
Cleveland, Ohio. 

This test claims to be a measure of “scholastic 
aptitude.” It was constructed on the basis of 
an item analysis sample of 120 college students 
and against an internal criterion. Evidence for 
its validity is limited to its correlation with other 
tests. Samples in all instances are grossly in- 
adequate. Three sets of norms are presented in 
the manual : “general population" norms based 


on 53 cases, the college norms on 180 upper 
classmen, and "non-college" norms on 76 cases, 

In the opinion of this reviewer, this test is 
faulty in its construction; its standardization is 
grossly inadequate and its validation non extant. 
We can imagine no set of circumstances under 
which its use could possibly be justified. 


[277] 
*American Council on Education Psychological 
Examination for College Freshmen. Grade 13; 
1924750; 3 scores: quantitative, linguistic, total; IBM; 
itions 1946, 1947, 1948, 1949; Editions 1933-45 out 
of print; no data on reliability and validity in manual ; 
manual ('50); $2.50 per 25; separate answer sheets 


must be used; 9o per 25 IBM answer sheets; 256 per 


set of scoring stencils; cash orders postpaid; 50% per 
specimen set, postpaid; 38(65) minutes; 1947 and ear- 
lier editions by L. L. Thurstone and Thelma Gwinn 
Thurstone; later editions prepared by publisher from 
materials developed by the Thurstones; Cooperative 
Test Division, Educational Testing Service. * 
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26:2432 

274. WALLACE, W. L. “The Prediction of Grades in Specific 
College Courses." J Ed Res 44:587 Ap 'sr. * (PA 26:5838) 

275. WHEELER, Lester R., AND WHEELER, Viota D. “The 
Intelligence of Music Students.” J Ed Psychol 42:223-30 Ap 


"sr. * (PA 26:2433) 
276. Woops, Roy C. “How Reliable Are National Norms 
When Applied to Specie Geographical Areas,” pp. 17-25. In 


The Eighth Yearbook of the National Council on Measurements 
Used in Education, 1050—51. Fairmont, W.Va.: the Council, 
Fairmont State College, 1951. Pp. vii, 58. Paper, mimeo- 
graphed. * x 


For reviews by W. D. Commins and J. P. 
Guilford of the 1946 Edition, see 3:217; for re- 
views by Jack W. Dunlap and Robert L. Thorn- 
dike of the 1939 Edition, see 40:1377; for re- 
views by Anne Anastasi and David Segel of 
1937 Edition, see 38:1037. 
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*American Council on Education Psychological 
Examination for High School Students. Grades 
0-12; 1933-52; 3 scores: quantitative, linguistic, total ; 
IBM; Editions 1946, 1947, 1948; Editions 1933-45 out 
of print; manual ['52]; $2.50 per 25; separate answer 
sheets must be used; 90€ per 25 IBM answer sheets; 
25€ per set of scoring stencils; cash orders postpaid ; 
256 per specimen set, postpaid ; 35(65) minutes; 1947 
and earlier editions by L. L. Thurstone and Thelma 
Gwinn Thurstone; 1948 Edition prepared by publisher 
from materials developed by the Thurstones; Coop- 
erative Test Division, Educational Testing Service, * 


Н REFERENCES 

1-2. S 906, t 

-9. See 3:218. 

To MURTYAM, Mirosg. “А Vr of the Re-Test Factor in 
the Minori Sue pude ка ec esting Program." J Ed Res 
ап 2 № '47. 22:2750. 7 

3i TEORANN Dow F. Al rlationships Between the High 
School and College Editions of the American Council on Edu- 
cation Psychological Examination and Their Relative Value in 
Predicting College Achievement." Col & Univ 23:217-33 Ja 
'48. * (PA 22:4631) 


For a review by Carl I. Hovland of the 1946 
Edition, see 3:218; for a review by A. Н. Turney 
of the 1939 Edition, see 40:1378; for a review 
by V. A. C. Henmon of the 1937 Edition, see 
38:1038. 


[279] 
The Army Alpha Examination: First Nebraska 
Revision. Grades 6-16 and adults; 1937; a revision 
of Army Group Examination Alpha (see 281); 4 
scores: verbal character, numerical computation, rela- 
tionships, total; 1 form; no data on reliability; $3.75 
per 25, postage extra; 50¢ per specimen set, postpaid ; 
2H, (5) minutes; J. P. Guilford; Sheridan Supply 
о. 


REFERENCES 


1-4. See 3:220 (61, 66, 69, 73). 
s. GUILFORD, J. P. Psychometric Methods, pp. 491-510. New 
York: McGraw-Hill Book Co., 1936. Pp. xvi, 566. * 


Rozert С. БЕМАВЕЕ, Assistant Professor of 
Psychology; and Louis L. McQuitry, Profes- 
sor of Psychology; University of Illinois, Ur- 
bana, Illinois. — 

DESCRIPTION OF THE TEST. This revision is 
patterned after the original forms of the Army 
Alpha with eight subtests: 1, Directions; 2, 
Arithmetical Problems; 3, Practical Judgment ; 
4, Synonym-Antonym; 5, Disarranged Sen- 
tences; 6, Number Series Completion; 7, Anal- 
ogies; and 8, Information. It is a time-limit 
test with answers recorded in the test booklet. 

PURPOSE OF THE TEST. In view of the college 
sample upon which this revision was based, the 
most appropriate use is in connection with coun- 
seling and guidance activities for persons at the 
college level. However, the publishers state that 
it is appropriate for subjects from the sixth grade 
to the adult level. 

ADMINISTRATION. Each of the subtests is sepa- 
rately timed and administered. Oral instruc- 
tions must be given by the examiner for each 
item of the Directions test. The manual offers 
no suggestions as to the rapidity with which ex- 
aminers should read these instructions, some of 
which are tongue twisters. Examiner differences 
in administering this portion may lower the 
validity somewhat. 

FACTOR scores. As reported by Guilford (5) 
in 1936, the scores on the eight subtests of the 
revised form were intercorrelated for 108 fresh- 
men who were entering a college of engineering 
and who were homogeneous as to sex and age. 
Factor analysis resulted in three factors. Multi- 
ple regression weights were established for pre- 
dicting the score of a person on each factor from 
the scores on the subtests most highly related 
to that factor. Tests 3, 4, 5 and 8 enter into pre- 
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diction of е score on V, the verbal factor. Tests 
2 and б are weighted on N, the numerical- 
computation factor, and tests 1 and 7 give scores 
on R, the factor interpreted as grasping and 
using relationships. 

In only tests 2, 4 and 7 did the three factors 
account for over half of the variance in the test 
scores, The least amount of variance accounted 
for by the three factors was 38 per cent in the 
case of test 1. There is no basis for knowing to 
what extent the unaccounted variance in each 
subtest is due to abilities which are unique to 
the subtest or is due to unreliability. No informa- 
tion is available as to the reliability of any of the 
part scores or even of the total score. 

А table of percentile norms, based upon 243 
University of Nebraska students, is given for 
scores on factors V, N and R, and the total score. 
The manual offers no suggestions as to the 
range of applicability of these norms; it would 
be hazardous, to say the least, to apply them to 
groups which differ from the standardization 
groups with respect to age or education. 

The endeavor to provide more parsimonious 
and yet more meaningful scores on the subtests 
by relating them to scores on factors is note- 
worthy. However, the factors V, N and R as 
revealed by this test are defined very broadly 
within their respective realms. Many refine- 
ments of these factors, and of tests to measure 
them, have been put forth in the past decade. 
Even apart from the fact that we do not have 
information as to the dependability of differ- 
ences in scores on V, N and R, other tests are 
currently available which have more to offer for 
diagnostic use in counseling and guidance or 
for making other differential predictions. The 
Differential Aptitude Test is one of these. 

DISTRIBUTIONS AND NORMS. The table given 
in the manual for converting total scores into 
percentile and mental age norms is based upon 
data from World War I, using the initial forms 
of Army Alpha. The statement is made that 
“The average level of difficulty for each test is 
the same as that for the original forms of Army 
Alpha, and so the customary norms should there- 
fore apply.” This conclusion is clearly unwar- 
ranted, since it is known that the distribution of 
scores on a test is not only a function of the 
difficulties of the items but is also a function of 
the inter-item correlations. The latter may have 
differed quite considerably between this revi- 
sion and the initial forms of the Army Alpha. 
That these norms are of doubtful applicability 


has been questioned in empirical studies (73), 

OVERALL EVALUATION. This test undoubtedly 
represented a significant contribution at the time 
of its release, but it does not compare favorably 
with other tests presently available and appro- 
priate for use at the college level. 

The use of the test outside of the college level 
is not recommended unless separate standardiza- 
tion is planned ; the norms and predicted factor 
scores would be of questionable applicability for 
any population not comparable to college-level 
students. 


For a review by W. D. Commins, see 38:1039. 
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*Army General Classification Test, First Civil- 
ian Edition. Grades 9-16 and adults; 1047-48; also 
called AGCT ; IBM ; 2 editions; 1 form, '47—identical 
to Form та ('40) of the Army Edition; revised manual 
(748) ; separate answer pads or answer sheets must be 
used; 756 per specimen set „of any one edition; cash 
orders postpaid; 40(55) minutes; Science Research 
Associates, Inc, * 

а) FORM AH (HAND SCORING EDITION). 40€ per test 
and answer pad; $1.80 per 25 answer pads. 

) FORM AM (MACHINE SCORING EDITION). 39€ per 
test; $2.90 per тоо IBM answer sheets; $1 per set of 
scoring keys. 

REFERENCES 

1-14. See 3:219. Н 

15. HOMPSON R. B. “Predictive Criteria for Selecting 
p SC a J Am Assn Col Reg 19:492-500 Jl '44. 

18: 

16. Bn. Н. W., лхо Datrenpacn, Калв, M. “A Баду 
of Selective Procedures and Educational Achievement of ASTE 
Trainees Processed by the STAR Unit at the University o! 
Illinois." Am J Psychol 58:1-24 Ја '4s. * (PA 19:1556 1 

17. Harrett, Тномаѕ W. “Army Classification Test Results 
i DE p D AM D TONS Am Psychol 1:292-3 
46. 20:3802, title оп DuC. 

18. BITTNER, “йаа, H. “The Army General Classification 
Test,” pp. 45-55, (PA 21:4174) In New Methods in Ape 
РЕЙ y. diea by George A. Kelly. Proceedings o 1 e 

aryland Conference on Military Contributions to Methodoloi 
in Applied Psychology Held at the University of Maryland, 
November 27-28, 1945 Under the Auspices of the Military 
Division of the American Psychological Association. со 
Park, Md.: University of Maryland, 1947. Pp. viii, зот. Li 


Intelligence and the MES of Test AAA J Consult Psy- 
chol 12:194-5 My-Je "48. у in World 
20. TuppennaM, Reap D. “Soldier Intelligence in 9 

Wars І and ТЇ.” Am Psychol 3:54—6 Е '48. * (РА 22:2054 
21. ALTUS, WILLIAM D. ii 
in Association W. 


thesis, University of Virginia (Charlottesville, Va.), 1949; ,. 
23. BRADLEY, GLADYCE КШ Review of Educationa Prob: 
lems Based on Military Selection and Classification Datani 
World War II.” J Ed Res 43:161-74 N '49. * (РА гар? «үо- 
24. KIRKPATRICK, jb ].. AND Советом, EDWARD Б chol 
cabulary non Difficu tb es tup М J Appl Psy 
3:347-51 Ag ^49. 24:22. / ^ 
3 25. Sirm, Dorato E. sprang Vocational. Fitness Pa 
MO Ur Psychological. Tests, pp. 12432. тахо: 
rothers, 1949. 727. 241213 а 3 
26. DuBors Ри Hj avo Warsow, Ковевт І, “The BA 
lection of Patrolmen.” J Appl Psychol 34:90-5 Ар '50- 
25:2076) 1 ; in Medical 
27. GLASER, ROBERT. Bayonet Achievement in б) 
зао Ј АР CD A Lid CORR CP dV "t 
28. GLASER, ROBERT. e Validity of So 1 
шору Achievement in Medical School." Abstract. Am Psycho! 
129) 51. » General 
- TAMMINEN, А. W. “A Comparison of the Army Ge 
Classification Test and the Wechsler Bellevue Intelligence 
Scales.” Ed & Psychol Meas 11:646-55 w 's1. 
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Joun T. Dairev, Technical Director, Research 
Division, Bureau of Naval Personnel, Wash- 
ington, D.C. 

According to the 1950 SRA catalog, "This 
test is designed as a measure of general learning 
ability. Administered to more than 12 million 
soldiers during the war, it is probably the most 
widely used test ever constructed. Developed by 
a distinguished staff of Army psychologists, it 
presents three types of problems: vocabulary to 
measure the verbal factor, arithmetic word prob- 
lems to measure the number and reasoning fac- 
tors, and block counting to measure the space 
factor." However, according to the Staff, Per- 
sonnel Research Section writing in 1947, "The 
first two forms were constructed under consider- 
able pressure, by an insufficient and shifting 
staff, and on the basis of data which were recog- 
nized as inadequate in several respects" (14). 
According to the same source, Forms та and 
tb, of which Form та became the civilian edi- 
tion, were administered to less than one million 
cases and were replaced by Forms Ic and 1d 
shortly before Pearl Harbor. In other words, 
the civilian AGCT is not the AGCT that was ad- 
ministered throughout World War II. 

In regard to the standardization of the AGCT, 
it should be pointed out that, while the various 
versions of it have been administered to 12 mil- 
lion soldiers as stated, the test was not stand- 
ardized on this large group or even on any repre- 
sentative sample of it. Instead, each successive 
form of the AGCT has been scaled by tying it 
into the previous forms all the way back to 
Form ta. This has been accomplished by equat- 
ing scores on the new forms to scores on the 
earlier forms given previously to the same sub- 
jects, thus equating retest scores to initial test 
scores. 

Form та was standardized on a group of 
3,790 soldiers and 606 CCC enrollees tested in 
September 1940. This group forms the basis 
for the entire superstructure of Army standard 
score scales from that date onward. An attempt 
was made to stratify a sample from this group 
on the basis of age, education, and area of origin 
but was only partially successful because these 
three variables fail to account for a large enough 
proportion of the variance оп the AGCT and 
because the soldiers and CCC enrollees were 
apparently not representative of young men of 
the same age, education and area of origin in 
the general population. At any rate, when Forms 
та and 1b were actually administered to repre- 


sentative samples of inductees they yielded stand- 
ard score distributions with a decided negative 
skew and with a mean score significantly greater 
than тоо. Various attempts have been made since 
to adjust the Army standard score scale to make 
it representative of the general male population 
of military age, but it has never been possible to 
draw such a straightforward sample to form a 
revised basis for the Army standard score scale. 
Аз а result it is still largely a matter of conjec- 
ture as to what is the AGCT mean and variance 
of the general population. In view of this fact, 
it would appear that the early version of the 
AGCT released to the public domain would have 
little to recommend it to the civilian user in 
preference to many of the similar commercial 
tests now on the market. 

The later forms of the AGCT were well con- 
structed tests and accomplished well the purpose 
for which they were constructed: to serve as 
instruments to enable relatively untrained per- 
sonnel to screen huge numbers of inductees in 
terms of their intellectual aptitude. The AGCT 
represented a compromise between a compre- 
hensive classification battery suitable for maxi- 
mum efficiency of differential classification and 
the traditional group intelligence test of the 
1920s. In 1940 the military psychologists had 
the know-how to construct such a comprehen- 
sive classification battery but operational limita- 
tions such as the amount of testing time available 
prevented its construction and use. Since the . 
end of World War II, both the Army and the 
Air Force have adopted such batteries. 


For an excerpt from a review, see 3:219. 


[281] 
Army Group Examination Alpha. Grades 5-16 and 
adults; 1919-39; released by the War Department for 
eneral use in 1919; Forms 5 (’19), 6 (^19), 7 (719), 8 
^19), 9 (19) ; no data on reliability; revised manua 
("39) ; $1.20 per 25; 25% per specimen set; postpaid ; 
221, (35-40) minutes; manual by H. E. Schrammel 
and E. R. Wood; prepared by the Psychology Commit- 


` tee of the National Research Council; Bureau of Edu- 


cational Measurements, Kansas State Teachers Col- 
lege of Emporia. * 
REFERENCES 

—7$. See 3:220. 

76. ВккомАн, Exsrz Oscumiw, "On Converting Scores on 
the Army Alpha Examination Into Percentiles of the Total 
Population.” Sch & Soc 23;695-6 fy 20 "26, * j 
77. TOLL, CHARLES H. “Scholastic Aptitude Tests in Am- 
herst College.” Sch & Soc M e о g^ "28, * (РА 3:512 

8. FRYER, DOUGLAS, AND SPARLING, Б. K “Intelligence and 
Occupational Adjustment.” Occupations 12:55-63 Је 3e Я 

79. HOLLINGWORTII, Lera S., AND KAUNITZ, отн M. "The 
Centile Status of Gifted Children at Maturity." J Genetic 
Psychol 45:106-20 S '34. * (PA 9:451) 

Bo. GuiLronp, J. P. Psychometric Methods, pp. 491-510; 
New Wk: McGraw-Hill Book Co., Inc., 1936. Pp. xvi, 566. 
V teen Turma С., AND Вовімо, Evwin б. "Intelli- 
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Recruits in 1918." J Abn & Social Psychol 39:471-4 О '44. * 
(PA 19:458) А à 
82. Garrett, Henry E. “Comparison of Negro and White 
у y à .” Am J Psy- 
chol 58:48 О 245. * (PA 20:855 
> n м F. Аѕніжү. “Intelligence of Northern 
Negroes and Southern Whites in the First World War." Ат 
J Psychol 58:161-88 Ap '4s. * (РА 19:2299) 
+ L4 AND Bitterman, M. E. “Notes on the 
Results of rum Intelligence e Wb їп World War L" Sci 
21:204 
. "Losses and Gains on the Alpha Group 
Psychol 8 * (PA ) A And 
Physiol & chol 40:1 е '47. 21:4372 
8 xj А Г? Fallacy Underi: ihe Garrett's 
Data of the Army Alpha and Beta Tests—A Com- 
ment.” Letter. Sci Mo БЕТ» О '4 
R. 


Joun T. Daey, Technical Director, Research 
Division, Bureau of Naval Personnel, Wash- 
ington, D.C. 

Nearly 35 years after its development, a re- 
examination of this test reveals a rather startling 
conformity between many features of the Army 
Alpha and much of current test theory. For ex- 
ample, one practice which many test theorists 
now strongly urge is the use of numerous short 
subtests in a test battery or omnibus test rather 
than a restricted number of long subtests. The 
original Army Alpha is a perfect example of 
sucha test with eight short subtests administered 
in a combined testing time of only до to 50 
minutes. In this respect the Army Alpha strongly 
resembles the currently employed General Apti- 
tude Test Battery of the United States Employ- 
ment Service. As a matter of fact, all of the cur- 
rently employed comprehensive classification 
batteries bear many points of resemblance to the 
Army Alpha and contain many subtests identi- 
cal with, or closely similar to, some of the sub- 
tests in it. When examined from the point of 
view of modern factor theory, the various sub- 
tests of the Army Alpha appear to cover fairly 
well most of the factors in the intellectual do- 
main, although the factorial purity of some of 
the subtests could be improved upon. In fact, 
the Army Alpha, with the deletion of items now 
obsolete in content, could well form the nucleus 
of a modern type of comprehensive classification 
battery. If supplemented with measures of the 
various perceptual, spatial, mechanical, and mo- 
tor factors, it could do a fairly good job of pre- 
dicting differential success in a wide variety of 
vocational and military specialties. It would ap- 
pear that the builders of the original Army Alpha 
built even more wisely than they realized. 

While the Army Alpha is an admirable pi- 
oneering product with many features in accord- 
ance with the latest testing theory, it still has 
many obsolete features that limit its utility as 
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an omnibus test of general intellectual aptitude, 
For one thing, even in the revised forms, it fails 
to give sufficient differentiation at the lowest 
literate levels because it does not use an ade- 
quate formula to correct for chance success. In 
a highly speeded test such as the Army Alpha 
with some multiple choice items, it is impossible 
to interpret low scores below or near the mean 
chance score that would occur if subjects guessed 
at each multiple choice item. For the Schrammel- 
Brannan revision this mean chance score is 
approximately 59 and falls near the 15th per- 
centile for the CCC norms. The test scores for 
the lowest 15 per cent of such a population would 
not seem to be interpretable. Those revised 
forms that fail to time each subtest separately 
are even less appropriate for administration to 
low aptitude subjects. In such a case it is pos- 
sible for one slow worker to plod along and bog 
down on an early subtest while another shops 
around for easy items or else responds to all the 
multiple choice items first. 

The highly speeded nature of the test is also 
objectionable in a test designed to be adminis- 
tered to low as well as high aptitude subjects. 
Most modern test theorists lean toward the use 
of a more adequate allotment of working time in 
order to allow most subjects to respond to all 
items. If such a test is in spiral omnibus form, it 
may legitimately be given under relatively un- 
speeded conditions, without separate timing of 
subparts, and be scored rights only, thus be- 
coming more nearly self-administering. 

However, if the Army Alpha were to be re- 
vised in light of the above principles, the scores 
could no longer be employed with the original 
norms. Thus the Army Alpha, even in revised 
form, is not generally recommended except in 
cases where a prime desideratum is to be able 
to compare the test performance of the subjects 
with the norms for World War I soldiers. 


Wirus C. SCHAEFER, Institute for Research in 
Human Relations, Washington, D.C. " 
The Army Group Examination Alpha and its 
revisions have a long history of research applica- 
tion and practical usefulness, and this record of 
service will undoubtedly continue on into the 
future. While there is an extensive bibliography 
of relevant research references, none of this 
background material is referred to in the brief 
manual. This is unfortunate since for many 
users of the test the manual is the only reference 
for test interpretation which is directly avail- 
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able. This reviewer believes that a real service 
could be performed for users of the Revised 


Alpha Examination by the extension of the. 


manual to include a list of selected references 
and a systematic summary of the findings of 
greatest practical value in the use of the test. 
Certainly the test publishers recognize the de- 
sirability of providing readily available back- 
ground interpretive material for effective test 
usage as evidenced by their excellent manual 
for the Differential Aptitude Tests. 

The use of this test involves several practical 
considerations. For one thing, the test is not 
self-administering. It requires careful oral pres- 
entation of all instructions and strict adherence 
to time limits for eight subtests, thus making it 
necessary to limit the test group in size to one 
in which there will be no hearing problems and 
in which effective supervisory control can be 
maintained without difficulty. Then, the test is 
set up for hand scoring only. While this may 
be a relatively rapid clerical process for small 
group administration, it can be quite unwieldy 
if large groups are involved. Probably most im- 
portant, no normative or intercorrelational data 
are contained in the manual for subtest scores 
either singly or combined ; hence, the user must 
do his own research for general or specific com- 
parative group performances. If part scores are 
not to be used, then considering the administra- 
tive difficulties of subtest time limits, some single 
time limit spiral-omnibus test form would ap- 
pear to be justified. Part scores are no end in 
themselves simply because they were historically 
available and persist as a result of the general 
usefulness of their sum total in correlating with 
other general intelligence tests or work criteria. 

Test norms are presented in terms of centile 
ranks separately for high school boys and girls, 
freshmen and seniors, and for army enlisted 
men. The brief explanation of the derivation of 
these norms contained in the manual is neither 
adequate nor clear. The implication that the 
norms presented apply to high school boys and 
girls in general is not justified, nor is the heading 
“General Adult Population (Men)” for norma- 
tive data based on scores of the 768 army en- 
listed men mentioned above, even though these 
cases were selected on the basis of scores on the 
Army General Classification Test “to be repre- 
sentative of all enlisted men.” With the limited 
space available, the section on acknowledgments 
to schools cooperating in the latest standardiza- 
tion might profitably be condensed to allow for 


the provision of more useful interpretative data 
as, for example, occupation equivalents in terms 
of more useful centile rank distributions such as 
are provided for the AGCT. 

The Revised Alpha Examination, Forms 5 
and 7, are equivalent forms of a general intelli- 
gence test which has demonstrated usefulness in 
a wide variety of situations over a long period 
of time. The present usefulness of this test de- 
pends largely on the user’s knowledge of back- 
ground applications relevant to his problem. 
Since its original development, newer tests have 
been produced which are equally reliable and 
valid, as well as easier to administer and to score 
singly or in large group situations. For those 
situations in which specific subtest scores are 
desirable, newer tests with better rationale and 
greater experimental evidence for the specific 
parts are also available. 


[282] 
*California Test of Mental Maturity and Adap- 
tations. Grades kgn-1, 1-3, 4-8, 7-10 and adults, 9-16 
and adults; 1936-51; 5 levels (Pre-Primary, Primary, 
Elementary, Íntermediate, and Advanced) ; 5 editions; 
IBM for grades 4 and over; postage extra; 35€ per 
‘specimen set of any one level of any one edition, post- 
paid; Elizabeth T. Sullivan, Willis W. Clark, and 
Ernest W. Tiegs; California Test Bureau. 
а) CALIFORNIA TEST OF MENTAL MATURITY. Grades 
kgn-r, 1-3, 4-8, 7-10 and adults, 9-16 and adults ; 1936- 
51; I9 scores: 11 subtests (not listed here), verbal con- 
cepts, and combined scores of memory, spatial rela- 
tions, logical reasoning, numerical reasoning, total 
mental, language, nonlanguage; 1 form, ’51; manuals 
(51) ; $275 per 25 of any one level; separate answer 
sheets may be used for grades 4 and over; 4¢ per IBM 
answer sheet; 7¢ per Scoreze answer sheet ; 60€ per set 
of stencils for machine scoring of answer sheets of any 
one level; 4o¢ per set of stencils for hand scoring of 
answer sheets of any one level; Pre-Primary: 50(70) 
minutes; Primary: 65(85) minutes; Elementary, In- 
termediate, and Advanced: 84-92(100-105) minutes. 
b) CALIFORNIA SHORT-FORM TEST OF MENTAL MATUR- 
iry. Grades kgn-r, 1-3, 4-8, 7-10 and adults, 9-16 and 
adults; 1938-50; 13 scores: 6 subtests (not listed here), 
verbal concepts, and combined scores of spatial rela- 
tions, logical reasoning, numerical reasoning, total 
mental, language, nonlanguage; S-Form (50) ; man- 
uals (’50) ; $1.75 per 25 of any one level; separate an- 
swer sheets may be used for grades 4 and over; 4€ per 
IBM answer sheet; 76 per Scoreze answer sheet; 4o¢ 
per set of stencils for machine scoring of answer sheets 
of any one level ; 20€ per set of stencils for hand scor- 
ing of answer sheets of any one level; Pre-Primary: 
20(40) minutes; Primary: 42(60) minutes; Elemen- 
tary: 47(60) minutes; Intermediate: 51(60) minutes; 
Advanced: 52(60) minutes. 
c) CALIFORNIA TEST OF MENTAL MATURITY: LANGUAGE 
SECTION. Grades kgn-1, 1-3, 4-8, 7-10 and adults, 9-16 
and adults; 1942-51; formerly called California Intelli- 
gence Tests, Language Section; 5 scores: 4 subtests 
(not listed here), total language; 1 form, '51; $1.50 
per 25 of any one level; separate answer sheets may 
used for grades 4 and over; 4¢ per IBM answer sheet; 
7¢ per Scoreze answer sheet ; 20€ per set of stencils for 
machine scoring of answer sheets of any one level; 


282 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK 


per set of stencils for hand scoring of answer sheets of 
any one level; Pre-Primary: 20(28) minutes; Pri- 
mary: 26(34) minutes; Elementary, Intermediate, and 
Advanced: 40-41(46) minutes. 3 
d) CALIFORNIA TEST OF MENTAL MATURITY: NON-LAN- 
GUAGE SECTION. Grades kgn-1, 1—3, 4-8, 7-10 and adults, 
9-16 and adults; 1942-51; formerly called California 
Intelligence Tests: Non-Language Section; 12 scores: 
8 subtests (not listed here), memory, and combined 
scores of spatial relationships, reasoning, total nonlan- 
guage; 1 form, ’51; manual (32 ; $1.75 per 25 of any 
one level; separate answer sheets may be used for 
ue 4 and over; 4€ per IBM answer sheet; 7€ per 

coreze answer sheet; 40€ per set of stencils for ma- 
chine scoring of answer sheets of any one level ; 20€ per 
set of stencils for hand scoring of answer sheets of any 
one level; Spanish editions for grades 7-10 and adults, 
9-16 and adults available at same prices; Pre-Primary : 
30(42) minutes; Primary: 39(51) minutes; Elemen- 
tary, Intermediate, and Advanced: 48-51 (54-59) min- 
utes. 
е) CALIFORNIA CAPACITY QUESTIONNAIRE. Grades 7-16 
and adults; 1941-42; an abbreviated adaptation of 
California Test of Mental Maturity (1936-39); 3 
scores : nonlanguage, language, total ; 2 editions; Forms 
А, B; manual ['42]; $1.50 per 25 of any one edition, 
postage extra. 

1) Hand Scoring Edition. 1941; 30(40) minutes. 

2) Machine Scoring Edition. 1942; 2€ per IBM an- 

swer sheet; 20€ per machine scoring stencil; 20€ per 

hand scoring stencil; postage extra; 35(45) minutes. 
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Меи A Psychocultural Interpretation оѓ the „ШШ 
est Performance of Ceylon University Entrants. 

R 16:371-5 Je 's1. * 

J Consult Psychol 15:516 D '5r. Laurance 
F. Shaffer. [Review of the California Short- 
Form Test of Mental Maturity, 1050 S-Form.] 
The revision of this group mental ability test 
places more stress on the differential aptitudes 
measured, and less on the total MA and IQ. The 
test for each stage of maturity consists of seven 
parts, presumably comparable from level to level 
because of similar content. Scores are obtained 
for total mental factors, language factors, non- 
language factors, spatial relationships, logical 
reasoning, numerical reasoning, and verbal con- 
cepts. Total score reliabilities range from .92 to 
.95, and part-score reliabilities from „8т to .95› 
being generally higher at the upper age levels. 
The subtest intercorrelations, mainly from the 
.20’s to the .40's, give some support to the dif- 
ferential interpretation of the profiles. Norms 
are based on very large groups that were con- - 
trolled with respect to age and school progress: 
No evidence is given, however, about the geo- 
graphic or socioeconomic distribution of the | 
normative subjects. Validity is defended in 4 
terms of high correlation with the Stanford- _ 
Binet, but the exact coefficient is not stated. No 
data are presented on the test's relationships to 4 
subsequent school achievement or to other ар- - 
propriate criteria. In these respects, the informa- | 
tion given in the manuals falls short of meeting 1 
desirable professional standards. 


URRAY А. 


For a review by Henry E. Garrett and 1 1 
cerpts from reviews of the California Test a 
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Mental Maturity, see 3:223; for reviews by 
Anne Anastasi and Emily T. Burr of the Cali- 
fornia Capacity Questionnaire, see 3:222; for 
reviews by Raymond B. Cattell, F. Kuhlmann, 
and an excerpt from a review of the California 
Test of Mental Maturity, see 40:1384; for re- 
views by W. D. Commins, Rudolf Pintner, Ar- 
thur E. Traxler, and an excerpt from a review of 
the California Test of Mental Maturity, see 
38:1042. 
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*Chicago Non-Verbal Examination. Ages 6 and 
over; 1036-47; may be administered orally or in pan- 
tomime; 1 form, '36; manual ['47] : $3.45 per 25; ата 
per manual and scoring keys; $1.25 per specimen set; 
postpaid; (40 minutes; Andrew W. Brown with the 
assistance of Seymour P. Stein and Perry L. Rohrer; 
Psychological Corporation * ~ 

REFERENCES 


_ 1. Brown, ANDREW W. “The Development and Standardiza- 
tion of the EAS Non-Verbal Examination.” J Appl Psychol 
24:36-47, 122-9 F, Ap '40. 


For reviews by Robert G. Bernreuter, Myrtle 
Luneau Pignatelli, and S. D. Porteus, see 40: 


1387. 
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XClassification Test for Industrial and Office 
Personnel. Adults; 1943-49; Forms A ('47—revision 
of 1943 form), В (47) ; mimeographed manual [47]; 
norms (49) ; $2 per 25, postage extra; 50f per speci- 
men set, postpaid; 10(15) or 15(20) minutes; Jay L. 
Otis, Evelyn Katz (A), Robert W. Henderson (A), 
Mary Aiken (A), David J. Chesler (B), and Gardner 
E. Lindzey (В); Personnel Research Institute, West- 
ern Reserve University. * 


REFERENCES 


т, Oris, Jay L., AND Снеѕіев, Davin J. 
SH Ability." J Appl Psychol 33:146-50 Ap '49. 
п 


«А Short Test of 
* (PA 
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XCollege Entrance Examination Board Scholas- 
tic Aptitude Test. Candidates for college entrance; 
1926-51; available only in College Entrance Examina- 
tion Board Admissions Testing Program (see 526) ; 2 
scores: verbal, mathematical; 180(210) minutes; pre- 
pared by the Staff of Educational Testing Service; 
program administered by Educational Testing Service 
for the College Entrance Examination Board. * 
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In 1949 Fall Testing Program in 'ndependent Schools and 


Supplementary Studies. Educational Records Bulletin, No. 53. 


New York: Educational Records Bureau January 1950. Pp. 


EST трей е cid ie Ар Test—Items, 
Scores, and Coaching.” Col Board R 15:235-9 N '51. * 
Freperick В. Davis, Professor of Education, 
and Director of the Educational Clinic, Hunter 
College, New York, New York; and Director, 
Test Research Service, Bronxville, New York. 
[Review of Forms XSA2-Z,, XSA1, YSA 
ZSA1, ZSA2,.] 

"This test includes five 30-minute speeded sec- 
tions. Three of these appear to measure prin- 
cipally word knowledge and reasoning ability 
in verbal materials, and two of them principally 
knowledge of arithmetic, algebra, and elemen- 
tary geometry plus ability to reason in quantita- 
tive terms, The items in all of the sections show 
evidence of careful construction and editing. 
However, to the reviewer they lack originality 
and interest. The passages on which the reading 
items are based seem dull and, in general, poorly 
written. The difficulty of the items based on 
them often seems to derive from the muddiness 
of the passages rather than from the necessity 
for close reasoning ог sensitiveness to implica- 
tions on the part of the examinee. In all types of 
verbal items, the examinee who has a well- 
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rounded assortment of clichés will do well to 
consider them in selecting his answers. 

As in the case of most tests, some faulty items 
can be found. One poorly edited item, referring 
toa load of baled hay, asks, “If each bale averages 
110 pounds, how many bales of hay are there in 
his load?” Naturally, one cannot get an average 

` weight for a single bale except by weighing it 
more than once. What is meant is, “If the bales 
average IIO pounds each, how many bales are 
there in his load ?" A number of items are faulty 
in that the keyed answer is not so nearly cor- 
rect as could probably be provided. This fault 
is pointed up by the fact that the directions ask 
the examinee to mark the one correct answer, to 
choose “the correct completion" or “the one 
correct completion." The usual directions to se- 
lect the best answer would be better. 

Otherwise, the directions seem satisfactory 
except, perhaps, for their treatment of the prob- 
lem of guessing. They encourage shrewd guess- 
ing "since a shrewd guess is more often right 
than wrong," but continue, *Do not, however, 
waste time in haphazard guessing." This sen- 
tence implies that the scores are corrected for 
chance success; unless this is true, time spent 
in “haphazard guessing" is far from wasted—a 
fact widely known among high school students 
and almost universally known among prepara- 
tory school students especially coached to take 
the College Board examinations. 

Reliability and validity data have been ob- 
tained for the test. The reliability data available 
to the reviewer were computed by methods that 
may yield spuriously high coefficients for speeded 
tests like most sections of the Scholastic Apti- 
tude Test. However, it seems safe to say that the 
reliability coefficients of the verbal and quanti- 
tative scores are so much higher than their va- 
lidity coefficients that the reliabilities could vary 
considerably without appreciably affecting the 
test’s effectiveness for predicting academic suc- 
cess. 

The validity data indicate that this test is about 
as useful for predicting academic success as a 
test of its limited scope can be. Tests that have 
been commercially available for twenty years 
are about equally useful. Any unique merit in 
using the Scholastic Aptitude Test lies in the 
fact that one can be reasonably sure that the 
examinees will not have had access to it before- 
hand and that it will be administered and scored 
with scrupulous care and accuracy. 
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*College Transfer Test. Candidates for college 
junior year entrance; 1950-51; formerly called College 
Ability Test; 2 scores: verbal comprehension and Eng- 
lish expression, quantitative reasoning; test adminis- 
tered five times annually (December, January, March, 
May, August) at centers established by Educational 
Testing Service; application form and bulletin of in- 
formation may be obtained from Educational Testing 
Service; fee, $6 per student; fee includes scoring serv- 
ice and reporting of scores to any three designated col- 


leges; scores not reported to examinees; $1 per addi-' 


tional report; 180(210) minutes; test administered by 
Educational Testing Service for the College Entrance 
Examination Board. * 


REFERENCES 


1. ENGcELHART, Мах D. “Examinations to Facilitate Trans- 
fer of Junior College Graduates to Senior Colleges." Jun Col J 
20:332-6 F 'so, * 


[287] 
Culture-Free Test. Grades 9-16 and adults; 1944- 
47; 1 form, '44; manual ['47] ; $4 per 25; 35¢ per speci- 
men set; postpaid; 72(80) minutes; Raymond B. Cat- 
tell; Psychological Corporation. * 
REFERENCES 

1-4. See 3:228. 
RALEIGH M. Drake, Professor of Psychology 
and Head of the Department, Kent State Uni- 
versity, Kent; Ohio. | 

There is no doubt that a culture-free test is 
needed. The elimination of the influences of 
training and environment would make scores 
from intelligence tests much more meaningful 
than they are at present when a score is made up 
of unknown proportions of innate capacity and 
environmental effects. This test is made entirely 
of nonverbal materials. Mental ability is meas- 
ured by the amount and goodness of eduction 
of relations among the nonverbal symbols pre- 
sented. That most of the common cultural in- 
fluences are eliminated is obvious. Even the di- 
rections can be given in pantomime if necessary. 
Very little validity and reliability data are given. 
Correlations with the Army Alpha run from .50 
to .60 for four groups of high school students. A 
reliability coefficient, based on only one high 
school freshman class numbering 121, is given 
as .88. Percentile norms are given for high school 
freshmen and high school seniors based on 1009 
and 918 cases respectively. Since the test is ар- 
propriate for levels through the adult, more 
norms would be very desirable. 7 

The reviewer has given the test to various 
groups during the past five years. For a grouP 
of college seniors and graduate students at Kent 
State University, it correlated .84 (N = 28) 
with the Wechsler-Bellevue Intelligence Scale А 
83 with the Army General Classification Test 
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.84 with the Otis Self-Administering Test of 
Mental Ability; .44 (N — 49) with the Miller 
Analogies Test, which is a rather high level 
verbal reasoning test; .13 (N = 25) with the 
KSU Graduate Qualifying Examination, a 210- 
minute comprehensive examination covering the 
various fields of psychology; and, for specific 
courses, .11 (N = 22) with Systems, .19 (N= 
24) with Learning Theory, .31 (ЇЧ = 62) with 
Mental Tests, .31 (N = 25) with Social, and 
.45 (ЇЧ = 35) with Statistics. 

Correlation matrices including this test indi- 
cate that it is measuring very little of the memory 
factor, which is more prevalent in verbal tests, 
but that it correlates with what appear to be the 
eductive factors measured by other intelligence 
tests. 

It does not correlate as well (.36) with high 
school grades as does the Henmon-Nelson (.80) 
or the Otis (.62), nor as well (.36) with teach- 
ers’ ratings of intelligence as does the Henmon- 
Nelson (.71) or the Otis (.71). The difference 
could be accounted for by the absence of the 
memory and verbal factors in the Cattell and 
their presence in the two verbal tests, and by the 
usual overemphasis in school on verbal and 
memory factors as represented by school grades 
and teacher ratings of intelligence. Since its 
high correlations with various recognized intelli- 
gence tests indicate that it is measuring mental 
energy (general intelligence) as usually defined, 
the low correlations with grades may not be a 
serious drawback. In fact, its use in school might 
bea distinct advantage in that it would emphasize 
relation finding (eduction), reasoning, con- 
structive thinking, and problem solving, rather 
than memory. If teachers would think of intelli- 
gence in these terms, they might emphasize the 
attitudes and techniques conducive to logical 
thinking. The Learning To Think Series by 
Thelma Thurstone is an attempt in this direction. 

A communication from C. M. Bhatia, director 
of the Bureau of Psychology for the State of 
Uttar Pradesh, Allahabad, India, tells how the 
test was used for selecting applicants for gov- 
ernment service. Only Matrices I and п were 
given, Correlations with two other intelligence 
tests were .62 and .s9 (N —228). He says, 
“The use of Cattell's test thus appears to us to 
be perfectly justified in our present group. He 
is not, however, sure that the test is one hundred 
per cent culture free. He suggests that giving it 
to rural illiterate masses might indicate how free 


it is of culture and training. A. Edwin Harper, 
Jr., psychologist at Ewing Christian College in 


. Allahabad, India, has given the whole test to 


56 college students and reports the following : 


Test was administered to First and Second-Year Col- 

lege students, and correlated with marks in two-year 
course in General and Educational Psychology (which ` 
is % of their college work). Sigma is almost identical 
with Cattell’s, but Mean is at his 1oth percentile. Dis- 
tribution is normal, not skewed. Correlation with two- 
year average marks is .416 + .07. The large number 
not finishing suggests that this test is not “essentially 
an untimed test" for our group. It is also cumbersome 
to administer as instructions often take 10 to 15 min- 
utes, before understood. 
At least this test can be given to literate-illiterate, 
educated-uneducated, and to other groups of 
widely varying cultural backgrounds; it pro- 
vides a means for studying the effects of racial, 
cultural, and educational influences on mental 
ability test scores and perhaps even of obtain- 
ing some evidence concerning these influences 
on the constitutional factors responsible for men- 
tal capacity. In view of the widesweeping claims 
and poor experimental evidence in the Iowa 
Studies and in the Schmidt report, it is evident 
that the respective roles of nature and nurture 
have never been adequately evaluated and that 
the true relationship between an IQ, any other 
score on a mental ability test, and real mental 
ability (energy) has not been clearly conceived, 
let alone even approximately determined. A 
wealth of revealing research results awaits the 
intensive use of this and other similar relatively 
nonverbal and culture-free tests. 


For reviews by L. S. Penrose, Walter C. 
Shipley, and David Wechsler, see 3:228. 
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Рон Intelligence Tests.] Grades 2-4, 5-9, 9- 
16; 1924-42; 3 levels; $4.20 per roo of any one evel, 
postage extra; 18¢ per specimen set of any one level, 
postpaid; Harry J. Baker; Public School Publishing 


d) DETROIT PRIMARY INTELLIGENCE TEST. Grades 2-4; 
1924; Forms C, D; no data on validity; 30 (35) min- 
utes. 
b) DETROIT ALPHA INTELLIGENCE TEST. Grades 4-8; 
1924-42; Forms S ('41—a revision of Form M), T 
(’42—a revision of Form R); manual (41); 45 (50) 
minutes. 
с) DETROIT ADVANCED INTELLIGENCE TEST. Grades 9-16; 
1924-42; Forms V (25), W t ) ; no data on valid- 
ity; revised manual ('42) ; 40(45) minutes. 
REFERENCES 
т. Kuntmany, Е. “The Kuhlmann-Anderson Intelligence 
Tes Compared with Seven Others." J Appl Psychol 12:545-94 
2 


2. Sracor, М.У. "An Evaluation of Certain Intelligence 
Tests." J Appl Psychol 18:432-6 Je '34. 


For a review by W. Line, see 40:1393. 
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*Duplex Series of Ability Tests. Ages то, II, 12, 
13; 1947-50; 6 scores: verbal ability, logical reason- 
ing, numerical reasoning, spatial relationships, nonlan- 
guage factors, total; 4 levels; 3s. 6d. per revised man- 
ual ('50) ; 7s. per specimen set of any one level; pur- 
chase tax (British purchasers only) and postage extra ; 
Frank М. Earle; George С. Harrap & Co. Ltd. * 
а) NO. I. Age 10; I form, '47; 12s. 6d. per 25; non- 
timed (60) minutes. 
b) No. 2. Age 11; 1 form, '47; 12s. 6d. per 25; non- 
timed (80) minutes. 
с) NO. 3. Age 12; І form, '50; 17s. 6d. per 25; non- 
timed (90) minutes. | 
d) NO. 4. Age 13; 1 form, '50; 20s. per 25; nontimed 
(180) minutes. 


REFERENCES 
ificance of Ability Differences at 
sychol 21:193-202 О '47. * (PA 


Secondary UE Bee RE as d рррех Т ere s еа ux 
W. С. Emmett, Reader in Educational Re- 
search, University of Edinburgh, Edinburgh, 
Scotland. 

These tests were constructed to give guidance 
in selecting children for different types of sec- 
ondary school, and in particular for different 
courses within such schools. The issue of 1950 
differs from that of 1947 only in some altera- 
tions to Tests 3 and 4, and in the amendment of 
answers and norms in the manual. 

Test No. 1, suitable for 10-year-olds, starts 
with 20 easy practice items forming Part I. Part 
II contains 50 verbal items of types common in 
intelligence tests, while Part III has 5o arith- 
metical and spatial items. Marking is objective, 
except for Items 21 to 30 of Part II, where the 
marker is likely to meet difficulty, especially in 
cases of bad spelling. Percentile norms are given, 
indicating a median raw score of 49 with a 
standard deviation of то. The test is thus of a 
suitable order of difficulty ; the spread of scores 
is not very high. 

Test No. 2, for 11-year-olds, has 70 verbal 


‘items of usual type in Part I and 50 arithmetical 


and spatial items in Part II. Marking of scripts 
should proceed without much difficulty though 
the lay-out of the answers in the manual is cer- 
tainly not helpful. Half marks are awarded here 
and there for partially correct answers, and some 
items receive 2 marks and some 3. The maximum 
possible score is 145, with a median of 68 and 
standard deviation of about 26. Economy of 
effort and time on the part of both children and 
markers is best achieved when test scores give 
a standard deviation of from 20 to 25 points for 
every 100 points of raw score. From this aspect 
the test is not highly discriminative. 


Test No. 3, for 12-year-olds, contains 90 
verbal items in Part I and 75 arithmetical, Spa- 


. tial and mechanical items in Part II. Instruc- 


tions to the children in one or two places are 
liable to be misinterpreted, and quite a few of 
the answers given in the key cannot be accepted 
as unique, or even as correct. The maximum 
possible score is 196; the median score is 80 and 
the standard deviation is 34. For a similar test 
of 100 items the standard deviation would be 
about 17, showing relatively poor discrimina- 
tion. The 99th percentile score is only 148; sev- 
eral items must therefore be too difficult for 
children aged 12 and 13. 

Test No. 4, for 13-year-olds, is divided into 
four sections. Section A of roo items tests at- 
tainments in English; Section B is a mixed bag 
containing verbal classification items, analogies, 
reasoning and general knowledge, 9o items in 
all. These are followed by 25 arithmetical prob- 
lems and 35 spatial items in Section C. Then 
comes Section D with 35 geometrical items, 50 
mechanical and 15 spatial items of three dimen- 
sions. Some items receive two marks, others 
three; the maximum possible score for all 350 
items is 395. Marking must be a laborious 
process, and in some items the marker's judg- 
ment must be exercised. The test leaves the im- 
pression that much of it is too difficult for chil- 
dren of 13, even of grammar school standard. 
This is confirmed by a median score for boys 
of 134 (out of 395) and a 99th percentile score 
for boys of 285. The standard deviation of raw 
scores is 52, corresponding to 13 for an equiva- 
lent test of 100 items ; only a very low discrimina- 
tion is thus attained. j 

Considerable ingenuity in the construction of 
items is evident, but it is obvious that many 
items would not be present if they had been 
subjected to the scrutiny of preliminary trial. 
To allow unlimited working time must cause 


difficulties of administration. 


The reviewer has been in correspondence 
with the author about statistics missing from the 
manual. Norms were based on 1,000 unselecte 
children in each age group from an English 
county borough; as the mean IQ in different 
boroughs may vary by as much as то points the 
basis of standardisation is not very secure. Per- 
centile norms are given for each test, and ad 
each broad type of item within each test ; menta 
age and IQ norms are given for test total scores 
Reliability coefficients are not available. " 

As the main purpose of the tests was to classy 
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children for different courses in the secondary 
schools, the validity of different groups of items 
was ascertained by forming contingency tables 
comparing the numbers of children who would 
have been placed in certain courses on the basis 
of their test scores with the numbers actually 
taking such courses, it being assumed that the 
children have been correctly placed by the school 
staff, Agreement between the test results and 
teachers’ placings on a five-point scale occurred 
in from 8o to 90 per cent of the cases. 

In general, tests of this type should serve a 
most useful purpose. The tests under review, 
however, should undergo considerable revision 
and should be substantiated with more statisti- 
cal information before they can be recommended. 


SraNLEY D. Nisser, Professor of Education, 
University of Glasgow, Glasgow, Scotland. 

These tests are designed for educational guid- 
ance during the period of transition from the 
primary to the secondary school, Earle has for 
long insisted (z, 2) that the prevalent British 
practice of making a sharp division, about the 
age of тт, into two groups (those fit to go to a 
Grammar or Senior Secondary school and those 
who are to go to a Modern or Junior Secondary 
school) largely on the basis of all-round general 
ability is absurdly oversimplified and in fact 
fundamentally unsound. His emphasis has al- 
ways been on special aptitudes as opposed to 
general mental ability, and his tests have been 
devised to give profiles rather than single total 
scores, He claims that the tests can predict, with 
fair accuracy, the type of secondary school course 
in which a pupil is likely to be most successful, 
but his classification, unlike the usual dichotomy, 
has as many as 15 different types and levels of 
course, The Duplex tests represent his latest 
attempt to provide a battery which can be used 
to guide pupils into the courses most appropriate 
for them. 

The directions given in the manual for using 
the tests are naturally more complicated than 
those for tests with a more limited aim, since the 
classification depends essentially on profiles of 
subtest scores. The necessary figures are never- 
theless clearly and definitely presented in tabu- 
lar form. The tests have been carefully con- 
structed, and the provision of separate sets for 
each age group makes for finer discrimination, 
and takes account of psychological differences 
between the younger and the older children. 

I believe these tests are very valuable, but not 


so much, at their present stage of development, 
for regular routine use in schools as for research 
workers, including heads of schools who are ex- 
perimenting with more scientific ways of classi- 
fying, or at least giving guidance to, pupils em- 
barking on their secondary education. I am 
convinced that Earle's approach is the right one 
and look forward to the time when tests of this 
kind may be confidently recommended for gen- 
eral application: that is why I hope research 
workers will use these tests extensively, I hesi- 
tate, however, to recommend them for general 
use because I am not satisfied that the process 
of validation and refinement is yet complete. 
Admirable as have been Earle's earlier experi- 
mental trials, the numbers he has been dealing 
with are too small to admit of adequate stand- 
ardization or follow-up. The final determination 
of the best test profiles for each type of course 
will have to be made as the result of numerous 
and complex follow-up investigations. The re- 
liability both of tests and of subtests must be 
accurately assessed, in order to let users know 
the standard errors of individual scores; and 
many item analysis studies will have to be made 
and reported. At present I do not feel that the 
profiles given can be surely enough established 
for general acceptance (for instance, the nega- 
tive weighting of nonverbal subtests in select- 
ing pupils for “linguistic” courses would require 
more investigation), though they are obviously 
on the right lines; reliability coefficients are not 
given in the manual, and it is probable that the 
tables of percentiles, especially for small subtests 
of 10 or 15 items, give a false impression of the 
stability of the raw scores. I have doubts, too, 
about a few individual items and should like to 
see new item analyses carried out on them. 
Finally, there is no indication in the manual of 
the size and composition of the standardizing 


+ groups on which the norms for the Duplex tests 


were drawn up. 

To sum up, therefore, I believe these tests 
represent an important contribution to research 
in the scientific educational guidance of children 
entering the secondary schools, but I am not 
convinced that they are ready yet to be issued 
for general use in the schools. 


Occupational Psychol 22:161-2 JI *48. P. E. 
V (ernon). * The manual....does not give as much 
information as one would like on the principles 
of construction * By 13 years, although g is still 
the main source of individual differences in 


12 


abilities, additional types of ability—probably 
influenced by interests and temperament—may 
be distinguished, the main ones being: (4) 
Knowledge of words and comprehension of 
sentences. (B) Reasoning—seeing relations be- 
tween objects and ideas, recognizing and describ- 
ing attributes of persons and things. (C) Recog- 
nizing relations between, and carrying out opera- 
tions with, (i) numbers, (ii) shapes. (D) Com- 
prehending the structure and functions of shapes, 
mechanical and other objects, and carrying out 
operations with them. Among younger children 
abilities are less specialized. А and B tend to 
coalesce, also C and D, and g is relatively more 
predominant, so that only very tentative diag- 
noses can be made before 12 years. For each of 
the main kinds of secondary course—two or one 
language, scientific, technical, commercial, and 
general with or without a practical bent—there 
exists an optimum range of scores on g (i.e. 
the test as a whole) and on one or more of the 
sets of sub-tests corresponding to these ability 
types. How successfully do the four batteries 
of tests achieve these'aims? If this review appears 
somewhat critical, it should not be regarded as 
depreciating Dr. Earle's ingenious and pioneer- 
ing approach to an extremely difficult problem. 
As an overworked headmaster he should be 
congratulated for having progressed so far in a 
field which needs years of full-time research in 
a large number of schools. In the first place the 
reviewer is doubtful, in the absence of factorial 
evidence, whether these particular four types 
are really self-consistent and distinctive. Num- 
ber abilities tend to be more closely linked with 
verbal than with spatial and mechanical among 
average adults, and only coalesce into a scientific 
factor at very superior levels, American research 
also indicates that geometry ability does not de- 
pend on k. Secondly, it is well known that ability 
structure depends to a large extent on the teach- 
ing or training received. Thus the classification 
reached in a Scottish multilateral school may or 
may not apply to different educational systems. 
(That the tests may retain their prognostic value 
under different conditions has, however, been 
indicated by research reported in Dr. Earle’s 
1944 book.) Thirdly, it is not always easy to 
see the connexion between some of the sub-tests 
and the abilities at which they are aimed. Thus 
printed Directions items occur in the 10+ and 
12+ tests of CD-type abilities. No doubt it is 
convenient to have different batteries for each 
. age group, but the later tests do not always de- 
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velop clearly from earlier ones. Thus Mechani- 
cal Comprehension items, and arranging graded 
objects in order, occur at 12+ but are omitted 
at 13+. The writer would prefer to see the (ap- 
proximately) 30 different types of items from 
all four batteries all tried out on 11 to 13-year- 
olds, factorized, and followed up by multiple 
correlation technique. Each such sub-test would 
be better given with a time limit. The present 
batteries have no limits and last up to бо, 80, 
90 and 180 minutes, respectively. But the writer 
knows of no evidence that untimed tests are more 
prognostic than timed among children, and won- 
ders whether the D items at the end of the 13+ 
battery receive a fair share of the children's zest 
under untimed conditions. Lastly, there are sev- 
eral minor defects, indicative of hurried con- 
struction. The range of difficulty in the sub-tests 
varies enormously; the sSoth percentile scores 
fall anywhere from o per cent. to 75 per cent. 
correct. There is a misprint in one test which 
has to be corrected in the Manual, and two foot- 
notes on p. 43 of the Manual are omitted al- 
together. In occasional instances the scoring is 
unclear (e.g. Do items 61 to 63 in Test 3, Part 
II, receive 3 or 4 marks in all?), and in many 
cases it is unnecessarily elaborate. Some of the 
items have “open” answers, where doubtfully 
correct alternatives might be given. Others tend 
to be "tricky." This makes one wonder whether 
full item-analysis has been carried out, since 
such items are usually shown thereby to possess 
poor validity. The arrangement of the sub-tests 
is far from convenient ; a great deal of searching 
is needed to determine which sets of items are 
supposed to represent which types of ability. 
Surely it would have been possible to demarcate 
them clearly by letters or numerals. The re- 
viewer's conclusion then is that the Duplex Tests 
provide admirably varied and promising ma- 
terial fora thorough investigation of classifica- 
tion at 11 to 13+, rather than a finished product. 
Nevertheless it is greatly to be hoped that psy- 
chologists and qualified teachers (overcoming 
their scruples at having to pay 7d. to лой. for 
each test paper) will try out this material, since 


‘it is only by collection of results on a large scale 


under diverse conditions that further progres? 
can be made. і 


[ 290 ] TU 
*The Essential Intelligence Test. Ages уа. 
1040-49; title on manuals is Essential Junior "Mor 
gence Test; Forms [A] (40), B (49), C (48) § 401 | 
bution of Form В is restricted; Form A manual 1 40! 
Form B manual ['49] ; 25-49 copies, 5160. each; 
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single сору; 15. 6d. per manual; postage extra; 45(55) 
minutes; Fred J. Schonell and R. H. Adams; Oliver & 
Boyd Ltd. * 

Е. W. WARBURTON, Lecturer in Educational 
Psychology, University of Manchester, Man- 
chester, England. 

This is a verbal test of general intelligence 
for children between the ages of 7 and 11 plus. 
The authors claim that it also gives valid results 
with dull or backward pupils of 11 years or 
more, provided that their reading age is at least 
71, years. It seems likely, however, to be more 
useful within the middle range of ages than at 
the extremes. 

There are 100 questions, most of which are 
multiple choice, but in some cases a single word, 
letter, or number has to be written by the child. 

Particular attention has been given to the 
attractiveness of the test items, and the layout 
of the questions is excellent. Instructions to the 
children and to the tester are clear and contain 
some valuable advice on testing method. The 
children are introduced to a book of “puzzles” 
and are told, amongst other things, that there 
are no catches (an important point) and that 
spelling mistakes will not count. The tester is 
told to begin by working out the test himself, to 
encourage a spirit of enjoyment throughout 
the test session, and is warned against divulging 
IQ's to the class. 

The back page of the booklet is used for prac- 
tice. The tester works through 11 typical ques- 
tions orally with the class. Help to individual 
children is presumably not allowed. 

Scores are converted into mental ages from 
which IQ's can be obtained. The dangers in- 
herent in this method are largely avoided, since 
the standardisation sample gives a very close 
fit to a normal distribution and has a standard 
deviation of 16.1, which means that IQ's are 
fairly strictly comparable, in the statistical sense, 
with those given by the Binet and the widely 
used Moray House tests. The mean IQ is 103.1, 
however, so that some error is introduced by this 
method of obtaining IQ's. This is not likely to 
be due to the sample chosen for the standardiza- 
tion, as the authors state that careful considera- 
tion of the groups chosen make up for limitations 
in the number of children tested (3,200). No 
details of this sample are given in the handbook. 
A stencil to facilitate marking would be an 1m- 
provement. : 

The reported split-half reliability coefficient of 
.92 is based on an inadequate group of only 66 


pupils, confined to the 11-year-old age group. 
The manual also reports a reliability coefficient 
of .92 based upon retesting an unspecified group 
of pupils after an interval of one year. One thus 
suspects that the test is reliable. 

There are two parallel forms of the test. Form 
A is on general sale; Form B is on the “secret” 
list and is meantime for sale only to education 
authorities. The two forms correlate .95. Form 
B was standardized on 7,759 pupils, and has a 
test-retest reliability after an interval of three 
months of .92. Form В is distinctly more diffi- 
cult than Form A (presuming that equivalent 
groups were used for standardising the two 
forms). 

There is a shortage in Britain of good intelli- 
gence tests for the age range 7 to 11. This test 
has already done a great deal to bridge the gap. 
The authors have achieved very well the pur- 
pose for which the test was designed. 

е [291] 

*Figure Reasoning Test: А Non-Verbal Intelli- 
gence Test. Ages 10 and over; 1949; I form; 3s. 6d. 
per test, postage extra; separate answer sheets must 
be used; answer sheets must be duplicated by test pur- 
chaser; 30(40) minutes; John C. Daniels; Crosby 
Lockwood & Son Ltd. * 


REFERENCES 
т. Daurzzs, Jon C. Study in the Variability of Intelligence 
Tests, Master’s thesis, Durham University (Durham, Eng- 
land), 1948. 


E. J. С. Вклрғокр, Senior Lecturer in Educa- 
tion, The University of Sheffield, Sheffield, Eng- 
land. 

This test could best be described as a variant 
of the Progressive Matrices: indeed, some of 
the matrices are practically identical, but the 
new test is intended to differentiate more ac- 
curately among the higher ranges of intelligence. 
The author claims that his test is highly reliable 
and heavily saturated with the intelligence fac- 
tor, but unfortunately no experimental data are 
offered in support of these claims. Distributions 
of raw scores, means, and standard deviations 
of each age group would have given more con- 
fidence to psychologists who: wished to use the 
test, than do the tables of standard scores which 
are all expressed in terms of a standard deviation 
of 15 per cent. 

An item validation was carried out on the 
original version of the test; the result was not 
impressive. Although the author rejected items 
giving a correlation of less than .25 with the 
criterion because they were below the I per 
cent level of significance, half the remaining 45 
items gave a correlation of less than .35. An item 
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whose correlation with the criterion is definitely norms for ages 12 to 15 will be the most reliable, 
not due to chance is not necessarily a satisfactory Data are being collected for more broadly based 
measure of the variable which the criterion norms. 
represents. The test would have been a better The split half reliability of the test is .96, and 
instrument of measurement if it had been con- retest after a year's interval gave a reliability 
structed from only 20 of the more valid items. of the order of .7. Both of these must be con- 
This would have meant scrapping Items 40 to sidered as satisfactory. In a factorial analysis 
45, the very ones which, presumably, were in- of eight miscellaneous tests of scholastic and 
tended to differentiate the higher intelligences. intellectual ability the general factor loading of 
Items 4, 7, 12-4, 17 and 26 all give validity the test was of the order of 0.85, with an addi- 
correlations above .55. The reviewer is of the tional nonverbal factor present. 
opinion that a close examination of the charac- The test is produced in a small, sturdy book- 
teristics of these matrices would well repay the . let; the diagrams are, in general, boldly drawn, 
author or anyone else who wished to improve though at times the paper is not quite opaque 
the test. enough. The norms, instructions for administra- 
Since the validation of the original version of tion, answer key and specimen answer sheet are 
the test was carried out, one of the examples and all printed on one folded sheet; a small booklet 
nine of the items have been modified. One is left would be an improvement. The cost of 3/6 per 
to wonder what effect such modifications have test is not, we suppose, excessive as these things 
had. go nowadays, and the booklet can be used more 
The instructions for administering the test than once. 
might satisfy a practising schoolmaster but The test appears to be a fairly useful general 
hardly a psychologist. If followed literally, some purpose test of intelligence but without specific 
groups of children might, by proffered solutions value in any field. With so wide an age range, 
and by many questions, get more tuition than no high degree of discrimination is likely. The 
other groups who were shy or dull. items in the test also have a wide range of diffi- 
If, instead of applying the complete test to culty, the earlier items being very easy ; but the 
thousands of additional children, fresh norms last, even with the answer key available, gave 
were calculated from, say, 15 or 20 items with the reviewer considerable difficulty. 
the highest validity coefficients, a really valuable Brit J Ed Psychol 20:206 N ’50. * a non- 
addition might be made to the present very lim- verbal group test which....has forty-five items 
ited number of good nonverbal tests. and is designed for use with subjects over the 
age of ten years. Each item consists of two com- 
James MAXWELL, Principal Lecturer in Psy- pleted series of three diagrams each and a third 
chology, Moray House, Edinburgh, Scotland. incomplete series which can be finished by dis- 
This test, described as a “new non-verbal In- covering the relationships which exist between 
telligence Test,” is constructed on principles the drawings in the completed series. The an- 
similar to those of Raven's Progressive Matrices. swer is selected from six possible solutions. 
Each of the 45 items consists of a 3 X 3 square Standardisation was on 4,500 secondary school 
set of diagrams, with the last one missing. The children and norms are given in the form of 
subject selects the missing diagram from a choice standard scores having a mean of 100 and à 
of six. Six practice items are given, and the need standard deviation of 15. By extrapolation the 
for adequate explanation is stressed in the in- norms have been extended beyond the schoo 
structions. The order of items is a compromise range to include a rating for adults. There is also 
between logical sequence and degree of difficulty. a table showing a special classification based on 
_ Norms for ages то to 16 and for adults are - 8 grammar school population, but this will prob- 
given in the form of standard scores. These ey) 4 


Д 4 аге be of little use outside the area where it was 
derived from a population of English secondary . 


dardised as in the population given here 30 
school children (4,500 cases) adjusted to give cent. of the plis eon less us 108 an 
the prevailing proportions of different types of {һе distribution has been given an artificial nor- 
secondary schools. In such a population there mality which would seem to be hardly justified. 
will be a tendency to selection bias with the older The test is easy to administer and very quick an 
and younger children, and though some allow- easy to mark. Answers are written on a separate 


ance has been made for this, it is likely that the sheet of paper so that the test booklet can be 
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used over and over again. It is a great pity that 
it has been printed on rather transparent paper 
which allows one subtest to interfere with an- 
other. The binding, too, could be improved. 
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Goodenough Intelligence Test. Grades kgn-3; 
1926; also called Draw-a-Man; т form; $1 per 25 chil- 
dren's drawing sheets, postage extra; $3.12 per man- 
ual, Measurement of Intelligence by Drawings (see 1 
below), postpaid ; specimen set not available; nontimed 
(10) minutes; Florence L. Goodenough; World Book 
Co. 
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fornia (Los Angeles, Calif.), 1 38. 3 a У 

27. Fincert, Hyman H.; GAN, JULIA R.; AND SCHILDER, 
PauL. "The Goodenough Test in Insulin and Metrazol Treat- 
po RS Sd J General Psychol 21:349-65 О 39. 

14:322 Бл 

28, BENDER, LAURETTA. "The Goodenough Test (Drawing A 
Man) in Chronic Encephalitis in Children." J Nerv & Mental 
Dis 91:277-86 Mr '4о. * (РА 15:841) 

29. OAKLEY, C. А. "Drawings of a Man by Adolescents," 
Brit J Psychol 1:37-60 Jl °4о. * (PA 14:4800) 

зо. Srozrt, DomorHv TILDEN. “Тһе Drawing Ability of 
Mentally Retarded Children.” J Genetic Psychol 57:259-77 
D '40. * (PA 15:2427) 

31. Cuasz, JANz Mustard. “A Study of the Drawings of a 
Male Figure Made by Schizophrenic Patients and Normal 
Subjects." Char & Pers 9:208-17 Mr '41. * (PA 15:3400) 

34. SPRINGER, N. Norton. "A Study of the Drawings of 
Maladjusted and Adjusted Children." J Genetic Psychol 58: 
131-7 Mr ‘41. * (PA 15:4081) 

33. Dennis, WAYNE. Wi Berfordiance of Hopi Indian Chil- 
dren on the Draw-a-Man Test.” Abstract. Psychol B 39:578 
O '42. * (PA 17:449, title ору; 

34. Dennis, WAYNE. “The Performance of UD Children 
on the Goodenough Draw-a-Man Test." J Comp Psychol 34: 
341-8 D '42. * (PA 1751254) 

35. RussELL, Rocer W. “The Spontaneous and Instructed 
pnr 08 дам Children." J Comp Psychol 35:11-5 Е '43. 

17:1664 

36. GEIL, GEORGE А. “The Use of the Goodenough Test for 
Revealing Male Homosexuality.” J Crim Psychopath 6:307-13 
О '44. * (РА 19:1692) 

7- HAVIGHURST, ROBERT J., AND Janke, Leora Lona. “Re- 
lations Between Ability and Social Status in a Midwestern 
Community: 1, Ten-Year-Old Children.” J Ed Psychol 35: 
357-68 5 '44. * (PA 19:476) A 

38. McCartuy, DomorHEA. “A Study of the Reliability of 
the Goodenough Drawing Test of Intelligence.” J Psychol 
18:201-16 О '44. * (PA 19:270) 

39. NegpuaM, Norma R, "А Comparative Study of the 
Pertormance of Feebleminded Subjects on the Goodenough 
Drawing, the Goldstein-Scheerer Cube Test, and the Stanford- 
Binet." 4m J Mental Def 49:155 61 о + * (РА 19:2774) 

40, Bernie, RALPH Е, "Measurement of Adult Intelligence 
by оаа J Clinical Psychol 1:288-95 О '45. * (РА 20: 

74. 
Ё т. МсНосн, GeLoro. "Changes in Goodenough IQ at the 
Public School Kindergarten Level.” J Ed Psychol 36:17-30 
Ja °45. * (PA 19:1816) 

42. МсНосн, Grroro. “Relationship Between, the Goode- 
nough Drawing a Man Test and the 1937 Revision of the 
Stanford-Binet Test." J Ed Psychol 36:119-24 F '45. * (PA 


19:181 
edi отт, SiNA M. “Muscular Activity As an Aid in 
r-Je '45. * (PA 


Concept Formation." Child Dev 16:97-109 
20:598) 
44. Wurtz, Manet Frances Roporm. A Study of the Per- 
formance of A Di Feeble-Minded, and Norm Children 
on the седо Intelligence Test. Master's thesis, Uni- 
* versity of Iowa (Iowa City, Iowa), 1945. 

4s. Havicuurst, ROBERT J.; GUNTHER, Minna KOROL; AND 
Pratt, Inez Е1л1ѕ. "Environment and the Draw-a-Man Test: 
The Performance of Indian Children.” J Abn & Social Psychol 
41:50-63 Ja '46. * e a aum 

46. RosENZWEIG, SAUL. “Тһе ielo of an Amnesic Per- 
sonality.” J Personalit, 18:121-42 '46. * (PA 21:3062) 

47. SHRIMALI, P. L. “The ‘Standardization of the Goode- 
nough Intelligence Test for India.” Abstract. Proc 34th Indian 


:9 '46.* 
48. McCurpy, HAROLD ua: “Group and Individual Varia- 
bility on the, Goodenough Draw-a-Man Test.” J Ed Psychol 


0. DARKE, бү А,, AND Gri!) слонот A. “Homosexual Ac- 
ole to the Goodenough Test 
and to the Cornell Selectee Index," J Nerv & Mental Dis 


8:217-40 S '48. * (PA 23:8 
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54. ROSENZWEIG, SAUL; WITH THE COLLABORATION OF 1 
Levine Kocan. Psychodiagnosis: An, Introduction to Tests in 
the Clinical Practice of Psychodynamics, pp. er Ban tort 
23:3 

с. STONESIFER, FRED А, “A Goodenough Scale Evaluation 

of Human Figures Drawn by Schizophrenic and Non-Psychotic 
Adults.” J Clin Psychol 5139678 О '49. * (PA 4413813) 

56. D'Anczto, Rita Y. A Comparison of White and Negro 
Pre-School Children in Goodenough M and Language De- 
Mme Master's thesis, Fordham University (New York, 
N.Y.), 1950. 
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57. GoopenoucH, Frorence L., Np Harris, Dare В. 
“Studies in the Psychology of Children’s Drawings: II 1928- 
1949." Psychol B $7 3597433 S 'so. * (PA 25:2456) 

58. Јонмѕох, A. P.; ELLERD, ARTHUR A.; AND LAHEY, 
Tuomas H, “Тһе Goodenough Test as an Aid to Interpretation 
of Children's School Behavior.” Am J Mental Def 54:516-20 


Ap 'so. * (PA 25:364) H 
R 5 [A o RUSSE “Chan in Goodenough Drawings 


ges Р, 
аР With Changes in Social. Adjustment." J Clin Psychol 
6:282-4 Jl 'so. * (PA 25:7052) , 
бо. ROTTERSMAN, Leon. A Comparison of the IQ Scores on 
the New Revised Stanford-Binet, Form L, The Wechsler In- 
telligence Scale for Children, and the Goodenough “Draw a 
Man” Test at the Six Year Age Level. Master's thesis, "Uni- 
versity of Nebraska (Lincoln, Neb.), 1950. 


Naomi Stewart, Educational Testing Service, 
Princeton, New Jersey. 

Although the recent literature pertaining to 
the Draw-A-Man test reflects considerable in- 
terest in its possible uses for the study of per- 
sonality, this test, as developed and standardized 
by Goodenough, is primarily a measure of in- 
telligence and most appropriately evaluated on 
its merits as such. 

During the 25 years the test has been in 
existence, a number of studies have confirmed 
that it compares favorably in test-retest relia- 
bility with most group tests of intelligence ap- 
plicable in the same age range. It also compares 
favorably in validity, as demonstrated on the 
basis of its correlations with the Stanford-Binet 
within the age groups for which it was designed, 
while yet possessing the advantage of being non- 
verbal in character. However, any test whose 
major interpretation must depend on certain 
assumptions concerning its score distribution 
within various subgroups of the population 
cannot help but lose in value as these assump- 
tions lose in accuracy. This test is no excep- 
tion. 

The raw score, which gives the number of 
points out of a possible total of 51 on which a 
given drawing has received credit, is not ordi- 
narily used directly. Raw scores are converted 
into “mental ages,” and the ratio between the 
subject’s mental age and his chronological age 
is then taken as his Goodenough IQ. The table 
for converting raw scores into mental age equiv- 
alents derives from the norms for ages 3-13 
published by Goodenough in 1926. These were 
based on 2,306 children, age 4—10, in grades 
appropriate to their age, drawn almost entirely 
from schools in four cities in New Jersey and in 
New York City. Norms were obtained by 
smoothing mean raw scores for the children at 
each age from 4 to то, and then extrapolating 
downward to age 3 and upward to age 13. The 
values found by smoothing and extrapolating 
were then verified, without any change, on data 


for 1,445 additional children of normal grade for 
their age, mainly in the 6-12 age range, and 
drawn from schools in three other localities, The 
drawings of 1,876 children who were accelerated 
or retarded in grade placement were also ob- 
tained and examined in the course of the stand- 
ardization, bringing the entire number of cases 
involved to the total of 5,627 which is sometimes 
cited, although not all of this total number ac- 
tually entered into the means on which the norms 
were based. 

Have the mental age equivalences thus es- 
tablished remained unchanged for 25 years? This 
seems unlikely in light of the startling changes 
found to have occurred during the period be- 
tween World War I and World War II with 
respect to the distribution of scores among the 
general population on such tests as the Army 
Alpha—as revealed, for example, in data by 
Tuddenham * which show a rise of 50 points in 
median Alpha score during that time. Apart 
from the fact that the Draw-A-Man age norms 
are now 25 years old, a check on their accuracy 
would seem to be worthwhile, considering that 
even at the time of their original publication cer- 
tain investigators (73, 14) questioned whether 
the norms for ages 10 and above were not un- 
duly high. 

A norms verification study would also pro- 
vide opportunity for checking upon possible 
changes in validity of any of the 51 scoring points 
used in the Goodenough scale. With the wide 
dissemination of comic books among present-day 
moppets, the drawing instruction provided for 
them on television, etc., it is not unlikely that the 
development of certain drawing features ПО 
longer proceeds along the same lines as at the 
time of the original study. Here, too, apart from 
the possible effects of influences of this sort, 
certain investigators (6-8) have taken 18516 
with some of the original scoring points on 
grounds that they penalize subjects who draw 
full-face rather than profile figures, and that this 
penalty is unwarranted, reducing the differentia 
tion afforded by the instrument. er 

These suggestions are not intended to imp» 
any fundamental defect in the test or in its basic 
scoring rationale. The Goodenough Draw-A- 
Man test is undoubtedly a most valuable instru- 
ment and as such is well worth the effort T€ 
quired to keep it fully up-to-date. 
ence in World Wars 


1 Tuddenham, Read D. “Soldier Intelli 
I and IL” Hw Psychol 3:54-6 F '48. * (PA 22:2954) 
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*Graduate Record Examinations Aptitude Test. 
Senior year college through graduate school and can- 
didates for graduate school; 1949-51; available only 
in Graduate Record Examinations programs (see 
527) ; 2 scores: verbal ability, quantitative ability; 180 
(220) minutes; prepared by the Staff of Educational 
Testing Service; Educational Testing Service. * 


REFERENCES 
1. Saum, James A. “The Graduate Record Examination and 


Its Application in the Stanford School of Education." Abstract. 
Calif J Ed Res 2:183 S 751. * ) 

2. STAFFORD, JoHN W. “The Prediction of Success in Gradu- 
ate School.” Abstract. Am Psychol 6:298 Jl 's1. * 

7. P. GUILFORD, Professor of Psychology, Uni- 
versity of Southern California, Los Angeles, 
California. 

This test was designed to predict success of 
students in graduate work. But nowhere in the 
manual, or in other literature accompanying 
the tests, is there a discussion of the nature of 
aptitude for graduate study. The content of the 
test and its “verbal” and “quantitative” scores 
indicate that aptitude at the graduate level is 
virtually the same as that at the undergraduate 
level where similar tests have been used for 
many years. Graduate-school catalogs commonly 
state that it is not enough for the student to earn 
good grades in courses and that graduate work 
stresses skills in independent scholarship and 
research. Although common practice in gradu- 
ate work may belie such statements, the fact 
remains that aptitude tests should be aimed at 
something more than grade getting. 

It is to the credit of the publisher of this test 
that criteria other than grades have been used 
in the validation of its graduate tests—ratings 
of seminar and research performances. No vali- 


dation data have as yet been reported for this. 


particular test. From earlier validations of a 
verbal score in a number of universities and sub- 
ject fields, we find indications of what to expect 
from the verbal score in this test. The correla- 
tions with grades ranged from —.19 to .67, with 
a median of .32. The correlations with ratings 
ranged from .14 to .77, with a median of .34. 

The manual is extremely explicit and de- 
tailed with respect to procedure for administra- 
tion of the test. There are norms, based upon 
samples of adequate size. There is very little 
other information of the kind we usually expect 
in manuals. 

From information provided the reviewer but 
not published generally, it can be reported that 
reliabilities (Kuder-Richardson 20) of the 
scores are high (verbal, .97 and .95; quantita- 
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tive, .89 and .93; in Forms X and Y, respec- . 
tively). Intercorrelations of verbal and quanti- 
tative scores are sufficiently low (about .50) to 
justify differential predictions. No indications 
are given as to what differential predictions can , 
be made or as to their margins of error. The 
quantitative part was found to be highly speeded 
(only 25 and 17 per cent completed the part), 
but the test is essentially a power test in spite of 
this fact. Other parts are less speeded. More 
time, however, might well be given to the more 
speeded parts. } 

Although the items in the verbal part are 
somewhat varied in form (synonyms, antonyms, 
analogies, sentence completion, and reading com- 
prehension), the prevailing variance in the score 
is probably that of the verbal comprehension 
factor. If only one score is to be given, it might 
as well be based upon a vocabulary test. If other 
factors, probably reasoning, are believed to be 
important, it would be desirable to bring them 
out strongly in other types of tests and to give 
them separate scores so they can be weighted 
independently. 

One or two minor technical points concerning 
items might be mentioned. In the reading com- 
prehension items the alternative answers are 
generally about as long as the stem of the item ; 
the item and the answers read much like a series 
of true-false items. This raises the question 
whether it might not be just as well to substi- 
tute five true-false items for each five-choice 
item (with more than one true statement in five, 
of course). There is also the question, applying 
to any multiple choice test, whether five choices 
are enough better than four to justify the extra 
printing and time. Stated otherwise, would the 
time and space devoted to fifth responses not 
be better devoted to additional items? 

The standard scale originally adopted for the 
Graduate Record Examinations has a mean of 
500 and a standard deviation of 100. This has 
probably been justified at some time and some 
place, but it seems a presumptuous exhibition of 
refinement of measurement, for even with re- 
liabilities as high as these, errors of measure- 
ment are appreciable in the raw scores and are 
multiplied many times over in going to the stand- 
ard scale. 

In summary, this test aims at the selection of 
graduate students in the same ways in which we 
select undergraduate students. Its scores show 
high to very high reliabilities. The two forms 
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have undergone considerable refined statistical 
study, the results of which have not yet been in- 
cluded in the manual. No information concern- 
ing validities of these forms is given, but similar 
verbal scores have demonstrated substantial 
value in predicting grades and ratings in some 
situations. It is urged that improvements will 
depend very much upon progress in understand- 
ing and arriving at criteria of success in gradu- 
ate work which go considerably beyond course 
gradés. 


Cart I. Ноуглмр, Sterling Professor of Psy- 
chology, Yale University, New Haven, Con- 
necticut. [Review of Forms XGR and YGR.] 

This newly developed scholastic ability test 
for use in the selection of students for graduate 
school is the successor to the Verbal Factor and 
Mathematics subtests in the Profile Test series 
of the Graduate Record Examinations, and ac- 
cordingly yields two scores, one for verbal ability 
and one for quantitative ability. The verbal abil- 
ity portion of the test includes vocabulary defini- 
tion, verbal reasoning, and reading comprehen- 
sion questions, and the test of quantitative 
ability contains items on arithmetical reasoning, 
algebraic problems, and the interpretation of 
graphs and descriptive data. 

Administration and scoring of the tests are 
kept under careful control. The tests can be 
taken only at centers established by the Educa- 
tional Testing Service, but a substantial number 
of these have been set up here and abroad. Scores 
on the test are reported both to the individual 
and to the graduate schools specified by the ap- 
plicant. Although the tests are kept highly con- 
fidential, arrangements are available for gradu- 
ate faculties considering their use to inspect 
them under appropriate conditions of control. 

Two forms of the test are available, Forms 
XGR and YGR. These are in the process of 
development and are not parallel forms with re- 
spect to content or timing. No data appear to be 
available with respect to their relative difficulty 
determined from matched samples. Scores are 
determined to some extent by speed. The pro- 
portion of candidates completing the test varies 
from 17 per cent for the quantitative portions of 
Form YGR to 80 per cent for the vocabulary 
comprehension portion of Form XGR. 

RELIABILITY, The reliability of the verbal por- 
tion of the tests is very satisfactory considering 
the rather narrow range of ability represented by 
college graduates applying to graduate schools. 


, 
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The odd-even reliability is .97 for XGR and 95 
for YGR. These results are based on samples 
of 500 candidates for admission to graduate 
schools. Slightly lower reliabilities have been ob- 
tained for the quantitative portions of the exami- 
nation (.89 for Form XGR and .93 for Form 
YGR). 

VALIDITY. Since the test is so new, there are 
as yet no validation data available, but arrange- 
ments have been made by the Educational Test- 
ing Service to secure information on this prob- 
lem from a number of institutions. The similarity 
of the verbal reasoning items to those of the 
Miller Analogies Test and the vocabulary items 
to those of the earlier Verbal Factor subtests of 
the GRE makes it appear likely, however, that 
the new test will be of real value in graduate 
student selection. Similarly the fact that the 
methods and procedures used in developing the 
mathematics portion of the Profile Test were 
closely followed in the construction of the pres- 
ent Quantitative subtest makes the new test 
appear promising. In the twenty-odd studies of 
the original Mathematics test in the Profile Test 
series, a median correlation of about .40 was 
obtained between test score and grades in grad- 
uate school. The predictive value of the new test 
may be expected to vary greatly for different 
fields of specialization and for different institu- 
tions, with their varying admissions policies and 
criteria for success. | 

- Unfortunately no manual is yet available de- 
'scribing the technical aspects of the construction 
of the test. It is hoped that one will soon be forth- 
coming, and that it will contain full infor Е 
on reliability, difficulty and discrimination o 


` items, intercorrelations between subtests, corre- 


lation with other tests, validation studies -— 
on survival in school and (hopefully) postschoo 
success, the contribution of speed vs. power, an 
other technical information to permit proper 
evaluation. The Educational Testing Service 
has done an outstanding job of preparing mant- 
als for examiners and for supervisors. pil 
clearly state the policies and procedures govern 
ing the standardized administration of the tes ©) 
SUMMARY, This test appears to have been й 
veloped їп а careful and competent та sn 
Both subtests have an adequate reliability, i d 
ticularly in the light of the narrow range wie 
among candidates for graduate school. The ila 
is so new that no satisfactory data are yet ae 
able concerning its validity, but the similarity e 
content to that of earlier forms in the РГО 
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Test series and to other similar tests makes the 
test promising as an aid in graduate student 
selection, particularly when used in conjunction 
with other tests in the GRE battery. 
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*Group Test of Learning Capacity: Dominion 
Tests. Grades kgn-I, 4-6, 7-9, 9-12 and adults; 1934— 
50; formerly called Group Test of Intelligence; 4 lev- 
els; 25€ per manual for any one level; postage extra ; 
prepared by the Department of Educational Research, 
Ontario College of Education, University of Toronto; 
distributed by Vocational Guidance Centre. * 
a) PRIMARY. Grades kgn-1; 1934-50; Forms А (44), 
B (744); a practice exercise (44) must precede ad- 
ministration of test; manual (46); revised norms 
('50) ; $1.50 per 25; 356 per specimen set; practice ex- 
ercise: (20) minutes; test: (45) minutes. 
b) juNIOR. Grades 4-6; 1940-41; Forms А (40), B 
(^40) ; manual (41) ; $1 per 25; 35€ per specimen set; 
25(45) minutes. 
с) INTERMEDIATE. 
Forms A, B. Ve 1 

1) [Original Edition.] 1934; $1.50 per 25; 356 per 

specimen set; 37(50) minutes. 

2) (1950 Edition, Omnibus Type). 1950; $1 per 25; 

45€ per specimen set; 30(45) minutes. 
d) ADVANCED, Grades 9-12 and adults ; 1039-40; Forms 
A (130), В (39) ; manual (40) ; $1 per 25; 45€ per 
specimen set ; 30(45) minutes. Н 

REFERENCES 

т. PHILLIPS, ALEXANDER J. “Comparison of Methods of Cal- 
culating Mental Age Equivalents.” J Ed Psychol 34:152-65 
Mr '43. * (PA 18:328 

2. Selection and Tratning of Shorthand Students in Ontario 
Secondary Schools. A study conducted by The Shorthand 
Survey Committee of The Ontario Commercial Teachers’ As- 
sociation and The Department of Educational Research, On- 
tario College of Education, University of Toronto. Toronto, 
Canada: Sir Isaac Pitman & Sons (Canada) Ltd., 1949. Pp. 
vii, 68. Paper. * А 

3. Yure, D. L. С. A Study of the Relation. М, 
Reading Ability and Performance on the Dominion du Test 
of Learnin Capacity, Master’s thesis, University of Toronto 
(Toronto, a), 1950. Pp. 53. 


Grades 7-9; 1934-50; 2 editions; 


Between 


ana 


W. С. Emmett, Reader in Educational Re- - 


search, University of Edinburgh, Edinburgh, 
Scotland. 

PRIMARY. Each form of this test contains four 
separately timed sections with 54 items in all. 
Emphasis is laid on the necessity that all chil- 
dren should understand what they have to do; 
the layout of the test items and the admirable 
instructions for administration certainly comply 
with this requirement. One feels uncertain 
whether items such as joining dots to reproduce 
a given pattern, or finding from a random ar- 
rangement of drawings those which appear in a 
given picture, measure “learning capacity” effi- 
ciently, and no supporting correlations with cri- 
teria are given to dispel one’s doubts. The corre- 
lation between Forms A and B based on 933 
cases was .85, a reasonable figure for a short 
test for such young children. Norms are given 
in terms of mental ages for each point of raw 
score; the revised norms of 1950 based on 1,800 
Ontario children give a mean IQ 8.1 points be- 
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- 
low the earlier norms of 1946, based on 933 
cases. The original publication of the test was 
obviously premature. 

JUNIOR. This test has five separately timed 
sections, containing 81 well chosen verbal and 
arithmetical items of established types. The in- 
structions contain one flaw: in the first three 
sections the moment at which the children are 
to start work is not clear. Correlations between 
the parallel forms vary from .83 to .92 for differ- 
ent grades from 3 to 7, and the standard devia- 
tion of raw scores within each grade is about 12. 
These figures denote rather poor discrimination. 
A higher reliability coefficient (.95) and stand- 
ard deviation (17.5) resulted from pooling the 
data for the five grades, but no great reliance 
can be placed on the figures as the spread of age 
and ability in the samples is not quoted. 

INTERMEDIATE (ORIGINAL EDITION). The 
tests are in eight separately timed sections, com- 
prising arithmetical problems, opposites, “does 
not belong,” analogies, number series, reasoning, . 
sentence completion and syllogisms. The items 
have obviously been carefully chosen and should 
give a good measure of general intelligence. The 
sections, which are of unequal length, are 
weighted so as to give approximately the same 
maximum score for each. Standard deviations 
of raw scores are not quoted, but the reliability 
coefficient of .95 for 100 cases of one grade only 
suggests adequate dispersion. Mental age norms 
are based, as far as one can judge, on 5,000 pu- 
pils from all parts of Canada; we are not told 


"how these children were selected. The only clue 


given to the validity of the test is its correlation 
with Spearman's “g,” .93, apparently an internal 
criterion and therefore of no great value. The 
nature of the items, however, should ensure the 
usefulness of the test. 

INTERMEDIATE (1950 EDITION, OMNIBUS 
түрк). This is a test of 75 questions without in- 
ternal timing. Thirty minutes are allowed for 
working. It is constructed for Grades 7, 8 and 
9, and appears in two parallel forms, A and B. 
It comprises opposites, analogies, number series, 
arithmetic problems, “does not belong," syno- 
nyms and a few of miscellaneous type. There are 
14 number series items and 19 arithmetical prob- 
lems, undue proportions which probably favour 
boys. Mental age norms based on the scores of 
3,300 pupils are provided. The standard devia- 
tion of raw scores was about 13.5, a low figure 
for a 75-item test. The correlation between the 
two forms of the test was .93 for 391 children, a 
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figure higher than the standard deviation of raw 
scores would lead one to expect. It may have 
been increased by a wide age range, but we are 
not told. 

ADVANCED. There are 75 items in each form, 
presented without internal timing. They were 
selected by elimination of unsuitable items from 
an original 486. The items are mostly verbal of 
standard type; r4 items are “number series," a 
rather excessive proportion of the whole test, 
especially as their solution depends largely on 
previous experience with this type. Reliability 
coefficients run from .8т to .9o and standard 
deviations of raw scores from 9.5 to 13.4 for 
grades 9 to 13 taken separately ; the discrimina- 
tion is therefore not particularly good. 

GENERAL CRITICISMS. The errors associated 
with individual scores are given in terms of the 
old-fashioned “probable error," while standard 
deviations are employed for dispersions of raw 
scores. The authors quote the stability of the 
probable errors of individual raw scores from 
one grade to another with satisfaction. But it is 
obvious from their data that this stability results 
from a low reliability coupled with a low stand- 
ard deviation of the score distribution in one 
case and opposite conditions in another case. The 
relevant statistic should be the standard (or 
probable) error of the individual IQ, which re- 
fers the errors of measurement to the same scale ; 
the reviewer believes that marked differences 
would then be noted. 

All correlations are based on children within 
certain grades and cannot be interpreted with- 
out knowledge of the spread of ability and age 
in the particular grade, which is not available. 
Where standardization has been effected by a 
comparison with other intelligence tests, the 
regression line has been taken as expressing the 
relation between the two tests ; it is probable that 
this relation would be better expressed by the 
major axis of the elliptical frequency contours. 

In the reviewer’s opinion the use of mental 
age norms is to be deprecated, for they yield 
different standard deviations of IQ and therefore 
different scales with different tests. This may 
occur even within the same test, as for example 
the Revised Stanford-Binet Scale, where the 
SD of IQ for six-year-olds is 12. 5 and for 12- 
year-olds 20.0. There is much to be said for the 
establishment of an international scale based on 
standardised scores with a mean of 100 and an 
agreed SD at, say, 15. The position at present is 
chaotic. The authors of the tests give no clue as 
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to the scale on which the resulting 10s аге 
measured. 

In general, the tests are well constructed, the 
instructions are on the whole excellent, and 
marking is entirely objective. However, more 
statistical information should be available before 
the tests can be recommended with confidence. 


For a review by F. T. Tyler, see 3:231. 
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*Group Test 33 (Intelligence): National Insti- 
tute of Industrial Psychology. Ages 14 and Over; 
1923-46; т form ['23] ; no data on reliability, validity, 
and normative population in manual; data available on 
request; revised manual (’46) ; 3s. 8d. per 12; rs. per 
single copy; Is. per manual; postage extra; 29(35) 
minutes; Cyril Burt; National Institute of Industrial 
Psychology. * 
REFERENCES 


„Farmer, E. ano CiawpxRS, E. С. The Prognostic Value 
"Some. " Medical Research Council, In- 


[296] ў 
*Group Test 36 [Intelligence]: National Insti- 
tute of Industrial Psychology. Ages 10-14; 1937- 
45; 1 form ['37]; no data on reliability, validity, and 
normative population in manual; data available on 
request; mimeographed manual ['37]; norms [45]; 
бз. od. per 12; 15, 6d. per single copy; Is. per manu i 
stage extra; 20(30) minutes; National Institute о 
Industrial Psychology. * 


[297] 
*Group Test 70 [Nonverbal Intelligence]: Na- 
tional Institute of Industrial Psychology. eas 
14 and over; 1939-50; 2 tests; no data on dpi 33] 
validity, and normative population in manual; Nac 
available on request ; norms ['50] ; postage extra; 
tional Institute of Industrial Psychology. eh 
а) GROUP TEST 70/1. 1950; 1 form ['50] ; 4s. 6d. per sg 
Is. per single сору; 1s. 3d. per manual; 5(10) minut А 
b) Group TEsT 70/23 rie EDITION]. 1939 30 a 
form ['46—same as test copyrighted in 1943] ; beer 
['46]; 21s. per 12; 1s. od. per single copy; pues 
answer sheets must be used; 15. 2d. per 12 ап 
Sheets; 1s. per manual; 13(20) minutes. 
REFERENCES 8 dary and 
I. SLATER, Patrick. “Tests for Selecting Secondary 
Techical School Children." Occupational Psychol 15:10-25 Ja 
Lus Tis, UT Properties of a Space Test. Экии 
thesis, University of Edinburgh VEdinburgu, Scotland tien 
з. Vernon, P. E. “The Structure of Practical " 
Creupestone! Psychol 23:81-96 Ap '49. * (PA 23 3313), uet Se 
4. VERNON, Pun E. AND Parry, Joun B., Ө London 
lection i the ES. ЖОО отбор: TU IET o д 
ress Ltd., . le . 24:776). 4 id- 
S. “ТЬе Human Problems of the Building Industry: GOD. 
ance, Selection and Training." Occupational Psyci 
104 Ap 'so. * (PA 24:5914) 


Grorce Westsy, Head of the Psychological 
Laboratory, University College of Hull, Ни, 
England. | ; 
E TEST 70/1. This test is described j^ 
Vernon and Parry, on the basis of experience 
the British Armed Forces as “а very simple 4 
surprisingly effective test" (4), an entirely JUS 
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fied description in view of the fact that it has a 
single principle and is easily administered in ten 
minutes including the pretest practice (the ac- 
tual scored test has a time limit of five minutes). 

The principle is that five standard figures 
numbered from І to 5 composed of two lines 
bisecting each other at various angles are given ; 
and the testee has to number correctly as many 
as possible of the part figures (printed in the 
test booklet below the key figures). Practice on 
two sets of figures using the same key is given 
beforehand. Factor analysis suggests that this 
test is loaded with “g” or relational thinking to 
about the same degree as Raven's Progressive 
Matrices 1938, namely .78 (Matrices .79) with 
a small spatial-perceptual factor. If a P or per- 
ceptual speed factor exists one would certainly 


except it in such a test, although Vernon’s 


Services analyses showed no evidence of it. In 
addition, capacity for continuous high-pressure 
concentration and ability quickly to memorize 
the key figures seems, a priori, to be involved. 
Little work has been reported with the test, how- 
ever, and considerable further experimentation 
with it seems desirable. If this is an almost pure 
“g” test as factor analysis suggests, then a speed 
factor in any mental testing seems to be more 
than doubtful, yet Armed Services scores for 
Matrices used as a timed test and N.LI.P. Group 
Test 70 were proved to correlate most highly 
with visual signalling, radar, and antiaircraft 
jobs of a perceptual type. 

The absence of data on reliability is remark- 
able (none is reported by Vernon and Parry even 
from their experience in the Services), but it is 
presumably not high. Norms for the general 
population of young adults issued by the N.LI.P. 
(1950) are based on scores of a stratified sample 
of 1,000. Percentiles for school ages (unpub- 
lished) have been established for restricted use 
and are based on 5,500 cases. The test has also 
yielded percentiles for other special groups, such 
as *Higher Level Executives," but these again 
have not been released as yet since the N.L.LP. 
uses the test extensively in its commissions in 
industry and vocational guidance. No validatory 
studies are available. The likeliest future for this 
test, which is certainly intriguing, is in а com- 
prehensive battery; it is certainly too brief to 
be used alone, and the test principle involves 
processes which are almost certainly more com- 
plex than the apparent simplicity of the task and 
would bear careful research investigation. 

GROUP TEST 70/23. This test, composed of 53 


non verbal items simply constructed geometri- 
cally and based in the main on Matrix princi- 
ples, is again remarkable for its economy of time, 
taking in all about 25 minutes (13 minutes only 
of test time) to administer. Here again the test 
is highly loaded with “g” or relational thinking 
and a small visuo-spatial factor. Here again, 
also unfortunately there is a total lack of data 
on reliability and validity. The norms issued by 
the N.LI.P. are based on 2,454 cases from the . 
Institute's ihdustrial work and 5,147 school cases 
done by vocational guidance sections. Separate 
norms are given for unselected young male adults 
and adolescents and for Grammar School boys 
from 14.0 to 16.0. Again the Institute has es- 
tablished useful special norms which it cannot 
yet release. 

The principle and construction of the test 
seem excellent; it is safeguarded like Group 
Test 70/1 by adequate pretest practice, but the 
absence of necessary data will handicap its wide 
dissemination until, no doubt, it is superseded 
by better tests at the N.LI.P. We shall then be 
allowed the benefit of its use with necessary 
safeguards and knowledge of which we are at 
the moment deprived. 


[298] 

xGroup Test 90A [Intelligence]: National In- 
stitute of Industrial Psychology. Adults; 1948; 1 
form [48] ; no data on reliability, validity, and norma- 
tive population in manual; data available on request; 
mimeographed manual; 4s. 6d. per 12; Is. рег single 
copy; 9d. per manual; postage extra ; 20(30) minutes ; 
National Institute of Industrial Psychology. * 


[299] 

*The Henmon-Nelson Tests of Mental Ability: 
The Clapp-Young Self-Marking Tests. Grades 
3-8, 7-12, 12-16; 1931-50; IBM for grades 7-12; im 
els; postage extra; 30(35) minutes; у. А. С. Неп- 
nion and М. J. Nelson; Houghton Mifflin Co. * 

а) ELEMENTARY SCHOOL EXAMINATION. Grades 3-8; 
1931-46; Forms A ('31), B (32), С (32); revised 
manual (46); $1 per 25; 28€ per specimen set, post- 


aid. 
b) HIGH SCHOOL EXAMINATION. Grades 7-12; 1931-50; 
IBM; 2 editions; revised manual (46); { 
1) [Clapp-Young Self-Marking ition.) Forms A 
(31), B (32), C (35); $1 per 25. T 
2 [Machine Scoring Edition.) Forms A ('50, ¢’31), 
(48, c'35) ; $1.50 рег 50; separate answer sheets 
must be used; $1 per 50 IBM answer sheets ; 20€ per 
machine scoring stencil; 28€ per specimen set, post- 


paid. 
C) INTELLIGENCE TEST FOR COLLEGE STUDENTS. Grades 
12-16; 1931-32; Forms А (31), B ('32); manual 
(^32) ; $1 per 25; 25% per specimen set, postpaid. 
: REFERENCES TEN 
.B Еркн A. “А Method of Selecting Subnormal 
Children for а Vocational School." J “Juvenile Res 21:188-92 


*37..* (PA 12: T 
UEM у= $03), AND RENON V. А. C. “The, Predic- 


. Drake, Lewis е 
Чоп of Scholarship in the College of Letters and Science at 


the University of Wisconsin.” Sch & Soc 45:191-4 Е 6'37. 


* 
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3. Henmon, V. A. C., AND Мох, M. J. The Measurement 
of Intelligence. Educational Progress Bulletin, Vol. 13, No. 2. 
Boston, Mass.: Houghton Mifflin Co., September 1937. Pp. 21. 

aper. Y 

р Morris, Cuartes M. “Ап Experimental Analysis of Cer- 
tain “СБ ишсе Ты Air Psychol B 34:716-7 
"37. 12:1661, title only б 

5. ALTENEDER, Louise E. The Value of Intelligence, Per- 
sonality, and Vocational-Interest Tests in a Guidance Program. 
Doctor’s thesis, New York University (New york, Bem 
1938. Pp. 130. (Abstracts of Theses....[School of Educa- 
tion] 1938, pp. 41-4.) { y 

6. Curist, Austin F, The Relation of Mental Test Ratings 
to Achievement of High-School Pupils. Master's thesis, Law- 
rence College (Appeton, Wis.), 1938. Pp. 105. 4 L 

. EMxRICK, Luctrre Mae, Predicting Success im Music 
Education for Adults. Doctor's thesis, New York University 
New York, N.Y.), 1938. Pp. 239. (Abstracts of Theses... 
School of Education] 1938, pp. 101-5. nec 

8. BECKHAM, ALBERT SIDNEY, "The ee of a Negro 
High Sohon! Бор анод in a Northern City." J Genetic Psychol 

4:327-: е '39. 14:400 : 
iin бй. Louise E “The Value of Intelligence, Рег. 


pP ‚ and Vocational Interest Tests in a Guidance Program.” 
J Ed P S '40. * (PA 15:1480) 


ychol t 2449-82 j 
10. Fartom, Leona Mae. An Evaluation of Intelligence Tests 
Used in Industrial Schools and Reformatortes. Doctor's thesis, 
University of Nebraska (Lincoln, Neb.), 1940. (Abstracts of 
Doctoral Dissertations....1940, m 59-65.) ` 
ir. FArLog, Leona МАЕ. n Evaluation of Intelligence 
Tests Used in Industrial Schools and Reformatories." Abstract. 
Psychol B 38:711 O '41. * (РА 16:364, title only) \ 
12. FnaozHLICH, Gustav J. “Academic Prediction at the Uni- 
versity of Wisconsin." J Am Assn Col Reg 17:65-76 O "41. * 
13. Hartson, L. D., AND Sprow, A. J. “The Value of Intelli- 
gence Quotients Obtained in Secondary School for Predicting 
Сее. EOS VE Ed & Psychol Meas 1:387-98 О "41. 
16:1174 


High School Subjects." Trans Kans Acad Sar 1417-8 '45.* 

16. Hartson, L. D. “Influence of Level of Motivation on the 
Validity of Intelligence Tests." Ed & Psychol Meas 5:273-83 
au '4s. * (PA 20:2500) 

17. Hutt, ESTHER. “Study of Achievement in Educational 
Psychology." J Exp Ed 13:17. AUS "4s. * (PA 19:3494) 

18. KNEZEVICH, STEPHEN j. “The Constancy of the IO of 
NDA EAT School Pupil.’ J Ed Res 39:506-16 Mr '46. * 

2033329 

19. Lins, Leo ]озЕРН. “The Prediction of Teaching Effici- 
ciency.” J Exp Ed 15:2-60 S '46. * (PA 21:610) 

20. BusHoNc, Grorce E. College Entrance Examinations, 
Master's thesis, Southern Methodist University (Dallas, Tex.), 
1947. (Abstracts of Theses....1946, 1947, p. 10.) 

21. Davis, W, ALLISON, AND Havronvnsr, Rosert J. “The 
Measurement of Mental Systems: Can Intelligence Be Meas- 
ured?” Sci Mo 66:301-16 Ар '48. * (PA 22:3381) 

22. Haas, Leonard. “Four Year Studies of the Freshman 
Classes of 1936 and то4о at the Eau Claire State Teachers 
College." J Ed Res 42:54-61 S '48. * (РА 23:2917) 

23. Froox, W. F. “А Statistical Study of 224 Seffior Stu- 
dents Graduated in June, 1949." J Ed Res 43:101-9 O '49. * 
(PA 24:2806) 

24. ALLEN, Ковевт M., AND Bessetr, HamoLp. “Intercorre- 
lations Among Group Verbal and Non-Verbal Tests of In- 
telligence.” J Ed Res 42:304-5 Ta 'so. * (PA 24:4841) 

25. Emprer, Коул, B., Tr. “A Longitudinal Studv of Per- 
farmance on the Henmon-Nelson Test of Mental Ability, the 
Differential Aptitude Tests, and the American Council on 
Education Psychological Examination for College Freshmen.” 
Abstract. Am Psychol 5:352 Jl 'so. * (PA 25:1276, title only) 


Н. M. Fowrzn, Associate Professor of Educa- 
tional Research, University of Toronto, To- 
ronto, Ontario, Canada. 

Soon after the first world war, test construc- 
tors, following the lead of Binet and Terman, 
concerned themselves with measuring that dif- 


ficult-to-define capacity which we call general - 


intelligence. Tn the last 20 or 30 years, however, 
there has been a trend toward attempting to 
construct tests which could give an analytical 
picture of mental ability. The California Tests 
of Mental Maturity, for example, were designed 


to give both a verbal and a nonverbal intelli- 
gence quotient. Carrying this trend for mental 
analysis even further, the factor theorists have 
been trying to isolate, and eventually measure, 
the primary mental abilities. 

The Henmon-Nelson Tests of Mental Ability 
belong to the class of group intelligence tests 
which yield a single global score, an overall 
estimate of general intelligence ; having no part 
tests, they are not diagnostic and they provide’ 
no measure of specifics. In the manual, the au- 
thors say “a wide variety of types of questions 
is used, thus furnishing a test of many kinds of 
ability.” But the 90 items of Form A of the test 
for elementary grades include at least 52 items 
which are strictly verbal—vocabulary, word 
classification, verbal analogy, anagrams, and 
'sentence completion. The items may be different 
in form, but the function being measured would 
appear to be almost entirely verbal comprehen- 
sion. At all three levels, these tests are patently 
measures of that highly verbal component some- 
times called "general intelligence" but perhaps 
more aptly labelled “scholastic aptitude." — 

Each of the three elementary and three high 
school forms have 9o items arranged in order 
of increasing difficulty in spiral omnibus form 
and printed on two sides of a two-page, 10” by 
12", answer booklet. The inside pages of the 
sealed booklet are reserved for the special Clapp- 
Young self-marking device. The two college 
forms are similar but are printed on 8% by 
11” pages. The paper om. which the tests are 
printed is inferior and the printing is rather 
small, especially for certain items such as geo" 
metrical analogies which become unduly am- 
biguous when not clearly printed in large figures: 
The space allowance for marking the items 15 
none too generous, and this may unfairly handi- 
cap students with poor eyesight. 

For those who prefer a single score measure 
of intelligence, the Henmon-Nelson Tests 0) 
Mental Ability have some definite advantages: 
their administration is simple, they can be ee 
quickly, and, most important, as far as one ds 
judge from the available evidence, they do pu 
ure scholastic aptitude, particularly at the low н 
levels where verbal comprehension is ris 
a more important constituent of school d А 
thamis either originality or initiative. Wort ds 
praise, too, is the manner in which the tests W б 
constructed in the first place; the items Tie 
selected rather carefully and only those b де 
survived rigorous statistical tests were іпс 0 
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in the final forms. This reviewer's chief com- 
plaint is that the Henmon-Nelson tests are now 
showing signs of wear. The inadequacy of some 
of the information given in the manuals and the 
lack of other information give the impression 
that the authors are not applying to the tests the 
constant revision which is necessary to keep 
them at their full effectiveness. Although the 
Henmon-Nelson tests were soundly constructed 
for a specific, if somewhat narrow, purpose orig- 
inally, and although it would be too much to 
say that they have outlived their usefulness, one 
feels that they now need a norm adjustment, an 
overhaul of the manuals, and certain other 
changes and improvements to bring them back 
to par. 

Perhaps most important in the upkeep of an 
intelligence test is constant attention to repair or 
renewal of the norms. Are the Henmon-Nelson 
intelligence quotients and mental ages too high, 
as is commonly averred? The manuals for the 
elementary and high school tests show that for 
reasonably large groups the mean mental ages 
and the mean IQ's of the Henmon-Nelson agree 
fairly closely with the corresponding averages 
obtained on other group tests of intelligence, but 
some recent investigations indicate that the 
Henmon-Nelson does give mental ability esti- 
mates which are too high. For example, Kneze- 
vich reports that the average Henmon-Nelson 
IQ for a group of forty-eight students was about 
three points higher than the average for the same 
students on the California Test of Mental Ma- 
turity, a test which is reputed to give rather 
high estimates (78). But this small, though sta- 
tistically significant, difference could be ac- 
counted for in terms of practice effect or changes 
in the students since the Henmon-Nelson was 
given about a month after the California test. 
This reviewer made a comparison of the Hen- 
mon-Nelson with the Dominion Group Test of 
Learning Capacity, Intermediate (Omnibus 
Type) by using mental ratings obtained on the 
two tests from 70 grade 9 pupils in a small On- 
tario town. Although this group is too small to 
provide final conclusions, the comparison pro- 
vided a check of the Henmon-Nelson norms 
against mental age equivalents recently obtained 
on a large representative Ontario group (June 
1950), The correlations between 1Q’s and be- 
tween mental ages, .88 and .70 respectively, sug- 
gest that the two tests are comparable, a con- 
clusion which is reinforced by an inspection of 
the tests. They are both omnibus type tests yield- 
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ing one overall measure of general intelligence, 
their content is highly verbal, and their items are 
similar in form and appearance. Nevertheless, 
the average mental ages were quite different: 
the Henmon-Nelson gave an average of 187 
months compared with 184 months given by the 
Dominion despite the fact that the Henmon was 
given first. The Henmon-Nelson IQ’s also av- 
eraged about three points higher than the Do- 
minion IQ's. Although these results are not 
conclusive, they do support the view that the 
Henmon-Nelson norms, at least for the high 
school test, are out of line for use today in On- 
tario schools. А 

In fairness to the authors and publishers of 
the Henmon-Nelson tests, it must be said that 
the reported norms are based on a very large 
number of cases—for the elementary test, “220,- 
ооо pupils from every state and parts of Canada," 
and for the High School Test, "about 300,000 
pupils in widely separated areas of the United 
States." Unfortunately this is all the informa- 
tion that is given in the manuals. In a letter to 
the reviewer one of the authors states: “Еасһ 
year users send in many thousand scores and 
these are tabulated every two years and com- 
pared with the norms. When variations are 
found, a complete study is made and norms are 
corrected as necessary." A statement of what 
the authors are doing to keep the norms up-to- 
date should appear in the manuals, which at 
present do not even give the date of the last 
norm revision. This would correct any impres- 
sion that the norms are completely outmoded. 
Even so, it is questionable whether checking 
norms by means of scores obtained almost at 
random by solicitation can take the place of a 
planned attempt to restandardize the test under 
controlled conditions. 

The Henmon-Nelson gives two sets of norms, 
a table of mental age equivalents which can be 
used to convert raw scores to intelligence quo- 
tients, and a table in which the percentile ranks 
of the raw scores are reported by grades. The 
authors deserve praise for giving percentile 
norms as well as mental age equivalents. Intelli- 
gence quotients obtained on a carefully stand- 
ardized test are probably not too misleading in 


{һе lower grades, but their use for students over 


the age of 14 can be seriously challenged. 
Mursell * has criticised the Henmon-Nelson for 
presenting percentile norms by grade groups. 


1 Mursell, James L. Psychological Testing, Revised Edition, 
pp. 154-5. nee York: Longmans, Green & Co,, 1949. Pp. xvi, 
488. * (PA 25:190) 
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He argues that the meaning of “grade six” shows 
such variation from one locality to another that 
to use grade norms is to define the unknown in 
terms of something equally unknown. This re- 
viewer believes that teachers find grade norms 
definitely helpful ; the differences in grades from 
one place to another may be smaller than 
Mursell suggests. Data recently collected by the 
Department of Educational Research on the 
Dominion Group Test of Learning Capacity, 
Junior, suggest that, for the lower grades at 
least, the between-province differences for 
grades 3 to 7 are not large. There is, however, the 
difficulty that age groups do not coincide with 
grade groups, particularly in the upper grades 
where there may be a considerable disparity be- 
tween the two. It would seem to be useful, and 
in keeping with the original conception of the 
purpose of mental measures, to report percen- 
tile norms by age groups as well as by grade 
groups. 

Of less import than the suspicion which at- 
taches to the norms, but still significant, are limi- 
tations and weaknesses of the manuals. These 
are four-page booklets which give descriptions 
of the tests, directions for administering and 
scoring them, and data for interpreting the 
scores, Neither in appearance nor in content do 
these manuals measure up to the best psycho- 
logical test manuals of the present day. In the 
first place, too much is crowded into too few 
pages; since the inner pages are taken up with 
a table for reading intelligence quotients di- 
rectly from raw scores, only the outer two pages 
are left for the description of the tests, the ad- 
ministration directions, and a number of tables 
containing data which may be used for inter- 
preting scores. It may be advantageous to cut 
a teacher's manual to the minimum, but certain 
information should not be omitted. The Henmon- 
Nelson manuals have practically no description 
of the norm groups or reliability groups, so that 
the reader is unable to decide how applicable the 
norms may be or how much confidence he may 
place in the reliability estimates. Criticism may 
be levelled at the authors not only for their sins 
of omission but also for the way in which they 
have presented some of the material that is in- 
cluded in the manuals. The use of the split half 
procedure for estimating the reliability of a 
30-minute test is particularly reprehensible, es- 
pecially since no other reliability estimates are 
given, and the critical ratio technique described 
by the authors in difficult technical language 
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definitely “dates” the statistics. The use of out- 
moded statistics in the manuals is unfortunate 
because it serves to undermine the reader's con- 
fidence in the validity of the tests for use today, 
The table for converting raw scores to intelli- 
gence quotients directly without reference to 
the mental age equivalents table will be warmly 
welcomed by the busy teacher. However, it is 
impossible to decide from an inspection of this 
table whether the authors have determined by 
research the age of arrest of mental growth for 
the Henmon-Nelson tests or have simply fol- 
lowed the Stanford-Binet precedent in adjust- 
ing chronological age divisors. 

In designing an elementary test for grades 3 
to 8 and a high school test for grades 7 to 12, 
the authors of the Henmon-Nelson tests have 
attempted, in the opinion of this reviewer, to 
cover greater age ranges than can efficiently be 
tested by single group tests of general intelli- 
gence. The percentile norms show that the ceil- 
ing of the elementary test is too low for eighth- 
grade children and that the floor is too high for 
third-grade children. The elementary test works 
best for grades 4 to 7 and its use should be con- 
fined to those grades. Still better, the authors 
might provide separate tests for grades 3-5 
grades 6-8, and grades 9-12. 

In summary, it may be said that the Henmon- 
Nelson tests still rank among the better group 
tests of general intelligence despite the fact that. A 
they have now been on the market for about ki 1 
years. Although the tests provide only overal 
estimates of intelligence, they can still be recom- 
mended to those who want a quickly and fairly 
cheaply obtained rough measure of that highly 
verbalized factor sometimes called scholastic ap- 
titude. They are inappropriate for use in clini 
situations in which diagnosis is desired, and n 
will probably give but poor assessments of t 7 
mental ability of children with language or ed 
ing difficulty or of children who are not es d 
inclined. The tests appear to have most use u 
ness in the elementary school where they «i 
particularly valuable in predicting school st! 2 
cess. Because of the apparent lack of а Mer 
ous and concerted research program, the d 
now give every sign of being in need of Ане 
and renewal although they were well construc à 
originally. It is claimed, and evidence heo 
found to support the view, that the norms | 
now out of line, with the result that ап un "A 
high proportion of above average intelligi 
quotients is being obtained by testwise childr 
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The manuals need to be revised to bring them 
up to date and to include some vital information 
which is now lacking. Other suggested changes 
are the addition of percentile norms by age 
groups and the provision of three separate tests 
{ог the junior, intermediate, and high school 
grades. 


For reviews by Anne Anastasi, August 
Dvorak, Howard Easley, and J. P. Guilford and 
an excerpt from a review by Francis N. Max- 
field, see 40:1398. 
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xThe IPAT Culture Free Intelligence Tests. 
Ages 4-8, 8-adults, 14-superior adults; 1933-50; title 
on test booklet is Test of g: Culture Free; IBM for 
ages 8-adults and 14-superior adults; 506 per manual 
['50] for any one level; cash orders postpaid; R. B. 
Cattell and A. K. S. Cattell (2, 3) ; Institute for Per- 
sonality and Ability Testing. * 
a) SCALE I. Ages 4-8; 1933-50; identical to the Cattell 
Intelligence Tests, Scale O: Dartington Scale (see 40: 
13862) published by George G. Harrap & Co. Ltd. ; in- 
dividual; 1 form, ’50; $2.80 per 25; 25¢ per scoring 
key; $3 per set of cards for classification test; 55¢ per 
specimen set; (30) minutes. 
b) scare 2. Ages 8-adults; 1949-50; IBM; Forms A 
(49), В (249) ; $2.50 per 25; $4.60 рег 25 sets of both 
forms; 70€ per specimen set; separate answer sheets 
may be used; $1.50 per 50 IBM answer sheets; design 
for machine scoring stencil included in manual ; 256 per 
stencil for hand scoring of answer sheets for any one 
form; 14(30) minutes. 
с) SCALE 3. Ages 14-superior adults; 1950; Forms A, 
B; prices same as for Scale 2; 14(30) minutes. 
REFERENCES 

т. Prerce-Jones, Jonn, AND TYLER, FRED T. “A Comparison 
of the A.C.E. Psychological Examination and the Culture-Free 
Test." Can J Psychol 4:109-14 S 'go. * (PA 25 3422) 

2. CATTELL, RayMonp B. “Classical and Standard Score I 
Standardization of the I.P.A.T. Culture-Free Intelligence Scale 
2." J Consult Psychol 15:154-9 Ap 's1. * (PA 26:6255) 


Rarrrcn M. Drake, Professor of Psychology 
and Head of the Department, Kent State Uni- 
versity, Kent, Ohio. 

This test has the same purpose as Cattell's 
original Culture-Free Test published in 1944 by 
the Psychological Corporation. It differs in that 
it is shorter and more convenient to give and in 
that it is available in three levels. Scale 1 is not 
really culture-free because it consists of rela- 
tion-finding among culture-bound pictorial ma- 
terials with rather specific verbal directions. 
Within our own culture, however, it is relatively 
free of the verbal and ordinary schooling fac- 
tors, Scale 2 and Scale 3 are the same in effect 
as the 1944 edition except that the subtest Pool 
Reflections is replaced by a subtest called Con- 
ditions which is a more complex relation-finding 
test and which should, therefore, be a better 
measure of mental capacity. All three scales can 
be administered in groups, and the cost is reason- 
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able. The provision of answer sheets would have 
been desirable for Scales 2 and 3. 

The wide range of ability from the feeble- 
minded to the superior adult, the availability of 
two equivalent forms, and the freedom from 
memory and training factors make it a very 
valuable measure of mental energy. The mime- 
ographed manuals (there is a manual for each 
scale) give little data on reliability and validity 
but references to published data are given. Both 
reliability and validity are about as high as one 
could hope for with such tests. 

The reviewer has given Scale 2 to the third 
and fourth grades in a summer school where 
about 50 per cent of the students were making 
up failures or low grades of the year before; 
r’s of .56 and .36 (N = 28) respectively were 
obtained with the Revised Stanford-Binet Scale 
and the Stanford Achievement Tests. The Cat- 
tell r of .36 with the Stanford Achievement Test 
is 11 points higher than is the Stanford-Binet r 
of .25. In another study, of ten different groups 
from high school to college, the Cattell 7's (Scale 
3) with grade point averages ranged from .37 to 
91, with an average of .51 (№ = 8-49 рег 
group) ; the r’s between the Otis Self-Adminis- 
tering Tests of Mental Ability and grade point 
averages ranged from zero to .92, with an aver- 
age of .35 for the same groups. For the same ten 
groups, the correlation of Cattell with Otis aver- 
aged .73, with a range of .34 to .90. For a college 
group of 32 students the Cattell correlated .59 
with the American Council on Education Psy- 
chological Examination. In view of the facts that 
(a) school training would have practically no 
direct influence on the Cattell scores and yet the 
Cattell correlates higher (on the average) than 
the Otis with school success and (b) that the 
Cattell correlates an average of .73 with other 
recognized tests of general ability, it would ap- 
pear that a test which is almost culture-free has 
many advantages over the usual culture-bound 
tests. Its advantages should be recognized as a 
possible way of equating all subjects for past sub- 
ject-matter-training effects and thus obtaining 
a more accurate score representing each subj ect’s 
innate ability. Used together with a test having a 
high saturation of verbal ability, it might reveal 
much more concerning a particular individual 
than does the usual global score of general ability. 

Scale 1 was standardized on only 117 chil- 
dren, with 20 being the largest for any year. It 
is therefore very tenuous. Since at this age a 
difference of one or two scale points, as well as 
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one or two months' variation in age, makes a 
great difference in the resulting IQ, the relia- 
bility of the scale is not very high. No evidence 
of reliability is given. 

Scale 2 is standardized on 713 pupils in two 
midwestern university towns and 2,584 pupils 
in an industrial city in England. The SD of the 
Culture-Free Test is 24, a considerably higher 
figure than the Revised Stanford-Binet Scale 
SD which averages 16 (range 12-20). It should 
be noted that IQ's which are very distant from 
the mean will not be comparable between distri- 
butions which vary in SD. 

Scale 3 is standardized on 886 high school 
pupils and 600 students from three different 
colleges and universities. For the same reason as 
for Scale 2, IQ’s some distance from the mean 
will be much higher or much lower than those 
usually obtained. For this reason too, all IQ's 
calculated from these scales must not be taken 
as necessarily the same as Stanford-Binet or 
Wechsler-Bellevue IO's. No actual compari- 
sons are given with other tests. 


GraApvs C. ScHWESINGER, Senior Clinical Psy- 
chologist, California Y outh Authority, Ventura, 
California. | 

This test consists of three scales each appro- 
priate to different age and intelligence levels. 
Directions for administering and scoring each 
of the three scales are fully and clearly given in 
the separate test manuals which also include dis- 
cussion of what the test measures, structure of 
the test in relation to purpose, and essential and 
peripheral standardization tables. No informa- 
tion is presented in the manuals on the relia- 
bility of the present scales. These tests have 
higher validity for low cultural or intelligence 
levels than do other tests, while for highly cul- 
tured groups, tests charged with scholastic skills 
show a higher saturation of general ability. How- 
ever, on the average, the culture free test is 
highly valid for all situations. Thomson (40: 
1386) called attention to the wider scatter 
_(SD’s of 20 to 27) of the Cattell Intelligence 
Tests (which include Scale 1 of Test of g) as 
compared with the standard deviations of the 
Binet (SD's of 15 to 16), a fact of which test 
users should be aware, and-one which should be 
made subject to more research. This scatter 
means that the Scale 1 IQ's may be either too 
high or too low, assuming Binet as a standard! 

The purpose of the Cattell test is, as its name 
implies, to have an instrument free of cultural 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK: [ 402 
F 


influence, and one which can more readily be 
applied to different kinds and degrees of culture 
throughout the world. The underlying principle 
is perceptual rather than apperceptual, thus do- 
ing away with a dependence on particular cul- 
tural associations—and incidentally also meet- 
ing the proverbial anthropologist’s objections to 
most of our intelligence tests that similar objects 
are invested with different meanings in different 
cultures. Geometrically rather than pictorially 
meaningful material is used as the substance of 
the subtests, with general ability for any indi- 
vidual being detected according to the level of 
complexity of relations he can handle. Per- 
ceptual appreciation attracted English psycholo- 
gists about 1937-1938 and during World War 
II became the standard for their wholesale test- 
ing program. In this respect, the United States 
is technically behind Oxford and other English 
universities dealing with intelligence meii 
ment. Ku, 
These tests do not require reading ability, 
while a section of Scale 1 and all of Scales 2 and | 
3 do not require pictorial recognition of objects 
of our culture. They are designed to measure 
general mental capacity, Spearman's “g,” the 
second order factor behind Thurstone’s primary 
abilities. Precise technical data on the relative 
“g” saturation of the various kinds of subtests 
are not yet available. However, although not 
enough data have been assembled to allow B i 
vagaries of sampling, etc., studies by Rimoldi 
in Thurstone’s laboratory in Chicago already 
show that general intelligence, that is, second 
order factor measurement in these tests, is rather 
strongly weighted in the reasoning factor Wm 
Thurstone's first order abilities. The fourth sub- 4 
test in the IPAT series of scales is entirely new 
and has not yet been properly evaluated among 
perceptual tests, but the others have about the 
same saturation on whatever is common to the 
subtests. . hos 
The tests are interesting to children and primi- 
tive peoples, and for the latter should be admin- 
istered on an individual basis, with minimum 
needed motivation being provided for by the 
field worker. Younger children increase more 
in test sophistication on these tests than do older 
children, while within one age group, increase 
in test sophistication is slightly less for the 
lower IQ's. Repeated exposure to the tests, pi 
ever, would result in a greater overall advance! 
for the dull than for the bright. 


The particular contribution of these tests 5$ 
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that they are as saturated with “g” as are any 
tests, and that they are free of the scholastic as- 
pect of culture. In other words, they are as free 
of culture as any performance test, and as full 
of “g” as verbal intelligence tests like the Binet, 
ACE, etc. The perceptual type test represents 
an entirely new tool for intelligence testing, a 
radical departure introduced by Spearman two 
decades ago, which is now well appreciated in 
England, although still failing to catch the in- 
terest and imagination of American psychome- 
trists. 
[301] 
*Kentucky Classification Battery. Grades 9-11, 
12-13; 1948-51; 4 scores: general ability, English, 
mathematics, total; 2 levels (labeled Forms 11, and 
2); Kentucky norms only; tests rented rather than 
sold; rental fee, 24¢ per student ; rental service includes 
rental of booklet, answer sheet, electrographic pencil, 
and scoring and reporting results; specimen sets avail- 
able on loan basis only; 90(105) minutes; Lysle W. 
Croft, Robert D. North, Ordie U. Davis, and Howard 
B. Lyman; Kentucky Cooperative Counseling and 
Testing Service, * 
a) LOWER LEVEL. Grades 0-11; 1951; .Form 1L; no 
data on reliability and validity; no manual; no norms; 
directions for administering [51]. 
b) [UPPER LEvEL.] Grades 12-13; 1948-50; a battery 
consisting of revisions of Kentucky General Ability 
Test (see 3: 234), Kentucky English Test (see 3:132), 
and Kentucky Mathematics Test (see 3: 311) ; Form 2 
(’so—a revision of Form 1); reliability and validity 
data for Form 1 only; manual ('50) ; earlier editions 
by E. J. Asher, T. E. McMullen, and P. L. Mellen- 
bruch. 
REFERENCES 
1-3. See 40:1402. “ 

“ ilizati ial College Abilit: 
ssim H Le “the ЛС үзү 
J Am Assn Col Reg 18:14-22 О '42. * (PA 17:2166) 


Davip V. TiEDEMAN, Lecturer on Education, 
Harvard University, Cambridge, Massachusetts. 
[A review of Upper Level Form 24 

This battery is in three sections, General 
Ability, English Fundamentals, and Mathemat- 
ics. Thirty, 25, and 35 minutes are allotted to 
each section respectively. The test authors' claim 
that the time limits are long enough to allow 
most students to complete all of the items seems 
reasonable. f 

Although this test is called a “classification 
battery” the authors completely ignore any po- 
tential for classification which the test may 
possess. No information concerning problems 
like those of vocational or curricular classifica- 
tion has been given by the authors. 

The battery “is designed for the dual purpose 
of evaluating high school achievement at the 
twelfth grade level and estimating aptitude for 
academic work at the college freshman level.” 
It is doubtful that many English teachers would 


wish to evaluate the English achievement of 
their students by means of a 45-item, 25-minute 
test of grammar. The 45-item, 35-minute mathe- 
matics section contains a fair sample of the con- 
cepts which are frequently taught in an intro- 
ductory algebra course, but none of the concepts 
of geometry, trigonometry, or advanced algebra 
are included. Thus, these two sections and the 
General Ability section, which by definition 
would not be recommended for evaluating high 
school achievement, fall far short of providing 
a valid framework for “evaluating high school 
achievement at the twelfth grade level.” 

The purpose which the test can probably serve 
best is that of “estimating aptitude for academic 
work at the college freshman level.” However, at 
the present time, the degree to which the test 
fulfills even this purpose is unknown. We are 
told that the correlation between total scores of 
a random sample of 100 students on Form 1 of 
the battery and grade-point standing at the end 
of the first year at the University of Kentucky 
was .63. That is all well and good, but the im- 
portant question of how the new Form 2 corre- 
lates with grade-point standing remains unan- 
swered. 

The reliability of Form 2 also remains inde- 
terminate. It would be nice if the authors’ hopes 
for high reliability were realized but perhaps we 
should postpone using the test until the facts are 
clear. й 
А few of the items in the General Ability Sec- 
tion can be answered without too great a knowl- 
edge of the words concerned. For instance, in 
looking for the opposite of “benevolence” as is 
called for in Item 14, one might choose the cor- 
rect option, “malevolence,” because it is the only 
option with a suffix like that of “benevolence.” 
A similar criticism may be leveled at Item 17 
where “heterogeneous” is the correct opposite 
of “homogeneous” and at Item 21 where “hei- 
nous” is the correct synonym for “atrocious.” 
In the mathematics section Items 113 and 127 
may prove confusing because the exponents are 
not raised sufficiently. 3! 

In taking the test, the reviewer found himself 
at one time recording answers in the wrong 
places on the answer sheet. The authors of the 
test would have done well to leave space after 
questions 15, 30, 45, 75» 90, 120, and 135 and 
to admonish the student to verify his position on 
the answer sheet at those places. 

In discussing the administration of the battery, 
the authors state, “It is desirable that the ad- 
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ministrator be a person who has had experience 
in giving group examinations and who appre- 
ciates the necessity for carefully controlled con- 
ditions, especially with respect to seating, tim- 
ing, and giving directions." With the poor in- 
structions for administering the test which the 
manual contains experience is not only desirable, 
it is downright necessary. 

Percentile ranks associated with part and 
total scores are given. Ranks are reported for 
two groups, 8,042 cases which were taken from 
throughout the state of Kentucky, and 775 en- 
tering students at the University of Kentucky, 
fall 1950. In using the ranks for this latter group, 
it must be remembered that the rank refers to 
the percentage of the group who made poorer 
scores on the fest and not to the percentage of 
the group who did poorer in college. 

At the time this review was written the only 
evidence concerning the utility of Form 2 of the 
Kentucky Classification Battery was that it is 
one of a family of tests that have been reliable 
and have had some validity for prediction of 
success in academic work at the college fresh- 
man level. However, in a strict sense, the bat- 
tery at present is a "classification battery" with 
no information as to what it classifies and how 
well it does so; itis a battery that was constructed 
to increase the reliability of a former test but the 
reliability of the present test is unknown ; it is 
a battery which is recommended for use in evalu- 
ating achievement at the high school level, and 
all that it includes for such a purpose is some 
English grammar and some algebra ; it is a bat- 
tery for estimating aptitude for college work 
and yet its validity for this purpose is unknown. 
The authors would have done well to withhold 
release of the battery until some information 
concerning this specific test were known. 


[302] 
*Kuhlmann-Anderson Intelligence Tests, Sixth 
Edition. Grades kgn, т, 2, 3, 4, 5, 6, 7-8, 9-12; 1927- 
52; 9 levels (labeled K, A, B, C, D, E, F, G, and Н); 
I form, ’52—same as 1942 edition; directions for ad- 
ministering (52); handbook ('52); master manual 
('52) ; $2.25 per 25 of any one level; roo or more, $2 
per 25 of any one level ; $1.50 per master manual ; post- 
age extra; $1 per specimen set (does not include mas- 
ter manual), postpaid; (30-45) minutes; F, Kuhl- 
mann and Rose G. Anderson; Personnel Press, Inc. * 
(Formerly published by Educational Test Вигеап, 
Educational Publishers, Inc.) 


REFERENCES 

1-15. See 40:1404. 

16-40. See 3:236. а 

41. Sawyer, Ciirrorp В. A Comparison of the Detroit and 
Kuhlmann-Anderson Intelligence Tests as Applied in the Grand 
Forks Public Schools. Master's thesis, University of North 
Dakota (Grand Forks, N.D.), 1931. Pp. бо. е 

42. Муврнү, Laura Wurtz. "The Relation Between Me- 
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chanical Ability Tests and Verbal and Non-Verbal li 
Tests.” J Psychol 2:353-66 '36. * (PA 11:3928) Totligenee 

43. Сіввомѕ, CmuaRLES C. “A Comparison of. Ki 3 
Anderson Test Scores and Teachers’ Estimates,” Sch & Soc 
D Ms West Y оа of Ach 

44. 5ЕАСОЕ, May V, “Prediction o chievement in i 
Lan es." J Appl Psychol 22:632-40 D '38. * Qaem 
Vr G Do Y E. An Attemp Predict | \ 

5. GENTRY, ROTH . Ап empt to Predict 
Mu From a Variet of Objective Tests: College Abt 
Ni -Denny Reading Test, Kuhlmann-Anderson Intelligencd 
Test, and English Placement Test. Master's thesis, Ohio Uni. 
yersity (Athens, Ohio), 1943. Pp. 33. (Abstracts of Masters’ 
Theses....1943, 1944, pp. 36-7.) 

46. Hartson, L. D. “Influence of Level of Motivation on 
the Validity of Intelligence Tests." Ed & Psychol Meas 
5:273-83 au '4s. * (PA 20:2500) 

4], BanwowE, Tuxopore Joserm. The Influence of Scoring 
Technique and Construction on the Validity of Mental Те, $, 

or's thesis, University of Washington (Seattle, Wash.), 
1946. (Abstracts оў Theses....1945-1946, 1946, рр, 11-2.) 3 

8. Тилох, M W. “The Relation Between IQ and Trait - 
Difference as Measured by Group Intelligence Tests.” J Ed. 
Psychol 38:343-52 О '47. * (PA_22:2066) 

49. Davis, {у? ALLISON, AND HavromumsT, Ковевт J, “The 
Measurement of Mental Systems: Can Intelligence Be Meas- 
ured?” Sci Mo 66:301-16 Ap '48. * (PA 22:3381) EA 

so. Влпжу, HeLeN К. “A Study of the Correlations Be- 
tween Group Mental Tests, the Stanford-Binet, and the Pro- 
gressive Achievement Test Used in the Colorado Springs Ele- 
mentary Schools.” J Ed Res 43:93-100 О '49. * (PA 24:2825) 


Henry E, Garrett, Professor of Psychology, 
Columbia University, New York, New York... 

The 1952 edition of the Kuhlmann-Anderson 
tests represents the sixth edition of this well 
known test series first published in 1927. As be- 
fore the test battery consists of 39 tests, each test 
with 6 to 24 items or trials. The tests cover the 
ability range from the first grade through high 
school; Booklet K is for kindergarten and grade 
one, and Test Booklets A to H cover the first 
grade through the high school. Tests have been 
arranged in order of difficulty based upon the 
median chronological age of children able to pass 
half of the trials and no more. The total scale is _ 
arranged in 9 separate booklets, each with its 
own instructions and scoring key. All booklets 
but two contain то tests so that each battery Ш- 
cludes a few of the tests found in the next higher 
level as well as a few in the next lower level. 
In this way the 9 booklets constitute overlapping 
measures of the total scale of 39 tests. A child's 
score is the median of the mental ages earned on 
the tests administered to him. The abilities of 
School children are relatively undifferentiated, 
at least as compared with adults. Hence the 
median mental age is a satisfactory way of de- 
termining the level at which a child typically 
functions in tests of this sort. 

Instructions for administering the tests are 
clear and easy to follow, and the discussions 9 
validity and reliability are far more satisfactory 
than was true in former editions. Validity 1s "M 
fined in terms of discriminative capacity, P» 
is, of the ability of tests to detect differences 1 
mental development over the age range covere : 
While the tests are given under timed condition 
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they seem to be primarily measures of power. 
Statistical evidence indicates little difference be- 
tween timed and untimed scores from grades 
3 to 9. It seems clear, also, that the test battery 
is able to differentiate satisfactorily among chil- 
dren who are old-for-grade, young-for-grade, 
and on-grade. 

Тһе reliability of the test is high in terms of 
the split half coefficient and the standard error 
of a score (5.5 points of IQ) compares favorably 
with the same error of measurement in the 1937 
Stanford-Binet. 

The reviewer can find very little to criticize 
in the construction and development of these 
test batteries. One statement on page то is, per- 
haps, subject to qualification. Our authors state 
that when a subtest has a very high correlation 
with another subtest in the battery, both may be 
measuring the same thing and hence one at 
least is superfluous. On the other hand, they state 
that if the intercorrelations are low or negative, 
the subtests may be measuring different abili- 
ties and hence do not belong in the same battery. 
They conclude from this that moderate inter- 
correlations are to be sought for in that they 
imply overlap but not identity. The reviewer 
would agree with these statements though he 
would point out that high correlation between 
two subtests and the low correlation of one of 
them with the criterion may be desirable if the 
second test is "suppressing" part of the non- 
valid variance of the first test. Better evidence 
that the battery is measuring a homogeneous 
ability (called “intelligence”) might have been 
forthcoming had a factor analysis been made of 
overlapping test batteries, much as was done by 
McNemar and by Jones with the 1937 Stanford- 
Binet. 

The tests provide a measure of mental age as 
well as of IQ, and it is concerning this latter 
measure that the reviewer is most dubious. Test 
data reported in Table 11, page 27 of the manual, 
indicate that the standard deviations of the IQ 
distributions at different chronological age levels 
vary from approximately 10 to 16. This means 
that some part at least of the variability in IQ 
upon retest can be attributed to differences in 
IQ variability from one age level to another. It 
would probably be advisable for our authors not 
to use the term IQ at all. If they feel that they 
must, their IQ's should be labeled “Kuhlmann- 
Anderson IQ's" to distinguish them from Stan- 
ford-Binet IQ's. 

In general the format of the test is good al- 
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though it seems to the reviewer that some of the 
picture items in the early age batteries are rather 
small for young children. For the kindergarten 
and first grade child accustomed to large draw- 
ings, several of the picture items in Tests 3 and 
4 may not be entirely clear. 

The authors, as in earlier editions of these 
tests, provide mental growth unit equivalents for 
trials passed in each of the 39 tests of the scale. 
Heinis Mental Growth Units, although they 
seem to work surprisingly well, are subject to 
definite qualifications as was indicated by this 
reviewer in his discussion of the 4th edition of 
the Kuhlmann-Anderson tests (see 40:1404). 
Apparently the authors do not make as much 
of this feature in scoring their tests as they did 
in earlier editions. 


Davin SEGEL, Specialist in Tests and Measure- 
ments, Office of Education, Federal Security 
Agency, Washington, D.C. 

These tests are the same as those of the Fifth 
Edition. What is changed besides the publisher 
are: (a) a redesignation of the test booklets to 
indicate their level by letter (for particular ages) 
rather than by grade; (5) revision of the general 
directions for administering and scoring to make 
them simpler and easier to follow; and (c) the 
development of new statistical evidence on the 
validity and reliability of the tests. 

The Kuhlmann-Anderson Intelligence Tests 
have been in part unique in (а) the use of median 
mental age scores on separate tests to arrive at 
the mental age of the pupil, thus obtaining IQ’s 
less affected by extreme scores on any subtest 
caused by some particular extraneous influence; 
and (b) the use of a greater number of separate 
booklets for varying levels of competence which 
make tests better adapted to the different groups 
of pupils. It also has somewhat less verbal ma- 
terial than most group intelligence tests. 

The claim for validity rests on (a) data show- 
ing that the intercorrelations among the subtests 
and the correlations between the subtests and the 
total test are neither high nor low which the au- 
thors interpret as meaning that the tests are in- 
dependent enough to add to the intelligence 
measure but not so independent that it is meas- 
uring something else entirely; (b) on statistics 
showing considerable variation in average score 
as among groups of retarded pupils, on-grade 
pupils, and accelerated pupils; and (c) on the 
progressive increase in pupils attaining a larger 
number of correct answers through the grades. 
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In general, these are acceptable statistical in- 
dices concerning intelligence tests for the pur- 
poses for which such tests are now used in 
schools. The reliability of the tests at various 
age levels shows that the results may be used 
in the guidance of individual pupils. The relia- 
bility coefficients were found to be .91 for grade 
3, 88 for grade 5, .92 for grade 7, and .95 for 
grade 9. The standard errors of measurement of 
about six IQ points indicate considerable sta- 
bility. | { 

These tests are among the best as all-round 
group intelligence tests that give an overall 
mental age and only one caution need be made. 
As the authors themselves emphasize, the sub- 
tests should not be used as measures of this or 
that mental function and the profile of trials 
passed should not be used for guidance purposes. 
The profile may, however, be examined for pos- 
sible erratic test performance. 


For areview by Henry E. Garrett of the fourth 
edition, see. 40:1404; for reviews by Psyche 
Cattell, S. A. Courtis, and Austin H. Turney 
of the fourth edition, see 38:1049. 


[303] 
*The Mill Hill Vocabulary Scale. Ages 4-14, 11- 
14, I4 and over; 1943-48; 3 levels; 2 editions; manual 
('48) ; 15s. per 50 of any one level; 2s. 6d, per manual ; 
17s. 6d. per specimen set including 5 copies of each 
form and a specimen set of the complementary Pro- 
gressive Matrices (1938) (see 314); purchase tax 
t British purchasers only) and postage extra; nontimed 
Ly rds minutes; J. C. Raven; Н. К. Lewis & Co., 
t 

а) JUNIOR. Ages 11-14; Forms 1. (748), 2 (748). 

b) SENIOR. Ages 14 and over; Forms І (48), 2 ('48). 
C) ORAL DEFINITIONS FORM. Ages 4 and over; consists 
of all words in both forms of the junior and senior 
levels; individual; І form (’48). 


REFERENCES 

Hrs iced ОТДА ante C. “Intell b 

. Fourps, G. A., AND Raven, T. C. “Intelleci ili 
Ostupational Grade.” Occupational Psychol PRA raa Us and 

23:1961 

5. Fours, G. A., AND Raven, J. C. “Normal Changes in the 
Mental Abilities of Adults аз Age Advances.” J Mental Sci 
94г1зд-42 Ja '48. E 22:4882) 

6. Raven, J. C. “The Comparative Assessment of Intellectual 
Ability.” Brit J Psychol 39:12-9 S '48. * (PA 24:1063) 

7. Ѕтлтев, Patrick, “Comment on "The Comparative Assess- 
ment of Intellectual Ability.’ Brit J Psychol 39:20-1 S '48, * 

24:1065. y 

. Fourps, С. A. “Variations in the Intellectual Activities 
of Adults.” Am J Psychol, 62:238-46 Ap '49. * (PA 24:1716) 

9. Fourps, С. A. “Variations in the Intellectual Activities 
of Adults: Their Assessment and Significance," pp. 61-2. Ab- 
stract. In Pete and Papers of the Twelfth ЖК НК) al 
Congress of Psychology Held at the University of Edinburgh, 
July 23rd to 29th, 1948. Edinburgh, Scotland: Oliver and 
Boyd Ltd., 1950, Pp. xxviii, 152. Paper. * 

то. PETRIE, ASENATH, AND PowELL, Моют. B. “The Se- 
lection of Nurses in England." J Appl Psychol 35:281-6. * 
(PA 26:3090) 


For a review by David Wechsler, see 3:230. 
[304] 


*Miller Analogies Test. Candidates for gradügte 
school; 1926-50; Form H ('50); Form G ('47) dis- 


continued, but available for special purposes approved 
by ublisher; manual ('47); supplementary 

(^50) ; distribution is restricted and the test is admin- 
istered only at specified licensed testing centers; details 
may be obtained from publisher; 50(55) minutes; 
W. S. Miller; Psychological Corporation. * 


REFERENCES A 

1. LovxoREN, Levi ALTON. Students im the Lowest Decile of 
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Forms G and H of this test are culminations 
of about 25 years of development, aimed at 
measuring scholastic aptitude at the graduate 
level. Very extensive item analyses have Боп 
into the preparation of both forms. The manua 
is carefully prepared, with a few oversights (0 
be mentioned later. 

The reliability of Form G (odd-even) Wi 
estimated $ò be .92 to .93. Form H correlated .83 
to .9o With Form G in three groups of graduate 
students. "=" ж b 

Practical validity for Form G is indicated by 
several procedures. With grades in 24 iie 
courses at Minnesota the median correlation RH 
.55. At three other universities, 12 yaks y 
with ртайез and grade-point averages ha e 
median of .35. At Minnesota and Syracuse; co 
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relations with 15 part scores on comprehensive 
masters’ examinations had a median of about 
.40. In the clinical training program at the Uni- 
versity of Michigan, the mean scores on four 
groups— completed the Ph.D.,” “still in train- 
ing,” “voluntarily dropped," and “dismissed”— 
were 78, 72, 69, and 62, respectively. Centile 
norms are given for 13 different graduate and 
professional school groups, with number of cases 
ranging from 70 to 1,084. The same norms apply 
to Form H, with minor corrections. 

The most important criticism that one can 
offer for this test is that it was not based upon 
any explicitly stated philosophy of education at 
the graduate level, Scholastic aptitude at the 

. graduate level is not even defined in the manual, 

"The operational definition, as indicated by the 
emphasis upon course grades in validation 
studies, is whatever course grades indicate. 
Those of us who train students beyond the M.A. 
level well know that there are often great dis- 
crepancies in both directions between grade 
getting abilities and capacities for independent 
thinking and research. 

The indications are that the Miller Analogies 
Test is almost entirely a measure of the verbal 
comprehension factor. When we aim a test at 
predicting grade getting, we are likely to come 
out with such a measure. Even at the graduate 
level, courses emphasize concept formation and 
the highly verbal student surpasses. The correla- 
tions of the test with the Verbal Factor part of 
the Graduate Record Examination clustered 
about .75 to .8o. Its correlation with a vocabulary 
test (reported by R. R. G. Watt in a personal 
communication) was .86. 

Some minor technical details occurred to the 
reviewer as he tried the items. Unlike standard 
analogies tests, the four alternative answers may 
come at any one of the four positions, not al- 
ways at the fourth position; as is customary. No 
explanation for this is given in the manual ; it 
would have to be a good one to outweigh the risk 
of confusion this practice entails. Some analo- 
gies are very loose. On the one hand, this may 
irk the examinee who demands tighter logic. On 
the other hand, it may encourage accepting 
wrong loose analogies in other items. The alter- 
native answers are not always of the same part 
of speech in an item. This sometimes serves as an 
irrelevant cue for elimination of wrong answers. 
On rare occasions more than one item hinges on 
knowing the meaning of а certain word or princi- 
ple. This extra source of interdependence of items 


may add to the apparent reliability of scores 
without adding to effective measurement. 

In summary, it can be said that this test has 
been prepared with the careful use of the techni- 
cal procedures now available to test makers, 
overlooking a few minor defects of item writing 
and editing. It has shown substantial value in 
predicting "scholarship at the graduate level" 
as indicated by grades in courses and compre- 
hensive examinations. Its very strong compo- 
nent of verbal comprehension variance assures 
that outcome. A thorough study of what it takes 
to make a successful graduate student beyond 
the master's level will show much room for 
supplementary predictive aids. The user of this 
test should give special attention to the state- 
ment on the cover of the manual: "for use as 
one factor in the selection of graduate students." 
The reviewer would add a recommendation that 
if a fixed minimum qualifying score is used at 
all, it be not too high. We do not yet have the 
information needed to say what such a score 
should be. The reviewer would also add the un- 
gracious suggestion that a good vocabulary test 
would probably do as well and do it more effi- 
ciently. 


Cart I. HovrAND, Sterling Professor of Psy- 
chology, Yale University, New Haven, Con- 
necticut. Ч * 

Development of the Miller Analogies Test 
was begun over 25 years ago by W. S. Miller 
аз ап aid in selecting students for graduate work 
at the University of Minnesota. As a result of 
interest on the part of other institutions, the test 
has been made available more generally through 
licensed centers by the Psychological Corpora- 
tion. 

The test consists simply of 100 analogy prob- 
lems in verbal form. Thus it lends itself readily 
to self-administration and to group testing. It 
can be easily scored by hand or by IBM equip- 
ment. Nearly all students complete the test 
within the time limit of 50 minutes. A nearly 
perfect correlation has been found between 
scores with a 50-minute time limit and with a 
go-minute limit. The items are arranged in order 
of difficulty and the score is the number of items 
correct. ` we ost 

Two forms are currently in print. Form G, 
developed by Miller on the basis of two earlier 
forms, is now restricted for special uses. Form 
Н, prepared by Miller and the staff of the Psy- 
chological Corporation, is being extensively 
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used at present. Work is underway on Form J. 
Considerable care appears to have been exer- 
cised to make the types of items used in Forms 
H and J closely comparable in difficulty and 
content area to the original form. Nevertheless 
Form H is slightly more difficult than Form 
G. The publishers recommend that two points 
be added to Form H scores in the range from 
30 to 70 to permit use of the percentile equivalent 
tables developed for Form G. At no point does 
the 2-point difference in raw score change the 
percentile values by more than five points. The 
two forms have approximately equivalent stand- 
ard deviations and correlate .89. 

RELIABILITY. The reliability of Form С ap- 
pears to be satisfactorily high considering the 
narrow range of ability represented by graduate 
School applicants. The data presented in the 
- manual show odd-even reliabilities of about .93 
(corrected for length by the Spearman-Brown 
formula). So far, similar information has not 
been released for Form H although such data 
must now be available in considerable profusion. 
It appears to the reviewer that many institutions 
will be working with more restricted ranges than 
those used by Miller in his computation of the 
reliability coefficient, and appropriate considera- 
tion must be given this fact in their interpreta- 
tion of test-retest results. 

CORRELATION WITH OTHER TESTS. А con- 
siderable body of information is available on 
the correlation of the Miller test with other tests, 
but no systematic factor analysis of its compo- 
nents appears to be available. Since successful 
performance on the test requires knowledge of 
vocabulary and familiarity with the content of 
courses in science and the humanities in addi- 
tion to reasoning ability, it is not surprising that 
it correlates. with both the Verbal Aptitude 
(about .80) and Quantitative Aptitude (about 
.60) portions of the Graduate Record Examina- 
tion, with the Ohio State University Psycho- 
logical Examination (.82), with the Terman 
Concept Mastery Test (.73), and with a variety 
of tests of knowledge of school subjects (about 
:50-.80). (Data from manual and mimeographed 
memorandums prepared by the publisher.) 

vALIDITY. The main data on validity are those 
bearing on the correlation bétween Miller tests 
scores and performance in various graduate 
courses. In a variety of courses in education at 
the University of Minnesota, correlations with 
course grades were predominantly in the range 


ш 
2 


from about .40 to .60. Somewhat similar data 
are reported elsewhere for grades in chemistry, 
psychology, biology, and language. A recent 
study of Glaser (73) indicates a correlation of 
.38 between Miller scores and first year grades 
in medical school. The data on validity are prob- 
ably of necessity rather fragmentary and scat- 
tered since as Miller points out: 


One must expect that the validity will vary from 

school to school, from course to course, and from year 
to year. It depends in part on the factors which go into 
graduate school marks, the number of applicants for 
admission to a school or department, and the relevance 
of the ability measured by the test to the qualities re- 
quired by courses for which it is predicting. 
The data from the studies reported in the litera- 
ture are often difficult to interpret because rarely 
is information presented on the reliability of the 
criterion, ‘ 

Significant data are being accumulated on the 
relation between Miller scores and survival in 
graduate school. The most comprehensive data 
currently available are those of Kelly and Fiske 
(15) obtained in connection with the selection 
of clinical psychologists in the Veterans Ad- 
ministration training program. The test battery 
was administered to a large group of students 
selected for training, and the result compared 
with careful follow-up data. The data indicate a 
clear relationship between Miller scores and the 
completion of graduate training. Trainees who 
subsequently completed their studies and ob- 
tained the Ph.D. averaged 77.6 (SD=84) 
while those who were dismissed from the pro- 
gram averaged only 62.4 (SD = 13.7). The dif- + 
ference in means is highly significant (t= 5.9). 

SUMMARY. The Miller test appears to be a 
carefully constructed instrument for use in а 
very difficult area of selection where the cri- 
terion itself is far from satisfactory. The relia- 
bility is adequate even for the narrow range of 
ability typically involved and a large number of 
studies indicate that the test has considerable 
predictive value for academic success at the 
graduate school level. The limiting of the use 
of the test to licensed centers is an important 
safeguard for its proper use. 
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*Modified Alpha Examination Form 9. Grades 77 
12 and adults; 1941-51; 3 scores: numerical, ver! ie 
total; Form 9 ('42); revised manual ('51); Mie s 
mentary norms [47]; $4 per 25; separate answe 
sheets must be d $1.50 per 50 answer sheets ;, 3 
per specimen set; postpaid; 24(35) minutes; revis 
by F. L. Wells; Psychological Corporation. 
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This test has been changed from the original 
Alpha format in a number of respects. It is in a 
reusable booklet with a separate (hand scored) 
answer sheet. Antiquated items have been 
revised. The eight subtests are: addition, follow- 
ing directions, arithmetic problems, verbal anal- 
ogies, number series, mixed-up sentences, high- 
est common divisor, and same or opposite 
words. With four verbal and four numerical sec- 
tions, verbal and numerical scores as well as the 
total score are obtained. 

The old trusting attitude that examinees 
would not look at the test items until time to 
start the test is partly gone. But not entirely ; a 
person getting the idea of a test form quickly 
can, on some parts, solve a number of the items 
while the examiner is reading the directions. 

The reliability is indicated by correlations of 
from .87 to .92, with scores on Revised Alpha 
Examination, Forms 5 and 7. Correlations with 
several other variables are given. For example, 
correlations with Otis Self-Administering Tests 
of Mental Ability run from .64 to .79. Numerical, 
verbal, and total score norms are given for boys 
and girls combined in each of grades 7-12. There 
are also norms for adult men applying for execu- 
tive positions and for four groups of “male engi- 
neering personnel of a large airplane company" 
and for these four groups combined. 

We now demand more information about a 
test's construction, item analysis, and validation 
than the author and publisher have provided. 
"There is, however, a wealth of information about 
the original Army Group Examination Alpha 
and its various revisions not found in the manu- 
als; that literature can provide anyone who is 
interested with much more extensive informa- 
tion about this still good test than that which is 


available concerning all but a few of its succes- 
sors. * 
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*Moray House Picture Intelligence Test 1. Ages 
78-102 months; 1944-48; 1 form ['44] ; manual (48) ; 
20s. per 25; Is. per single copy; 1s. 3d. per manual; 
Is. 9d. per specimen set; postage extra; 25(75) min- 
utes; Margaret А. Mellone under the supervision of 
Godfrey Н. Thomson and an Advisory Committee of 
Edinburgh Headmasters and Infant School Mistresses; 
University of London Press Ltd. * 
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GERTRUDE KER, Lecturer in Psychology, Uni- 
versity College, London, England. 

The test consists of nine nonverbal subtests : 
Directions, Doesn't Belong, Completion, Ab- 
surdities, Sequence, Reversed Similarities, Al- 
ways Has, Analogies, and Series. The drawings 
are well spaced, and clear, though in some there 
is a little too much detail for children of this 
age. There are practice items for all the nine 
subtests with the exception of Directions. Each 
subtest is timed. 

The manual gives very complete and clear di- 
rections to the teacher for the administration of 
the test. The examiner's instructions for each 
subtest do not suffer from lengthiness more than 
is usual in group tests for young children. The 
manual also contains directions for scoring, an 
answer key, and tables for the conversion of raw 
scores into IQ's. The user is warned that the 
IQ's are not computed by means of mental age, 
and so are not comparable to the 1Q’s of the 
Binet test. In the 1948 edition of the manual 
the IQ's differ by about one point from those 
given in the original manual, being based on a 
more representative sample, hence those using 
the test should see that they use this, or later, 
manuals. For the convenience of clinical psy- 
chologists who “frequently require a measure of 
a child's brightness in terms of mental ages" a 
table of mental age norms is given at the end of 
the manual. “The figures have been extrapolated 
graphically outside the range 6 years 9 months 
to 7 years and 10 months and therefore become 
less secure towards the extremes." 
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The test has been constructed with great care. 
It consists of the most discriminative items from 
three trial batteries given to more than 600 chil- 
dren. 

The manual states that “the test shows a bias 
in favour of boys," the average superiority of the 
boys being about 2% points of IQ. This sex bias 
has not been confirmed by later statistics gath- 
ered by W. G. Emmett. Further scores for 2,289 
boys and 2,150 girls of complete age groups 
gives the mean for boys 57.56 and for girls 57.54 
(SD about 20). The reliability coefficient, ob- 
tained by the Kuder-Richardson formula 20 from 
a random sample of 250 scripts of 5,415 Edin- 
burgh children, is .96. Using Thurstone's cen- 
troid method, the author carried out a factorial 
analysis on the results of an early form of the 
test applied to 414 boys and girls. This form 
consisted of 14 subtests, of which 9 were retained 
for the final version. The results of boys and 
girls were analysed separately. A. general factor 
was found to be present in both. A second factor, 
a space factor, appeared in the boys’ results in 
addition to specifics; this was not found in the 
girls’ results, only general and specifics being 
present. The test appears to have good reliability 
and a fairly good degree of validity. The size of 
the correlations between this test and other group 


_ tests of intelligence, lower than might be deemed 


altogether satisfactory, is no doubt due to the 


'. age of the children tested and also to the fact 


that spatial ability appears to be measured by 
it to some extent. Nevertheless, it is among the 
best group tests of intelligence in this country 
for younger children. 


M. L. KELLMER PRINGLE, Lecturer in Educa- 
tion, and Acting Deputy Head, Remedial Edu- 
cation Centre, University of Birmingham, Birm- 
ingham, England. 

Designed to test intelligence at the transition 
stage when children move from the infant to the 
junior school, the test is being widely used in 
this country as it is the only group test for this 
young age group produced by trained and ex- 
perienced educationists. It consists of nine sub- 
tests each of which has a number of items (vary- 
ing from 8 to 15). With the exception of the 
first one, all subtests are preceded by three prac- 
tice items for which no score is given, while all 
correct items receive one point; except for the 
first and last subtest, all the items are of the 
multiple choice type. 

In view of the age of the children for whom 
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the test is intended, it is an important advantage 
that neither reading nor writing is required ; the 
instructions are given orally by the "supervisor" 
(to use Mellone's term), and to solve the var- 
ious problems, patterns, designs or pictures are 
to be ticked, crossed out, or encircled. Accord- 
ing to the general instructions in the manual, 
no more than 30 children should be tested at 
one time and two adults should be present, one 
to read out the instructions, the other to super- 
vise the children's work. In the writer's experi- 
ence 30 is too large a number with such young 
children. Copying is very easy and liberal spac- 
ing out is needed; 15 to 20 children would seem 
the maximum number, 

The first subtest, Directions, not only contains 
some rather poorly drawn pictures (the ring and 
the house for instance are neither typical nor 
representative of the child's own world) but 
also seems psychologically unsound, since it 
tends to arouse in some children anxiety at the 
beginning of the test. Being asked to draw lines 
across a page which looks neat and tidy with 
competently drawn pictures goes right against 
the usual training given at school where defac- 
ing and scribbling in printed books is frowned 
upon. As none of the other subtests are based 
on this manner of response, there seems to be 
no reason why this test should not be replaced 
by one similar to the other eight subtests. 

The test has been standardised on a total 
sample of 8,107 children aged between 6 years . 
9 months to 7 years 10 months, one group being 
children from Edinburgh and another from some 
English mining and country areas. The intelli- 
gence quotients derived from this test are 50 
arranged as to have an average of 100 with a 
standard deviation of 15. They differ from the 
Binet IQ's in that not being based on the usual 
calculation from chronological and mental ages, 
the IQ conversion tables are not self-consistent 
if interpreted in terms of mental age. 

Only the scantiest statistical data are offered 
despite the fact that the test has been widely used 
over the past six years or so in many parts 0 
England and Scotland. The reliability coefficient, 
“calculated from an answer pattern drawn up 
from a random sample of 250 scripts from EE 
complete year group of 5,415 Edinburgh goal 
dren,” is given as .96. Correspondingly, the 
standard error of the IQ is 3.1 which is smaller 
than that of most group tests but larger than 
the standard error of individual tests. Gorisider 
ing the high variability of performance wit 
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young children and the narrow age range, it is 
fairly moderate. No data are given for the valid- 
ity of the test or for its correlation and inter- 
correlation with other tests. 

It is stated in the manual that it must not be 
assumed on the evidence of this test alone (or 
of any group test) that an IQ of less than 7o 
necessarily implies that the child is mentally 
defective. But surely it ought to be stressed even 
more that group tests are unreliable with young 
children, and that they often give a serious un- 
derestimate of intelligence not only with the dull, 
but with the emotionally disturbed, the sensitive, 
the discouraged, or excitable child. This test can 
be considered a useful tool for predicting a child's 
readiness to respond to more formal methods of 
teaching. A low result should be accepted only 
as an indication that a child is not ready for such 
an approach but the reason for this must be 
found through individual tests and case study 
procedure. 
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*Non-Verbal Test. Ages 10-11; 1947-51; title on 
test booklet is A Scale of Non-Verbal Mental Ability; 
2 forms (listed below) ; use of the practice exercise, 
Preliminary Practice Test 1 ['50], is optional; revised 
manual ['51]; 70s. per 100; 9s. per 12; A an single 
copy; 255. per 100 practice exercises; 35. 6d. per 12; 
4d. per single сору; 2s. 9d. per manual; postage extra; 
practice exercise: 10(15) minutes; test: 30(35) min- 
utes; J. W. Jenkins and D. M. Lee (П); National 
Foundation for Educational Research in England and 
Wales, * , 
а) NON-VERBAL TEST I. 1047. 
b) NON-VERBAL TEST II. 1948. 
REFERENCES 

1. Mitus, L. Е. The Properties of а Space Test. Bachelor's 
thesis, University of Edinburgh (Edinburgh, Scotland), 1947. 

2. Pert, A. “Symposium on Selection of Pupils for Dif- 
ferent Types of Secondary Schools: VI, Evidence of a Practical 
Factor at the Age of Eleven.” Brit J Ed Psychol 19:1-1 SF 49. 

fi Scorr, G. C. “Measuring Sudanese Intelligence. Brit J 
Ed Psychol 20:43-54 Е '50. * (PA 25:316 


E. A. Peet, Professor of Education and H ead 
of the Department, University of Birmingham, 
Birmingham, England. 

These parallel scales of nonverbal ability may 
be applied according to two procedures, I and 
II. In the first procedure the test is applied with- 
out preliminary practice in similar material, and 
in the second a practice test is given and the 
instructions for the main test are presented 
orally. The second procedure is to be preferred. 

Scale I was devised by the late J. W. Jenkins 
as a measure of nonverbal intelligence. It has 
been used extensively in England and where 
applied by research workers in a mixed battery 
of verbal, nonverbal, pictorial and spatial tests, 
has generally come out as a measure of pure g. 
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Thus when the results of such batteries are 
analysed by Thurstone's centroid method and 
the unrotated factors examined, the communal- 
ity of Jenkins' test appears to be almost com- 
pletely explicable in terms of the first factor. 
Its loading in the second bipolar factor is us- 
ually very near to zero. The test is therefore to 
be recommended as a useful reference test where 
problems relating to different qualities of intel- 
ligence are being investigated. It is to be dep- 
recated that the publishers of this test do not 
present some of the interesting factorial infor- 
mation concerning the test. 

The test is composed of five separately timed 
subtests making up an overall time of 30 min- 
utes. Jenkins used the mathematico-logical prin- 
ciples of classification, ordered sequence and an- 
alogy, and applied them to nonverbal pattern 
material. Subtests 1 and 3 use the device of like 
and unlike as a test of the principle of classifica- 
tion ; Subtests 2 and 4 utilise the familiar device 
of ordered sequence ; and Subtest 5, the analogy. 
These are familiar enough concepts but the main 
feature of the Jenkins test is that they lead to 
high g saturation and negligible or insignificant 
k loadings. 

When used without practice tests the instruc- 
tions at the head of each subtest have to be read 
by the testee. These are brief but admit the pos- 
sibility of verbal interference, and it would be 
preferable if they were orally presented by the 
tester. The key is much smaller than the test 
booklet and is difficult to relate to the test. In- 
structions and information about the test are 
provided separately. Norms are provided over 
the age range 10:0—12:11, and there is a marked 
gradient in the scores with respect to age. For 
example, the standardised score of 100 at 10:0 
is equivalent to a raw score of 28, and at 12:11 
to a raw score of 45. The test is therefore dis- 
criminative with regard to age. The standardisa- 
tion is based on a mean of 100 and a standard 
deviation of 15, and the scores are therefore 
roughly equivalent to IQ scores. 

It would be advantageous if the information 
about the test, instructions, key and norms, were 
all given in a single test manual of the same size 
as the test booklet. The possibility of sex differ- 
ences is not discussed and the norms are based 
on a mixed population of boys and girls. The 
test reliability is .95 by the split half method, but 
no test validity is given either in terms of other 
tests of intelligence or independent estimates of 
general intellectual ability. As in the case of the 
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relationship of the test to other tests of intelli- 
gence there is much material available about the 
relationship of this test to outside estimates of 
intelligence and the publishers might well have 
produced some of this information to guide the 
would-be user of the test. For all that, the test is 
to be recommended as a measure of g. It should 
not, however, be used to measure spatial ability 
or practical ability. In this respect the title may 
be a little misleading. 

Scale II was devised by Lee and Jenkins as 
a parallel form of the first issue and embodies the 
same logical principles. It may be administered 
according to either procedure, that is, with or 
without practice material. The new test manuals 
now printed are a great improvement and the 
key is of the same size as the test booklet. This 
makes for rapid, easy scoring of the test. Reli- 
ability of the test is .92 and the two parallel 
forms correlate to the extent of .87. As in the 
case of the first test no information is given 
about the relation of the test to other published 
tests of intelligence and also no correlations are 
given between the test and independent esti- 
mates of intellectual ability. Norms are given 
over the same age range and standardised to the 
same basis as the first form. The second form 
is not so discriminative with regard to age. At 
10:0 the standardised score of 100 is equivalent 
to 22 points of raw score, and at 12:11 to 38 
points. This gradient, however, makes fair dis- 
crimination with regard to age. The norms are 
based on a mixed population, but the authors 
might well have given some indication of the 
Sex differences of the test, particularly in view 
of the accumulating evidence that this type of 
material may show significant differences. In 
spite of these criticisms both forms of the test 
are to be recommended as useful for research 
workers investigating mental ability and for 
routine grading of children on the basis of non- 
verbal general ability. For school selection pur- 
poses the material should be supplemented by 
verbal material. 


The review excerpt presented in the first 
printing of The Third Mental Measurements 
Yearbook ( 3:260) is a review of the SRA Non- 
Verbal Classification Form. 


[308] 
*Ohio State University Psychological Test. 
Grades 9-16 and adults; 1910-50; 2 scores: reading, 
total ; IBM; Forms 18 (733), 20 (’37), 21 (740), 22 
(43), 23 (47), ^r (750) ; forms are successive revi- 
sions, not parallel forms; separate answer pads or an- 
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swer sheets must be used; 66 per answer pad; sé per 
IBM answer sheet (Forms 21, 22, 23 only) ; 5¢ per 
scoring stencil; postage extra; worklimit (120) min- 
utes; Herbert A. Toops; Ohio College Association, * 
а) FORM 24. 10¢ per test and answer pad or IBM an- 
swer sheet. 

b) Forms 18, 20, 21, 22, hf per new test; 2¢ per 
used test. (Form 21 also published in 2 editions by al 
ence Research Associates, Inc.: Form AH (hand scor- 
ing edition) : 49€ per test and answer pad; $2.15 per 
25 answer pads; Form AM (machine scoring edition) : 
39€ per test; $2.90 per 100 IBM answer sheets; sof 
per scoring stencil; 75€ per specimen set; cash orders 
postpaid.) 

REFERENCES 

1-28. See 3:244. 

29. Harvin, RongRT ALLEN. A Study of the Maturity of 
High School Seniors, Junior Colle: e, and University Students 
as Measured by the Ohio State niversity Psychological Test 
and the Pressey Interest-Attitude Test. Doctor's thesis, Uni- 
versity of Nebraska (Lincoln, Neb.) 985: 

30. Mayor, C. L. “The Percentile Ran ing on the Ohio State 
University Psychological Test as a Factor in Ботен the 
Success of Teachers in Training." Sch & Soc 47:582-4 Ар 30 
'38. * (PA 12:4361 T 

31. LAUGHLIN, ILLIAM Patrick. Changes of Intelligence 
Scores of College Students Over a Period of Approximately 
Four Years. Doctor’s thesis, University of Minnesota (Minne 
«0010 Мїап,), 1940. (Summaries of Ph.D. Theses, 1949, pP. 


2. Gentry, Doxorny E. An Attempt to Predict. Scholar- 
ў Variety of Objective Tests: College Abili Test, 
Nelson-Denny Reading Test, Kuhlmann-Anderson Intelligence 
Test, and English Placement Test. Master's thesis, Ohio Uni- 


versity (Athens, Ohio), 1943. Pp. зз. (Abstracts of Masters’ + 


Theses....1943, 194. . 36-7.) ү 
p THOMPSON. RP ?g? “Predictive Criteria for Selecting 
A.S.T. Students." J Ат Assn Col Reg 19:492-500 Jl '44. 
ся our L. D. "Inf f Level of Motivation on 
34. Hartson, . "Influence o evel o oti’ 
the Validity of Intelligence Tests.” Ed & Psychol Meas 5:273- 
83 au ’45. * (PA 20:2500) З E h 
35. Starr, Hener Cannon. A Comparative Analysis oh is 
Ohio State Ps chological Scores of the Yearly Groups o n 
versity of Utah Freshmen. Master's thesis, University of U A 
(Salt e City, Utah), 1945. the 
36. PAULSEN, Gace B. “The Intellectual Demands of he 
уаш кн а сае IRIS Am Рзусі 
2:422 47. 22:1 , title only * "oO 
7. Соорвтрев, Н. Y AND Lastett, Н. К. "Predictive 
Values of the Stanford Scientific and the Enginceringe d 
Physical Science ҮҮ Tests." Ed & Psychol Meas 8:683-7 
w - 24:14! А É 
38. HznrzL, J. P., AND Dr Vesta, Francis J. “Ап Evaluation 
of Five Factors for Predicting the Success of Students) М 
ing the New York aie Collec of RUE Ed & Psy 
eas 8: au '48. 23:4412 3 
39. INGRAM Lois! M. The Prediction of Scholastic sie 
From a Variety of Measures of Ability and Achievement. d 
ter's thesis, Ohio University (Athens, Ohio), 1948. Pp. 
(Abstracts of Masters’ Theses... 1948, pp. 66-7.) in Intro- 
40. Portenter, LrLtLIAN G. “Predicting Success in 148, * 
ductory Psychology." Ed & Psychol Meas 8:117-26 sp '48. 
(PA 22:3730) кт in Intro- 
41. Porrenrer, Їлїллхм С. “Predicting Success у Mr 
duct Psychology.” Abstract. J Colo-W'yo Acad Sci 3: 
4 


4 } AND 

42. Dr Vesta, Francis J.; Woopmurr, Asant D.; AND 
Herter, Jomw Р. “Motivation as a Predictor of College Sue 
cess." Ed & Psychol Meas 9:339-48 au '49. * (PA 2 e Sur- 

43. Macvonatp, Сокроч LUuNDY. Predicting Collegia: York 
vival From Pre-Admission Data. Doctor’s thesis, ‘Microfilm 
University (New York, N.Y.). 1949. Abstract: Fe 
Abstracts 10:42-4 по т "so. * (РА 24:6063, title оп Ys Suc 

44. RiNsLAND, Henry D. “The Prediction of Veterans, Part 
cess From Test Scores at the University of Oklahoma Al 
1, рр. 59—72. In The Sixth Yearbook of the Nationa, си 
оп Measurements Used in Education, 1948-1949. Fairmont 
W.Va.: the Council, Fairmont State College, 1949. FP V» 
(variously numbered). Paper. mimeographed. ion of à St 

45. Онввкоск, RICHARD STEPHEN. Construct Ohol 41: 
lection a for College on J General Psy и 

dd 24:4874 in the 

ub Wate, с Сбплват. “Potential Research Talente. Rec 
Sciences: Based әп Intelligence Quotients of Ph.D's. е 

14-2 3 ; u- 
er? Ар Warren. Selection of Grano tate 
dents: (1) The Adequacy of Certain Measures for J The Value 
ing Between Two Grouds of Master Candidates (2) | Achieve- 
of These Measures in. Prognosina Graduate Academici N.Y). 
ment. Doctor's thesis, New York University (New York. үр 
1950. Abstract: Microfilm Abstracts 11:53-74 no 1 '51. 
24:2428, title only) 
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ui Раана, PEPESdu a9 9 ЧА CREE 
5008) Ryan, Ѕогек ErpoN. Some Characteristics of the 1948 
Freshman Class at the University of Missouri and the Rela- 
tion of These Characteristics to Academic Success. Doctor's 
thesis, University of Missouri (Columbia, Mo.), 1950. Abstract: 
Microfilm Abstracts 10:84-5 no 3 'so. * (PA 25:4828, title 
саа) Ѕлвен, RAvMOND, Comparisons of the Wechsler-Bellevue 
Adult Intelligence Scale With the Ohio State Psychological 
Examination in Predicting Academic Success for College Fresh- 
men. Master’s thesis, Ohio University (Athens, Ohio), 1950. 
Pp. 40. (Abstracts of Masters’ Theses...., 1950, p. 126.) 

51. CARTER, The Missouri College Aptitude Testing 
Program. Universit of Missouri Bulletin, Vol. 52, No. 30; 
Education Series, No. 40. Columbia, Mo.: the University, ӧс 
tober 20, 1951. Pp. 76. Paper. * (РА 26:3043) 

GrorcE А. Fercuson, Professor of Psychology, 
McGill University, Montreal, Canada. [Review 
of Form 24] 

This test, which is widely known and used, 
has been revised many times since work was 
first initiated on it in тото. Few, if any, tests 
have been the object of more careful, prolonged, 
and continuous study. Further, a number of im- 
portant contributions to psychological test theory 
have accompanied its development. Its authors 
over the past 30 years have employed the most 
exacting technical procedures in the develop- 
ment of new forms. Form 24 is not markedly 
dissimilar in format and content from a number 
of earlier forms. Three subtests comprise the 
test: same-opposites, word relationships, and 
reading comprehension. There are 150 items in 
all. No time limit is specified. The test is of the 
power-limit or power type. This, the authors 
contend, provides a much more accurate evalua- 
tion of a student’s scholastic potentiality than 
similar time-limit tests. The test is self-marking, 
the testee employing a pin stylus to record his 
answers on an answer sheet. A score is obtained 
by simply adding up the right answers. IBM 
answer sheets and scoring may also be employed. 

Percentile norms for total scores and subtests 
separately are available based on a sample of 
3,790 freshmen in Ohio colleges. Norms are 
available also for Ohio high school freshmen, 
sophomores, juniors, and seniors. 

The unusual technical effort expended in the 
development of this test has yielded dividends 
in terms of increased validity. Recent forms of 
the tests are reported to have correlations with 
a criterion of scholastic performance consistently 
of the order .бо with an occasional coefficient 
greater than .70. No reliability data are reported 
for Form 24. The reliability of Form 21 (N= 
300) is .93. P NT 

This test emphasizes verbal ability which is 
of undoubted importance in scholastic achieve- 
ment. My impression is that the authors have 


approached a limit in the prediction possible 


with this type of test material. It is probable that 
future technical improvements, however exact- 
ing, will not result in an appreciable increase in 
validity. This test serves as an excellent illus- 
tration of what can be accomplished by the 
rigorous appliance of sound technical proce- 
dures. It illustrates, likewise, the limitations of 
that accomplishment. A correlation of .70 with a 
criterion, which is high for tests of this class, 
still leaves about 5o per cent of the criterion 
variance unaccounted for. The technology that 
has evolved around the construction and use of 
mental tests, however rigorously applied, can 
carry us only so far in the prediction of human 
behavior. Progress in this area will to а con- 
siderable extent depend on the development of 
new types of test material which define proper- 
ties of human behavior radically different from 
those defined by many of the more widely used 
types of tests. These observations are in no we 
intended to imply criticism of the present test, 
which clearly stands out as a model of technical 
excellence among tests of this class. 


For areview by J. P. Guilford of Form 21, see 
3:244; for reviews by Louis D. Hartson, Theos 
A. Langlie, and Rudolf Pintner of Form 20, see 


38:1051. 


[309] 

*Ога1 Directions Test: An Intelligence Test on 
Phonograph Records. Grades 6-12 and adults; 1945- 
50; also called O.D.T.; 1 form, '45; revised manual 
(’50) ; 3 types of vinylite recordings : standard 78 rpm 
(four 12-inch records), transcription 33/4 rpm (опе 
16-inch record), “LP” microgroove 33% rpm (one 12- 
inch record) ; each record set includes manual, plastic- 
covered key, and 100 answer sheets; $18 per standard 
record set; $17 per transcription record set; $12 per 
microgroove record set; $4 per 100 answer, sheets ; 
postpaid; 28(35) minutes ; Charles R. Langmuir; Psy- 
chological Corporation. * 


REFERENCES 
. Seasnorr, Hanorp С. “Psychological Testing With Phono- 
Pant Recordings.” Abstract. Am Psychol 1:248 Jl '46. * (PA 
20:3932, title only) 


For reviews by Charles D. Flory, Irving 
Lorge, and William W. Turnbull, see 3:245. 


[310] 

XOtis Employment Tests. Applicants for employ- 
ment; 1943, c1922; identical to Forms A and B of Otis 
Self-Admunistering Tests of Mental Ability (see 
3:250) except for manual and cover page; 2 levels ; 
Yorms A, B; no data on reliability and validity in 
manual (for data presented elsewhere b the author, 
see manual for Otis Self-Administering Tests of Men- 
tal Ability) ; $1.60 per 25 of any one level, postage 
extra; 356 per specimen set, postpaid; 30(35) min- 
utes; Arthur S. Otis; World Book Co, * 

a) test 1. Applicants for jobs requiring a grammar 
school level of ability. 
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b) test п. Applicants for jobs requiring a high school 
level of ability. 

[311] 
*P.C. Verbal Capacity Sampler. Male adults (ages 
20 and over); 1950-51; Form AH ('s1) ; reliminary 
peri (51) $1.25 per 25; 256 per serine PETEA ў 
25¢ per manual; 75€ per specimen set; postpaid; 5(10 
minutes; Byron B. ЖаҢы» and Gerald Р. Bodily; 
Psychological Counseling, Inc, Tampa, Fla. * 

[312] 
*Pattern Perception Test. Ages 6 and over; 1943; 
1 form; no data on reliability and validity ; no manual ; 
mimeographed directions for administering [4 я 
mimeographed directions for self-administering [’43] ; 
305. рег 100, postage extra; Is. per specimen set, post- 
paid; 30(40) minutes; L. $. Penrose; Galton Labora- 
tory, University College, Gower St, London W.C.r, 
England. * 


REFERENCES 


1. Penrose, L. S. “An Economical Method of Presenting 
Matrix Intelligence Tests.” Brit J Med Psychol 20:144-6 pta 
^44 (issued Е 45). " (РА xp:4067) 1 б 

2. Penrose, L. S. “The Pattern Perception Test: Experi- 
How ЖӨН.” Brit J Med Psychol 20:339-43 pt 4 '46. * 

21:2800 

3. PETRIE, ASENATH, AND PoweLL, Murret В. “The 
lection of Nurses in England.” J Appl Psychol 35:281-6 
"51. * (PA 26:3090) 


Атлсе W. Herm, Medical Research Council Ap- 
plied Psychology Research Unit, Psychological 
Laboratory, University of Cambridge, Cam- 
bridge, England. : 

This is a paper and pencil intelligence test 
consisting of 64 items. These are preceded by a 
set of eight examples which the tester explains 
and answers for the subject. Each problem con- 
sists of a row of five diagrams; the subject is 
instructed to find four diagrams in each row 
which form a perfect pattern (or matrix) and 
to strike out the odd one. The problems are 
arranged in eight sets of graded difficulties. In 
each set they begin with an elementary presenta- 
tion of a theme which is subsequently developed. 
In some problems, the matrix of four consists of 
two more or less closely related pairs of dia- 
grams. .— 

The score is the number of correct answers 
given. It is unnecessary to penalise wrong an- 
swers or to adjust for correct guesses since low 
scoring subjects were found to answer the dif- 
ficult problems correctly in much less than 20 
per cent of cases. Evidently failure to grasp the 
principle, especially for the harder questions in 
this test, results in the choice of a wrong answer 
more often than would be expected by chance. 

The test has very wide application since it 
effects discrimination at both ends of the intelli- 
gence scale (i.e., among mental s on the 
one hand and among university students or army 
officer candidates on the other). Its correlations 
with other standard intelligence tests range from 
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73 (with a random sample of 597 army males) 
to .43 (with 67 medical students). The reviewer - 
has no information as to its test-retest consist- 
ency. (1) 

Some interesting differences have been found 
(т) with respect to individual test items in the 
scores of male and female subjects and of normal 
and psychotic subjects. These differences sug- 
gest (a) that whilst men tend to excel on items 
requiring rotation and superimposition for their 
solution, women do better on those in which 
(to quote the author of the test) "the pattern 
itself is intriguing"; (b) that distinctions be- 
tween psychotic and normal thinking, exem- 
plified in test results, do not necessarily require 
a verbal test for their detection. Psychotic sub- 
jects were found to do relatively well on items 
which involved rotation or reflection of figures 
or mere counting and to do relatively poorly on 
items requiring “common sense or constructive 
thought.” (т) 

The main weakness of Pattern Perception as 
a test of general intelligence lies in its having 
only one “bias” : not only is every problem dia- 
grammatic in nature but the form of the subject's 
task is exactly the same throughout the test. This 
inevitably results in certain subjects scoring 
“unduly” high and others scoring “unduly” low 
marks. Furthermore, the lack of variety makes 
difficult the production of those discrepant scores 
which, in mixed tests, often prove illuminating. 

A further criticism derives from the theoreti- 
cal basis of the construction of the problems. It 
is an extremely ingenious system based on com- 
bination theory ; but as a direct result of this the 
subject may acquire, or fail to acquire, a knack 
which seems to the reviewer still more specific 
than that required by most “pure” intelligence 
tests. It is perhaps the logical rather than the 
psychological outcome of attempting to assess 
mental calibre by means of one homogeneous 
measure. For example, some of the problems 
require four of the diagrams to be recognised as 
two pairs, the fifth being extra because it 4 
unpaired. But in many such items, one of the 
members of a pair is, for instance, the only ire 
gram of the five which is all black. It may i 
argued that all blackness as the criterion shoni 
be rejected since another of the paired diagrams F 
is all white. This is indeed always the Sa BA 
such questions. In the reviewer's opinion, Pc 
ever, this point should be brought out gna 
preliminary examples. These examples ue Ші 
no illustration of the particular type of probie 
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criticised above; they are all therefore easier 
than the items on the last pages of the test proper. 
Finally, the subject does not complete these ex- 
amples on his own account. These seem to the 
writer to constitute serious weaknesses of the 
test. 

Pattern Perception is however an exception- 
ally welcome test because of its unique suitability 
for subjects of every range of intelligence. 


[313] 

*The Peel Group Test of Practical Ability. Ages 
8-14; 1951; for selecting "children who might benefit 
from a technical craft, or art school education"; non- 
language; I form [51] ; manual [51] ; 7s. 6d. per 25; 
ба. per single copy; Is. 3d. per manual; purchase tax 
(British purchasers only) and postage extra; 25(30) 
minutes; E. A. Peel; Thomas Nelson & Sons Ltd. * 


REFERENCES 

т. Lampert, Constance M. “Symposium on Selection of 
Pupils for Different Types of Secondary Schools: VII, A Sur- 
vey of Ability and Interest at the Stage of Transfer.” Brit J 
Ed Psychol 19:67-81 Je '49. * 

2. Pret, E. A. "Symposium on Selection of Pupils for Dif- 
ferent Types of Secondary Schools: VI, Evidence of a Prac- 
es Factor at the Age of Eleven," Brit J Ed Psychol 19:1-15 
F '49. 


Скокск Westsy, Head of the Psychological 
Laboratory, University College of Hull, Hull, 
England, 

This is a novel and economical pencil and 
paper test consisting of 54 perceptual items in 
the form of well designed and printed black and 
white patterns, mainly but not exclusively geo- 
metrical (some organic configurations are in- 
cluded) and arranged in three subtests ANE 
Z requiring 10, 5, and 10 minutes respectively 
to administer. In each of the X and Z items there 
is an imperfection in the "Gestalt" or “Struc- 
ture” and the task of the testee is first to discover 
the law of the configuration or pattern-principle, 
then observe the fault and mark it with a cross 
in the test booklet, a task which may well in- 
volve some special ability beyond simple per- 
ceptual discrimination and comprehension of 
relations of similarity. Subtest Y is a simple 
matching test. The testee compares a correct 
with a faulty pattern where both are printed side 
by side and again marks the imperfection with 
a cross, It would be interesting to know how this 
item correlates with the two other subtests and 
with total score in view of what seems “a priori” 
to be a significant difference in the tasks. 

The test designer observed in 1949 that an 
aesthetic factor seemed to be involved in some 
nonverbal tests and hazarded a guess that the 
measurement of this along with the spatial factor 
might be relevant for some selection problems. 
This led to the construction of the test now pub- 
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lished with the claim that it measures "sensitive 
reaction to form and pattern" and that this en- 
ters into "all technical and craft operations." 
Further, that "high correlations are attained 
with success in woodwork, metalwork, technical 
drawing and art school design, indicating that 
the test is a true technical measure." Correla- 
tions given (without, however, any data of popu- 
lations), are .5 to .6 with woodwork and tech- 
nical drawing, .68 with art school drawing, and 
д2 with art school design. Correlations with the 
Alexander Performance Scale, an individual 
test with much the same purpose, are given as 
between .4 and .6, and with verbal tests, as be- 
tween .5 and .7, but again no information is 
given as to the populations from which these 
coefficients were obtained. The reliability is given 
as .86 which is around the minimum for a good 
mental test, but again the method used to arrive 
at this is not indicated. There is no published 
information as to the correlations of the test 
with other superficially similar tests which claim 
only to measure relational thinking by means of 
visual perceptual tasks such as Raven's Progres- 
sive Matrices, the Cattell Culture-Free Test, or 
the British N.LLP. Group Tests 70. Such a 
comparison would enable ап assessment to be 
made of the special claim of this test to be meas- 
uring something new. 

The main uses claimed for the test are: (а) To 
help in the selection from among British pri- 
mary school children aged 10.3 to 11.2, those 
likely to benefit from a technical, craft, or art 
school training, at the point when a transfer is 
made to secondary education. (b) To show 
whether pupils between the ages of 12.3 and 
14.0 already receiving academic secondary edu- 
cation should be transferred to technical, craft, 
or art school. (c) To provide some guidance in 
the choice of vocation after leaving school. With 
these ends in view a thorough standardization of 
the test to a mean of 100 and SD of 15 involving 
some 15,000 children has made available the 
following adequate and independent norms : (а) 
Norms for each sex separately, in the age group 
10.3 to 11.2, the usual age range for selection for 
secondary education in British schools. (It is 
worthy of note that no sex differences ap- 
peared.) (b) Norms for a mixed group of 15, 
ооо aged from 12.3 to 14.0, the transfer group. 
(c) Norms for a mixed group of about 1,000 in 
the age group 8.6 to 9.6. The last group was 
selected with the hope of using the test to assess 
the emergence of “special abilities” in the early 
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school years, presumably for research or guid- 
ance purposes as there is no need for selection 
at this stage. 

Any assessment of a new and relatively un- 
tried instrument must obviously be provisional, 
but it seems doubtful whether the loading of 
the “К” or "f" or any spatial or “practical” fac- 
tor, or of any special "aesthetic" factor, or of 
any P or "perceptual speed" factor will be large 
enough- to implement its purpose. Reports by 
Vernon and Parry? suggest that a nonverbal 
test of relational thinking along with suitably 
designed measures of mechanical and practical 
interest (through what may be largely a test of 
mechanical information and attainment along 
lines of the Bennett Test of Mechanical Compre- 
hension), along with an arithmetic test for tech- 
nical entrance would provide the most satisfac- 
tory method of attaining the ends outlined above, 
insofar as psychological tests can play a part. 

In the absence of much more explicit and ade- 
quate data of validation, any programme as eco- 
nomical as this test must remain in the reviewer's 
opinion incautiously optimistic and calculated 
once again to weaken the recovering reputation 
of the mental tester among educationists who are 
able to assess psychometric claims without preju- 
dice. 


[314] 
*Progressive Matrices. Ages 5-11, 6 and over, 11 
and over; 1938-51; nonlanguage; 3 levels; separate 
answer sheets must be used; purchase tax (British 
purchasers only) and postage extra; J. С. Raven; 
Н. К. Lewis & Co., Ltd. * 
а) PROGRESSIVE MATRICES (1938) : SETS A, B, C, D, AND 
E. Ages б and over; 1938-50; 1 form, ’38; 1685. per 25; 
75. 6d. per 25 answer sheets ; 25. 6d. per manual 50); 
17s. 6d. per specimen set including 5 copies of each 
form of the complementary Mill Hill V ocabulary Scale 
(see 303) ; nontimed (60) minutes. 
b) COLOURED PROGRESSIVE MATRICES (1947): SETS A, 
Ab, AND B. Ages 5-11; 1947-51; I form, '47; 1685. per 
25; 7s. 6d. per 25 answer sheets; 25. 6d. per manual 
(51); 17s. 6d. per specimen set including 5 copies of 
each form of the complementary Crichton Piu 
Scale (see 337) ; nontimed (30) minutes ; also distrib- 
uted by George С. Harrap & Co. Ltd. 
C) PROGRESSIVE MATRICES (1047) : SETS I AND II. Ages 
II and over (especially with above average intelli- 


.gence); 1943-47; 2 editions; 1 form, '47; no data on 


reliability and validity. 

I) Set I. For use either as a practice test for Set II 
or as a rough screening test; no norms; 50s. per 25; 
5s. per test; (10) minutes. 

2) Set II. For use either as “а test of intellectual 
capacity" when used without a time limit or as “а test 
of intellectual efficiency" when used with a time limit 
("usually of 40 minutes") ; distribution is restricted 
to registered users who after signing an agreement 
and paying an annual service fee may borrow tests 
1 Vernon, Philip E., and Parry, John B. Personnel Selection 


im the British Forces. London: University of London P. 
Ltd., 1949. Pp. 324. * (PA 24:776) x ae 
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for indefinite periods at 150s. per set of 25 tests or 
for 4 weeks at 205. per set of 25 tests; 7s. бй. per 100 
record forms; specimen set not available. 


REFERENCES 
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24. “Raven's Intelligence Test" Editor's reply to query, 
Brit Med J 4506:706 My 17 '47. * Reply by John C. Raven: 
Brit Med J 4510:074 4 ДУ t 

25. Мплз, L. F. The Properties of a Space Test. Bache- 

r's thesis, University of inburgh (Edinburgh, Scotland), 


b Fouros, С. A, ano Raven, T. C. “Intellectual Ability 
lonal 


and Occupat: Grade." Occupational Psychol 22:197-203 
О '48. * (PA Ано) 
27. Fouros, б, А., AND Raven, J. C. “Normal Changes in 


, 
the Mental Abilities of Adults Age Advances.” J Mental 
Sct 94:133-42 Ja '48. * (PA 22:4882) 
28. Raven, f. C. “The Comparative Assessment of Intellectual 
Ability." Brit J Psychol 39:12-9 S '48. * (PA 24:1063) 2 
29. Rimoni, Н. J. А "A Note оп Raven's Progressive 
Жаса Test." Ed Psychol Meas 8:347-52 au '48. * (РА 
23:42. 
oe US H. J. A. "Study of Some Factors Related to 
Intelligence." Psychometrika 13:27-46 Mr '48. * (PA 22:13820) 
31. Корор, @ ре M. “Тһе Kent and Other Tests Used on 
thé Same Subjects.” J Mental Sci 44:452-8 Ар '48. * (PA 
23:12 1 
uox LATER, Patrick. “Comment on ‘The Comparative 
ON Ry ae Ден! Ability.” Brit J Psychol 39:20-1 
"48. Hi n 
33: таке ўан S. Биши Findings оп Testini 
the Cerebral Palsied With Raven's ‘Pro; 


gre Matrices. 
J Excep Child 15: + D '48. * (PA 24:733). 

34., CASSEL, Bokar H. “ анын Evaluation of the Pro- 
gressive Matrices Tests." Ed & Psychol Meas 9:233-41 su 749, 
(PA 24:184) к iviti 

35. Fourps, С. A. “Variations in the Intellectual Ау 
of Adults." Ат J Psychol 62:238-46 Ар '49. * (РА 24:171 
36. JasrAK, Joseru. “Problems of Psychometric 


Analysis." Psychol B 46:1 My '49. * (PA 24:188) — .. 
3 x Kerr, ERTRUDE fle Progressive atrices as S 
to School Children.” Brit J Psychol, Stat Sect 2:140-50 N 49. 


(PA 24:3213) Qi a 
38. Norcorr, Bernard, “The Distribution of Scores 
Rare Progressive Matrices Test.” Brit J Psychol 40:68-70 

e, A 2 " ipee” 
39 аш 7 E? “The Structure of Practical Abilities.” 
Occupational Psychol 23:81-96 Ap "49. * (РА 23:5313) е by 
40. VERNON, Sanus Е. “Occupational Norms Eus 293 
Мїпше Be ayy lh) Minis Test.” Occupational Psy 
A ag he ЫЫ AND paan Тоня B. геа 
selection in the UN HE X73 pondon: niversity o! 
td. . Pp. 324. 24:77 на 
Чэ, Fovips, б. А, 39 riations in the Intellectual uis 
of Adults: Their Assessment and Significance, рр, ternational 
stract. In Proceedings and Papers of the Twelfth "s Tinburghy 
Congress of Psychology Held at the University of & Boy 
gny 23rd gA 29th, 1948. еннан Scotland: Oliver 
td., . Pp. xxviii, 152. Paper. “ + mental 
43. Роз, G. A. anp Raven, J. C. "Am Experiment 
Survey With Progressive um (1947)." Brit 
1104— 'so. 25:10 " T 
pO TE Male, Mental Hospital Ps 
tients." J Mental Sci 96:726-33 Jl 'so. * (PA 2 сы of Re- 
45. Herm, A, W., AND Wattace, J. G. “The "etelligence 
eatedly Retesting the Same Group on the Same a 8) 
est." Q J Exp Psychol 2:19-32 F i * (PA 35 озо Deaf.” 
46. Oranj P. “A Study of ASET о и 
: . 24:6 ўе 
күс Алы түн Е 
the Progressive Matrices Test. aster’ , chol 
London (London, England), RESET Brit J Ed Psy : 
E 'sr, * :4812, title o: к AN 
^ 2350158 9s g COLLABORATION witH N. O Core High; 
дүй Cnawronp, “The Abilities of Adolescent ап О 'so. * 
Grade Male Defectiyes.” J Mental Sci 96:888-007 ba 
nalysis 


us icati rial A: 
. . E. "An Application of Facto "I5 
to ‘the Em vut LOS Ttems." Brit J Psychol, Stat Sect 3 т 


*so. * i -Analysis 
EG etre AND SINHA, UMA. “An Item pt 
of the Progressive Matrices Test." Brit J Psychol, 


4:91-4 Је 's1. * (РА геш) hods їп Testing 
. "Improved Methods Q 'st- 
cols Нон, OaD py Ат J Mental Def $6:349-53 ^ 
3 : m itu 
Кы 9. С. R, anp Pest, E, A. “Predicting Apt 


1 


417 | 


TESTS & REVIEWS: INTELLIGENCE—GROUP 314 


E 2 ———————————-—-————-—— 


i репе Occupational Psychol 25:192-9 Jl ’51. * (PA 
26:367 ы 
ГҮЛ 8'connor, N., AND TIZARD, d “Predicting the Occupa- 


tional Adequacy of Certified Mental Defectives." Occupational 
Psychol 25:205-11 Jl '51. * (РА 26:3487) 


CHARLOTTE BANKS, Department of Psychology, 
University College, London, England. [А re- 
view of Sets A, Ab, and B.] 

This edition of Progressive Matrices is usu- 
ally known as the “Coloured Matrices.” It has 
been specially adapted for use with children of 
5 to 11 years of age. This has been done by 
interposing, between Sets A and B of the 1938 
scale, anew set of problems Ab, of approximately 
the same difficulty as those in A or B, and by 
reproducing a large number of the items in bril- 
liant colours. Thus, of the total of 36 items, only 
12 are, in fact, new. This version of the test is 
presented in both booklet and board form, though 
it may be purchased only in the booklet form; 
the board form can only be borrowed from the 
Crichton Royal Department of Psychological 
Research “for a limited period," and only “for 
certain experimental purposes.” The owner of 
the test claims that “for the majority of routine 
purposes, when coloured grounds are used, the 
Book and Board forms of the test give prac- 
tically the same results.” 

The Coloured Matrices have been produced 
in order that a wider dispersion of scores may 
be obtained from children of this age range than 
was obtainable with the 1938 edition of the test. 
Thus, a great deal depends on the efficacy, for 
this purpose, of the 12 new items in Set Ab. Un- 
fortunately, no item analysis, and, still more im- 
portant, no measures of scatter have yet been 
published. It is not possible, therefore, to state 
whether or not the coloured version does in fact 
fulfill the main purpose for which it has been de- 
signed. However, comparison of the scores given 
at the end of the manual to the coloured edition, 
with published figures obtained with the 1938 
edition, suggests that the range and standard 
deviation of total scores for the age range of 
5 to 9 may not be appreciably greater for the 
1947 version. Insertion of the twelve new items, 
Set Ab, would seem to have raised the mean 
scores, but not to have increased the scatter to 
any great extent. 

Norms in percentile form are given separately 
for the book and board versions of the test ; those 
for the book form are based on results of ap- 
proximately 600 children, for the board form on 
the results of about half this number. “For prac- 
tical purposes,” the percentile figures are classi- 
fied into five grades, ranging from Grade I or 


“intellectually superior,” the top five per cent, 
to Grade V or “intellectually defective," the bot- 
tom five per cent. 

Reliability coefficients are quoted only for the 
results of children under 7 and children aged 9. 
With the children under 7, we are told, both 
forms of the test “have, so far, shown a low re- 
test reliability in the neighbourhood of 0.65. * 


` By the age of 9, the re-test reliability of both... 


has been found to increase to at least 0.80.” The 
tests were repeated after an interval of two 
months, and the numbers of children tested were 
small: “groups of only 35 to 100.” The author 
claims that, with children under 7, both forms 
of the test correlate about .5 with the Crichton 
Vocabulary Scale and the Revised Stanford- 
Binet Scale; and, with children aged 9, about .65 
with these two other tests. 

Detailed and helpful instructions are given 
in the manual for administration of the test, 
which may be given as a group, individual, or 
self-administered test. When given with the 
Crichton Vocabulary Scale, the two tests to- 
gether usually take about half an hour to com- 
plete. The author also claims that the test is 
suitable, not only for use with children, but also 
with “persons who are for any reason mentally 
sub-normal or impaired," and with those “suf- 
fering from partial paralysis, deafness or de- 
fective speech." However, no results for groups 
such as these have been published. 


W. D. Watt, Reader in Education, University 
of Birmingham, Birmingham, England. [Re- 
view of Sets A, Ab, and В.] 

This is a development of the Progressive 
Matrices (1938) in a form suitable for children 
and for mental defectives. This new test con- 
tains additional problems to those in sets A and 
В of the original version, all the problems are 
presented in colour to make them more attrac- 
tive, and the whole can be given individually 
either in booklet or in board form. For most 
children over the age of eight, the author claims, 
the test can be fairly satisfactorily used as a 
self-administered or group test. — | 

The rationale of the test, like that of others 
of the series, is that it tests, with а series of in- 
creasingly difficult pattern completion problems, 
the "intellectual capacity to form comparisons 
and reason by analogy." 

Administration and scoring, either as an indi- 
vidual or as a group test, is simple; and in the 
writer’s experience the problems are attractive 
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to children—especially in board form. It is, how- 
ever, important to realise that the test is still 
experimental and that too great a dependency 
cannot as yet be placed upon the norms, nor 
indeed upon the general reliability of the test 
for use with very young children or low grade 
defectives. The norms given in the handbook ac- 
companying the test are based only upon 608 
Scottish children for the book form (age range 
5-11, roughly 50 per 6-months age group), and 
on 291 Scottish children for the board form (age 
range 5-IO, roughly 32 per 6-months age 
group). Reliability with young children is not 
high (test-retest .65 with children under seven). 
The author claims, however, a test-retest relia- 
bility of .9 over the whole range of development 
for which the test is constructed but states 
frankly that this is based on relatively small 
groups. 

Clinically, the author claims, the test will dif- 
ferentiate clearly between different degrees of 
intellectual deficiency or impairment. In associa- 
tion with the Crichton Vocabulary Test, he sug- 
gests that it can be used to assess intellectual 
deterioration. 

The test is likely to be a valuable adjunct in 
the clinical study of young children, defectives, 
cases of deterioration, and possibly—though 
much research is required here—in the differ- 
ential diagnosis of cerebral injury. It can, how- 
ever, be regarded as an adjunct only; even with 
the addition of a vocabulary scale, it is not a 
substitute for more thorough measurement and 
diagnosis. 


Сковсе WzsrBv, Head of the Psychological 
Laboratory, University College of Hull, Hull, 
England. , 
PROGRESSIVE MATRICES (1938), SETS A, B, 
C, D, E. The concerted attack by Spearman, Burt, 
and the British “g” school of factor analysis upon 
the empirically composed heterogeneous test as 
developed by Binet and his American followers 
secured its first indubitably spectacular success 
in Raven's Progressive Matrices (1938) devel- 
oped in his work with Penrose with mental de- 
‚ fectives at the Colchester Institution. Here was 
a test simple to administer, usually enthralling, 
and universally acceptable to dull and clever, 
child and adult, friendly and suspicious alike, 
claiming to provide a "measure of a person's 
capacity to form comparisons, reason by analogy 
and develop a logical method of thinking regard- 
. less of previously acquired information." More- 
” 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK 


[ 418 


over, the testee need not have it presented as an 
"intelligence test," and he need not work under 
time pressure, valuable assets indeed in clinical 
work with disturbed personalities. АП subjects 
seem to derive motivation from the built-in train- 
ing in each set of tasks. A highly intelligent 
schizoid subject described it as “like a cross- 
word puzzle with shapes, but it gets more diffi- 
cult and interesting instead of getting simpler 
and boring.” 

A generation should be time enough to assess 
even a test as original and as far reaching as this, 
especially now that clinics have amassed con- 
siderable experience, and data аге to hand from 
the testing in British Armed Forces in the Sec- 
ond World War. Unfortunately, the test seems 
to have been little worked in the U.S.A., an un- 
justifiable neglect of an unusual contribution of 
which even Raven’s British critics may feel in- 
clined to complain, for there is no doubt in this 
reviewer’s mind that the method developed by 
Raven is capable of the finest refinement in test- 
ing practice. 

The test is described in the current (1950) 
manual as follows: “P.M. 1938 is a non-verbal 
test of a person’s capacity at the time of the test 
to apprehend figures presented for his percep- 
tion [Spearman’s Principle of Experience], see 
relations between them [Spearman’s Eduction 
of Relations], and conceive the correlative fig- 
ures completing the systems of relations prê- 
sented [Spearman's Eduction of Correlates]. 

Sixty well drawn “matrices” or patterns are 
divided into five sets (A, B, C, D, and E) of 12 
problems each and are well printed in strong 
covered booklets which can be used again and 
again. Each matrix is a network of logical rela- 
tions between simple and more complex visual 
forms, mainly of geometrical design; and each 
matrix has a "gap" which has to be filled by in- 
dicating on the printed score sheet the number 
of the correct choice from the alternatives printe 
below the matrix. The relations within the ma- 
trix usually allow of more than one mode 
analysis of the problem. The aim of the seit 
designer was to produce five sets of items prO 
gressively graded in difficulty both between n 
within sets and of sufficient range of complexity 
to discriminate in a short testing time (about a 
minutes at the 5oth percentile) a sample of eu 
general population in Raven's words vint 3 
fancy to maturity." This is an ambitious proJ s 
which one is not surprised to find has been ane 
in better perspective by the designer himself, bY 
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his own subsequent development out of PM 1938 
of the two more specialized 1947 tests. 

Factor analysis in the Services suggests that 
the test is an almost pure "g" test with a small 
loading of some spatial perceptual factor and the 
latest data for its reliability agree exactly with 
the figure of .88 which Services testing revealed, 
adequate perhaps, but certainly to be improved 
and not supporting the original claims of the 
designer. Slater's probit analysis and indeed 
Raven's own admission show that too many easy 
items have resulted in a markedly skew distri- 
bution; and a recent study suggests that too 
many poor items and a low general factor be- 
tween the five sets (despite apparent homoge- 
neity) militate against very high validity. The 
crucial test, however, for any selective instru- 
ment is the external criterion; and the weak- 
ness of a theoretically based "g" test is usually 
just that it claims to measure a psychological 
"thing" rather than to predict successful be- 
havior in some more or less special social situa- 
tion. The author warns us against taking the 
test as one of "general intelligence," but has 
nevertheless himself proceeded on some such 
basis to draw what appear to be inappropriate 
conclusions about “The Development and De- 
cline of Mental Ability" (reviewer's italics) and 
specifically describes his five rough screening 
grades as “intellectually superior" (at or above 
the 9sth percentile), “definitely above average 
in intellectual capacity" (between the 75th and 
95th percentiles), “intellectually average" and 
"definitely below the average in intellectual 
ability" (at or below the 25th percentile), and 
“intellectually defective" (at or below the sth 
percentile). : 

Without accepting in detail the supporting 
argument, it is certainly better as Raven has 
aimed in his test design, “to evaluate an adult's 
ability in terms of the percentage frequency with 
which a similar degree of ability is found to 
occur amongst people of the same age." This is 
another way of formulating Wechsler’s well 
known claim that intelligence be defined in terms 
of the normal distribution of individuals of the 
same age group as the testee. 

An important claim is made that the score 
pattern, that is, discrepancies between actual 
and expected scores on each of the five sets for 
a given total score, “have considerable psycho- 
logical significance,” but there 1s still no data 
published to support this clinical claim after 15 
years experience with the test. It is clear that 
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the streamlined nature of the test, which is one 
of its special features, limits its usefulness as 
compared with the Wechsler-Bellevue Intelli- 
gence Scale in providing appreciable informa- 
tion of a clinical nature for score profiles. 

Now our hypotheses on the nature of intelli- 
gence, our preferences for "oligarchic,". “hier- 
archic,” or “democratic” theories, need not en- 
tirely determine our evaluation of this test. It 
is quite evident that reliable, carefully con- 
structed, and standardized intelligence tests de- 
vised upon the most diverse theoretical bases 
range a given population in much the same 
order. But even if we accept the psychological 
reality of a unitary and general “g” factor of 
relational thinking, it does not seem entirely per- 
missible to assume that geometrical matrices are 
entirely culture free in our geometry ridden ma- 
chine civilization. It is even more likely that 
cultural factors might account for group differ- 
ences in respect of “abstraction-tolerance” within 
our own western societies. This would be ex- 
pected from the apparent differential decline in 
intelligence between those adults whose jobs 
make no demands on abstract relational think- 
ing and those whose jobs do. It might be well to 
use the test with greater scepticism about its 
freedom from practice and cultural influences. 
Unfortunately, as the designer may well answer, 
it is impossible here in the sphere of mental 
testing, as elsewhere, to find the “neutral non- 
sense-syllables” which would provide the perfect 
counters to manipulate, and Raven may be said 
to have done as well as is possible to avoid effects 
of previous learning and established attitudes, It 
might, however, be well to consider a programme 
of tailoring the test to the testee as shown in 
case history and interview wherever this is pos- 
sible and practicable as it is sometimes in clinical 
work, There is much to be said for the common 
practice in guidance work in Great Britain of 
using the test with a good mixed scale or verbal 
test. It is certainly imperative at the very least 
to use the Matrices, as the designer suggests, 
with a vocabulary test such as his Mill Hill Vo- 
cabulary Scale. This reviewer would stress the 
above considerations rather than the reason ad- 
vanced by Raven for this practice, viz., that vo- 
cabulary is a “measure of present mental func- 
tioning” as against the normally untimed Matri- 
ces score which he claims measure “mental ca- 
pacity.” The graceful will o' the wisp of "innate 
capacity” is perhaps even more enticing in the 
geometrical jungle, but recently published re- 
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sults of the Heim’s experiments at Cambridge 
and other evidence seems to suggest that con- 
siderable gain in scores may be expected in well 
motivated repeated testing in PM 1938 without 
any knowledge of results. Moreover, it does not 
seem possible to draw a clear-cut logical distinc- 
tion between capacity to reason by analogy as 
in the PM 1938 and capacity to recall informa- 
tion by finding a word similar in meaning to a 
given word. The relational thinking here is no 
less evident than in some of the simpler items in 
the PM 1938; it is merely that the counters are 
different, verbal instead of perceptual. 

Whatever weight is given to the above criti- 
cisms, the test must be considered as having 
opened a most important exploratory path in 
purely nonverbal testing and to have established 
a special place for itself in Great Britain as a 
highly convenient rough grading instrument for 
the mass testing of adults as used in the British 
armed forces with a 20-minute time limit. It is 
a pity that in devising the new tests it has not 
been found possible to improve the test further 
by better grading of the items, but it may still be 
recommended warmly to our American friends, 

PROGRESSIVE MATRICES (1947), SETS I, II. 
This test is a forthright attempt to develop out 
of PM 1938 an instrument which will discrimi- 
nate usefully a population of testees within the 
top quarter of a sample of the population. The 
construction of the new test and the way in which 
it is presented reveal a spirit both critical and 
empirical, this latter especially welcome per- 
haps in view of the traditional theoretical ap- 
proach of pure “g” test construction. (Progres- 
sive Matrices (1947), Sets A, Ab, B, isa similar 
attempt to deal with the lowest quarter.) If we 
must deplore the fact that it has not yet been 
found possible to improve the PM 1938 to any- 
where near the limit of its possibilities, the de- 
signer is to be congratulated upon the develop- 
ment of these three refinements of the original 
test, each of more limited aim, but each distinctly 
promising and frankly and scientifically pre- 
sented. The first two tests which constitute Parts 
I and II of PM 1947 are based upon 
PM 1938 and an earlier version of the 
high level test first drafted in 1943 for use 
in British War Office Selection Boards. The 
final 1947 version is composed of items very 
similar to the 1938 test and is published with 
the claim that it provides *a means of assessing 
all the ordinal, analytical and integral opera- 
tions involved in the higher thought processes" 
© 


and that it “differentiates clearly between people 
of even superior ability." In so far as "speed of 
accurate intellectual work" (as Raven defines 
“Mental Efficiency") is a factor in any training, 
the test is designed, when given with a time 
limit of 40 minutes, to distribute scores normally 
and provide useful information concerning “а 
person's probable rate of progress and success 
in any course of study." Raven, however, also 
suggests that since mental efficiency so defined 
varies with health and practice, it is most impor- 
tant to be able to obtain “а more valid estimate 
of a person's real capacity for intellectual ac- 
tivity under normal conditions where clarity of 
thought and accuracy of judgment, rather than 
speed of work during a fixed period of time, are 
important." This alternative purpose of the test 
may be achieved, he claims, by giving it without 
time limit. 

Part I consists of 12 matrices designed to 
cover in method all the different types of prob- 
lems the testee will meet in the 48 problems of 
Part IL. It is essentially a pretest training, but 
it is held to be in itself a useful quick test (10 
minutes) for rough grading the normal popu- 
lation into those below the roth percentile, those 
above the goth percentile, and those between 
these limits. The items of Part II are probably 
rightly claimed to be of such difficulty “that no 
one of even outstanding intellectual capacity 
[“efficiency” surely?] solves them all in less 
than half an hour.” Estimated norms only are 
available as yet for Part II; for ages 11.6-12.6 
above the 75th percentile, for ages 13-30 above 
the soth percentile, for ages 30-40 above the 
75th percentile only. The difficulty of establige 
ing accurate norms around the g5th percentile 
is emphasized, but in due course it is hoped Йй 
establish reliable norms for the g5th percentile 
to the soth. i al 

Useful prepublication work on the interpa 
consistency, reliability, and validity of the tes 1 
based on the testing of 1,844 school children "ei 
adults is reported in some detail in the шары 
a commendable but rare feature in the ipar 
publication of a test. Unsatisfactory Pomme 
tions between percentages of passes and ш 
scores appear in a few tests and an alterna! s 
key or. a “cook,” as the chess problemist Ше 
describe it, appears in connection with pro 

6. 
л Four problems are placed too high, four i 
too low, and minor improvements in respec не) 
the groups of alternatives might be made in 
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spect of ten of the items. Test-retest reliability is 
unsatisfactory below 11 years at .76, not good 
at 12.6 years at .86, but with adults of more than 
average ability Raven's claim is again high at .gr. 
No really satisfactory conclusions as to va- 
lidity are possible on the basis of school children 
aged 10-13 years, (the only studies reported 
besides the data on university undergraduates 
which do not bear directly on the question), but 
the author is no doubt well aware of this in- 
adequacy, though we must impatiently await 
further validatory evidence. It should be added 
that the figures given for the distribution of 
scores among university students suggest an ap- 
proximately normal distribution about a mean 
of 33-34 right out of 48 items with a standard 
deviation of about four and no ceiling effects. 
There can be no doubt that this is a valuable 
new instrument of research and the inclusion of 
critical analysis in the first edition speaks well 
for the scientific conscience of the designer and 
is a good augury for its future development. 
There is undoubted need for an acceptable, prac- 
tical, reliable, and valid test for predicting high 
level intelligent behavior as required in uni- 
versity education and in technical and mana- 
gerial situations. This reviewer, however, is not 
satisfied that it is the best plan in tackling this 
problem to set out by isolating “g” intelligence, 
defined as relational thinking, from the other 
requirements of complex social-intelligent be- 
havior evidenced in the above: situations for 
which we are selecting. It seems that if “g” tests 
must be supplemented in high level entry selec- 
tion in the Armed Forces and the Civil Service 
in order to ensure right choice (47), it is equally 
likely that we shall need some sort of battery or 
conceivably a portmanteau test using a variety 
of problem “counters” for manipulation (e.g. 
verbal, numerical, and pictorial besides geometri- 
cal) and also perhaps tests of memory function- 
ing with different counters as well as tests of 
special interests. The 1947 Matrices, it would 
seem, ought to evolve an even more stripped and 
streamlined model to take its place in such a 
battery. In this respect, it has the advantage in 
its simplicity of administration and construction 
over the Heim Ahs with a similar aim. It is also 
perhaps more easily pruned down to say a 30- 
minute testing time. On the other hand, and this 
may very well be a crucial defect, it tests with 
only one set of counters and so covers less of the 
ideal testing breadth at this level. At present and 
until adequate validating data are available for 


both tests, it is impossible to know whether 
either is superior. 

PROGRESSIVE MATRICES (1947) SETS A, AB, B. 
This test is designed to disperse the scores of 
children from 3 to 10 years of age or of the men- 
tally defective or impaired. An extra set of items 
is introduced to link Sets A and B of the 1938 
scale by problems of intermediate difficulty, and 
most of the problems are printed in colour to 
make the task more attractive to young children 
although in the reviewer’s experience this was 
scarcely necessary and may be a distracting and 
irrelevant stimulus. The test can also be used 
in а formboard version which does seem to serve 
the requirement of an activity method for the 
more efficient testing of preschool children. It 
can easily be given without verbal instructions. 
The test designer himself is rightly sceptical of 
the predictive value of tests of children below 
the age of 6, but is moderately sanguine about 
the Board Form of the Matrices. It is recom- 
mended especially for the qualitative informa- 
tion it may give by “rendering visible the process 
of approximations by trial and error.” Warning 
is given that for judging mental deficiency as 
legally defined in Great Britain it must be sup- 
plemented by tests of attainment. 

The norms at present available for this test 
are unsatisfactory, being based upon only 291 
Dumfries school children, and it is not shown 
how these few cases are distributed over the age 
range from five to ten years. Regional differ- 
ences established during Services testing placed 
Glasgow and Scotland lowest of the main Brit- 
ish geographical areas and although there are 
pitfalls in drawing conclusions from these fig- 
ures, Raven himself expects that norms for a - 
general population of English children will 
probably be higher. However, with the estab- 
lishment of regional differences among adults, 
the question naturally arises whether for some 
selection purposes the logical continuation is not 
to construct norms regional to the population 
from which selection is made. This is not so 
much a criticism of this test as of the tacit as- 
sumption of most testing programmes. 

In general conclusion, a word of appreciation 
on the part of the many British users of the 
Matrices Tests seems appropriate. The develop- 
ment of these instruments by the Psychological 
Research Department of the Crichton Royal is 
a fruitfully specialized piece of work which 
ought to be known more widely in America. In 
the opinion of this reviewer, the complicated and 
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intimidating arrangements made for the safe- 
guarding (in itself a laudable object) of the 
PM 1947, Set II do not encourage such wide 
recognition; this is a pity. The annual loan 
charge seems excessively high besides being 
highly questionable in principle. 


For reviews by Walter C. Shipley and David 
Wechsler of the 1938 edition, see 3:258; for a 
review by T. J. Keating of the 1938 edition, see 
40:1417. 

[315] 
*Revised Alpha Examination, Forms 5 and 7. 
Grades 9-12 and adults; 1932-51; Forms 5 ('32), 7 
(^33) ; revised manual ('51) ; $3 per 25; 35€ per speci- 


men set; postpaid; 2144(30) minutes; revision by 
F. L. Wells; Psychological Corporation. * 


REFERENCES 

1-7. See 3:220(43, 53, 54, 62, 64, 65, 67) 

iF 8. ^ oxi к? йн, бы in Industry." B Can Psychol 
ssn 5: о2 D '45. 20:203. 

9. 1 ees с. a pictures of the Kiina.” J Rehabil 13:4-10 
Je '47. * (PA 21:363) A n 1 

10. TuppenuaM, Reap D. “Soldier Intelligence in World 
Wars I and IL." Ат Psychol 3:54-6 Е '48. * (PA 22:2954). 

11, Gurvitz, Mitton Ѕоіомох. An Experimental Pplica- 
tion of Wechsler-Bellevue Type Tests in an Attempt to Dis- 
criminate and Di se Psychopathic Personality Types Resi- 
dent in a Penal Institution. Doctor’s thesis, New York Uni- 
versity (New York, N.Y.), 1949. Abstract: Microfilm Ab- 
stracts 10:234-5 no 3 'so. * 

Dart WOoLrrE, Director, Commission on Human 
Resources and Advanced Training, 2101 Con- 
stitution Ave., N.W., Washington, D.C. 

This test consists of eight parts: addition, 
arithmetic problems, common sense questions, 
same or opposite words, mixed-up sentences, 
number series, verbal analogies, and following 
directions. Several of these item forms are 
rarely used now. A few of the individual items 
are ambiguous, for example: 

' If it rains when you are starting to go for a doctor, 

what should you do? 1, stay at home. 2, take a 
raincoat. 3, wait until it stops raining: 


- Some are out of date, for example: 


A dealer bought some used cars for $1000. He sold 
them for $1,200, making $20 on each car. How 
many cars were there? 

-| Directions and test items appear on the same 
page. Ап examinee who gets the idea of a test 
form quickly can solve a number of the items 
while the examiner is reading the directions. An- 
swers are to be written in the test booklet. Thus 
the booklet cannot be re-used and the tests must 
be scored by hand. 

By current standards, the statistical informa- 
tion given in the test manual is quite inadequate. 
“About nine hundred high school freshmen were 
given both Modified Alpha Examination Form 
9 and one of these tests," according to the man- 

. ual. Percentile norms for high school freshman 


boys and girls were set by formula on the basis 
of Form 9 and norms for high school senior boys 
and girls obtained by extrapolation, also on the 
basis of Form 9 norms. That is all the informa- 
tion given about the norms. 

The Personnel Research Section of the Ad- 
jutant General's Office during World War II 
administered Revised Form 5 to “768 Army en- 
listed men selected on the basis of Army General 
Classification Test scores to be representative 
of all enlisted men, The correlation of scores on 
Alpha 5 and AGCT for this population is .9o." 
The correlation of .90 indicates that the test 
would still serve fairly well the military classifi- 
cation purposes for which it was developed. Al- 
though AGCT has long since deservedly re- 
placed Alpha, this correlation is a tribute to the 
pioneers who devised the World War I test. 

There are many newer tests and tests more 
specifically designed for special purposes. These 
can be used in most testing programs. But Alpha 
will always be the test to use in showing stu- 
dents how our group testing developments 
started. And if one wants to use it practically, 
under appropriate conditions it is still a good 
test. 

[316] 

Revised Alpha Examination Form 6, Short 
Form. Grades 9-16 and high-grade adults; 1933; à 
short form of the experimental edition of Revised 
Alpha Examination, Form 6, consisting of subtests 2, 
4, 7, and 8; Form 6, Short Form; no data on reliabil- 
ity and validity; no description of normative popula- 
tion; $1.65 per 25; 35¢ per specimen set; postpaid; 13 
(20) minutes; revision prepared under the supervision 
of C. R. Atwell and F. L. Wells; Psychological Cor- 
poration. * 

REFERENCES 


т. ATWELL, Снлвівѕ R., AND Wetts, Е. L. “Army АШЫ 
Reviaed—Short Form.” Personnel J 12:160-3 О '33. 
11844 


Epwin R. Henry, Personnel Research Ad- 
visor, Employee Relations Department, sae 
ard Oil Company (New Jersey), New York, 
New York. — 1 

This test may be “easier and shorter to 3€ 
minister and score" but there are no data to 1" 
dicate that it is “more appropriate for educa- 
tional and business use," nor that it is апу jus 
“out of date" than original Army Alpha. Noe 
Tor the general population start at 50th E. 
centile (N is not given) and those for СА 
school freshmen and seniors were derived fro 
another test. 


3 n uman 
Dart Wotrte, Director, Commission on H a | 
Resources and Advanced Training, 2101 C 
stitution Ave., N.W., Washington 25, D^ 
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Revised Form 6 of the Army Alpha is a 
shortened version consisting of only four tests: 
problems, same or opposite words, following di- 
rections, and analogies. This form is intended 
for use with high grade adults and high school 
students. In format, method of administration, 
and scoring, it resembles Revised Alpha Exami- 
nation, Forms 5 and 7. Percentile norms for 
high school freshmen and seniors are given with 
no more explanation than “as derived from 
Modified Alpha Examination Form 9." Even 
greater trust is shown in the method of securing 
general population norms. Those norms (for 
scores from the 5oth to the ggth percentile) are 
presented simply “аз obtained by F. L. Wells." 
No other statistical information is given in the 
manual. 


[317] 
*Revision of Army Alpha Examination. Grades 
7-16 and adults; 1925-47; Forms A (746), B C 5); 
manual ('47) ; no data on reliability ; $3 per 25; 35 per 
specimen set; postpaid; 2174(35-40) minutes ; revision 
by Elsie O. Bregman; Psychological Corporation. * 
REFERENCES 
т. Lorce, Irvine. “A Table of Percentile Equivalents, for 


Eight Intelligence Tests Usa Ure With Adults. 


Appl Psychol 20:392-5 Je '36. * 10:5552) 
‚2. BITTNER, REIGN ay Ks WORCESTER, pi A. “Тһе Selec- 
tion or ме o the Wim Беу SOR J Consult Psychol 
3:57-64 Mr— B 13:382 Г 

атин ХА, “А Comparison of the Ratings of Men- 


tally Retarded Prisoners on Army Ape and Stanford-Binet.” 
І 


J Appl Psychol 23:608-11 О '39. * 4:1106) 

' Epwarp Е. Советом, Professor of Psychology, 
The University of Tennessee, Knoxville, Ten- 
nessee. 

The manual states that: 

Each revision of the Army Alpha Examination, 
Form A and Form B, is a combination of the five forms 
of Alpha used in the United States Army during World 
War I. The questions best for general use have been 
selected by order-of-merit method. To make these 
revisions more suitable for use with civilians than any 
of the original forms, items addressed particularly to 
men recruits have been excluded and military terms 
modified. Otherwise the revisions are identica with 
the original forms. The revised examinations are соп- 
structed to be of average difficulty of the original Army 
Alpha and to yield average scores. 

The statistical data on which these statements 
rest are not given in the manual. Taking them at 
face value, the test is still essentially the original 
Army Group Examination Alpha. The World 
War I Army norms are given in the manual, 
along with norms for grades 7-12 inclusive (N’s 
1372 to 4126), freshmen at Ohio State Uni- 
versity and at the College of the City of New 
York, and student nurses. Norms are given for 
total scores and also for total scores omitting 
the administratively troublesome Directions Test 
(Test 1). Most of these norms are out of date, 


and the author rightly mistrusts an attempt 
(reported in the manual) to equate the scores 
on this test indirectly to those of the Army Gen- 
eral Classification Test of World War II. 

The manual gives no data on reliability or 

validity. Since more recent group intelligence 
tests are more convenient to administer and are 
provided with more modern interpretative aids, 
we may conclude that this test, in common with 
the earlier revisions of Army Alpha which tried 
to preserve the original World War I norms, is 
now less useful than these more recent tests, even 
though it probably measures about the same 
combination of abilities with about the same re- 
liability. 
Epwin R. Henry, Personnel Research Ad- 
visor, Employee Relations Department, Stand- 
ard Oil Company (New Jersey), New York, 
New York. 

If one has need for a form of Army Alpha 
different from the originals, this is probably as 
good as any. According to the manual the relia- 
bilities should be better than the originals but 
no data are presented. Norms for grades 7—12 
were collected before 1931, those for college 
freshmen at Ohio State, before 1926. Norms for 
freshmen at CCNY and for student nurses are 
undated. The author draws attention to possible 
changes in norms since hers were collected. 


[ 318] 

*SRA Non-Verbal Form. Grades 9-12 and adults; 
1946-47 ; formerly called SRA Non-V erbal Classifica- 
tion Form; preliminary school edition; IBM; 2 edi- 
tions; т form, '47; manual (’47) ; 50¢ per specimen set 
of any one edition; cash orders postpaid; 10(15) min- 
utes; prepared by Robert N. McMurry and Company ; 
Science Research Associates, Inc. * 
п) FORM AH (HAND SCORING EDITION). $2.65 per 25. 
B FORM AM (MACHINE SCORING EDITION). $2.65 рег. 
25; $2.90 per 100 IBM answer sheets; so¢ per set of. 
scoring keys. 
W. D. Commins, Associate Professor of Psy- 
chology, Catholic University of America, Wash- 
ington, D.C. 3 

A nonverbal “objective index of student in- 
telligence" might seem at first glance to have 
much to recommend it. Unfortunately, serious 
criticisms can be made of all tests of this kind. 
We scarcely know whether "student intelli- 
gence” has anything of practical value left over 
after “verbal” intelligence has been subtracted. 
It is possible to think of some ideal "intelligence" 
without academic associations, but in practice 
we commonly find that the available nonverbal 
test is basically not so different from the verbal 
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variety. The present test, for example, based as 
it is upon the classification of pictured objects 
and figures, contains many examples of a princi- 
ple of classification that is strongly academic. 
Many studies suggest that our ordinary non- 
verbal tests are not less, but actually more, in- 
fluenced by cultural factors than are verbal tests. 
The manual of the present test gives no validity 
data bearing on the "purpose of the SRA forms," 
measuring ability to learn, career aptitude, school 
course aptitude, college caliber, etc., and where 
the nonverbal test might be successful in these 
respécts, it probably duplicates the verbal kind 
but measures ability with a typically lower re- 
liability. The time limit of the present test, IO 
minutes, seems unduly short for an adequate 
measure of intelligence, and other available non- 
verbal tests offer a greater variety of items. 

The present reviewer doubts the value of non- 
verbal tests in general. Research is needed to 
tell us what they actually measure and to give 
us concrete validity data. Instead of using the 
SRA Non-Verbal Form as “а re-check on those 
students scoring below Percentile 25 on the 
Verbal Form," as the manual recommends, it 
would be better, in the opinion of the reviewer, 
` todepend on the best group verbal and individual 
tests to improve the accuracy of measurement 
within the school situation. 


For an excerpt from a review, see 3:261 (in 
the first printing of The Third Mental Meas- 
urements Yearbook, the excerpt is incorrectly 
listed under 3:260). 


[319] 

*SRA Verbal Form. Grades 9-12 and adults; 1946- 
47; formerly called SRA Verbal Classification Form; 
an abbreviated edition of Thurstone Test of Mental 
Alertness (see 3:265) which is an abbreviated adap- 
tation of American Council on Education Psychologi- 
cal Examination for High School Students, 1940 Edi- 
tion; preliminary school edition ; 3 scores: quantitative, 
linguistic, total; IBM; 2 editions; 1 form, "47; manual 
(47) ; 50€ per specimen set of any one edition; cash 
orders postpaid; 15(20) minutes; Thelma Gwinn Thur- 
stone and L. L. Thurstone; Science Research Associ- 
ates, Inc. * e 
a) FORM AH (HAND SCORING EDITION). $2.65 per 25. 
b) FORM AM (MACHINE SCORING EDITION). $2.65 per 
25; $2.90 per 100 IBM answer sheets ; 50€ per set of 
scoring keys. 


W. D. Commins, Associate Professor of Psy- 
chology, Catholic University of America, Wash- 
ington, D.C. 

This test has a number of features that would 
recommend its use for class sectioning, personnel 
selection, or other survey purposes when con- 
venience and speed are desirable. It is self-ad- 


ап objective index of student intelligence 
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ministering, has a short time limit of 1 5 minutes, 
is easy to score, and the items are clearly phrased, 
For many group comparisons, it would seem 
adequate. There are, however, some features of 
the test, its standardization, and published norms 
that would incline one to doubt its ability to 
discriminate between individuals in manly cases 
that would have educational and practical sig- 
nificance. The time limit of 15 minutes seems 
unduly short, not necessarily for the test as it 
stands, but as a feature of any adequate test of 
intelligence. The Q and L scores, separately de- 
rived for “quantitative” and “linguistic” intelli- 
gence, can scarcely be accepted at the present 
time as offering any practical distinction. The 
value of the age norms, from 12 to 17 inclusive, 
is difficult to estimate when all the manual tells 
us is that “500 representative students at each 
of the age levels” were employed to furnish the 
data. Whether they were representative as stu- 
dents or as age groups should be known to the 
user of the test. Р 

The amount of mental growth shown by this 
test between the ages of 14 and 15 is presented ] 
as being about one third as great as that be- 
tween 15 and 16. This would be a very peculiar 
and anomalous finding if the standardization 
groups were representative of age. They might, 
of course, represent student years, but it would 
be well to have the facts on this, as well as оп ш 
the scaling of the items and the steps separating 
them. s 

The use of the "standard deviation" method of 
finding the IQ should make the user of this test 
very careful in his comparison of the IQ so found 
with an IQ calculated in the Binet manner. Thus, 
a 12-year-old child who would receive an 10 
of 114 according to the method of the manua 
would obtain an IQ of 125 if we computed M 
MA from the conversion table in the manua 
and divided it by his CA. The reliability vd 
efficients as published are high, but since we d 
not know the range of ability used in the calcu- # 
lation, it would be more meaningful if we [^ 
given the standard error of measure d 
Further^work on scaling of the items and dit E 
obtaining and reporting evidence of validity 
seems desirable. 


WiLuis C. SCHAEFER, Institute / i d 
Human Relations, Washington, ЕС: SRA 
The SRA Verbal Form (as well as the 94 


Non-V erbal Form) isa test designed “to ch 
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is “sometimes called the ability to learn, to solve 
problems, to foresee and plan, to use initiative, 
to think quickly and creatively.” It is noted that 
neither the term “test” nor “intelligence” ap- 
pears in the title, nor, for that matter, does the 
term “intelligence” appear anywhere in the test 
booklet. This practice in test construction of 
avoiding words which may provoke emotional 
attitudes to the detriment of some students’ per- 
formance in the testing situation is an example 
of a small consideration which may be of marked 
practical consequence for fair evaluation. 

The excellent overall appearance of both test 
and manual in terms of content, typography, 
format, and usefulness is understandable in 
terms of the long experience of the Thurstones 
in test construction and the modern publishing 
standards of Science Research Associates. The 
page of fore-exercises presents adequate instruc- 
tions and practice for self-administration. The 
test is designed in spiral-omnibus form consist- 
ing of blocks of seven items ; each cycle or block 
contains two same-opposite, one arithmetic- 
reasoning, two vocabulary-recall, and two num- 
ber-series items. Items increase in difficulty 
throughout the test. A total of 84 items with a 
strict 15 minute time limit makes this a speed 
test for which students are warned that they are 
“not expected to finish in the time allowed.” 
While the four item types would permit separate 
part scores, only language (L) and quantita- 
tive (Q) subscores are computed following the 
practice with ACE from which this form was 
developed. The sum of these two part scores is 
the “index of intelligence” raw score. Both hand 
and machine scored forms are identical in con- 
tent, thus utilizing the same normative tables 
and interpretations. The hand scoring form 
utilizes a carbon transfer to a precoded answer 
sheet which greatly simplifies scoring. In fact 
since carbon marks cannot be duplicated by di- 
rect pencil, and since the carbon marks cannot be 
` easily erased, this form may be scored by the 
person tested minimizing any possible cheating. 
The direct equivalence in interchangeability of 
these two forms is a very useful feature in the 
practical situation. In addition, the availability 
of the supplemental SRA № on-Verbal Form for 
cases in which there are reading difficulties, or 
for cases scoring very low and requiring recheck, 
makes this group of test forms widely useful 
as a series of coordinated measuring instru- 
ments. 

Jormative data for part, total scores, 


and 


TESTS & REVIEWS: INTELLIGENCE—GROUP 


320 


Non-Verbal Form converting directly to centile 
ranks, quotient ranks (IQ equivalents), and 
stanine ranks are very effectively presented in a 
single table. Score equivalents are presented 
separately for each chronological age group from 
12 through 17 and over. This manner of pre- 
senting extensive standardization data in a rela- 
tive simple format for maximum usefulness is 
recommended as a model for test publishers. The 
basis for this standardization, unfortunately, is 
limited in manual presentation to the statement 
that the “Conversion Table was prepared from 
scores made by 500 representative students at 
each of the age levels. It furnishes the range of 
general intelligence found in a typical cross- 
section of the student population." It seems de- 
sirable to demand that test standardization data 
include specific characteristics of the groups 
which are sampled. A general statement which 
implies representative sampling of the (entire) 
student population when it may in fact be only a 
sample of those students who happened to take 
a particular examination in 1940 does not meet 
the high standards which test publishers speak 
about for themselves. 

An additional feature of broad general use- 
fulness in interpreting general intelligence test 
scores is the inclusion in the test manual of а 
Job Chart showing the average and interquartile 
range of centile scores for 150 major occupa- 
tional groups. While these data were obtained 
on a different test, the Army General C lassifica- 
tion Test, in World War II, the equivalence of 
centile ranks for these two tests may reasonably 
be assumed as a working hypothesis in test score 
interpretation, It would be desirable for this 
hypothesis to be clearly pointed out in the man- 
ual together with some estimate of the correla- 
tion between these two tests. 

In general the SRA Verbal Form is an ex- 
cellent test, well constructed, self-administer- 
ing, simple scoring, and broadly useful. Where 
a short form (20 minutes) estimate of general 
intelligence with possible part scores for linguis- 
tic and quantitative abilities is desired, the test 
can be highly recommended for general use. 

" [320] 


*Scovill Classification Test. Adults ; MELU an 
adaptation of Army Group Examination dipha (19) 
and Army Group Examination Beta (’20) ; 2 parts; 1 
form; no data on reliability and validity; norms (30 

—publishers recommend the use of local norms; man- 
ual ['47]; 35€ per specimen set; postpaid ; prepared by 
Scovill Manufacturing Co.; Psychological Corpora- 
tion, * 7 
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а) PART т. Illiterates and literates; can be administered 
in pantomime; $4.25 per 25; (30) minutes. ? 
b) part п. Literates; $3.30 per 25; (20) minutes. 


REFERENCES 


1. Ромо, MILLICENT, AND Brits, Marton A. “Intelligence 
and Clerical jobs Held: Two Studies of Relation of Test Scores 
to Job Held.” Personnel J 12:41-56 Је 33. * (PA 7:4734) 


Rosert С. BERNREUTER, Director, The Psy- 
chological Clinic, and Professor of Psychology, 
The Pennsylvania State College, State С ollege, 
Pennsylvania, 

This is an early test which was based upon the 
original Army Alpha and Army Beta and 
brought up to date by more recent norms, The 
manual presents norms for populations ranging 
from 1,924 to 1,939. Of the seven scored parts, 
four are from the Beta (picture completion, 
number checking, digit symbol, and paper form- 
board) and three are from the Alpha (arithme- 
tic reasoning, synonym-antonym, verbal analo- 
gies). The eighth unscored part is a test of hand 
coordination. The final score is the average of 
the converted scores on each part of the test, 
which permits any combination of the verbal 
or nonverbal tests to be used. For the convenience 
of the administrator the tests are published in 
two separate booklets. Normally only the first 
booklet is used with illiterate subjects; both 
booklets are used for those who can read and 
write. 

No attempt has been made to make the ad- 
ministration of the nonverbal tests any easier 
than it was for the original Army Beta. Demon- 
stration boards must be locally made for use with 
'the four nonverbal tests. This arrangement per- 
mits the use of pantomime instructions with only 
a minimum of verbal instructions but requires 
awkward equipment and a considerable amount 
of practice on the part of the administrator in 
order to give the tests in a standardized manner. 

The manual does not give any data on re- 
liability and validity. However, the well known 
validity of the original parts of the Army Alpha 
and Beta apply to this test also. 

The norms provided in the manual are based 
upon large industrial populations from one com- 
рапу only and cover jobs which are not de- 
scribed. Consequently the recommended pro- 
cedure of constructing local norms for each new 
industrial user must be followed. , 

The Scovill Classification Test is.an adaptation 
of the originalArmy Alpha and Army Beta but 
it is no improvement over the original tests. No 
attempt has been made to utilize the knowledge 
of test construction which has been developed 
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since 1918. As а consequence this test can be 
recommended for use only in industrial Concerns 
where local norms have already been con- 
structed. Other industrial concerns which are 
now initiating testing programs will undoubtedly 
prefer to use some of the newer tests that are 
now available. 


Epwarp E. Cureton, Professor of Psychology, 
The University of Tennessee, Knoxville, Ten- 
nessee. 

This test is composed largely of items from the 
Army Alpha and Beta tests of World War I. 
One booklet, designed to be used with both lit- _ 
erate and nonliterate subjects, contains picture 
completion, number checking, digit-symbol sub- 
stitution, paper form board, and cross-out tests 
(the last an almost pure motor-speed test). À 
second booklet, for literate subjects only, in- 
cludes arithmetic reasoning, same-opposite, and 
verbal analogies tests. . 

The applicability of the number checking, 
digit-symbol substitution, and cross-out tests as, 
indicators of general intelligence is open to ques- - 
tion, but they may still be useful as indicators of 
performance on particular jobs. ШҮ. 

No data on reliability or validity are given in 
the manual. Normative data are confined to 
samples from one company and are presented 
(aside from one table) only in somewhat crypti- - 
cally labelled graphs. The test may still be valu- 
able, however, to those who wish to conduct 
their own validity studies and prepare their own 
norms. Few other tests include so wide a апр 
of subtest types in a single battery. д, x 

Test authors should study the directions for 
administration given in the manual for this test. 
They are models of simplicity, clarity, and Wi 
herence to the basic principles of group test a 
ministration. 


[321] 
*The Simplex Group Intelligence Scale, Be 
and over; 1922-39; І form, '39; no data on г KA aa 
manual ('34) ; tentative norms; 85. 6d. per 25; NS 
per scoring stencil; rs. 3d. per manual; 35. Pd 
men set; purchase tax (British purchasers sy 
postage extra; 90(95) minutes; C. A. Ri 
George G. Harrap & Co. Ltd. * 


poy [322] Ў 
*[The Simplex Junior Intelligence Те 
KE 1932-51; 2 forms (listed below) ; purci As (50) 
(British purchasers only) and postage extra iy Со 
minutes; C. A. Richardson; George G. Harrap. 
Ltd. * ‚ i 
а) THE SIMPLEX JUNIOR INTELLIGENCE ea 19321 
6s. рег 25; 6d. per single сору; 8d. рег тап эт. 10204 
b) THE SIMPLEX JUNIOR 'A' INTELLIGENCE Eds 
355. per 100; 6d. per single copy; Is. per ma 


sts.] Ages 
chase tax 
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REFERENCES 


т. Kerr, Севтворе. “The Progressive Matri i 
to School Children.” Brit J Psychol, Stai Sect чен A plied 
(PA MN) James T. "A Consi f 
2. Wi А " onsideration of the Abili 
Algebra. ; SHEER SCORES SEXT Entrants. A aos 
si lasgow University lasgow, Scotlan 1 Abst: E 
Brit J Ed Psychol 20:65-6 К *so. * (PA Mandy ini 


[323] 
Southend Group Test of Intelligence. Ages 10.5- 
13; 1939; 1 form; no data on reliability and validity ; 
no description of normative population; 7s. 6d. per 25; 
той. per handbook; postage extra ; selected from tests 
originated by William Stephenson; George G. Harrap 
& Co. Ltd. * 


REFERENCES 


1. Peet, E. A. "Symposium on Selection of Pupils for Dif- 
ferent Types of Secondary Schools: VI Evidence of a Practical 
Factor at the Age of Eleven." Brit J Ed Psychol 19:1-15 F 


"49. * 
Gertrupe Keir, Lecturer in Psychology, 
University College, London, England. 

This test is “designed to grade normal chil- 
dren who are about to enter the Senior School * 
It is not suitable for children who are markedly 
dull and backward.” It consists of eight subtests, 
four verbal and four nonverbal, each verbal sub- 
test being followed by its,nonverbal equivalent. 
Each subtest is preceded by a teacher’s demon- 
stration on the blackboard. The tests fall only 
into two types, classification and analogies. The 
test as a whole suffers from a certain monotony 
ag the number of mental processes involved is 
imited, 

The manual contains instructions for the ad- 
ministration of the test, a key to answers and 
scoring (laid out in a somewhat involved way) 
and tables for the conversion of scores into mental 
ages and IQ's. It gives no details as to test con- 
struct on and standardisation procedures. 

The test items, selected from tests already de- 
vised, were arranged in order of difficulty and 
administered to the entire school population of 
Southend between the ages of 10% and 13 
years. “The scores are medians of each of the 
monthly age groups * The S.D. in terms of 
crude scores lies at between 8 and 9 points. * 
The upper limits of mental age were assessed 
by comparing scores after application of this 
test and the 1938 revision of the Binet Scale to 
the same subjects." * The reliability of the test 
is said to be.93 (split half method). No informa- 
tion is available as to whether this is the cor- 
rected figure, nor is there any on the sample on 
which the study of reliability was carried out. 
The author states that “Factor analysis suggests 
it discriminates more on the non-verbal items 
than the verbal.” The test correlates .7 both 


1 Personal communication. 
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with Jenkin's Non-Verbal Test and with Ra- 
ven's Progressive Matrices. The size of the sam- 
ple used in the correlational studies is unknown. 
Details of the results of item analysis are not 
available. i 

The scanty information regarding the con- 
struction, standardisation, reliability, and validity 
of this test should make one hesitate to use it 
as a valid and reliable measure of intelligence. 


For an excerpt from a review, see 40:1423. 
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*Terman-McNemar Test of Mental Ability. 
Grades 7-12; 1941-49; a revision of Terman Group 
Test of Mental Ability (see 40: 1424); IBM; Forms 
C (41), D (41) ; manual (49); $1.90 per 25; 35¢ per 
specimen set, postpaid; separate answer sheets may 

used; 856 per 25 IBM answer sheets; 20€ per stencil 
{ог scoring answer sheets; postage extra; 40(45) min- 


utes; Lewis M. Terman and Quinn McNemar ; orld 
Book Co. * 
REFERENCES 


1-25. See 40:1424. 

26-50. See 3:263. noa 

$1. TRASTER, STELLA. A Study of the iid of the 
Scattering of the Scores of the 10 Subtests of the Terman rou 
Intelligence Tests—Form B in Relation to Psychopathic Tend- 
ency in 102 Students of the Fredonia, Kansas Higi School, 
Master's thesis, University of Kansas (Lawrence, Kan.), 1927. 

52. TOLL, CHARLES H. “Scholastic Aptitude "Tests in Am- 
herst College.” Sch & Soc 28:524-8 О 27 '28. * (PA 3:532 

53. TRIGG, CHARLES WAILDERMAN, An Analysis of Individual 
Responses to the Items of the Terman Group Tests, Form 
Master’s thesis, University of Southern California (Los An- 
geles, Calif.), 1931. Pp. 132; { , 

4. RICHARDSON H. D. “The Intelligence Quotient and Sec- 
ondary-School Guidance." Sch R 43:49-59. а 35. * 

ss. MURPHY, LAURA Wuirz. "The Relation Between Me- 
chanical Ability Tests and Verbal and Non-Verbal Intelligence 


Tests.” J Psychol 2:353-66 ^36. * (PA 11:3928) М 
ip D L. 153 Influence of Level of Motivation on 


56. Hartson, 
the Validity of Intelligence Tests.” Ed d Psychol Meas 
§:273-83 au "45. * (РА 20:2500), Ў 
. TILTON, T ‚ “The Relation Between IO and Trait 
Difference as Measured by Group Intelligence ests.” J Ed 
Psychol 38:343-52 O "47. * (PA 22:2066 AR 
ALTER. “Intelligence Examinations, in 


Peru With the Lima Revision of Terman's Group Test, J 
Genetic Psychol 73:251-67 D "48, + (PA 23:3106 р 

59. Davis, W. 'ALLISON. ano HAVIGHURST, ROBERT J. "The 
Measurement of Mental Systems: Can Intelligence Be Meas- 
ured?" Sci Mo 66:301-16 Ap '48. * (PA 22:0980 М 

60. Bayiey, Nancy, “Consistenc and ariability in „the 
Growth of Intelligence From Birth to Eighteen ears.” J 
Genetic, Psychol 75:165-96 D.'49. * (PA sr 

61. YELA, MARIANO. „Аррісаіов of the Concept of Simple 
Structure to. Alexander's ata," Psychometrika 14:121-35 Je 


bs 11066 
a. da [LR T. А Comparison a Results of Three In- 
telligence Tests. Test Service Notebook, No. t1. Yonkers, N.Y.: 


World Book Co., [1951]. Pp. 4. Paper. • 

For reviews by Carl I. Hovland апа Robert 
L. Thorndike, see 3:263; for reviews by Anne 
Anastasi and Howard Easley of the original edi- 


tion, see 40:1424. 
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xTests of General Ability: Conperatime Inter- 
American Tests. Grades 1-3, 4-7, © 13; 1950; IBM 
for grades 4 and over ; 3 levels ; 2 editions ; English edi- 
tion: Forms AE, BE; Spanish edition: Forms AS, 
BS; tentative norms; $2.50 per 25; so¢ per series 
manual; cash orders postpaid; 50¢ per RA set, 
postpaid; prepared under the auspices О the Com- 
mittee on Modern Languages of American Council on 
Education, Herschel T. Manuel, Director of Test Con- 
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struction 4 Cooperative Test Division, Educational 
Testing Service. * 

а) PRIMARY LEVEL. Grades 1-3; 2 scores: oral vocabu- 
lary, total; nontimed (3 ) minutes. 

b) INTERMEDIATE LEVEL. Grades 4-7; 3 scores: non- 
verbal, verbal, total; ІВМ ; separate answer sheets must 
be used; ооё per 25 IBM answer sheets; per set of 
stencils for scoring answer sheets; 34(50) minutes. 

с) ADVANCED LEVEL. Grades 8-13; 3 scores: nonverbal, 
verbal, total; IBM ; separate answer sheets must be 
used; 90¢ per 25 IBM answer sheets; 30€ per set of 
stencils for scoring answer sheets; 34(50) minutes. 


REFERENCES 

1. SrovALL, FRANKLIN L. An Evaluation of Three Non- 
Verbal Tests of Mental Ability. Master's thesis, University of 
Texas (Austin, Tex.), 1942. : 

2, McCraniz, JOSEPHINE., A Study of Four Inter-American 
Tests Applied to High School Seniors. Master's thesis, Uni- 
versity of Texas (Austin, Tex.), 1944. 

3. KeLLey, Frances. A Study of the Inter-American Tests 
at the High School Level. Master's thesis, University of Texas 
(Austin, Tex.), 1945. 1 

4. WILLHAUK, Tcr C. A Study of Inter-American Tests 
in Relation to the Stanford-Binet, Master's thesis, University 
of Texas (Austin, Tex.), 1946. Э 

4 Zaccaria, MICHAEL ANGELO, An Item Analysis of the 
Advanced Inter-American Tests of G Ability. Master's 
thesis, University of Texas (Austin, Tex. b 

6. Dunnam, Cuartes V. A Study oj 
Tests of General Ability With Special 
Ame even and Eight, Doctor's thesis, 

C o Tex.), 1947. 


1946. 

the Inter-American 
‘erence to Children 
niversity of Texas 


The Teaching of English in Puerto Rico, pp. 171-313. 337-410. 
for th 1 on а. San Juan, 
P.R.: Department of Education Press, 1951. Рр. xix, 410. * 


RatetcH M. Drake, Professor of Psychology 
and Head of the Department, Kent State Uni- 
versity, Kent, Ohio. 

With the large number of general mental 
ability tests available nowadays, one must ask 
what justifies the appearance of a new series on 
the market. The only claim that this series le- 
gitimately has is the fact that it offers identical 
tests in English and Spanish from the primary 
level to the level of the first year in college. There 
is no evidence stated, implied, or evident that 
any improvement has been contributed to the 
field of intelligence testing. 

Most of the test content is of nonverbal ma- 
terial Scoring directions mention verbal and 
nonverbal scores, but the norms in the manual 
do not provide any means of interpreting or 
comparing them. In fact, in spite of the fact that 
the tests have been given to 6,000 pupils in 
Mexico, 20,000 pupils in Puerto Rico, and 10,- 
ooo pupils in the United States, the only norms 
data consist of "approximate medians" for 
grades one to twelve. These medians are based 
on the results of testing done in 1943 with pre- 
liminary forms of the tests which were later 
revised but not given again. The original me- 
dians have since been adjusted for changes made 
in final forms of the tests. 

There is in the manual considerable discus- 
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sion of validity in general and of the Steps that 
were taken to insure the validity of all the tests 
in the Inter-American battery. The statement js 
made that in the construction of the several tests 
“an exercise which did not discriminate satis. 
factorily (between high scorers and low scorers) 
was rejected or revised.” No statistical data are 
offered to substantiate this statement, and no 
correlations with other tests, with grades, or 
with ratings are reported. The meaning of re- 
liability is also discussed in the manual, but not 
one reliability coefficient is presented ; it would 
seem that for tests administered as early as 1943 
ample data would have been provided before 
this for extensive studies of both reliability and 
validity. 

The authors have tried to select test items 
which, since they are either nonverbal or verbal 
analogies employing identical concepts in both 
languages and identical mental processes, are 
apparently free from cultural influences. How- 
ever, whether the content of the tests is actually 
culture free is questionable. If it is, and if the 
tests are equivalent in difficulty for both lan- 
guage groups, then, according to the medians 
provided, children in the United States are st 
perior in general ability to children in Mexico 
and Puerto Rico, and children in Mexico are 
superior to children in Puerto Rico. One won- 
ders whether the tests are actually equiva- 
lent, whether children in the United States are 
superior in intelligence to children in Mexico 
and Puerto Rico, whether the sampling was 
equivalent for the three groups, whether the mo- 
tivation was different for the three groups, 
whether other cultural factors are involved, oF 
whether the differences may be due to a combi- 
nation of several of these factors with each one 
contributing an unknown proportion to the pi 
effect. No mention is made in the manual of M 
wide national differences in the scores p : 
children tested so far, or of d problems of m 
terpretation posed by these differences. 

T is to pal а that these tests, already 
given to a total of 36,000 subjects, have a s 
companying manual which offers so little й 
what опе needs to know if the tests аге P 
used, and the results interpreted, intellige: 
The manual states only that the norms Ea a 
tremely approximate and will be replace TR 
soon as it is feasible." It is hoped that Mis 
adequate norms will be supplied, and ood dies 
tests will be subjected to many critical $ 
to determine their usefulness. If it is foun 
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their content is relatively culture free and that 
satisfactory validity and reliability can be dem- 
onstrated, there is no obvious reason why the 
tests should not be offered in German, Chinese, 
or any other language, and so ultimately fulfill 
a purpose far beyond that for which they were 
intended. But all that rests upon research that 
is to come. For the present, these tests can only 
be used with extreme caution. 


Warrer N. Durost, Director, Test Service and 
Advisement Center, Dunbarton, New Hamp- 
shire. 

In order to “accent the positive” this reviewer 
would like to state initially that, while fully 
aware of the difficulties involved, he believes 
heartily in the type of international test develop- 
ment represented by the Inter-American series. 
The fact that the committee was aware of these 
difficulties is clearly stated in the manual and at 
other appropriate places. This is about as much 
as one can say, however, for the material pro- 
vided by the publishers for the reviewer's use. 
Direct correspondence with Herschel T. Man- 
uel, chairman of the committee responsible for 
the construction of the test, brought additional 
information. The availability of this informa- 
tion does not free the publishers from the re- 
sponsibility which is solely theirs for publishing 
the tests for general use without adequate sup- 
porting information. If experimental editions 
are to be made available for further study (and 
these must be considered experimental at the 
present time), due cautions and restrictions on 
their use should be exercised by the publishers. 

Validity. Validity is never general: it is al- 
ways specific to the situation in which a test is 
to be used, and data presented can be generalized 
only on logical, not statistical, grounds. The evi- 
dence in this case indicates that great care was 
taken to secure parallelism in Spanish and Eng- 
lish editions from the point of view of vocabu- 
lary, intrinsic difficulty of the concepts involved, 
etc. This care should do much to heighten the 
meaningfulness of comparisons from one lan- 
guage group to another. However, the ultimate 
usefulness of the instruments calls for their vali- 
dation in both Spanish and English speaking 
situations following along lines already estab- 
lished by the series of masters and doctoral 


1 The most pertinent information came from Chapters, 10 
and 11 of The Teaching of English in Puerto Rico e by Rob- 
ert Н. Fife and Herschel T. Manuel, pus made available, 
in manuscript form, by Pablo Roca irector, Division of е; 
search and Statistics, Department of Education, Government о 
Puerto Rico. 
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studies being carried out at the University of 
Texas. 

From the point of view of validity, it seems 
clear that these tests are.superior to tests cur- 
rently available in the United States for measur- 
ing mental ability for Spanish-English groups. 
Unfortunately, little specific evidence has been 
made available concerning the validity statistics 
obtained at the time of test construction, such as 
distributions of the phi coefficients which were 
computed or even distributions of difficulty 
values in English or Spanish, or both. 

Some data have been made available concern- 
ing the subsequent use of the test. In a study 
by Dunham (б) intercorrelations of the parts of 
the Primary test are reported to fall between .23 
and .55; and intercorrelations of the parts of 
the Intermediate test, between .28 and .57. Cor- 
relations of total scores (for the Primary test 
presumably) with the Otis Quick-Scoring Test 
of Mental Ability were from .69 to .76; with 
Binet mental ages, from .42 to .71. Otis scores 
are reported to correlate with Binet mental ages 
between .51 and .71. Since the implication is that 
several values were computed without clear-cut 
definition of what all the variables are, these 
data are not so valuable as they might be. How- 
ever, it would appear that they are typical of 
other similar correlations in the literature. The 
same investigator reports split half reliability 
coefficients of between .69 and .94 on single 
age groups. Nothing is said about correcting 
the test intercorrelations for attenuation, so 
presumably the values reported are uncor- 
rected. 

In another study by Willhauk (4) the Inter- 
mediate test was correlated with the Binet on 
the basis of the results of 50 fifth and sixth grade 
pupils in Austin, Texas. The correlation with 
the Binet was .75 ; with school marks the correla- 
tion was .58. 

In another study by McCranie (2) Form BE 
of the Advanced test was given to 212 Louisiana 
high school seniors. This investigator found a 
split half reliability coefficient of .88 on this 
group. She also found a correlation of .35 with 
high school marks. Correlations with other tests 
in the Inter-American series are reported as 
follows: with Reading, .51; with Natural Sci- 
ences, .47; with Social Studies, .56. 

In still another study, Kelley (3) reports cor- 
relations between .40 and .64 for the Advanced 
test with the subject matter tests in the series. 
She also reports correlations between .26 and 
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.29 with total school marks but a correlation of 
.44 with algebra marks alone. 

One can best summarize the reported informa- 
tion on the correlation of scores on the Tests of 
General Ability with achievement measures 
of various types by saying that they fall within 
the typical range of such values. They provide 
no basis for thinking that these tests are any 
better than other widely used intelligence meas- 
ures for measuring learning capacity in the typi- 
cal American school. y 

RELIABILITY. The manual gives no statistical 
evidence on reliability for the present tests, say- 
ing “Because the Cooperative Inter-American 
Tests are being offered for general use for the 
first time, reliability statistics are not available 
at present.” This is inexcusable from the point 
of view of good publishing practice, but the 
situation is somewhat retrieved by the avail- 
ability of data on the earlier experimental edi- 
tions even though these editions were somewhat 
different in content. Pertinent data from Table 
26 of the Fife and Manuel report indicate that 
the within-grade reliability coefficients range 
from .86 to .93. The nature of the grade groups 
or the number of cases on which these reliability 
coefficients were based is not available. How- 
ever, other reports of reliability contained in the 
same study on more clearly identified groups are 
consistent with the information given. One can 
conclude that the tests probably are about “run- 
of-the-mill" with respect to reliability. 

NORMS. The normative data provided in the 

manual are of little practical usefulness. This 
means that the interpretation of the scores, for 
the present at least, must be based upon local 
distributions. So long as the interpretation is 
concerned only with variations within the popu- 
lation tested, this is not necessarily bad, but it 
must be considered a weakness in the tests with 
respect to good publishing standards. 

MECHANICAL DETAILS. One would be happy 
indeed if he were able to say that mechanically 
the tests are all that they should be. Instead, the 
writer feels that the art work is not very good 
and that sometimes the intent of the picture is 
very hard indeed to determine. Even granting 
that the primary purpose in a test of this sort is 
representation, there is need for great care to 
prevent confusion of meaning. Larger type size 
could have been used for the printed sections of 
the test without increasing the amount of space 
required. ў 

Nothing is done to simplify the use of the 
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separate answer sheet in spite of the fact that it 
will normally be used with populations not gen- 
erally testwise. Cycling the numbering of re- 
sponse spaces through a cycle of ten digits in- 
stead of five would be helpful. For example, it 
would be better if the numbering of the alterna- 
tives for Item 1 ran 1 2 3 4 5; for Item 2,678 
9 10; for Item 3, 12 3 4 5; for Item 4678 
9 10; etc. This simple device tends to break up 
the possibility of an individual misplacing his 
answers by one set of answer spaces and thus 
having all of his answers scored as chance re- 
sponses. 

Nothing is done to make the scoring keys, 
which are designed primarily for machine scor- 
ing, easy to use for hand scoring. The inclusion 
of guide arrows for positioning the answer key 
over the answer sheet would greatly improve 
this situation. 

CONCLUSION. At the present time, one must 
conclude that there is very little about the tests 
as presented in the materials available which 
would argue in favor of their use in this country 
in place of widely used standardized intelligence 
measures. However, their continued use with 
populations of bilingual children at the borders 
of English- and Spanish-speaking countries 1s 
certainly desirable. Even though the results may 
not be so immediately useful as they might be, 
the tests undoubtedly represent the best that 15 
available for use in Spanish-speaking countries. 
With the task of making good tests for use in 
a unique situation having been carried so i 
would appear only reasonable that the work be 
carried to the point where it will receive the y 
fessional recognition which it deserves. P ү 
be hoped that much additional research wi of 
forthcoming to implement further the jes 
these tests, and that the publishers will do "im 
is necessary to secure more adequate reliability, 
validity, and normative information. 
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*Thurstone Test of Mental Alertness. cada ^ 
12 and adults; 1943-48; an abbreviated айа Exam- 
American Council on Education Psychologica tr. 4 
ination for High School Students, 1040 editions: 
scores: quantitative, linguistic, total ; IBM; e set of 
I form, '43; manual (48); sof рег specim minutes; 
any one edition; cash orders postpaid ; 20(25) пе; Sci- 
Thelma Gwinn Thurstone and L. L. Thurstone; 
ence Research Associates, Inc. * 8s per 25; 
а) FORM AH (HAND Кын EDITION). $1.85 
r set of scoring folders. 2 
ge AM MARE SCORING EDITION). 52 En 
IBM test-answer booklets; $2 per set of scor! 
REFERENCES Success 
iction of College Success 
seis sen, Kaneen Unzan, Prediction of Cents Mon 
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КЕЗЕ ТА CAbetracts of Theses 1040, О ыйл. 
кн Chokes, delen Program” 7 Я Вазе: 
211602767 Блан 22235 а Relating to Art School 
Success." J Ed Res 43:376-84 Ja 'so. * (PA 24:4811) 

For reviews by Anne Anastasi and Emily T, 


Burr, see 3:265. 
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V.G.C. Intelligence Indicator. Grades 3-8, 7-12; 
1943; an adaptation of Henmon-Nelson Tests of Men- 
tal Ability (see 299) ; 2 levels; Forms A, B; prelimi- 
nary manual, revised; $1.95 per 25 of any one level; 
separate answer sleeves must be used; 40€ per 25 an- 
swer sleeves for any one level; 10¢ per manual for any 
one level; 85¢ per specimen set of any one level; post- 
age extra; available only to Canadian schools; 30(35) 
minutes; adaptation by M. D. Parmenter; Vocation: 
Guidance Centre. * 


GromcE А. Frercuson, Professor of Psychol- 
ogy, McGill University, Montreal, Canada. 
These tests are an adaptation of the Henmon- 
Nelson Tests of Mental Ability. The 9o test 
items in each form are identical with those of 
the Henmon-Nelson and include vocabulary, 
sentence completion, verbal analogies, figure 
analogies, and the like. They differ, however, in 
format, method of administration, and scoring. 
The test forms are printed on heavy card- 
board. The testee records his answers on a sepa- 
rate cardboard answer sheet or sleeve which is 
fitted over the test form and subsequently in- 
serted into a plastic sleeve for scoring purposes. 
The format and the scoring method are in- 
genious and should have application to other 
types of tests. My impression is, however, that 
this test does not represent an improvement, in 
terms of ease of scoring, over the original forms 
of the Henmon-Nelson which used the Clapp- 
Young scoring system with its carbon impres- 
sions. The advantage of this test over the Hen- 
mon-Nelson resides in economy. The test forms 
are of substantial manufacture and may be used 
many times. In testing fairly large numbers of 
individuals, the cost would work out at very 
roughly a little more than half the cost involved 
in using the original Henmon-Nelson forms. 
Tables of mental age equivalents for given raw 
scores are provided. All data relating to the 
Henmon-Nelson such as norms, reliability in- 
formation, correlations with other tests and the 
like, are presumed to be directly applicable to 
the V.G.C. Intelligence Indicator. There is no 
reason to doubt this presumption. The differ- 
ences in format between the Henmon-Nelson 
and the present test should have little or no 
bearing on the performance of subjects. On the 


whole I prefer the format of the Henmon-Nel- 
son. If, however, economy is a consideration, 
then the V.G.C. Intelligence Indicator is recom- 
mended. 


For a review by Н. М. Fowler of the Hen- 
mon-Nelson Tests of Mental Ability, see 200; 
for reviews by Anne Anastasi, August Dvorak, 
Howard Easley, and J. Р. Guilford and an ex- 
cerpt from a review of the Henmon-Nelson Test 
of Mental Ability, see 40:1398. 
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*Verbal and Non-Verbal Test. Ages 10-11, 12-13; 
1951; 2 levels; 1 form [’51]; a practice exercise, en- 
titled Verbal and Non-Verbal Practice Test I, must 
precede administration of test; manual ['51]; 90s. 64. 
per 100 sets of test and practice exercise for any one 
level; 125. ба. per 12 sets of any one level; rs. 2d. per 
single set of any one level; postage extra; practice ex- 
ercise: nontimed (10-15) minutes; test: 40(45) min- 
utes; National Foundation for Educational Research 
in England and Wales. * 
а) VERBAL AND NON-VERBAL TEST I. Ages 12-13; no 
data on reliability. 
b) VERBAL AND NON-VERBAL TEST П. Ages IO-II; no 
data on reliability and validity. 

[329] m 

XVerbal Intelligence Test. Grades 12-16 and appli- 
cants for executive positions; 1948-50; formerly called 
Verbal Intelligence Test for Business Executives; 1 
form ’s0—same as test copyrighted in 1948; manual 
(49) ; supplementary manual ['50] ; по data on relia- 
bility; $2.50 per 25; 50¢ per specimen set; sample test 
free to qualified test users; cash orders postpaid; 20 
(25) minutes; William J. Morgan and Antonia Mor- 
gan; Aptitude Associates. * + 
WiLLIAM B. $5снкАРрЕв, Assistant Director, De- 
partment of Statistical Analysis, Educational 
Testing Service, Princeton, New Jersey. 

This vocabulary test is designed to be accept- 
able in form and content to well educated adults. 
The items are printed across the center spread 
of a 4-page booklet in clear, blue, 12-point type. 
The front cover is used for identifying informa- 
tion, and the back cover presents an adequate 
set of instructions. The words which constitute 
the test do not appear to be excessively rare, 
pedantic, or technical. On the whole, this test 
should make a favorable impression on the per- 
son taking it. 

Five items, each of which includes 10 words 
to be matched by a synonym or antonym from 
a list of 12 words, comprise the test. The total | 
possible score is 50. In four of the items, each key 
word is substantially different in meaning from 
the other key words; the last item, however, has 
one alternative (No. 7) which is synonymous 
with two key words and with one alternative. 
This would seem to require an excessively subtle 
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discrimination for a test of this kind. The test 
does not seem to be arranged in order of diffi- 
culty. 

The 3-page manual published in 1950 is well 
adapted to the proposed uses of the test. Clear 
and sufficiently detailed instructions for ad- 
ministration and scoring are provided. Norms 
for each decile and for the 95th and 99th per- 
centiles are presented for two samples: 312 high 
School graduates and 575 college graduates. 
Both norms groups are based on adults aged 20 
to 49. It is regrettable that the authors described 
the samples as "random"; a few words about 
how the samples were obtained would have been 
far more helpful. Although a more precise state- 
ment of the limitations of the norms would be 
desirable, the data should have considerable 
value if used with common sense. 

Superior performance by college graduates as 
compared with high school graduates and rea- 
sonably high (.67 to .84) correlation coefficients 
with the Henmon-Nelson Test of Mental Ability 
and the V score of Thorndike’s CAVD are of- 
fered as evidence of validity. These correlation 
coefficients indicate at least fair reliability for 
the test, but no direct evaluation of reliability 
is presented. The absence of sex differences and 
age differences in performance within the groups 
studied, as reported in the manual, is a finding 
of some interest. The statistical studies are pre- 
sented in a condensed but acceptable style; one 
hopes that a fuller technical account will be pub- 
lished. 

Since this test is pitched at a fairly high dif- 
ficulty level, and since vocabulary has often 
proved to be a useful predictor of certain kinds 
of success, this test has considerable potential 
value. Its usefulness would be increased if the 
method used in its construction were specified 
and if studies of its validity for particular uses 
in business and industry were available. 
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*Verbal Test I. Ages 10-11; 1951; I form; a prac- 
tice exercise, entitled Preliminary Practice Test for 
Verbal Test ['51], must precede administration of test; 
no data on reliability and validity; manual [’5r]; 50s. 
per IOO sets of test and practice exercise; 75. per 12 
sets; 8d. per single set; 1s. 6d. per manua ; postage 
extra; practice exercise: 10(15) minutes; test: 50(55) 
minutes; І. Macfarlane Smith and J. Darroch; Na- 
tional Foundation for Educational Research in Eng- 
land and Wales. * 
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*Wesman Personnel Classification Test, Grades 
8-16 and adults; 1946-51; 3 scores: verbal, numerical, 
total; Forms А (746), B (747) ; revised manual C51); 
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$1.65 per 25; 35€ per specimen set; postpaid: 28 
minutes; Alexander G. Wesman; Psychological e» 
poration. * 
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Joun C. FrANAGAN, President, American In- 
stitute for Research; and Professor of Psychol- 
ogy, University of Pittsburgh, Pittsburgh, Penn- 
sylvania, 

The first part of this test consists of verbal 
analogies items of the form “blank is to water as 
eat is to blank.” Four options are provided for 
each of the two blanks, and the correct response 
is found by selecting both the number cor- 
responding to the key word in the first series, 
drink (2), and also the letter corresponding to 
the key word in the second series, food (C). The 
author points out that this design provides for 
reasoning through analogy, perception of rela- 
tionships, a wide variety of subject matter and 
a very small chance of getting the item correct by 
guessing because of the double series. He con- 
cludes this discussion with the statement, “The 
net effect of these advantages is a more efficient, 
more accurate measure of verbal reasoning 
ability than is obtained by other types of verbal 
tests.” No experimental evidence is provided to 
substantiate this statement. Perhaps the author 
meant to offer this as a hypothesis. However, it 
appears unmodified in the 1951 revision of the 
manual as well as the 1948 edition. 3 

The second part of the test contains 20 items. 
These involve the addition, subtraction, multi- 
plication, and division of integers, decimals, an 
fractions. Percentage, square root, and ratios аге 
also included. The items stress “the use of n 
merical concepts,....the ability to perceive Pes 
tionships and to operate with ingenuity,” rat E 
than “sheer figure-handling speed.” The zer 
correlation reported between the total score qe 
the numbers score of the Minnesota Clerica 
Test tends to indicate success in this regard. hk 

Throughout the test the items appear H x 
of good quality and only a very few appeare 
this reviewer as somewhat ambiguous. ied 

The manual states “The test is essentia. Оп 
measure of power rather than of speed. the 
high school senior was given the test using E 
standard time limits. He read 33 of the 40 ite 
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in the Verbal Part in the 18 minutes allowed, an- 
swering 31 of them and getting 3 wrong. On the 
Numerical Part he completed 19 out of 20 in 
the time limit with 18 correct. He answered the 
remaining 7 items on Part I and the 1 item on 
Part II in an additional 4 minutes getting 4 of 
them right. This is a change of slightly more 
than half a standard deviation in his total score. 
This one case, of course, does not "prove the test 
is not a power test for most of those who take 
it. It does suggest that the author might do 
well to supply a more quantitative and objec- 
tive statement than the sentence quoted at 
the beginning of this paragraph about the 
extent to which speed is a factor for those 
tested. 

Norms are provided for six groups of stu- 
dents, three groups of employees, and eight 
groups of applicants. These groups vary in size 
from 93 to 1,476. The median size is about two 
hundred. A paragraph description of each of the 
17 groups is provided. This information should 
be of value to test users. Certainly more of this 
type of normative information for tests of this 
type is needed, and the author and publisher 
should be commended for this detailed informa- 
tion. However, it would be useful in many cases 
to know a little more about the groups. In try- 
ing to interpret the score obtained for the high 
school senior mentioned above, it was useful to 
know how he compared with seniors in a New 
York high school. The high schools of New York 
vary greatly in the quality of their students, and 
there is no hint as to whether this is one of the 
better or worse ones. Similarly, it is valuable to 
be able to compare his score with “freshmen men 
ina small New England college of average stand- 
ing.” This point of reference seems a little vague. 
The mean score for this group is only a point 
and a half higher than the mean for applicants 
to nursing schools. ч? 

The discussions of the reliability and validity 
of the test in the manual are comprehensive and 
competent. The test is shown to have a correla- 
tion of about .70 with the Otis and Wonderlic 
intelligence tests. Although this can probably be 
accepted as typical, the failure to provide means 
and standard deviations for the groups con- 
cerned makes it difficult to estimate the effects 
of the variability of the groups on these coeffi- 
cients. 1 

In general the average scores increase with 
educational and occupational level. The data 
appear convincing on this point. However, the 
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reader is given no information as to how the 
seventeen groups were selected. 

Reliability data are provided for a large num- 
ber of groups using split half methods and for 
two groups using alternate forms. The reliability 
coefficients for the same groups are only slightly 
higher when split half procedures are used than 
when alternate forms are correlated. This sug- 
gests that speed was not a large factor in de- 
termining the scores in these two groups. The 
reliability coefficients for total scores computed 
in single grade ranges are іп the low .80's. 

Forms A and B are said to be equivalent. The 
author reports that these forms "showed very 
similar cumulative percentile distributions and 
high test-retest reliability coefficients.” It would 
be helpful if he had reported either the distribu- 
tions or means and standard deviations by which 
someone else might get an idea of what he re- 
gards as “very similar.” 

In conclusion it appears that Wesman has pro- 
vided a test which should be useful in many 
situations. He has not provided evidence that 
# is better than similar available tests. It ap- 
pears that it may be almost as difficult to show 
superiority for one “intelligence” test over an- 
other as it is to show superiority for a particular 
brand of soap or cigarettes. In spite of this com- 
petitive situation the general quality of the tech- 
nical data supplied with the Wesman Personnel 
Classification Test is excellent. It is hoped that 
authors and publishers will continue to improve 
their test manuals so that factual data will com- 
pletely replace the type of claims so typical of 
commercial advertising. 


Erwin К. TAYLOR, Director, Personnel Re- 
search Institute, Western Reserve University, 
Cleveland, Ohio. 

The title of this test implies to this reviewer 
that it was constructed primarily for use in the 
selection and placement of personnel in business 
and industry. The 1951 revision of the manual 
contains a lengthy discussion on the validity of 
intelligence tests. It is prefaced by the statement 
“їп the final analysis, its [an intelligence test] 
validity rests on the validity of the basic philoso- 
phy underlying the test and the extent to which 
the test’s content faithfully exemplifies that phi- 
losophy." 

This reviewer takes a point of view diametri- 
cally opposed to that expressed in the foregoing 
paragraph. Tests are marketed neither as ex- 
amples of basic philosophies nor to provide 
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mental gymnastics, but rather to provide de- 
vices for prediction in a given area. This pre- 
diction may concern academic achievement or 
occupational success, among many other things. 
The test consequently has no such things as ab- 
stract “validity.” It is valid for this, that, or the 
other purpose. If it is independent in a corre- 
lational sense of the criterion for a specific situa- 
tion, a test, regardless of how good a measure of 
"intelligence" it is, is useless for that purpose. 
This correlational relationship, as the manual 
appropriately notes, may be of a product-mo- 
ment, biserial, or tetrachoric nature, as the situa- 
tion requires. It may even be that a curvilinear 
relationship between the test and the criterion 
exists, although appearances of these in the avail- 
able literature are so rare as to render them in- 
consequential. 

The test manual presents three types of “va- 
lidity" evidence. First it considers the relation- 
ship of PCT to the Otis Self-Administering 
Tests of Mental Ability, the Otis General In- 
telligence Examination, and the Wonderlic Per- 
sonnel Test. These correlations on six samples 
of between 100 and 200 cases range from .68 
to .84. Secondly, it is demonstrated that the 
mean score on each of the parts, and the total, 
increases for populations of successively higher 
educational achievement. The same relationship 
is demonstrated for a number of vocational 
groups. Finally five validity studies on industrial 
populations are summarized. In three of these, 
validities are expressed in correlational terms 
and are of a magnitude adequate to justify con- 
sideration of the test on an experimental basis 
in situations where measures of verbal and nu- 
merical ability appear to hold promise of validity. 

The reliability of the test appears to be ade- 
quate for any use for which it might prove valid. 
The manual provides norms for 11 vocational 
groups ranging in size from 116 to 1,496 cases. 
Norms are also presented for six student popu- 
lations. The size of these varies from 93 to 436. 

In summary, this instrument seems to be an 
excellent experimental intelligence test. Its use 
for research purposes, where measurement of 
verbal and numerical facility is indicated, is 
recommended. Its validity has not, in this writ- 
er's opinion, been firmly established for any 
purpose, and there are no circumstances in which 
it would appear that its operational use, without 
a specific validation study, could be justified. 


For an excerpt from a review, see 3:253. 
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[332] ; 
*“West Yorkshire" Group Test of Intellige 
Ages 7.5-15; 1940; 2 forms Pao] ; no data on validity: 
no description of normative population; od, per man- 
ual for any one form; 2s. 3d. per specimen set; Postage 
uw d J. P. Tomlinson; University of London Press 
td. 


d) FORM W. 9s. 3d. per 25 sets of test and answer 
sheet ; 7d. per single set of test and answer sheet; 
arate answer sheets must be usd; 15, ЗУ. per 12 an- 
swer sheets; 54(70) minutes. 
b) FORM X. I25. per 25; 8d. per single сору; 60(70) 
minutes, 

[333] 
Word-Number Test of Scholastic Aptitude, 
Grades 4-8, 9-16; 10, 9-40; 3 scores: verbal, quantita- 
tive, total; IBM; 2 levels; 1 form, '40—same as test 
copyrighted in 1939; no norms for part Scores ; manual 
(740) ; tentative norms; $2.50 per 30 of any one level; 
Separate answer sheets may be used; 114 per IBM an- 
swer sheet; 25¢ per specimen set; ostpaid; 30(35-40) 
minutes; Н. Т. Manuel, James night, J. A. Floyd, 
and R. C. Jordan; Steck Co. * 
a) FORM X, Grades 4-8. 
b) Form A. Grades 9-16. 


JANE LorviNGER, Research Associate in Psy- 
chology, Washington University, St. Louis, 
Missouri. 

This test is an excellent example of a psycho- 
logical test. The accompanying manual is 
straightforward and explicit in describing what 
the test purports to accomplish and what evi- 
dence exists as to how well it fulfills its pur- 
poses. 

Stress is laid on the fact that the test meas- 
ures present ability. “The relative contributions 
of heredity and environment, if ascertained Е 
all, must be discovered by supplementary study. 
A number of practical illustrations are given of 
situations in which the test can be useful for 
educational and vocational guidance. 

Data on reliability and validity of the test аге 
presented with clarity. Reliability is described 
in terms of forms correlated, nature of the groups 
tested, and number of cases. A significant omis- 
sion is the time interval between testings for 
these coefficients. Standard deviations might 
well have been included also. Corrected odd-even 
correlations for single grade, with roo cases In 
each grade but one, range from .82 to .95. Ё a 
X tends to have higher odd-even coefficien 
than Form A. sie. 

Data on validity include a number of corr : 
tions of .7 and .8 with other recognized ш, 3 
intelligence, and a correlation of .4 with И 
semester grade-points of 51 college freshm я 
These correlations would also be more шаш 
ful if accompanied by standard deviations 
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a statement concerning time intervals between 
tests. 

Concerning grade norms for Form X, the test 
booklet states, “The norms for grade 8 have been 
adjusted upward on the basis of results with 
Form A.” The precise reason for this adjustment 
and how it was accomplished should have been 
stated. 

As to content, one might question the inclusion 
of “to demise” in Form A. The reviewer has 
failed to find a college professor familiar with 
the expression, and it seems to be uncommon 
even among lawyers. One item in Form X is 
also open to question. The first wrong alterna- 
tive for “to observe” is given as “keep.” There 
is, however, one usage of keep, perhaps col- 
loquial, in which the words are equivalent: to 
keep a holiday is to observe a holiday. With these 
small exceptions, the content of the test appears 
to be unambiguous and well chosen. 

The two forms of this test deserve wide use 
in view of the excellence of the content and the 
clearness of the supporting data. One can only 
hope that the modesty with which the claims for 
this test are presented will not lead test users 
to prefer tests with more pretentious but less well 
documented claims. 


INDIVIDUAL 


[ 334] 

Alexander Performance Scale: A Performance 
Scale for the Measurement of Practical Ability. 
Ages 8-18; 1935-46; individual; 3 tests; 1 form; man- 
ual (46); 1475. per set of testing materials; 25. pet 
50 record sheets; purchase tax (British purchasers 
only) and postage extra; (40-50) minutes; 
Alexander ; Thomas Nelson and Sons Ltd. * dx 
а) THE PASSALONG TEST. 1932-37; may be adminis- 
tered alone as a quick measure of practical ability ; 
(15) minutes. 
b) KOH'S BLOCK DESIGN TEST. 1919-36. 
с) CUBE CONSTRUCTION TEST. 1918-25. 
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This performance scale consists of three tests, 


the Passalong Test, a shortened version of Kohs’ 
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Block Design Test (similar to that used in the 
Drever-Collins Performance Tests of Intelli- 
gence), and the Cube Construction Test. It has 
much to recommend it. Each of the tests contains 
a series of items graded in order of difficulty, and 
the score is thus not so subject to chance effects 
as are scales containing tests that are single items. 
The test materials are attractively coloured and 
supplied in a box convenient for actual adminis- 
tration of the test. The three subtests themselves 
have proved quite useful in other contexts apart 
from the scale. 

The criticisms that can be made are on points 
of detail rather than on the scale as a whole. 

a) It has been pointed out elsewhere that with 
some subjects difficulty is experienced in the 
Block Design Test when there is a changeover 
from a 4-block to a 9-block design and from a 
g-block design to a 16-block design. In these 
cases the score may be influenced by an emo- 
tional reaction against the abrupt change. The 
inclusion of 6-block and 12-block designs would 
help to overcome this. 

b) The scoring of the С ube Construction Test 
represents an advance over the usual method 
with respect to speed only or to accuracy only ; 
i.e., the score is based on time taken and on the 
number of blocks correct. One wonders, how- 
ever, whether a similar method could not be 
developed for the Block Design Test. It quite 
often happens that the majority of a design is 
completed within a time limit, but no score is 
given because the result is not perfect. 

с) The description of the scoring of the Cube 
Construction Test is made rather confusing by 
the choice of words concerned. A simplification 
could be introduced by calling the “full” time- 
score the “actual” time-score and the so-ealled 
“actual” time-score (which has no real exist- 
ence) the “corrected” time-score. Apart from 
minor points such as these, the scale, princi- 
pally designed to measure the practical ability 
of children between the ages of 11 and 16, can be 
well recommended. 

For a review by John Cohen and excerpts 
from reviews, see 3:270; for a review by J. M. 
Blackburn, see 40:1376; for reviews by James 
Drever, T. J. Keating, and Grace H. Kent of 
the Passalong Test, see 40:1414; for related re- 
views, see 38:B293 and 36:B3. 

Р [335] 
Arthur Point Scale of Performance Tests. Ages 
4.5 or 5.5 to superior adults; 1925-47; individual; 2 
forms; (45-90) minutes; Grace Arthur. * 
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а) FORM I. Ages 5.5 to superior adults; 1925-43; 10 
tests; $78 per complete set of testing materials; postage 
extra; C. H. Stoelting Co. : 
b) REVISED FORM Пп. Ages 4.5 to superior adults; 1947 ; 
5 tests; $60 per complete set of testing materials in- 
cluding manual and 100 score sheets; $3.50 per 100 
Score sheets; $3.50 per 100 score cards; $2.50 per man- 
ual; $4.50 per carrying case (when ordered with com- 
plete set) ; postpaid; sychological Corporation. * 
1) Knox Cube Test, Arthur Revision. $3.60 per test. 
2) Sequin Form Board, Arthur Revision. $22 per 
test. 
3) Arthur Stencil Design Test I. $3 per test. 
4) Porteus Mase Test, Arthur Printing. $11.50 per 
complete set of 14 pads. Y 
5) Healy Picture Completion Test II. $22.50 per 
test. 
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МУ/плллм R. Grove, Chief, Clinical Psychology 
Service, Veterans Administration H, ospital, 
Palo Alto, California. 

It is impossible to review this scale adequately 
without relating its history of development 
through the years. It is a notable and proud his- 
tory of painstaking research that documents the 
professional life of Grace Arthur, a clinical psy- 
chologist of distinction. 

In my student days the contribution of Binet 
held the center of the stage in the embryonic 
field of clinical psychology. At that time the 
Stanford-Binet (1916) was the tool par excel- 
lence of the clinician. But discerning psycholo- 
gists of those days realized that the Binet told 
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only part of the story. A child’s abilities соц 
not always be adequately assessed by a test that 
relied so heavily upon verbal responses alone 
There were subjects who had a Specific disa. 
bility or handicap in the verbal area—such as 
deaf children, those who spoke a language other 
than English, or those with severe speech defects 
as in cerebral palsy—for whom the Stanford- 
Binet was entirely unsuitable. 

And so we were taught to supplement the 
Binet rating on a child by administering non- 
verbal tests called performance tests, Such tests 
typically involved the use of such materials as 
formboards, pencil mazes, blocks, and other 
paraphernalia of a similar sort. Using these, we 
presented the subject with a variety of tasks and 
problems believed to test his general intelligence, 
Such tests were often validated by using the 
Binet as a criterion. In the early days there was 
a profusion of performance tests from which the 
psychologist could select. Each was separately 
standardized, and there was no way to be sure 
of the comparability of the norms. Then various 
workers began to assemble groups of perform- 
ance tests into test batteries intended to be used 
together. The Arthur Point Scale of Perform- 
ance Tests was one of the better of these bat- 
teries. 

The Arthur scale consisted of two forms. 
Form II as originally standardized was designed 
for retest purposes only, and the norms i 
were applicable only to those subjects who ha 
already been tested with Form I. Although Form 
I proved fairly satisfactory from the first, Form 
II was less so because its standardization Ж 
less adequate and it had several technical w 
nesses. It was never extensively used and was 
eventually discarded. 

The paria Point Scale of Р Тате 
Tests, Revised Form II (1947) isa Шош 
revision and restandardization. In many ways 
is an entirely new scale. It consists of Шу 
tésts, each separately standardized, combi 
into a single point scale. 

The ii isa one that can be used by Т 
untrained person. The examiner needs E 
thoroughly versed in the administration 0 | 
dividual tests in a clinical setting. In ad d 
order to manage the examination smooth hr » 
adequately, he must have specific experien BS 
the use of the test materials. The materials HA 
vided are of excellent quality and are сое 
ient and easy to use. Directions for айа! 
tion are entirely adequate. The clinical 
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have been worked out by an author who has 
devoted years of study to the efficient manage- 
ment of such examinations. Instructions for 
scoring are easy to follow. The tasks presented 
are intrinsically interesting to children, and, in 
our experience, easily stimulate adequate effort. 

Norms are based upon “the scores of 968 pu- 
pils from the same middle-class ‘American’ dis- 
trict used in standardizing Form I." An effort 
was made to have the selection of these children 
representative and not biased by fortuitous se- 
lective factors. In spite of this care, Arthur be- 
lieves that Form I is slightly easier than the 
Revised Form II. This she accounts for by the 
high motivation that was characteristic of the 
children used in the restandardization. 

In my judgment, the reliability of the scale as 
a whole is probably fairly high, although the 
test manual does not present adequate data on 
this point. 

The question of the basic validity of the scale 
is less clear. Arthur relies for evidence of va- 
lidity upon the “discriminative value” of each 
subtest at successive age levels and upon “the 
agreement of ratings obtained from the Revised 
Form II, with those obtained from Form I and 
with those obtained from the Binet scale." In my 
opinion such evidence is inconclusive and un- 
satisfactory. This scale is now in need of rigor- 
ous validation studies. Arthur presents "Some 
Notes on the Use of the Scale" in the test manual 
that are helpful and suggestive. These could well 
be used as a basis for further validation studies. 
In correspondence, Arthur has indicated that 
she has other unpublished validity data. 

In closing, we should note that Arthur has 
recently completed an adaptation and restand- 
ardization of the Leiter International Perform- 
ance Scale which she recommends for use with 
the Arthur Scale, Revised Form II “to reinforce 
the measurement of the ability of preschool chil- 
dren.” This should prove a welcome addition, 
for, in our experience, the Arthur scale in both 
forms is too difficult for adequate measurement 
of dull preschool children. 


For reviews by Andrew W. Brown and Car- 
roll А. Whitmer, see 40:1379; for excerpts from 
reviews, see 3:271 and 40:1379; for related re- 
views, see 40:B830, 38:B304, and 36:Br9. 

[336] 
*Canadian Intelligence Examination, 1947 Re- 
vision. fr 3-16; 1940-47; а modification of the 


1916 Stanford Revision of the Binet-Simon Intelli- 
gence Scale; individual; 1 form, ’47; no data on relia- 


bility; 60€ per set of test materials; $1.50 per m. 1 
(47) ; $1 per 50 record forms; се вор т { 
nontimed (25-45) minutes; Harry Amoss, Charles С. 
Stogdill, and Carman E. Stothers; Ryerson Press. * 


Gwen F. AnNorp, Assistant Professor of Edu- 
cation, University of Wisconsin, Madison, Wis- 
consin. , 

"This test, published in 1940 and revised in 
1947, is basically a revision of the original Stan- 
ford-Binet Scale. The revision was undertaken, 
according to the authors, “to iron out certain 
existing discrepancies [in the Stanford-Binet 
Scale] and to adjust the content more precisely 
to the needs of both rural and urban [Canadian] 
communities.” The’ major changes indicated are 
these: 

a) “Language of instruction used in the pres- 
entation of various problems has been simpli- 
fied.” In many instances, this appears at face 
value to have also simplified the task, as com- 
pared with the task in the Stanford-Binet. 

b) “Attempt has been made more vigorously 
to define acceptable responses.” 

c) “Certain tests in which the element of 
chance seemed unduly involved have been 
dropped.” These revisions are commendable 
since they attempt to eliminate two of the weak- 
nesses of the Binet Scale—chance success and 
subjectivity of scoring. 

d) “With a view to the avoidance of local and 
literary expressions, the word list in the vocabu- 
lary test has been selected from conversational 
language common to both rural and urban com- 
munities rather than from a dictionary.” While 
there are advantages in utilizing conversational 
vocabulary, there is the possibility of less pre- 
cise differentiation in the higher levels of in- 
telligence. 

e) “To reduce the weighting heretofore given 
acquired literary skills, more mathematical and 
constructional problems have been introduced.” 
This has greatly increased the nonverbal aspects 
of the scale, a questionable result, since it tends 
to reduce its value as a predictive measure of 
academic aptitude (if this is one of the purposes 
for which the scale is to be used). It might have 
been more advantageous to maintain the verbal 
aspects and to use the Ontario School Ability 
Examination, from which the authors have bor- 
rowed a number of items, as a supplementary 
test if an estimate of nonverbal abilities was de- 
sired. 

f) “Partial answers have been allowed credits 
for the purpose of obtaining a more finely graded 
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result as well as for the purpose of eliminating 
overleniency in the interpretation of responses," 
The need for partial credits is one that has been 
recognized for some time by psychologists who 
have used the Binet scale. The actual value of 
the credits set here can only be determined 
through use of the scale. 

д) "The mean age of mental maturity has 
been fixed at 15 instead of 16. This change was 
made in consideration of the fact that practically 
all retests of children at 16 and 17 who had been 
previously tested at 8-13 showed a falling off 
in the intelligence quotient. The use of 180 as a 
maximum denominator largely rectifies this 
diminution and tends to keep the I.Q. constant." 
In his 1937 revision of the Binet scale, Terman 
also established 15 as the mean age of subjects 
I5 years and older. 

The Canadian scale was found to have a cor- 
relation of .92 with the Stanford-Binet Scale 
when administered to 1,099 children of Canadian 
school population in grades т to 8 inclusive. This 
would indicate that while this scale is radically 
revised, it gives highly comparable results with 
the Stanford-Binet. The scale has proved satis- 
factory in measuring the intelligence and the 
School ability of normal, English-speaking Ca- 
nadian children between the ages of 3 and 16, 
superior children between the ages 3 and 14, and 
subnormal children above age 4. 

This scale is well standardized and can be 
used as a substitute test for the Stanford-Binet 
Scale in Canada. The same cultural and vocabu- 
lary differences which necessitated the prepara- 
tion of the Canadian scale would preclude its 
general application for children in the United 
States, however; with these considerations in 
mind it might be used on occasion as an alter- 
nate scale for Stanford-Binet. 


For excerpts from reviews, see 3:272. 
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*The Crichton Vocabulary Scale. Ages 4-11; 
1950; individual; 1 form; 15s. рег 50; 2s. 6d. per man- 
ual; 175. 6d. per specimen set including 12 tests and a 

imen set of the complementary Coloured Progres- 
sive Matrices (1947) (see 314) ; purchase tax (British 
purchasers only) and postage extra; nontimed (15- 
20) minutes; J. C. Raven; Н. K. Lewis & Co., Ltd. 
(Also distributed by George G. Harrap & Co. Ltd.) * 


CHARLOTTE Banks, Department of Psychology, 
University College, London, England. 

The Crichton Vocabulary Scale, designed spe- 
cifically for use with the Coloured Progressive 
Matrices, “is constructed to cover as nearly as 
possible the same range of intellectual develop- 


. 
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ment,” namely, that from age 5 to age 11, In: 
deed, the author claims that it is "suitable for 
use with persons of sub-normal intelligence, ag 
well as for all children [reviewer's italics] under 
II years of age.” 

The test consists of two series or sets of words, 
which the child has to explain, each set bei 
arranged in an approximate order of difficulty, 
The words in Set 1 are identical with the first 
40 words in the Mill Hill Vocabulary Scale, 
Those in Set 2 are nearly all new, and have been 
chosen as being roughly equivalent to the words 
in Set 1, although “towards the end, the words 
of Set Two are slightly more difficult to explain 
than the equivalent words of Set One.” 

The scale is given as an individual test. When 
testing young children the words are read to 
them, one by one, by the tester. Children over 
10, however, may read the words aloud to the 
tester, explaining them as they go. 

Although the two series are claimed to be 
“equivalent,” norms are given only for the re- 
sults of the two sets added together. These norms 
are based on the results of 608 children, апае 
in the form of percentile points, which are given 
for each half year of age from 4% to 11. It is 
recommended that, as in the case of the Coloured 
Matrices, the percentiles should be interpreted 
in terms of five grades, ranging from Grade I 
or “verbally superior,” the top five per cent, to 
Grade V or “verbally defective,” the bottom five 
per cent. К я 

Test-retest reliability coefficients are given in 
the manual, but only for age 6, age 9, and for 
the years 5 to ro inclusive. These are as follows: 
for year 6, r = .87; for year 9, г = .95; and for 
the years 5 to то, r = .98. The final figure is, of 
course, spuriously high, and it is also very prob- 
able that memory plays a considerable part n 
raising the figure at age 6 and age 9. These vb 
relations are based on a very small number 0 
children. Only 35 children were tested at ages 
and 9, and only 60 over the whole range from 
5 to ro—that is, то children at each year. | 

Correlations between this scale and both t i 
Coloured Progressive Matrices and the Kevise 
Stanford-Binet Scale are also given for the id 
children. These аге: between the Crichton Sc& : 
and the Coloured Matrices, at age 6, r = -53 а 
age 9, г = .65, and for the ages 5 to IO pt 
г = 94; between the Crichton Scale and 
Stanford-Binet, at age 6, r — .84, and at age 9 
г = .79. $ :ne the 

Careful instructions are given for using 
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. test, and examples of acceptable and unaccept- 
able replies to each word,are provided, since, as 
with all open answer tests of this type, the mark- 
ing is largely subjective. 

The Crichton Vocabulary Scale and the Col- 
oured Progressive Matrices are sold together as 
a single set, and a single record form is supplied 
on which the results of both tests may be entered 
for each child. Raven claims that : 


While the Matrices Test indicates a person's present 
capacity for. intellectual activity whatever knowledge 
he has acquired, the Vocabulary Test provides an in- 
dex of his general cultural attainments. By using the 
Matrices and Vocabulary Tests together instead of a 
single verbal test of general intelligence, a child's pres- 
ent capacity for intellectual activity and his acquired 
fund of verbal information can be assessed separately. 
As a result, it is possible to estimate more accurately 
both the validity of the assessments made and the psy- 
chological significance of any discrepancy between the 
two. 

It is perhaps unfair to point out that it is 
hardly an advantage to give two tests together 
in order that what they both measure may then 
be assessed separately. Raven has not clearly 
stated what he means. In any case, few modern 
psychologists would content themselves, in 
assessing a child's intelligence, with a purely 
verbal test, as Raven seems to suggest. There is 
no doubt, however, that in publishing a test of 
vocabulary to supplement the Coloured Pro- 
gressive Matrices, the author is meeting a widely 
recognised demand, although many psycholo- 
gists would probably agree that the addition of 
a vocabulary test adds more information about 
general intelligence than about cultural attain- 
ments, For example, in the Revised Stanford- 
Binet Scale, vocabulary shows а consistently 
high biserial correlation with total test score, 
from age 6 to the superior adult level. 

Until some systematic results are obtainable, 
it is impossible to assess the “validity of the 
assessments made” by these two tests. Raven 
suggests, in the manual, some interesting con- 
clusions which may be drawn in cases where 
there is a wide discrepancy between the Vocabu- 
lary and Matrices scores. These, however, ap- 
pear to be based on casual observation rather 


than on any systematic analysis. 


W. D. Watt, Reader in Education, University 
of Birmingham, Birmingham, England. j 
This scale of 80 words for oral presentation 
to children up to the age of 11 is a development 
of the Mill Hill Vocabulary Scale. It is intended 
to be used in association with Progressive Mat- 


rices (1047) Sets A, Ab, and B, and the two 
are claimed by the author to be an adequate 
measure of intelligence. Differences between ma- 
trix score and vocabulary score, he holds, indi- 
cate differences between the child’s present ca- 
pacity for intellectual activity and his acquired 
fund of verbal information. 

This claim should be accepted with some cau- 
tion. In the handbook to the test, the author gives 
norms of performance based on 608 children 
between 4 and 11 years. If my calculations are 
correctly derived from his data, none of the 
single increments of score for each half-year 
group constitutes a significant difference. The 
same applies to the norms for Progressive Mat- 
rices. Hence, differences in level between the 
two tests would have to be considerable to be 
significant statistically and therefore worth eval- 
uating diagnostically. 

A further point is worth making here. In 
groups of narrow age range, the correlation of 
vocabulary and matrix scores is relatively low 
(.53 for 6% year olds, .65 for 9% year olds). 
Hence, observed discrepancies between status 
on the two tests may well be regression phe- 
nomena merely. 

Reliability data are given the handbook, and 
again the exiguousness of the sampling suggests 
caution. For 35 children of 644, the test-retest 
correlation is .87; for 35 children of 9%, it is 
‚95; and for 60 children from 5 to то years, 98. 
In themselves these figures are satisfactory ; most 
of us, however, would regard them as inconclu- 
sive evidence that the test can be accepted as any- 
thing but an experimental tool. 

All this indicates that although the test is 
promising and convenient in form for clinical 
work, much research upon it has still to be done, 
In the meantime, it is a useful supplement or 
alternative to the vocabulary scales in such bat- 
teries as the Revised Stanford-Binet Scale or 
the Wechsler Intelligence Scale for Children. 
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Curtis Classification Form. Grades 12-16 and 
adults; 1951; a disguised intelligence test administered 
as an interview; individual; Forms А, B; Form A 
recommended for selection purposes, Form B for clini- 
cal use; $3.75 per 100, cash orders postpaid; specimen 
set free to qualified test users; (10) minutes ; James 
W. Curtis; Teleca Research Associates, Inc. 


Hanorp С. SEASHORE, Director, Test Division, 
The Psychological Corporation, New York, 


New York. f 3 
This test consists of 15 questions which are 
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asked of the examinee in the manner of an em- 
ployee interview. It is supposed to foster rap- 
port and to prevent resentment or "blocking" ; 
nervousness and attempts to "beat the test" are 
said to be virtually eliminated. 

Rating the mental abilities of individuals from 
the content of interviews has been common prac- 
tice for many years, with good and bad results. 
Curtis’ test is an attempt to formalize the pro- 
cedure by providing a scoring method for 10 of 
the 15 questions asked. The interviewing itself 
is intended to be informal. Some of the questions 
are highly structured and presumably the ex- 
aminer should ask them just as printed; for ex- 
ample, “What is your height?” After recording 
the reported height, the examiner states the 
height (as printed on the form) of the average 
person of appropriate sex and then asks, “Ex- 
actly how much above (or below) average is 
your height?” The reply to this mental arith- 
metic problem is recorded. After recording the 
response of the person to a question about the 
specific type of work in which he is interested, 
the examiner asks, “In general, what are the 
most important advantages of that type of 
work?” The answer to this question is scored 
2, I, or О points, according to the criteria men- 
tioned in the manual. 

As noted above, Forms A and B are recom- 
mended for different purposes. Although the 
difference in use of the two forms is iterated 
several times in the manual, the rationale is not 
given. Six of the 10 scored questions are identi- 
cal or essentially the same. Perhaps the reason 
that Form B is not usable in personnel selection 
is that it contains questions on religion; these 
are illegal in many places and in bad taste in 
most. 

The reviewer also wonders why the question 
“What was the highest level of school you com- 
pleted?” is not scored when level of school at- 
tainment is an obvious basis for an estimate of 
intelligence, one that has been used informally 
so long that it is almost a classical basis for a 
first estimate. The manual gives no information 
regarding the selection of the items or the method 
for evaluating their contribution to the total 
score. 

The scoring key presents criteria for evalu- 
ating each question on the 2, т, or о basis, but 
evidence is lacking concerning the objectivity 
of the scoring, a matter of utmost importance 
in view of the brevity of the test. The manual 
does point out that with 33 cases given without 
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careful study of the manual, correlation between 
Curtis scores and Wechsler IQ's was only .74, 
The standardization is based on 500 individuals, 
about half of each sex, in the age range of 15 
to 61; apparently there was an approximately 
equal number in each of the five-year intervals 
from ages 15 to 50. The author states that a 
correction for chronological age proved superflu- 
ous, 

Norms for interpretation are presented (a) 
as ranges of scores which differentiate stated 
levels of intelligence, (b) as an expectancy table 
in which the test scores are on one axis and the 
percentages of persons expected to equal or ex- 
ceed IQ’s of 125, 105, 90, and 70 are on the 
other; and (c) in terms of suggested job levels 
for persons with certain scores, this being a 
dubious extrapolation from Army General Clas- 
sification Test data. 

The author estimates that the reliability of the 
test is at least .80; odd-even values of .76 and 
.77, for A and B, are given. He asserts that when 
Form C is ready, test-retest reliability will be 
obtained ; considering the similarity of the Forms 
A and B, one may ask why this study has not 
been possible. 

The basic validity data for the Curtis test re- 
side in its correlation with the W echsler-Bellevue 
Intelligence Scale, Form 1. This correlation 1s 
reported as being .89, N = 120. The correlation 
of Curtis scores with IQ’s from either the 
Wechsler or the California Capacity due 
naire is given as .86, N = 250. In the light o 
the reliability of this short test, these values are 
so high as to be suspect, a criticism anticipat 
and remarked upon by the author. 4 

The reviewer feels that Curtis has attempte 
to fill an important gap in testing. A method - 
quick determination of mental ability in a $e 
ural interview situation is needed. One omy 
wishes that there were more evidence of erit 
item analysis and that the rationale of the соп 
tent were established. 


[3391 test for 
*The FR-CR Test. Adults; 1949; a eir form; 
use in evaluating brain injury; individua i тоо rec 
memory selections included in manual; $3 pe Psychol 
ord sheets; $ per eee. d sd Козы (3-5 
ervice Center J 1: I S '49) ; po: tel Gerv- 
minutes; Russell Graydon Leiter; Psychological Se 
ice Center Press. * 


REFERENCES 


T! 
F es el AL Psycnotocica. Service CEN valitative 
> est 


441 ] 


WirLIAM S. Kocan, Chief Psychologist, Veter- 
ans Administration Hospital; and Kate LEVINE 
Kocan, Clinical Psychologist, 6034 44th Ave., 
N.E.; Seattle, Washington. 

The publishers of this brief memory test, con- 
sisting of one paragraph, claim that it is a useful 
addition to the psychologist’s armamentarium. 
This is to be questioned for several reasons. (a) 
It duplicates in type of material and method of 
scoring a part of the Wechsler Memory Scale. 
The latter has the advantage of more extensive 
standardization and a varied sampling of dif- 
ferent forms of behavior involving memory. The 
FR-CR is too brief and limited a sample df be- 
havior from which to obtain clinical impressions. 
(b) Although several inferences are made con- 
cerning the usefulness of the test in eliciting 
qualitative clinical material effective in delineat- 
ing individuals with organic brain defect, the 
only substantiating material presented in the 
test manual is one reference to the literature; 
this states in general terms that it was found 
to be “fairly sensitive” to such things as cir- 
cumstantiality, blocking, pseudoprofundity, and 
so forth. It would seem that an experienced clini- 
cian would be in a position to obtain this ma- 
terial from tests useful in other ways as well. 
(c) It is inferred that the addition of a con- 
trolled recall section makes possible the evalua- 
tion of personality organization. The test manual 
notes that this addition was made 15 years ago, 
but comments further in another place that a 
shortage of time made it impossible to evaluate 
the hypothesis. (d) Although reservations are 
made concerning the practice, provision is made 
for the calculation of an MA and 10. It is inter- 
esting to note in this connection that the FR-CR 
is termed a useful addition to the verbal section 
of the Wechsler-Bellevue with implied equiva- 
lence to Wechsler IQ's, yet the FR-CR has been 
standardized against the Revised S tanford- 
Binet, whose IQ scores are known to deviate 
from those of the Wechsler-Bellevue in specific 
ways. 

Because this test is new the reviewers ar- 
ranged to have it administered experimentally 
to a series of adult veterans by three examiners. 
They were in agreement that although the test 
was brief and easy to use, it did not provide in- 
formation or elicit behavior which was not se- 
cured by procedures already in routine use. 

In summary the publishers of the FR-CR test 
infer that this 3- to 4-minute sample of behavior 
is probably useful in eliciting information con- 
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cerning intellectual level, personality organiza- 
tion, and organic brain impairment. Poorly sup- 
ported inferences are an unfortunate basis for 
the publication of a test procedure for general 
use, Sa 


[340] 


xFull-Range Picture Vocabulary Test. Ages 2 
and over ; 1948; individual ; Forms A, B; no data on 
reliability and validity with test (for data presented 
elsewhere by the authors, see 1-9 below) ; no manual ; 
$5 per set of testing materials; separate answer sheets 
must be used; $1.10 per 25 answer sheets ; $4 per 100; 
postage extra; nontimed (10-15) minutes; obert В. 
Ammons and Helen S. Ammons; Southern Universi- 
ties Press. * 
REFERENCES 

1. Ammons, Rozert B., AND AMMONS, HELEN S. “The Full- 
Range Picture Vocabulary Test." Abstract, Ат Psychol 
4:267-8 Jl '49. * (PA 23:6202, title only) 

2. AMMONS, ROBERT В. лхо Homes, JAMES Стлрғовр. “The 
Full-Range Picture Vocabulary Test: ITI, Results for a Pre- 
соге Population." Child Develop 20:5-14 Mr '49. * (PA 
24:22: 

p AMMOXS, Rosert B., ann Hutu, Вснлкр W. “The Full- 


Range Picture Vocabulary Test: I, Preliminary Scale.” J 


Psychol 28:51-64 Jl '49. * (PA 24:424) 

4 Corrias mie WiLsoN. The Full-Range Picture Vo- 
cabulary Test: A Normative Study о, Southern Negro Chil- 
Master’s thesis, Tulane University (New Orleans, La.), 


Rosert S. "The Full-Range Picture ухаар тч IVR 
sychol 6:164-9 


7. AMMONS, ur m Larson, WiLLIAM L.; AND SHEARN, 


8. Ammons, Ковевт B., AND MANAHAN, New. “The Full. 
for a Rural 


13154) 
o. Ammons, Ковевт B., AND КАСн1кгЕ, ТЕО B. V he Full- 
Range Picture Vocabulary Test: ТІ, Selection of Items for 
Final Scales.” Ed & Psychol Meas 10:307-19 su '50, * (PA 


25:5821) Y 4 
10. Hoven, RavMoxp H, “Improved Methods in Testing 


Cerebral Palsied Children.” ‘Am J Mental Def 56:349-53 О '51. 
* (PA 26:2184) 


WiLtLrAM D. Artus, Associate Professor of 
Psychology and Chairman of the Department, 
Santa Barbara College, University of California, 
Santa Barbara, California. 

One may infer from the publications of Am- 
monses that they wished to construct a multiple 
choice picture vocabulary test which would meas- 
ure the same functions as does the vocabulary 
section of the 1937 Stanford-Binet. They have 
reported data which indicates that the two forms 
of the picture vocabulary test correlate quite well 
with this criterion. In their discussion of the 
standardization of the picture test in the mimeo- 
graphed sheet which accompanies the test, the 
authors report an 7 of .95 with the Stanford- 
Binet vocabulary test. This correlation coeffi- 
cient, based on an N of 44, looks spuriously 
high when one considers that two fallible instru- 
ments are involved. However, the correlations 
reported for samples of a rural population (8) 
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and a bilingual, Spanish speaking group (5) 
run in the .80's. These latter figures appear to 
be more reasonable and conservative and doubt- 
less represent the relative validity of the picture 
test when the vocabulary section of the Stan- 
ford-Binet is used as the criterion. When the 
criterion is changed to the vocabulary subtest 
of the Wechsler-Bellevue Intelligence Scale, the 
same sort of correlations emerge for a sample 
of the adult population, .86 for Form A of the 
picture vocabulary test and .85 for Form B (7). 

Although the number of words involved in 
the picture vocabulary test is relatively small, the 
reliability is quite adequate, a finding which is 
not surprising in the area of vocabulary testing. 
For an adult sample, the authors report an r 
of .93 between Forms A and B. For the stand- 
ardizing groups, the median odd-even reliability 
for the various age levels is .81 (6). The co- 
efficient for the full range of talent came to the 
spuriously high figure of .987. Reliability would 
seem to be sufficiently taken care оѓ; the correla- 
tions reported in the preceding paragraph for 
"validity" would be prima facie evidence of 
fairly adequate reliability anyway. 

The authors have, then, constructed a test 
based on pictures, which measures approxi- 
mately the same factor or factors which underlie 
the respective vocabulary tests of Terman and 
of Wechsler. In order for it to be acceptable for 
anything except research purposes, the authors 
must show that their test can serve certain 
groups of people or certain desired ends more 
effectively than can the criterion tests which 
have been used in deriving the new one. The au- 
thors have not yet done this. They emphasize the 
speed with which the test may be administered, 
saying that “it takes only fifteen minutes or less 
to administer.” If the authors wish the picture 
vocabulary to supplant the Stanford-Binet vo- 
cabulary, this “fifteen minutes or less” is much 
too slow. During World War II, the writer was 
chief psychologist in an Army training center 
through which over 15,000 adult male illiter- 
ates passed. All of these who could speak Eng- 
lish were given the Stanford-Binet vocabulary 
test. Those who administered this test had to 
study the acceptable and unacceptable responses 
in the Terman-Merrill manual until they could 
score the individual answers of illiterates plus or 
zero without writing down the answers. The av- 
erage time for administering the Terman vo- 
cabulary test in this fashion was three minutes, 
not “fifteen or less.” Thus, the criterion test 
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against which the authors validated this new one 
is much more efficient in terms of time of admin- 
istration—if the person administering the Stan- 
ford-Binet vocabulary is properly and thor- 
oughly trained in its use. This point would not 
be made were it not that the authors in their pub- 
lications and in the mimeographed sheet which 
accompanies this test make much of the rapidity 
with which the test may be given. 

Ammons and Agüero note in one article that 
Spanish speaking American children of Mexi- 
can ancestry do considerably poorer on the pic- 
ture vocabulary test than do monoglot Ameri- 
cans#of the same chronological age. Similar 
findings have been reported numerous times by 
many psychologists, including the writer whose 
conclusions were based on his work with Army 
illiterates. The picture vocabulary test suffers 
the same limitations as do other personally ad- 
ministered vocabulary tests when they are given 
to certain bilingual groups. Setting up new 
norms for deficient bilinguals, as the Ammonses 
propose for those of Mexican ancestry, does not 
improve the efficacy or usefulness of the test. 


Wituram М. СкшскзнАхк, Director, Edu- 
cation for Exceptional Children, School of Edu- 
cation, Syracuse University, Syracuse, New 
York. 

This test, designed to give a rapid estimate of 
verbal comprehension, has been in use for only 
two years and as yet has not met the crucial test 
of clinical experimentation. It consists of 16 
plates each with four separate cartoon-like draw- 
ings on it. The testee is asked to indicate by word 
or gesture which of the four pictures best illus- 
trates the meaning of a given word. Approxi- 
mately 80 words are used in each of the two par- 
allel forms which cover the range of verbal abili- 
ties from early infancy to the superior adult 
level. Items are given an approximate age place- 
ment, but this is only intended to facilitate the 
mechanical aspect of testing since the test i5 
actually a point scale. р k 

The test meets a need felt by clinicians work- 
ing in the field of the mental testing of persons 
with speech handicaps since it is essentially non 
verbal in the sense that the testee need not 8m 
a word during the performance. The cartoon- 
like nature of the drawings should have an ү 
trinsic interest value for children. Also, as y 
author mentions, it is very easy to pmo 
The author reports that proficient up adi 
were able to test three or four children an ho 


443 ] 


TESTS & REVIEWS: INTELLIGENCE—INDIVIDUAL 


342 


with both this test and the Stanford-Binet or 
Wechsler vocabulary. 

Admirable also is the designers’ thorough and 
meaningful standardization procedure. The lim- 
itation of a circumscribed geographical sample 
from around the Denver area and from a rural 
area of Nebraska does not appear to be a serious 
handicap. A careful breakdown of the standardi- 
zation population made it possible to include in 
the final norms an adequate sample of rural sub- 
jects. Then too, 80 Spanish-American and 80 
Negro children have been included in the final 
standardization norms. The author rejected items 
for the final scale if they showed inadequate dis- 
crimination in per cent passing between succes- 
sive age levels, regional bias, sex differences in 
difficulty, ambiguity of denotation, or poor dis- 
crimination at one age level even though the 
same item was satisfactory at another level. 

For Form A of the final scale, the correlation 
with the Stanford-Binet vocabulary raw score 
was .91 and for Form B it was .93. Since Stan- 
ford vocabulary scores have a correlation of .81 
with the mental age of their scale as a whole, the 
validity of the Ammons as a measure of verbal 
intelligence is not as outstanding as the correla- 
tion might suggest since it is twice removed. 
Taking this into account, however, the scale still 
offers a satisfactory estimate of the desired 
verbal comprehension. The total score on the 
test can be transcribed into mental age scores at 
the younger levels and percentiles at the adult 
level. 

The present reviewer considers this test valu- 
able as a quick verbal intelligence screening de- 
vice in industry and in the armed services and 
as a means of testing persons seriously handi- 
capped by a speech defect or by upper extremity 
disabilities. 

[341] 
*Gesell Developmental Schedules. Ages 4 weeks 
to 6 years; 1925-49; individual; also called Gesell Ma- 
turity Scale, Gesell. Norms of Development, Gesell 
Tests, Preschool Child Development. Scale, Preschool 
Child Test, Yale Psycho-Clinic Developmental Sched- 
ules, Yale Tests of Child Development, and other vari- 
ations; (20-40) minutes; Arnold Gesell and others. 
а) 1925 SERIES. 1925; the authors consider the 1925 
Series out of date and superseded by the 1940 Series 
(see c below); manual, The Mental Growth of the 
Preschool Child (see 1 below) is out of print; $65 per 
set of testing materials (listed in publisher’s catalog 
as Pre-School Child Test), postage extra; C. H. Stoel- 
ting Co. 
b) 1938 SERIES. 1925-38; manual, The Psychology of 
Early Growth ('38—see 13 below), out of print; test 
materials out of print; Marietta Apparatus Co. 


с) 1940 SERIES. 1925-49; manuals: The First Five 
Years of Life ('40—see 20 below) and Developmental 


Diagnosis, Second Edition ('47—see 31 below) ; $68.50 
per set of testing materials; 80¢ per specimen set of 
printed materials and record forms; 50# per revised 
general instructions booklet (749) ; postage extra; Psy- 
chological Corporation. * 

REFERENCES 
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32. ESCALONA, SreyrLe. “The Predictive, Value of Ps cho» 
logical Tests in Infancy; A Report on Clinical Findings.” Ab- 
stract. Am Psychol 3:281 Jl m * (PA 22:5366, title only) 
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dictive Purposes." B Menninger Clinic 14:1 17-28 Jl 'so. * (РА 
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For reviews by Nancy Bayley and Florence 
M. Teagarden, see 3:276; jor related reviews, 


see 3:277-80 and 40:B912-6. 
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xThe Immediate Test: A Quick Verbal Intelli- 
gence Test. Adults; 1951; individual; orally admin- 
istered; 1 form; $1 per 25, postage extra; 256 per spec- 
imen set, postpaid; (5) minutes ; Raymond J. Corsini: 
Sheridan Supply Co. * 
_ REFERENCES 

1. Corsint, Клумохр. “Тһе Immediate Test." J Clin Psychol 
7:127-30 Ар '51. * (PA 25:7778) 

Jerome E. Doppett, Assistant Director, Test 
Division, The Psychological Corporation, New 
York, New York. 

This is a vocabulary test consisting of 66 
words grouped into 11 sections. Each section 
corresponds to a “mental year,” with years 10 
through 20 represented. The score on the test 
is the number of words below the basal year plus 
the words actually defined correctly by the sub- 
ject. This raw score may be converted to MA or 
IQ by means of a table printed on the test. There 
is no scoring key. It is simply suggested that the 
examiner ask himself, “Does the subject know 
what the word means?” 

Since the determination of the MA or IQ is 
ultimately dependent on the proper allocation of 
words to mental years, the procedure used to 
assign the words is of considerable importance. 
However, the manual is woefully inadequate on 
this point. It is stated that the difficulty of words 
was determined by noting “the percent of passing 
individuals at each mental age.” The numbers 
and kinds of cases are not given. A journal arti- 
cle cited as a reference gives a rather confusing 
picture of several different difficulty analyses. 
One of these studies was apparently made with 
104 subjects distributed over at least eight mental 
age levels. One can only say that the number of 
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cases at each mental age must have been remark- 
ably small. 

АП of the statistical data are based on male 
prison populations. The test-retest reliability is 
reported as .90, and correlations with other tests 
such as Otis, Terman List and Wechsler-Belle- 
vue range from .77 to .91. Unfortunately, the 
author does not give means and standard devia- 
tions either on the other tests or on his own test, 
so the reader is left to wonder about the effects 
of range on the correlation coefficients. Before 
these coefficients are accepted, further verifica- 
tion of the findings based on other populations, 
preferably with means and standard deviations 
given, should be obtained, 

The reviewer finds great difficulty in accepting 
the author's statement that vocabulary “is less 
affected by culture than most kinds of material." 
This difficulty is enhanced by finding words like 
‘borsht” and “fid” in the test. One can hardly 
accept these words as “culture-free.” 

There are a number of inconsistencies in the 
short manual. It is stated that the test should be 
announced in a manner that does not mention 
"intelligence" or "test." Yet at the top of the 
list of words, which the subject is permitted to 
see if he wishes, is the subtitle, “A Quick Verbal 
Intelligence Test." There is also a certain incon- 
sistency between the statement that the test “is 
a laboratory technique and requires proper ad- 
ministrative procedures" and the subjective 
scoring system. 

In the reviewer’s opinion, there is nothing in 
the manual which would justify confidence either 
in the mental age equivalents or the IQ's ob- 
tained from this test, 


Ivan Norman Mensu, Assistant Professor of 
Medical Psychology, Washington U niversity 
Medical School, St. Louis, Missouri. 

This test is described by its author as a quick 
verbal intelligence test-—“an emergency clinical 
instrument.” Consisting of 66 words, six at each 
age level from 10 through 20 and ranging in dif- 
ficulty from “zebra” to “gecko,” the test is a 
vocabulary measure designed as an age scale. 
Mental age and IQ values are immediately avail- 
able on the single-sheet form on which the sub- 
ject’s identifying data and responses are re- 
corded. The two-page manual consists of brief 
statements on the advantages of a vocabulary 
test as a measure of intellectual status; a descrip- 
tion of the test ; its construction in terms of word 
selection, difficulty, discriminating power of the 
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individual words, validity, and reliability ; and di- 
rections for administration and scoring. The au- 
thor suggests that, according to the data from 
the standardization samples, which were male 
reformatory and prison inmates, The Imme- 
diate Test is suitable for adults with IQ's be- 
tween 75 and 125. The standardization samples 
had also been administered the Otis, the Ter- 
man vocabulary, Form L of the Stanford-Binet, 
and Forms I and II of the Wechsler-Bellevue, 
With N's of 50 to 300 the correlations of The 
Immediate Test with the criterion tests range 
from .77 to .91 for 12 samples. Test-retest reli- 
ability on 137 prisoners is reported as .90. 

Unlike many brief tests, this rapid measure is 
not an abbreviated technique derived from a 
parent test but was specifically designed as a 
short test. Thus there are no previous data avail- 
able on which to judge the merits of Corsini’s 
vocabulary scale. Only cross validation on sam- 
ples other than those used in the standardization 
can provide adequate demonstration of the va- 
lidity and reliability of the scale for other ele- 
ments of our test population. The test has been 
introduced so recently that this necessary evi- 
dence has not yet been accumulated. Finally, one 
caution recognized and mentioned by the test 
author should be stressed. He states that the 
simplicity of administration of the scale suits it 
to the needs of workers without training in psy- 
chometrics, “but its simplicity ought not to be 
confused with the great difficulty of adequately 
determining intelligence.” There is danger in the 
evaluation of intelligence by untrained indi- 
viduals who do not recognize the limitations of 
brief measures sampling a single area of intellec- 
tual functioning. 
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"Intelligence Tests for Children. Ages 2-11; 
1945-50 Е formerly called Intelligence Tests for уо 
Children; 1 form; requires only such Merge" me"; 
can readily be found or made in the school or [see H 
5s. per manual (’50—same as the 1948 ese thuen 
below]); (30) minutes; C. W. Valentine; Me 

& Co. Ltd. 


REFERENCES scence 
1. WAKELAM, B. B. “The Application of a New I ari 
Test in an Infant School and the Prediction of Ва УЗ ; 
Brit J Ed Psychol 14:142-50 N "44. * ga 19:1355 ung Chil- 
2. VALENTINE, С. wW ntelligence ' 
dren, First and Second Editions. London: 
1945, 1946. Pp. xii, 67. * (PA 19:3520) [The $ 
а reprinting of the first edition. Children, Third 
3 Fourth Ед. London Methuen & Coy Li 1946 е 
an. м Изопз. оп: м, ыер inti: 
Pp. їй Вт. * (РА p :4596) [The fourth edition is а repr 
of the third edition.] 


: 4 at 
For excerpts from reviews, see 345; for О 
cerpts from reviews of an earlier edition, 52е 3 
and 3:283. 
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[Re Intelligence Tests for Children.] VALENTINE, 
C. W. Intelligence Tests for Young Children, 
First and Second Editions. London: Methuen & 
Co. Ltd., 1945, 1946. Pp. xii, 68. * (PA 19:3520) [The 
second edition is a reprinting of the first edition.] For 
latest edition, see 345. 


Lancet 249:80 Jl 21 45. Professor Valentine 
has done the educationists a good service by col- 
lecting together a number of standard tests for 
young children, adding to them some new tests, 
and making a new scale of tests. These range 
from tests for children of eighteen months to 
children of eleven years, and they are meant to 
be given by teachers. They call for only the sim- 
plest equipment and his hope is that in this way 
every child will be tested three times in the years 
from his nursery school up to his entry into the 
senior school. In a sense, it is "utility" intelli- 
gence testing. The reader is given a clear idea 
of what a test situation should be like, what are 
the necessary safeguards to be observed, and the 
commoner pitfalls and misconceptions are as 
honestly faced as they can be in the brief space 
of so small a book as this. The instructions, both 
for giving the test and for scoring it, are clear 
and concise. They are nevertheless not entirely 
unambiguous; and here and there the author 
rather breezily disposes of the need for absolute 
standardisation. He wisely makes the point that 
at the very young end of the scale, the rapport is 
all-important, and in his readiness to approxi- 
mate and subsequently correct at a later test, he 
probably underestimates the high degree of ac- 
curacy obtainable by an expert working with say 
the Gesell material. He will, however, be able 
to add to his knowledge of the young child, and, 
wisely used, these tests should go far to prevent 
any failure in early recognition of marked de- 
viations from normal ability. 

Nature 156:349 S 22°45. John Cohen. * Al- 
though the scale is intended chiefly for teachers, 
it is also meant for the use of that rara avis, an 
"intelligent parent," eager to weigh the native 
talent of his offspring. Apart from four new tests, 
all the material is borrowed, often with slight 
modifications, from Burt, Gesell, Stutsman and 
from the Merrill-Palmer and Terman-Merrill 
scales. In many cases new age assignments have 
been given, As in the Binet scale, the tests are 
scored on a pass-fail principle, and all test items 
are given equal weight, though no evidence is 
offered that they deserve it. A practical advan- 
tage is the simplicity of the apparatus required. 


The reader may be surprised not to find in a 
book of this kind, where the emphasis is upon 
intellectual ability, any reference to the work of 
Dr. Charlotte Bühler, for Dr. Bühler's aim is to 
judge the degree of maturity attained by the 
child's personality as a whole, including its cog- 
nitive aspects. A few of the “tests,” for example, 
bowel and bladder control, are scarcely tests at 
all in the sense of sampling responses in a stand- 
ardized situation, for success in them depends on 
habits acquired under varied conditions of train- 
ing. Without further evidence, one cannot there- 
fore regard them as indexes of basic ability. 
The validity of the “sentence completion” test 
at age six is also questionable, since it requires 
the highly specific ability to use two preposi- 
tions. One intriguing point is the author's state- 
ment (p. 10) that he found a “very close cor- 
respondence between very early tests (at only 
12 or even 6 months) with tests of intelligence 
at 10 years and later.” One would like to know 
precisely which tests were given at the age of 
six months to obtain such good predictions of 
later ability. * Prof. Valentine admits that the 
age assignments are not altogether satisfactory, 
especially in the very early years. Thus the “ac- 
tion agent” test at age three requires the infant 
to know the meaning of such words as “gallops,” 
“aches,” “explodes,” “roars,” etc. This surely 
deserves a higher age assignment than the Binet 
test in which the child is required to point to its 
nose, eyes, mouth, hair and knee, or the one in 
which the child has to put on his shoes. Some 
brief experiments by the reviewer confirm this 
view. Nevertheless, the scale may be warmly 
recommended for further experimentation and 
clinical use. It may be justified as a device for 
roughly gauging the general ability of young chil- 
dren, the estimates improving in accuracy às 
the child grows older. Much careful work is, 
however, needed, as the author would doubtless 
agree, before all the tests could claim a sound 
theoretical foundation. 


For an additional review, see 3:283. 


[345] 

*[Re Intelligence Tests for Children.] VALEN- 
ды C. W. Intelligence Tests for Children, Third 
and Fourth Editions. London: Methuen & Co. Ltd., 
1948, 1950. Pp. xiii, 81. 55. ж (PA 25: 4596) [^I have 
added tests for the ages of 12, 13, 14, and 15, which 
should provide ample material for the bright 11-уеаг- 
olds and the average 12- ог 13-year-olds.’—Preface to 
the Third Edition. The fourth edition is a re rinting of 
the third edition which is a revision of Intelligence 
Tests for Young Children.] 
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Brit J Ed Psychol 18:169 N °48. C(yril) 
B(urt). * In its original form the book sought 
to provide a variety of simple and interesting 
tests for children from the ages of 114 to 11. The 
new edition includes tests for older children up 
to the age of 15. Thus the book can now be used 
for testing pupils of r1 plus who are being se- 
lected for an appropriate type of secondary edu- 
cation ; and the addition of problems for higher 
mental ages makes it possible to examine 
brighter children at the age of transference. One 
feature of the original series has appealed par- 
ticularly to teachers in these post-war years; it 
dispensed with all elaborate apparatus that might 
require to be imported from abroad or purchased 
at some expense. Most of the test-material 
needed was furnished by the book itself in the 
shape of pictures or diagrams. For the rest noth- 
ing was required except a little simple apparatus 
such as can easily be found or made in the school 
or at home. This admirable principle has been 
preserved in selecting new tests for the higher 
ages. They include problems of well-established 
types, such as Reasoning, Analogies, Absurdi- 
ties, together with Performance Tests including 
Mazes; but the mazes and other diagrams are 
printed in the book itself, and nothing else is 
wanted but cards cut to the appropriate shapes, 
and occasionally a pencil and paper. In the past, 
many of the test-scales produced for school 
pupils have been constructed either by labora- 
tory psychologists with little or no teaching ex- 
perience, or by teachers not very familiar with 
modern scientific principles for standardizing 
such tests. Professor Valentine's tests have a 
double advantage; they can claim the scientific 
merits of a scale that is constructed by one who 
is a leading educational psychologist; at the 
same time they show a first-hand practical ap- 
preciation of the needs of the teacher in the 
classroom. 

Brit J Psychol, Gen Sect 39:172 Mr ?49. 
S. J. F. P (hilpott). * an enlarged edition of a 
well-known and valued book. The additional ma- 
terial consists of tests for children beyond the 11- 
year range. As the author notes in his preface, it 
is nowadays advisable to have tests available for 
assessing the bright youngsters likely to be en- 
tering Grammar Schools or Technical High 
Schools. By the addition of tests up to the 15- 
year level enough material has been given ade- 
quately to stretch the brighter II-year olds and 
of course the more ordinary children at а higher 
level. Full instructions are given with all neces- 
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sary diagrams. The tests involve little other ap- 
paratus and none which cannot easily be found 
in school, 

J Ed (London) 80:700 D '48. E. M. Bari- 
lett. * contains little that is new, but is a collec- 
tion of well-standardized individual tests, from 
other scales, for children to the ages of 15. This 
book in its turn may be “dangerous.” Prof, Val- 
entine is himself aware of the danger, as is in- 
dicated in his remark in the preface that “for a 
precise assessment of a child's intelligence quo- 
tient the expert tester is needed,” but his own 
suggestion that the scale can be used by teachers 
and even by an "intelligent parent" may open 
the way to all sorts of abuse, especially as it seems 
generally true that the less one knows about 
the theory of intelligence the greater one's faith 
in the result of a single test. 

Scottish Ed J 31:757 D 24’48. Ina July 1946 
issue of this Journal Emeritus Professor Valen- 
tine's new battery of intelligence tests was wel- 
comed as a useful alternative to the Terman- 
Merrill scales. By adding tests for the ages 12, 
13, 14 and 15, Prof. Valentine has in the new 
enlarged edition still further increased the use- 
fulness of the original work. 


For reviews of the first edition, see 344 and 
3:283. 


[346] Pn 

Kent Series of Emergency Scales. Ages 5-7, 6-5 
7-10, 9-14; 1932-46; also called E-G-Y Scales; iis 
vidual; 4 levels; 1 form, '46; $2.50 per 100 of any on 
level (Scales В and С are combined) ; боё per mH 
(’46—contains the four scales) ; postpaid; (15) X 
utes; Grace H. Kent; Psychological Corporation, don 
а) SCALE A. Ages 5-7. 1944-46; formerly called Am 
ver School-Entrance Test. 

) SCALES В AND C. Ages 6-8, 7-10; 1946. { Emm 
C) SCALE D. Ages 9-14; 1932-46; a revision 0 
gency Test (also called Kent E-G-Y Test). 


REFERENCES 
1-26. See 


:284. ; SOLOMON, 
- Hunt, W. A.: Wirtsox, C. L.; Harris, H. 13, 
Р.Р AND EE M. M. ""Psychomeiric Procedures in NS Ps 
tection of $ Nav 


oe T 
of the Kent and Buck Screen Tests of Mental Ability Pem- 


:187-9 My-Je '48. * (РА 22:4943) ~. exem 
ed € MUERE КЕЕДЕ SEYMOUR С. Ме Д , 
Ivan N.; AND WiLLIAMS, Meyer. “The Vali LANES "Psychol 


, Jo j Rican, б. гр 
Application of Brief Tests of Intelligence in tep A 23:77 
n 


title only) Tests Used on 
QR G. M. “The Kent and Other Test? (рд 

the Same Subjects.” J Mental Sci 44:452-8 Ар '4® 2 

23:12 cs, S. 


S rtr. H. F.; MacPuer, Н. М.; AND CUMMIN Bellevue 
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opc MM SA Psychol 44:223-30 Ap '49. * 
А .; Cannicott, RicHARD G.; UMBERGER, 
CUR e ME DM n 
S (PA 25:8099) 2 7:123-6 Ар ’51. 
Ivan Norman Мемѕн, Assistant Professor of 
Medical Psychology, Washington University 
Medical School, St. Louis, Missouri. 

Kent’s manual for her series opens with a 
statement which immediately defines her goals 
in devising the test. She has carefully stated that 
the E-G-Y was offered “as a preliminary meas- 
ure....useful in helping the examiner to decide at 
what mental level to start the more formal exami- 
nation; and occasionally it may serve to indi- 
cate how exhaustive an examination is required.” 
Introduced in 1932, revised in 1942, and in- 
creased by three new scales with additional 
norms added in 1944 and 1945, the Kent Series 
were widely used during World War II. The 
press of military and naval demands often re- 
quired a brief, “simple and informal mental test” 
аз a screening device in selection of recruits. In 
this setting the Kent tests were extremely valu- 
able as a step beyond the initial screening of 
group intelligence measures, providing a clinical 
tool for face to face examining of suspected 
mental defectives. 

In actual practice, when used with service 
personnel the Kent Series have shown correla- 
tions of .73 and .74 with the Wechsler-Bellevue 
Verbal Scale and the Stanford-Binet, respec- 
tively, both in the U.S. and in England. These 
correlations have led various investigators (e.g., 
Wright, MacPhee, and Cummings (33) and Ru- 
dolf (32)) to conclude that the E-G-Y does not 
sufficiently discriminate levels of mental func- 
tioning and cannot be used in place of longer 
tests. Against these studies are those by Hunt, 
Wittson, Harris, Solomon, and Jackson (27), 
Lewinski (13, 16, 21), Cofer and Biegel (29), 
and others. Hunt, for example, reported that 
predictions of Wechsler-Bellevue mental age 
from Kent E-G-Y performance showed an aver- 
age error of six months or less. It should be re- 
called that Kent did not intend the E-G-Y as an- 
other test, but rather as a preliminary measure. 
With this specific goal in mind, the arguments 
against the Kent Series seem to have less weight. 

Finally, unlike many, if not most, test authors, 
Kent has devoted a significant portion of her 
manual to "Suggestions for Further Develop- 
ment." She points out that she “is not prepared 
to carry its development further, but is hoping 
that it will be taken up by some younger person." 
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The critical thinking about test construction 
which has gone into Kent's suggestions for 
further development can well serve as a model 
for other authors. 


This method of using the available material—the 
arrangement of questions covering a wide range of 
difficulty in overlapping scales which carry independ- 
ent norms—is believed to be a more important contri- 
bution than any actual test-material offered by the 
writer. Almost any particular question is likely to lose 
its significance with the passage of time, and probably 
the best of scales should be brought up to date and 
restandardized at intervals of not over twenty years; 
but the system here proposed is capable of in efinite 
expansion. This set-up of scales is one which effects 
the maximal usefulness of every question and the mini- 
mal use of inappropriate questions, while at the same 
time permitting the utmost economy of time. 


For a review by Charles N. Cofer, see 3:284. 


[347] 
XThe Leiter Adaptation of Arthur's Stencil De- 
sign Test. Adults; 1945-49; а subtest of Leiter- 
Partington Adult Performance Scale (sce 350) ; indi- 
vidual; 1 form, '49; manual ('49) ; $3 per set of sten- 
cils and designs; $3 per 100 record cards; $3 per 100 
qualitative check lists for evaluating brain injury; $1 
per manual; manual and record cards same as їог 
Leiter-Partington Adult Performance Scale; postage 
extra; (25-30) minutes; Russell Graydon Leiter; Psy- 
chological Service Center Press. * 
REFERENCES 
т. ADJUTANT GENERAL'S OFFICE, CLASSIFICATION AND RE- 


PLACEMENT BRANCH, PERSONNEL RESEARCH Section. “The New 
Army Individual Test of General Ability." Psychol B 41:532-8 
O '44. * (PA 19:581) 

J Consult Psychol 14:162 Ap '50. Laurance 
F. Shaffer. This upward revision of the stencil 
design test is substantially the “shoulder 
patches” item of the Army Individual Test. As 
a measure of ability in male adults, it shows a 
correlation with the Revised Stanford-Binet of 
62. Norms, in terms of MA and IQ are for adult 
young men only. An interesting clinical check 
list (by Neal Watson) calls attention to ob- 
servations of ability to shift, synthesize, plan, 
and analyze, that may be useful indicators of 


brain injury. 


[348] 
xThe Leiter Adaptation of the Painted Cube 
Test. Adults (especially with IQ's over 110); 1941— 
49; a variation 0! Imaged Cube Test which appeared 
about 1900; æ subtest of Leiter-Partington Adult Per- 
formance Scale (see 350); individual; 1 form, '49; 
manual (40); $6 per set of 24 cubes; $3 per set of 
model cards; $3 per 100 record cards; $1 per manual ; 
manual and record cards same as for Leiter-Partington 
Adult Performance Scale; postage l 
(10-15) minutes ; author of original test not known; 
adaptation by Russell Graydon Leiter ; 
Service Center Press. * 
REFERENCES 
т. ADJUTANT GENERAL'S Orricm, CLASSIFICATION AND Re- 


NCH, PERSONNEL RESEARCH SECTION. “The 
ута 3 M dual Tet of General Ability." Psychol B 41: 


532-8 О ’44. * (PA 19:581) 
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J Consult Psychol 14:162 Ap '50. Laurance 
F. Shaffer. Although some versions of the cube- 
assembly tests are about 50 years old, the present 
compact and useful form was devised by Leiter 
in 1941, and served as a part of the Army Indi- 
vidual Test. The revision differs from the well- 
known World War I form by using colored pic- 
tures of cube assemblies as the models, instead 
of solid blocks. Data on 256 male veterans show 
that the test correlates .57 with the Revised 
Stanford-Binet, and that it has superior discrimi- 
nation at the upper IQ levels, above 110. Norms 
are available only for adult young men. 


[349] 

*Leiter International Performance Scale. Ages 
2-18; 196-48; individual; 54 tests which can be given 
without language or pantomime; 1 form, revised, '48; 
revised manual (’48) ; test materials are in three trays: 
Tray 1 (tests for years 2-7), Tray 2 (tests for years 
8-12), Tray 3 (tesis for years 14-18) ; $27.50 per tray; 
$3 per 100 record cards ; $3 per manual ; per carrying 
Case; postage extra; (30-60) minutes; Russell Gray- 
don Leiter; Psychological Service Center Press, * 


REFERENCES 
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ter J 1:136-8 D "49. * (PA 2 11792) 
x Mirrnews, facr, AND Bircu, Jack W, “The Leiter In- 
асов Performance, До A du ested Тан Чог 
'sychologica! esting о peech an caring Clinic Cases,” 
J Speech & Hearing Disorders 14:318-21 D "49. * (PA 25: 
= Brssent, Trent E. “A Note on the Validity of the Lei- 
ter a телап Scale." J Consult Psychol 14:234 
"so, 135 
он MIL. E. A Study of the Performance оў Se. 
lected Groups of Five-Year-Olds on the Leiter International 
Performança Scale. Doctor's thesis, University of Iowa (Iowa 
ity, , e ‚ 
P^ Bind, 1950. R., AND Васин, Jack W. “The Leiter Inter- 
national Performance Scale as an Aid in the Рат 


і 1 i Scale 
tients Derived by the Leiter International Derim а 
University of Pittsburgh, (Pittsburgh, Pa.), 1951. 


Gwen Е. AnNorp, Assistant Professor of Edu- 
cation, University of Wisconsin, Madison, Wis- 
consin. 

This nonverbal mental age scale for measur- 
ing intelligence, published in тодо, was used 
comparatively little until the last five years be- 
cause of its unavailability during the war years. 
The scale consists of 68 standardized items, four 
at each age level, ranging in difficulty from the 
2-year to the 18-year level. The author suggests 
that its greatest usefulness is for children d 
tween the ages of 5 and 12. It is unfortunate tha 
the standardization data have not yet been pub- 
lished. However, the author states that the qu 
ardization and location of items in the scale fn Д 
lowed the procedure used by Тегтап in t à 
Revised Stanford-Binet Scale. He further state: 
that while the items in this scale are in no у 
adaptations or modifications of the Binet peo 
they may be considered of equal value at еа И 
year level despite the difference in the basic a 
sumptions regarding the indices of медиа 
While Terman believes that the amount 0 tal 
formation a child acquires through incide 
learning is an index of brightness, Leiter BH 
his scale on the principle that the ability to i 
with new situations is a truer indication О » 
telligence. In both scales the mental age rhe 
total number of months of credit eui 
intelligence quotient is the ratio MA/CA. any- 
uses a chronological age of 156 months d e 
one thirteen years and older. This in itse 
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fines the upper limits of the test's usefulness. 

The scale utilizes the common test technique 
of matching, ranging from simple pairing of like 
colors and objects to the more complex relation- 
ships of designs, genus, analogies, etc. The in- 
structions are given in pantomime and, since the 
materials are unique, the influence of coaching 
and previous experience are minimized. The 
items at the lower levels are sufficiently easy and 
the technique of administration is such as to 
provide opportunities for learning to take place 
during the testing. 

The advantages of this test as a clinical in- 
strument are many. It is useful with children 
who have hearing, speech, and multiple handi- 
caps, as well as with non-English speaking chil- 
dren. It is extremely interesting to children. 
While manipulation of materials is at a mini- 
mum, it provides activity for the young child 
which fosters attention. The minimal recording 
allows the examiner to observe the child's ap- 
proach to problem solving and his emotional re- 
actions while working. These observations are 
frequently more significant than the score itself. 
Timed items are negligible, thus making it a 
power rather than a speed test. It has some 
value in diagnosing brain damage. It can be 
adapted for use with the multiply handicapped. 

The greatest practical disadvantage of this 
scale is the amount of material which must be 
handled. The cost is also high. 

The level of difficulty of tests at years two 
and three seems to be too high for these ages. In 
the experience of the reviewer, the total score 
in the preschool ranges of the test averages IO 
IQ points lower when compared with other 
scales. ; 

The all-or-none aspect of the scoring, while it 
lends itself to greater objectivity in scoring, 
penalizes the child who may comprehend the na- 
ture of the task but who for other reasons does 
not accurately complete the total task. A partial 
score would be more definitive. With only four 
items at each age level, a failure exacts great 
penalty and lends itself to misinterpretation of 
the overall estimate of a child's ability by the non- 
technically trained person. 

From observations made during test adminis- 
tration the reviewer questions the validity of the 
following items in the scale. Year V, Test 1, 
Genus: Many children match the items on the 
basis of color rather than genus. While this may 
have been intended as part of the discriminative 
value of the test, it frequently fails in this pur- 


pose. Year V, Test 3, Clothing: The pictures of 
clothing are so outmoded and disproportionate 
that many children of this age fail, not because of 
lack of comprehension of the task, but because 
the pictured items are misleading. Y ear VI, Test 
2, Pattern Completion: This is most baffling to 
children of this age level. The author, too, has 
noticed this. In general, the six year tests seem 
to have a steep incline in difficulty level. Year 
X, Test 1, Recognition of Footprints: This item 
seems rather inappropriate for our culture and 
does not seem to have any discriminative value. 
Year XII, Test 2, Similarities: The use of real 
objects and materials and pictured objects to be 
matched with pictured objects is unduly confus- 
ing and does not conform with the other items 
in the scale since this technique is not otherwise 
employed. Year XII, Test 3, Recognition of Fa- 
cial Expression: This was an unfortunate choice 
of item to measure intelligence since its unrelia- 
bility has been shown in other types of studies. 

Arthur also recognized the inadequate stand- 
ardization at the two and three year levels and 
in her revision of the scale has modified it so 
that a basal age of two years yields a mental 
age equivalent of two years six months. The 
writer has found in rechecking tests on this 
basis that the results are more nearly equivalent 
to other test results. Similarity of results does 
not of itself prove adequacy or validity of a test. 
However, there are clinical situations where the 
Leiter scale, to date, is the only administrable 
measure, and the examiner must make recom- 
mendations on the basis of it. In the hands of 
nonclinically trained persons the prediction of a 
subject’s potential could be unfortunate. 

The reviewer has used this scale for six years, 
both for experimental and diagnostic purposes. 
She feels that despite some of its inadequacies, 
it is a good clinical instrument. She prefers it 
for clinical diagnosis rather than as a test to be 
used widely in public school situations as a 
measure of general intelligence. 


J Consult Psychol 13:386 O '49. Laurance F. 
Shaffer. * Although the scale seems to be a 
stimulating and original contribution to the 
measurement of ability with nonlanguage tasks, 


full clinical acceptance still awaits the publica- 
tion of data concerning its standardization. 


[350] 
XLeiter-Partington Adult Performance Scale. 
Adults; 1938-50; individual; 3 tests (also listed sep- 
arately) ; revised manual ('so—reprinted from Psy- 
chol Service Center J 1:139-71 D 49) ; $30 per com- 
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plete set of testing materials including 100 record cards 
and 200 qualitative check lists for evaluating brain in- 
jury; $3 per 100 record cards; $3 per 100 qualitative 
Check lists; postage extra; (35-45) minutes; Russell 
Graydon Leiter and John E. Partington; Psychological 
Service Center Press, * 

а) PARTINGTON'S PATHWAYS TEST. 1938-49; for prices, 
see 355. 

b) THE LEITER ADAPTATION OF THE PAINTED CUBE TEST. 
1941-49; for prices, see 348. ’ 

С) THE LEITER ADAPTATION OF ARTHUR'S STENCIL DE- 
SIGN TEST, 1945-49; for prices, see 347. 


REFERENCES 
1. ADJUTANT GENERAL'S OrricE, CLASSIFICATION AND R 


TLACEMENT BRANCH, PERSONNEL RESEARCH Section. “The 
New pony Individual Test of General Ability.” Psychol B 
41:532-8 м4. * (PA 10:481) Ў б 

2. ARMITAGE, STEWART G. An Analysis of Certain Psycho- 
logical Tests Used for the Evaluation of Brain Injury. Ameri- 
can Psychological Association, Psychological Monographs, Vol. 
60, No. 1 hole No. 277. Washington, D.C.: the Association, 
Inc., 1946. Pp. iii, 48. Paper, * 

. Leiter, RUSSELL GRAYDON, AND Partincton, Joun Е. 
“Manual for the Leiter-Partington Adult Performance Scale.” 
Psychol Service Center J 1:139-71 D '49. * (РА 25:1382) 

4. WEAVER, HERBERT B.; WITH THE ASSISTANCE OF ROGER C. 
Brown AND Lovis Н. РоквмА+х. “The Leiter-Partington Adult 
Performance Scale at the College Level." Psychol Service Cen- 
ter J 2:182-8 S 'so, * 

Нако A. Derr, Coordinator of Educational 
Activities, The Training School, Vineland, New 
Jersey. 

DESCRIPTION OF TESTS. (a) Pathways, Path- 
ways I, with a practice and a test pathway, in- 
cludes circled numbers which are to be con- 
nected in order. Pathways II, also with a prac- 
tice and a test, includes numbers and alphabet 
to be connected alternately. ( b) Stencil De- 
signs. Test is composed of a practice design and 
nine test designs. Materials include I9 round 
stencils in 6 colors, adapted from Arthur. (c) 
Painted Cubes. The test has an innovation in 
the use of three printed cards picturing the piles 
of cubes to be made. Individual wooden cubes 
are painted red on one or more sides with the 
other sides white. Cards vary in number of cubes 
and tiers illustrated. 

GENERAL COMMENTS. The authors criticize 
both the Revised Stanford-Binet Scale and the 
Wechsler-Bellevue Intelligence Scale when used 
for adults. They claim the L-P was devised to 
fill certain vacuums and to satisfy necessary 
characteristics for adult testing. Wechsler is 
criticized for his use of conventional items. Yet 
both the cube and stencil tests are adaptations of 
older tests. 

The manual contains no real data on validity. 
Its main argument is based on the correlation 
with the Revised Stanford-Binet Scale, although 
admitting the S-B is not applicable to adults. 
Correlation between the tests is .78. The authors 
imply that for validity a test must "produce re- 
sults that are commensurate with expert judg- 
ment of the mental ability." No evidence is given 


even at the subjective level that this is obtained, 
The authors believe that the L-P is “as reliable 
and valid as the performance section of the 
Wechsler-Bellevue Scale.” No data concerning 
the relation to the Wechsler are described or 
referred to. 

Despite the lack of specific evidence to sub- 
stantiate claims, there seems to be some truth in 
the statements. Personal use by this reviewer 
plus an informal study have shown relatively 
high agreement for the majority of cases be- 
tween L-P and Wechsler full scale results, The 
relationship with the Wechsler performance 
score was still better. 

According to the authors, Pathways and 
Stencil Design offer best discrimination at 
mental levels of average and below (IQ's 65- 
110), while Painted Cubes is best at superior and 
above (1075 rro-about 145). The scale as а 
whole is interesting and easily administered to 
adults. Administration time is relatively short 
(about 45 minutes asa maximum). Norms range 
in MA from about 8-0 to 19-0 and in IQ from 
65 to 145 for subtests and total score. Times for 
completion of items are converted to raw scores 
and then into MA and ІО. Chronological age 
of 13-0 is used as the age of mental maturity 
(maximum divisor), i 

Norms are based on “256 unselected male 
World War II veterans between the ages of 19 
and 36 who had been given the 1937 Revision of 
the Stanford-Binet Scale." Approximate normal 
distribution on S-B scores is taken to imply 
adequacy of sample. Socio-economic, occupa- 
tional, and other background of the sample are 
not given. Even though “unselected,” the sam- 
ple seems entirely too small for developing yi 
quate norms. The authors freely caution ж 
the use for individuals not similar to the Зари 
ardization population. They also mention t : 
limitation of any single subtest as a Insane i 
general intelligence. It is doubtful that the t n 
subtests together are adequate to cover the pes 

The authors state that “two of the three s д 
[Pathways and Stencil Designs] composing t Я 
Scale have been found to be useful in evaluation 
of brain injury." While the considerations zd 
volved in the analysis of brain injury seem Ha 
sistent with other such lists and characteris 4 
there аге по data іп the manual to validate t 
above statement directly. One article is pa 
to which is used as the basis, even thous E 
arguments on the pathways are by implica 
only. 
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SUMMARY. The Leiter-Partington Adult Per- 
formance Scale is composed of three subtests, 
each complete in itself and interesting to adults. 
MA and IQ scores are given for each part and 
for the total score. The scale appears subjec- 
tively to produce satisfactory results, but relia- 
bility and validity are not completely substan- 
tiated by the data given. The norm population 
seems too small and too restricted to be compared 
with a general population. Accuracy is a pre- 
requisite, with credit based on a time score. 
Clinical and diagnostic values are claimed but 
with inadequate validation. 


HrnscuEL MANUEL, Professor of Educational 
Psychology, and Director, Testing and Guid- 
ance Bureau, University of Texas, Austin, 
Texas. 

The Leiter-Partington Adult Performance 
Scale consists of three subtests: Partington’s 
Pathways Test, the Leiter Adaptation of Ar- 
thur’s Stencil Design Test, and the Leiter Adap- 
tation of the Painted Cube Test. In the first of 
the subtests the subject marks with pencil two 
irregular pathways, one. indicated by circles 
bearing the numbers 1 to 25 and the other in- 
dicated by numbers and letters alternately. The 
second subtest requires the building up of various 
designs by superimposing circular pieces of col- 
ored paper, all of which, except the ones to be 
used as the foundation, have cut-out portions. 
The third subtest consists of assembling small 
cubes to make blocks with painted sides like the 
stimulus pictures. Directions are given by pan- 
tomime illustrating the procedure. 

Each subtest is scored separately on the basis 
of the time required for successful completion of 
each part. Maximum time limits beyond which 
no credit is allowed are prescribed for the sev- 
eral parts of the Stencil Design and Painted 
Cubes tests. Raw scores are converted into 
mental ages and intelligence quotients by refer- 
ence to tables provided in the manual. The IQ's 
are based upon a divisor of 13 in contrast with 
the 15 used in the Revised S tanford-Binet 5 cale. 
A total mental age or intelligence quotient 1s 
found by simple summation of the raw scores 
on the three subtests and reference to a con- 
version table. 

А version of the three tests was used as the 
performance portion of the Army Individual 
Test constructed for the measurement of gen- 
eral mental ability. Probably the most signifi- 
cant difference in the two series was in the use 


of verbal directions in the Army and the use of 
pantomime in the present series. The evaluation 
of the tests by the Personnel Research Section, 
AGO (1), is quite definite and unequivocal : The 
Army Individual Test “is valid and it is practi- 
cal * It contains both verbal and nonverbal ma- 
terials which contribute equally to the variance 
of the total score.” The intercorrelations of the 
three performance tests vary from .43 to .60, and 
the correlation of the total weighted perform- 
ance score with the score on the Army General 
Classification Test is .74. 

Intercorrelations reported in the manual for 
the Leiter-Partington scale are .45 for Pathways 
and Cubes, .50 for Stencils and Pathways, and 
.63 for Stencils and Cubes. The correlation of 
the total score with the Stanford-Binet is .78. 

Weaver used the Leiter-Partington scale with 
5o male and 50 female college sophomores and 
compared the results (with a few exceptions ) 
with the results of the American Council on 
Education Psychological Examination adminis- 
tered at the beginning of the freshman year 
(4). No significant sex differences on the per- 
formance tests were found. The Pathways Test 
gave a wide distribution of scores but lacked 
sufficient ceiling. The distributions on the other 
two tests were good. Correlations of the total 
Leiter-Partington score and the ACE were: L 
score, .18; Q score, .45; and total score, .24. 
Weaver thinks that the Leiter-Partington scale 
measures something ‘quite different from scho- 
lastic aptitude as measured by the ACE, and 
should prove a valuable supplement to tests of 
the verbal-symbolic type.” 

Armitage used the Army Version of Parting- 
ton’s Pathways under the name Trail Making 
and the Army version of the Leiter Adaptation 
of the Arthur Stencil Design Test under the 
name Shoulder Patches in comparing cases of 
brain injury with normal or, neurotic cases һау- 
ing no history of brain injury (2). Although 
there was considerable overlapping of the scores 
of the two groups mean scores on both tests were 
higher for the control group. The manual of the 
Leiter-Partington scale contains suggestions for 
obtaining “qualitative data" for use in the evalu- 
ation of brain injury. 

In the reviewer's opinion the clinician may 
add the Leiter-Partington Adult Performance 
Scale to his equipment with confidence that he 
will have a useful instrument. Its coverage is 
obviously limited, but this is true of many tests 
from which helpful information may be obtained. 
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It is probable that factor analysis would add in- 
terpretative information of considerable interest. 


J Consult Psychol 15:88 Е "5r. Laurance F. 
Shaffer. The L-P Performance Scale consists 
of the three nonverbal subtests of the Army Indi- 
vidual Test developed during World War II: 
the pathways test, a stencil design test employ- 
ing round patterns, and a painted cube assem- 
bly test. The reliability of the whole scale is not 
given, but its correlation with the Revised Stan- 
ford Binet is .78 in a group of 256 adult males. 
The intercorrelations of the three subtests range 
from .45 to .69. Tables of norms give adult MA's 
and IQ's for each subtest separately and for the 
battery. Since the materials are novel and do 
not seem childish, the scale offers a useful addi- 
tion to resources for the nonverbal examina- 
tion of adults. The manual discusses qualita- 
tive observations obtained from the tests that 
may have values in psychological diagnosis. 


[351] 
Minnesota Preschool Scale. Ages 1.5-6; 1932-40; 
3 scores: verbal, non-verbal, total ; individual; Forms 
А (32), B (732); manual (40) ; $14.50 per set of 
testing materials needed by examiner; $1.25 per 25 
record blanks; postage extra; $1 per specimen set, 
Postpaid; nontimed (10-30) minutes; Florence L. 
Goodenough, Katharine M. Maurer, and M. J. Van 
Wagenen; Educational Test Bureau, Educational Pub- 
lishers, Inc, * 
REFERENCES 

1-3. see 40:107. 

I-5. е! 7250. 

& Tenet) Huszgr STANLEY. A Study of Certain Mental 


Functions and Their Relation to Chan es in the Intelligence of 


Preschool Children. Doctor’s thesis, University of Iowa (Iowa 
City, Iowa), 1938. (Programs Announcing Candidates for 
Higher Degrees 1938.) 

7. DE Forest, RuTH, “A Study of the Prognostic Value of 
the Merrill-Palmer Scale of Mental Tests and the Minnesota 
Horse Scale." J Genetic Psychol 59:219-23 S '41. * (PA 
16:1231 


For a review by Beth L. Wellman, see 3:286; 
for reviews by Rachel Stutsman Ball, Nancy 
Bayley, 
1407; for related reviews, see 352 and 3:287-8. 
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[Re Minnesota Preschool Scale.] Maurer, KATH- 
ARINE M. Intellectual Status at Maturity as a Cri- 
terion for Selecting Items in Preschool Tests. 
University of Minnesota, Institute of Child Welfare 
Monograph Series, No. 2r. Minneapolis, Minn.: Uni- 
versity of Minnesota Press, I946. Pp. ix, 166. Litho- 
typed. $2.50. * (London: Oxford niversity Press. 
20s) (PA 21:286) 


El Sch J 48:167-9 N "47. Lee J. Cronbach. * 
introduces a basically different method of item 
selection—one which has much to commend it. 
Items are tested, not to see whether they have 
certain theoretically desirable characteristics 
when given, but to determine whether they pre- 


and Florence М. Teagarden, see 40: , 


dict ultimate mental ability. She takes advantage 
of data accumulated over the years by the Min- 
nesota Institute, which permit one to re-examine, 
at maturity, young people tested originally be- 
low the age of six. Maurer tested a superior, se- 
lected group of 226 persons at the age of sixteen 
or later with the Wells Revision of Army Alpha 
Test. Binet records were also used for fifty ado- 
lescents. The results are striking—even aston- 
ishing—in, the light of the complacency with 
which some items have been accepted for 
years as measures of "intelligence." Among 
Maurer's group nonverbal items were generally 
predictive, while many verbal items had little 
or no relation to standing at maturity. The au- 
thor points out that verbal items are peculiarly 
subject to special experience and even coaching 
by persons who wish children to show impres- 
sive "development." If the general-factor hy- 
pothesis is accepted—and Maurer's data sup- 
port this view—these nonverbal items are evi- 
dently a better measure of g at earlier ages than 
are many verbal tests. At later ages, verbal tests 
are heavily saturated with g, according to long- 
standing evidence. Among the items which have 
highly predictive value are imitative drawing, 
block-building, form discrimination, digit span, 
mutilated pictures, picture puzzles, vocabulary, 
and opposites. Since few of these tests depart 
in any way from the standard conception of 
mental ability, the import of this study is to point 
out certain types of items for discarding, rather 
than to draw attention to new forms. А sand 
ard preschool test, believed to contain only es 
items, was the original basis of the study. 1e 
twelve of the original twenty-nine item-forms 
were found to be inadequate predictors. Tt 18 : 
be hoped that this study may lead some mmi 
worker to invent superior item types to rep/ac 
those lost. * 

7 Ed (London) 80:700+ D ’48. C. М. Flem- 
ing. * a scholarly analysis * Greater prediri 
value is indicated for items such as imitativ 
drawing, block building, discrimination of ui 
memory for digits, knowledge of word oppos! of 
comprehension of directions, than for т 
the type of aesthetic comparisons, paper et 
or the naming of familiar objects. It is poss! 1 i 
agree that there is in this some evidence 0 » 
constant core of mental functioning, in the pé 
that what is commonly described as intelligen t 
is manifested in a candidate's adaptability А 
what, at the age in question, is something á ae 
novel situation. In view of the extent of the 
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parities covered by correlations even so high 
as +.7 or +.8, it seems, however, dangerous to 
claim that these findings give promise that we 
shall yet succeed in predicting terminal intellec- 
tual status from pre-school testing. This book, 
in its concentration on the search for stability 
and its lack of allowance for the influence of 
later experiences, comes near to being one of 
those of which it may fairly be said that to con- 
tinue such educational research for a means of 
effecting a neat and final classification of pupils 
at a specified age is to remain within the in- 
tellectual framework of the nineteenth century. 
The more recent emphasis on educability, rather 
than on fixity, and on flexibility rather than sta- 
bility, is the fruit of those more adequate studies 
of mental performing which offer not a single 
retesting after ten to sixteen years, but cumula- 
tive recording and annual measurements. From 
such continued observations of the details of 
personal growth, one is led to an admission of 
variability and to an awareness that each indi- 
vidual follows a pattern of his own in relation 
to the average of his group. Prediction at any 
age is hazardous, and it is dangerous to claim 
that what an individual does do to-day provides 
a full indication of what he can do at some future 
date. 

J Ed Res 41:314-5 D "47. Nancy Bayley. * 
The study as a whole is very carefully done, and 
the implications of the results and the theoretical 
discussion are very important additions to our 
knowledge of intelligence and its growth. It may 
be noted, however, that in its present state pre- 
diction from this shortened test remains very 
poor. The great value of this research is that it 
points the way to the type of test items which 
should be devised and added to those already 
known to be predictive, in the hope of develop- 
ing a test battery which will predict mature in- 
telligence at an early age. It is hoped that data 
now available from other longitudinal studies 
will be similarly analyzed. From such analyses 
we may eventually learn the extent to which 
the growth of intellectual factors is consistent 
in individual children. ] 

Occupational Psychol 22:217-8 О '48. Н. J. 
Eysenck. * A good deal of complacency has ac- 
companied the inculcation ofa belief in the “соп- 
stancy of the IQ" into educational psychologists. 
This belief is in no way justified. Correlations 
between test and retest on the Dinet are no- 
toriously low when several years have intervened 
between test and retest ; in a survey of published 


material the present reviewer found that after 
an interval of ten years Binet retests correlated 
only about .52 with original tests. A value of this 
kind represents an improvement over chance 
guessing of only 14 per cent. Clearly then in- 
telligence tests of the Binet type leave much to 
be desired in terms of predictive efficiency. Mat- 
ters are even worse when we turn to pre-school 
tests; here correlations of test with later retest 
are lower still, and may even become negative 
for very young age-groups. For this reason, psy- 
chologists have given up the use of pre-school 
tests for predictive purposes, although the prac- 
tice still lingers in some backwaters. * Detailed 
figures are given regarding the predictive value 
of each item in the scales, These should be of 
immense value to those interested in the use and 
interpretation of pre-school scales. One weak- 
ness in an otherwise scholarly and thorough 
study is the method of validation, in which pre- 
diction coefficients derived from part of the total 
group of subjects are applied to another part 
of the group. While obviously such validation 
is essential, the method used by Maurer is prob- 
ably the most inefficient which could have been 
devised under the circumstances, and indeed 
leaves the question at issue largely unsettled, 
Even in view of the natural limitations of the 
study in terms of number of subjects, its im- 
portance can hardly be overrated. The problems 
raised are of such importance to psychology that 
one can only hope that a large-scale study using 
thousands rather than hundreds of subjects will 
soon be inaugurated. And as one further point, 
one would like the same method used on tests 
other than pre-school tests; the problem of pre- 
diction is no less urgent with the older child than 
with the four-year old, and the application of 
similar methods should go far to dispel the 
present false optimism which still enshrouds so 
many educational psychologists, while substi- 
tuting a more firmly grounded factual belief, 
Psychol В 44:578-9 N 47. Quinn McNemar, 
* A grand total of about 700 biserial correlations, 
between Alpha scores and the Preschool Scale 
items, were computed on 180 cases subdivided 
into nine age groups according to age at time of 
their preschool test (N's varied from 9 to 33). 
The records of 46 cases (every fifth) were held 
for “validating” the “predictive” scale made up 
of the subtests selected on the basis of the item 
analysis. Two points lead the reviewer to ques- 
tion the accuracy of the statistical analysis. With 
sizable standard errors (up to .30), it is very 
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doubtful that the 400 r's under -40 (200 under 
:20) would all be on the positive side of zero; 
yet no negative biserial values are reported. 
About one-tenth of the 700 given standard er- 
rors, for biserials, are erroneously low. On the 
basis of the item biseral r’s, 17 of the 29 subtests 
of each form were retained as “predictive.” These 
17 represent about 75 % of the 224 scoring items 
per form. The preschool test blanks of the 46 
validating cases were rescored on these selected 
items, and the correlations of Alpha with selected 
tests were compared with the correlations 
yielded by all the preschool subtests. With r’s 
based on N’s of 3 to 8 per age group, the results 
are inconclusive : it cannot be said that the scores 
based on selected tests are any more predictive 
than scores based on all the tests. This lack of 
conclusiveness also holds for the findings based 
on a supplementary validation group with Stan- 
ford-Binet IQ's at ages 11 to 15 taken as meas- 
ures of near terminal intelligence. A better pro- 
cedure would have been to choose on the basis 
of the item analysis the best half of the items as 
predictive, with the remaining half as nonpre- 
dictive, then to compare the validity of the pre- 
dictive versus that of the nonpredictive tests, 
instead of the predictive versus the total. The 
difficulty of eliminating the age factor so as to 
have a validity coefficient based on all 46 cases 
could be overcome by rescoring a sufficient num- 
ber of typical Minnesota Preschool Test blanks 
as a basis for standard scores, at each age level, 
for the predictive items. These standard scores 
could then be correlated with later Alpha stand- 
ing, also expressed as standard scores by age 
level. This monograph represents an enormous 
amount of work on a very important, and likely 
fruitful, line of research. Far more cases will be 
required for checking the reasonable hypothesis 
that preschool scales contain “dead wood" as 
far as the prediction of terminal intelligence is 
concerned. 


For additional reviews, see 3:288. 


[353] 
Nebraska Test of Learning Aptitude for Young 
Deaf Children: A Non-Verbal Individual Scale 
Especially Devised for Deaf and Hard of Hear- 
ing Children from Four to Ten Years of Age. 
Ages 4-10; 1041; individual; 1 form; $52 per set of 
testing materials and 100 record forms; $1.75 per roo 
record forms; $1.75 per 100 completion-of-drawings 
blanks; $1.75 per manual ; postpaid; (50-60) minutes ; 
Marshall S. Hiskey ; distributed by Psychological Cor- 
poration. * 
REFERENCES 


Hs See 3 di. 
4. Tuomure, Eucene. A Comparative Study of the Results 
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of the Nebraska Test of Learning Aptitude for У, М 
Аа Aptitude Test и ) 
esi; 3 


Mivprep С. TEM PLIN, Associate Professor, In- 
stitute of Child Welfare, University of Minne- 
sota, Minneapolis, Minnesota. 

This test is constructed specifically for young 
deaf and hard of hearing children and has been 
standardized on such a group. The following cri- 
teria were among those against which the items 
in the preliminary scale were selected : (a) simi- 
larity to school tasks of the deaf child, (b) possi- 
bility of nonverbal measurement, (c) instruc- 
tions that can be pantomimed, (d) correlation 
with accepted criteria of intelligence and learn- 
ing ability, (e) objective scoring, and (f) non- 
timed scores. In an item analysis those items 
showing the greatest increase in percentage 
passing from age to age were retained. In the 
final scale there are 124 test items arranged ac- 
cording to difficulty under 11 types of measure- 
ment: memory for colored objects, bead string- 
ing, pictorial associations, block building, mem- 
ory for digits, completion of drawings, pictorial 
identification, paper folding, visual attention 
span, puzzle blocks, and pictorial analogies. The 
instructions for each item are pantomimed, no 
item is a “speed test,” and practice items are 
presented for the various types of measures. 

The split half reliability of the test, corrected 
by the Spearman-Brown formula, is .963. The 
validity rests largely on the method of selection 
of items for the preliminary scale and on the 
item analysis for the selection of items for e 
final scale. No attempt was made to establis 
validity through correlation with other er 
since “there is no existing test which woul 
have been an acceptable criterion.” The тенш 
between the total test score and each of the i 
test parts is reported in coefficients ranging п, 
63 to .84 for the age groups 4 to 7 and 8 to 

ears. 
t The results of the test are expressed as i 
learning age (LA) which is interpreted нй 
larly to the MA except that ће LA d 
the performance of the average deaf child. idi 
learning quotient (LQ), while presented, m 
be used cautiously. 

A short form of the test, including five i 
of measures which are most highly corre? in 
with the total test score, can be administer oup 
about 30 minutes. On the standardization £T he 
the correlations between the five parts an 
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total test are .94 for both the 4 to 7 and the 8 
to 10 age groups. A 

The test is attractive and interesting to chil- 
dren; since it was constructed and standardized 
for deaf children, it has a special value. 


[354] 
*Northwestern Intelligence Tests. Ages 4-12 
weeks, 13-36 weeks; 1949-51; a revision of Gilliland- 
Shotwell Intelligence Scale ('43) by A. R. Gilliland and 
‘Anna M. Shotwell; individual; 2 levels; 1 form; $2 
per 25 test record sheets for any one level; other test 
materials may be purchased from A. R. Gilliland, 
Northwestern University, Evanston, Ill. ; 40¢ per speci- 
men set of test record sheet and manual for any one 
level; postpaid; (20-30) minutes; A. R. Gilliland ; 
Houghton Mifflin Co. * 
a) TEST A. Ages 4-12 weeks; 1949. 
b) rest в. Ages 13-36 weeks; 1951. 


REFERENCES 
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595 Л '41. * (PA 15:5403, title only) 
2. SuorWELL, ANNA M., AND GILLILAND, А. R. “А Prelimi- 
nary Scale for the Measurement of the Mentality of Infants," 
Child Develop 14:167-77 5 '43. " (РА 18:908) А 
3. Ркоммохр, Mary D., AND GILLILAND, A. R. “The Vali- 
dation of the Gilliland-Shotwell Infant Intelligence Scale,” Ab- 
stract. Am Psychol SEO О '46. * (PA 21:624, title only) 

4. GILLILAND, А. R. “An Intell igence Test for Early In- 
DN уз Am Psychol 2:302 Ag '47. * (PA 21:4661, 
itle only 

s. Berks, Mary Dez DRUMMOND, New Data in the Develop- 
ment of the Gilliland-Shotwell Intelligence Scale. Master’s the- 
sis, Northwestern University (Evanston, In» 1948. 

6. GiLLILAND, А. R. “Environmental Influences on Infant 
ГУ ОГ "Test Scores." Abstract. Am Psychol 3:265 Jl '48. * 


A 22:5367, title only) à 
GiLLILAND, А. В. “The Measurement of the Mentality of 


7. р, 
Infants." Child Develop 19:155-8 S '48. * (РА 23:5, 19) 

8. Спллілкр, А. R, “Environmental Influence on Infant In- 
telligence Test Scores.” Harvard Ed R 19:142-6 su '49. * (PA 


24:3106) y 
9. GILLILAND, A. R. “Socio-Economic Status апа E as 


Factors in Infant "Intelligence Test Scores." Child Dev 22: 
271-3 D ’st. * 

Миравр C. TemrLIN, Associate Professor, In- 
stitute of Child Welfare, University of Minne- 
sota, Minneapolis, Minnesota. 

The до test items in the present revision of 
the Northwestern Infant Intelligence Test 
(NIIT) are based on a study of 276 complete 
infant tests. They were selected according to 
four criteria: (а) the test items have definite in- 
structions for giving and scoring, (b) they re- 
quire relatively simple equipment, (с) they 
measure adaptation to the physical and social 
environment rather than maturation, and (4 ) 
they show an increasing percentage of babies 
passing from week to week. The raw score for 
the test is the total number of items passed. An 
IQ can be computed for any age. ]t should be 
noted that the test items in the earlier forms 
were made up from items used by other infant 
test authors, primarily Gesell, from some modifi- 
cation of other infant test items, and from items 
devised by Gilliland and Shotwell after noting 
signs of development in comparing the behavior 
of younger and older babies. 


The odd-even reliability coefficients, corrected 
by the Spearman-Brown formula, range between 
.79 and .94 for each week between 4 and її. 
The coefficient for the total age range is .84. The 
validity of the test has been considered in sev- 
eral ways. The age-grade progress of the indi- 
vidual items and of the test as a whole is one of 
the criteria used in the construction of the test. 
The results of the tests of 72 inmates of the Lin- 
coln State School and Colony, Lincoln, Illinois 
are reported as approximate to expectation for 
various groups of feebleminded children. The 
mean difference between the IQ measured on 
the NIIT and the Cattell test or the Kuhlmann- 
Binet is 3.8 points for 29 children. Although no 
quantitative results are reported, “several of the 
normal infants who were tested by one of the 
earlier forms of the test have been given the 
Stanford-Binet Tests later. While the numbers 
are too small for statistical analysis, the results 
thus far have shown high predictive value for 
the early tests.” (7) It is probable that the real 
value of the NIIT rests upon the quantified evi- 
dence which will accumulate as the results of 
the many tests of the same children as infants 
are compared with their test results in later years. 

J Consult Psychol 13:385 О '49. Laurance F. 
Shaffer. Because of the need, especially of adop- 
tion agencies, for a test for very young infants, 
a scale of 40 items was developed drawn in part 
from the work of Gesell, Cattell and Buehler 
and in part from original observations. The ob- 
servations emphasize responses showing adap- 
tation to the physical and social environment 
rather than growth or maturation. * There is 
some evidence of good agreement with other in- 
fant tests. 


[355] 
*Partington’s Pathways Test. Adults (especially 
with IQ's between 65 and 110) ; 1938-50; formerly 
called Distributed Attention; а subtest of Leiter- 
Partington Adult Performance Scale (see 350); in- 
dividual; can be given in pantomime; 1 form, '49; 
revised manual ('50) ; $9 per set of pads for 100 exam- 
inations; $3 per 100 record cards; $3 per 100 qualita- 
tive check lists for evaluating brain iud od 
manual; manual and record cards same as for citer- 
Partington Adult PA nd Scale; postage extra; 
(3-5) minutes; John E. Partington and Russell Gray- 
don Leiter ; Psychological Service Center Press. * 
REFERENCES 

1, ADJUTANT GENERAL'S Orrice, CLASSIFICATION AND Re- 
PLACEMENT BRANCH, PERSONNEL Research Section. “The 
New Army Individual Test of General Ability.” Psychol B 
41:532-8 О '44. * (PA 19:581) 


Grorce K. Bennett, President, The Psycho- 
logical Corporation, New York, New York. 
When this test was originally developed it was 
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called "Distributed Attention." A slightly modi- 
fied version used by the U.S. Army was called 
"Trail Making." The present test differs from 
its predecessors in having two parts, each of 
which is prefaced by both a demonstration and a 
practice test. The task consists of drawing a 
pencil line to intercept sequentially each of 25 
labeled circles printed on an 8% by 11 inch page. 
In Part I the circles are numbered; in Part II 
the series runs: 1, A, 2, B, and so on. The ex- 
aminer's demonstration upon a miniature prac- 
tice test is followed by at least one practice run, 
and if necessary several, on the fore-exercise 
to insure the subject's familiarity with the 
process, Score points are awarded in a roughly 
linear relationship to speed on each of tlie two 
real trials. On Part I, the maximum of 20 points 
is earned if the route is traced in under 40 sec- 
onds while only 2 points are granted to a sub- 


ject who requires 1 10 seconds or more. On Part | 


II the corresponding limits are 45 seconds and 
145 seconds. No credit is given on either part 
if an error is made and left without prompt and 
Spontaneous correction. 

Although this test is available for separate 
use, it is one of the three parts of the Leiter- 
Partington Adult Performance Scale, the man- 
ual for which serves also for each of the com- 
ponent tests. This manual clearly indicates that 
the Pathways test is regarded by the authors as 
à test of general intelligence suitable for use with 
adults with IQ's of 60 to 140. Table III of the 
manual gives MA and IQ equivalents for Scores. 
This designation appears to have two founda- 
tions. The first is that the test meets the authors' 
requirements of yielding speed scores, contain- 
ing inherently motivating material, and not re- 
sembling childish activities. The second is that 
when the test was administered to "256 unse- 
lected male World War II veterans between the 
ages of 19 and 36" who had also taken the 
Stanford-Binet, a correlation of .68 was ob- 
tained. It should be noted that this was the same 
group as that from which the time zones for 
point values were established. As a consequence 
the coefficient of .68 is probably an overestimate. 
Through the courtesy of the publisher an as yet 
unpublished article by H. Weaver entitled “The 
Leiter-Partington Adult Performance Scale at 
the College Level” has been made available in 
manuscript. Correlation with the American 
Council on Education Psychological Examina- 
tion L score is reported as .12; and with Q score, 
as .62. The correlation with total is not given. 
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Subjects were an unspecified number (less than - 
100) of University of Cincinnati sophomores - 
who had been tested on ACE while freshmen, 
The distribution of Pathways test scores is se 
verely skewed. i 
A qualitative checklist permits the examiner 
to make a record of his observations of the sub- 
ject's planning, method, and reaction to errors, - 
This is supplemented by a series of questions to 
be asked aíter testing is completed. The manual 
reports that the checklist and inquiry have been 
designed with reference to the findings of Stew- 
art G. Armitage in relation to brain-injured sub- 
jects. No data are offered in support, although 
journal references are given. | 
The reviewer feels that it would be highly — 
unfortunate if the Partington's Pathways Test, 
on the basis of present evidence, were to be | 
taken seriously as a measure of intelligence. The 
performance itself is a simple tracing ope tion 
of very brief duration in which a sort re | 
ceptual-motor speed is the controlling ї. 
Although tables of MA and IQ equivalents " | 
given, these are based upon a single groupo | 
veteran counseling candidates. No report of 5 
liability is given and, unless the Weaver study 
is so regarded, there is no validation Ro 
secon 1 group. It would appear to be highly ; 
ardous to use the results of this test in its vs 
stage of development as a basis for any Tn 
affecting an individual. Later experini Hai 
may confirm or contradict the faith of the ps 
lishers which at present would seem to 
largely on uncritical optimism. 
7 Consult Psychol 13:386 О 49. Lori d 
Shaffer. The Pathways Test is a perfo objet 
task using printed forms. In Part Wie S 
draws a line to irregularly placed cir es n 
bered 1, 2, 3, etc., in sequence, and in 
to similar circles in the order I, 
etc. Although an > of .68 with the 193 
Binet is reported for 256 adult males, ee 
is not intended to be a comprehensive po 
of intelligence. The author regards v M Per 
ure of a complex of speed, eye-hand c Norms 
tion, alertness and distributed аео Me 
are given in terms of MA and IQ, for a 


only. 
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*Porteus Maze Test, Vineland Revis 
and vers 1914-50; 2 scores: qualitative, qu 3-12, 14 
individual; І maze for each of the Te Por 
Test and Tntelligen с Кы below 
Test a ntelligence € Й 
The Mase Test and Mental Differences & 


O ONC ae И ТК е 7 Dg 


457 ] 


below) and Qualitative Performance in the Maze Test 
(’42—see 33 below) ; $1 per 100 of any one test; $4.10 
per manual; specimen set not available; postpaid; non- 
timed (15-60) minutes; Stanley D. Porteus; Psycho- 
logical Corporation. * 
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C. M. LourrIT, Assistant to the Provost, Uni- 
versity of Illinois, Urbana, Illinois. , 

Since its original publication in 1915 the 
Porteus Maze Test has been widely used by it- 
self and as part of the Arthur Point Scale of Per- 
formance Tests. Based upon the extent of use 
alone, the instrument has apparently been found 
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acceptable by clinical psychologists. However, 
there are still questions unanswered as to the 
meaning, reliability, and validity of the Maze 
Test performance. 

Originally Porteus (1915) offered the Maze 
as a test supplementary to the Stanford-Binet 
Scale which would sample such nonverbal as- 
pects of ability as prudence and forethought. In 
subsequent publications the tendency has been 
to emphasize the usefulness of the Maze in re- 
vealing "planning and foresight" in dealing with 
an essentially simple concrete situation. Corre- 
lations with the Binet of the order of .60 indi- 
cate appreciable communality between the abili- 
ties required on the two tests. Maze perform- 
ance also requires abilities in common with other 
types of performance tests. Finally, there appears 
to be a third requirement, probably adequately 
expressed in "planning and foresight,” for suc- 
cessful Maze performance. This type of analysis 
of the Maze Test rests in large measure on logic 
and intuition—there has been a surprising lack 
of rigorous experimental analysis of a test which 
is so widely used. 

The nature of the Maze Test makes use of 
the usual measures of reliability questionable. 
Administration of the graded series of Mazes, 
designated by year levels, always begins at year 
V or below and proceeds until a specified cri- 
terion of failure is met. An element in success 
is ability to profit from experience, i.e., a learn- 
ing factor. Because of this, test-retest correla- 
tions may be expected to be high because of 
learning and not because of consistency of test 
measurement. Split half methods of estimating 
reliability are not applicable at all because scores 
on parts of the total series completed are mean- 
ingless. That the Maze Test is reliable in a sta- 
tistical sense has been intuitively accepted, but 
not adequately demonstrated. 

Consideration of validity immediately raises 
the question of a criterion. From earlier discus- 
sion it is evident that performance on Binet-type 
or performance tests are not adequate criteria 
for validating the Maze as a measure of “plan- 
ning and foresight.” There have been no attempts 
of which the reviewer is aware to devise special 
situations involving these traits against which 
validity might be measured. The most promising 
evidence comes from studies contrasting the per- 
formance of delinquents and nondelinquents, ad- 
justed and maladjusted feebleminded and de- 
linquents, pre- and post-lobotomy patients, and 
Various social groups. Evidence from such stud- 
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ies is consistent in showing that the less well 
adjusted have poorer Maze performance than 
the better adjusted groups. In spite of consist. 
ency in the findings they are open to serious 
methodological questions because "adjusted" 
and "maladjusted" have usually not been cares 
fully defined. x 

The Maze Tests were standardized on an age 
level basis. The Maze age score (and its derived 
Maze IQ) is based upon a success-failure per- 
formance on the series, This all-or-none 
of scoring takes no account of qualitative differ- 
ences in subjects’ performance. Everyone who 
has used the Maze Tests to any extent recog- 
nizes that subjects earning the same Maze age 
scores may exhibit wide variation in the details 
of their performance. In 1942 Porteus published 
a study of results with a method of scoring per- 
formance in terms of quality of line and accuracy 
of successful drawing. This scoring method 
showed significant difference between delin- 
quent and nondelinquent children and between 
criminal and noncriminal adults. This qualitative 
scoring adds to the clinical usefulness of the test, 
but more extensive investigation is needed. 

In summary it may be said that the Porteus 
Maze Tests have been widely used and therefore 
have apparently found usefulness in psychologi- 
cal clinics. Nevertheless, there has not been ade- 
quate study of the test to establish it on a ЕП 
technical base. For those of us who have bo 
the test meaningful intuitively, there would : 
much value in knowing that adequate inve 
tion had established its reliability, validity, an 
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Grapvs С. SCHWESINGER, Senior Clinical Psy 
chologist, California Y outh Authority, V entut 
California. 

The Maze Test began between 1912 and ye 
when Porteus contributed to the. ша 
testing movement by offering this test 0 " $ 
dence and foresight.” His first papers were g Uf 
lished simultaneously in England and gir 
in 1915, with subsequent generalized re p 
appearing in 1919, 1924, 1933, and the p^ pi. 
cent in 1950. The last publication, The hice 
Mase Test and Intelligence, carries a bi pu. 
phy of 91 titles, attesting to the ipte ‘he 
which the Maze Test has undergone tiga 
hands of its author and many other en 
tors. During these 35 years, the Maze Ро паг 
and methods of scoring went through some 
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changes, but the design of the test is essent? f 
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the same. The tests have been republished and 
standardized in several foreign languages. Re- 
cently an alternative procedure for adults has 
been arranged by Peters. A second series stand- 
ardized against the original series, with the new 
one to be scored so as to diminish or eliminate 
practice effect, is now being developed by Por- 
teus. É 
Methods of validation have progressively 
changed from correlation with scores from other 
mental tests through judgments and ratings of 
educational, industrial, and social sufficiency, 
quality of adjustment in the community, adapta- 
bility of the test in widely differing environ- 
ments, to the most recent and dramatic checking 
of Maze Test results against the effects of psy- 
chosurgery. From this new approach more sig- 
nificant research on Maze Test validity has been 
done within the last 10 years than in the previous 
25, with new vistas open to what is now recog- 
nized as a highly sensitive testing instrument. 
Correlation coefficients between Maze Test 
and innumerable well known tests of intelligence 
have all been positive, suggesting a high degree 
of “g” saturation. In Porteus' own opinion, this 
reflects the common factor, planning capacity, 
present in all tests of intelligence. Males do 
better than females in nearly all environments, 
this being attributed to the fact that environment 
puts less stress on males than on females. The 
Maze Test also has the distinction of being able 
to tap emotional and temperamental factors 
which do not affect performance in other tests 
to the same degree. à 
Maze Tests show a fairly high correlation with 
Stanford-Binet for subjects relatively homoge- 
neous as to education and training. In nonschool 
ec. however, with heterogeneous adults, 
e correlation is not so very high. This latter 
moderate correlation shows that when scholastic 
aptitude is ruled out, the two tests measure dif- 
ferent things. Burt found that the nonlinguistic 
mazes proved a better measure of practical in- 
telligence than did the Binet-Simon which car- 
ried advantages for the verbally glib. But when 
the Maze Tests were applied to criminals, de- 
linquents, and socially maladjusted subjects 
their scores ran lower, making it even clearer 
that the Maze tested temperamental qualities 
such as prudence and emotional control as well 
as mental capacity. By 1941 this realization led 
to Porteus' devising a supplementary method of 
scoring responses on the Maze Test. In addition 
to obtaining a quantitative measure of mental 
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ability (for which the units of measurement аге 
scaled by years and half-years of age with a test 
age or test quotient being obtained) the psy- 
chologist can now simultaneously derive a qual- 
itative measure of temperament and emotional 
control (scored by error points, with a critical 
score of 29 for normal males, and 31 for females, 
with the average Q score for delinquents 50 for 
males and 54 for females). The total weighted 
error scores range from О to 150 and more 
points. Responses on this qualitative scale of 
errors include: cutting corners, crossing lines, 
lifting pencil, making wavy lines, going in the 
wrong direction, and being penalized for mis- 
takes on the tests for years V and VI in the 
regular series. It should be repeated that the 
qualitative measure does not serve to replace 
the quantitative measure but to supplement it. 

As a high Q (i.e., qualitative) score marks an 
inferior effort, with the reverse being true for 
the test age score on the quantitative scale, nega- 
tive correlations are to be expected to indicate 
corresponding relationships. For delinquent 
girls, = —.40; for criminals, r — —.37, showing 
that the more intelligent individuals tend to be 
more exact and careful, but not so much so as to 
indicate that intelligence is an important factor 
in quality of Maze performance. Critical scores 
obtained for these various groups show rela- 
tively small overlapping. Objective scoring on 
both qualitative and quantitative scales must still 
be accompanied by careful observation and note- 
taking during the clinical interview. 

Social status is reflected in Maze Test re- 
spogse, with children in higher social grades do- 
ing better than those in the lower grades, both 
being an expression of greater prudence, fore- 
thought, and planning capacity. The Maze Test 
has been widely used in studies of national-racial 
differences as they are independent of language 
disabilities, cultural disadvantages, and educa- 
tional inequalities, points which invalidate find- 
ings of mental comparisons obtained by other 
standard tests, especially if the performance of 
the white race is made the standard of compari- 
son. 

With two quite differently discriminating 
scales in one test instrument, the Porteus Maze 
Test is packed with value for any clinic reper- 
toire of tests. With the new and startling checks 
on validity derived through use of the Maze 
Tests on lobotomized and topectomized patients 
the Mazes have shown their superiority over 
other intelligence tests in correlating with that 
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core of the intelligence process known as plan- 
ning capacity. With the nonlanguage character 
of the Mazes, and with the universality of their 
interest and appeal, the Maze Test is still one of 
the very few which can be employed in many 
widely varying environments and different lin- 
guistic groups. With the additional light they 
throw on mental reactions in mentally retarded 
children, already tested by other instruments, 
their use before any final diagnosis of mental 
deficiency is made is strongly indicated; and 
certainly so before any commitments to institu- 
tions for the feebleminded are recommended. 
The Maze Test should be routine in any psycho- 
logical clinic dealing with children, for it is a 
very useful and very discriminating instrument, 


For related reviews, see 357, 


A 38:B453, and 
36:B2ro. 


[357] 


*[Re Porteus Maze Test] Porreus, SrANLEY D. 
The Porteus Maze Test and Intelligence. Palo 
Alto, Calif.: Pacific Books, 1950. Pp. vi, 194. $4.00. 
(PA 25: 1105) 


Ат J Psychol 64:322 Ар "51. Madison Bent- 
ley. The author of a clever and widely employed 
instrument of examination explains "why a test 
devised in 1914 should be written about at length 
in 1950." Although Porteus allows the main 
sanction to rest upon its recent usefulness in 
modifications of "intelligence" through lobot- 
omy and other surgical insults to the brain, 
wrought for diagnosis, relief and recovery, he 
might have laid weight also upon the later ad- 
vances in the conception and determination of 
intelligence and in the author's long-continued 
researches at Vineland and in Honolulu. 

J Consult Psychol 14:417-8 О '50. Laurance 
F. Shaffer. As the author says in his Preface, 
“1914 seems a very long time ago" when the 
Porteus Maze Tests were first devised. Since 
that time, however, they have been the subject 
of continued research, much of it in the past 
decade. Although there were books about the 
maze test in 1919, 1924, and 1933, the author 
has no need to apologize for still another. Por- 
teus reports the origin and development of the 
test, and the.results of studies in many areas: 
feeblemindedness, industrial applications, racial 
differences, delinquency, brain surgery, and or- 
ganic brain defects. The data are stated clearly 
and fairly, but the comments often depart from 
cold science toward a personal ànd autobio- 
graphical flavor with more than a trace of 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK [ 460 


contentiousness. A complete manual for the 
administration and scoring of the test, and for 
quantitative and qualitative interpretation, is in- 
cluded. Regret must be expressed that, in de- 
termining the age-score ratio or test quotient, 
Porteus continues to tinker with the chronologi- 
cal age in order to adjust to variations in test 
difficulty at different age levels. It would haye 
been far more rational to provide a conversion 
table to turn test ages into true performance 
ages. Small shortcomings, however, do not con- 
ceal the value of the maze test. It is one of the 
great original contributions to psychometrics of 
the first half of the century. The present volume 
replaces all others as a sufficient manual for the 
test, which should continue in wide use. 

J Ed Psychol 41:502-3 D ^50. Lee J. Cron- 
bach, * The book is more than a manual for the 
test, although it includes instructions for testing, 
scoring guides, and similar clinical helps. Por- 
teus discusses use of the test in clinical diagno- 
sis, and the implications of changes in per- 
formance following lobotomy. He presents a 
qualitative score which has been found to differ- 
entiate delinquents from control groups. Both of 
these will have interest for some practicing psy- 
chologists, although it appears that the test is 
best treated as an opportunity for observing the 
subject rather than by any objective scoring of 
the product. Two chapters on differences ner 
racial groups collect much material pre 4 
published in scattered sources, and express t 
author's view that such differences on the Maze 
reflect constitutional factors. The book is most 


of all a personal history, recounting the long | 


period when Porteus' contributions were, in s 
opinion, over-shadowed by the Башын 
This retelling of our tribal conflicts is not w d 
out interest, but Porteus' anecdotes, au MT 
from early sources, and retracing of the pat д 
which we have modified our interpretation a 
the Binet test seem lacking in present-day si 
vance. His advocacy of the Maze Test as а Caa 
cal tool is well-warranted, but he seems mn 
tinguish inadequately between what ui The 
measures and “planning” as a general tra! A 
strongly-felt opinions Porteus drops are d ШЕ 
intriguing and stimulating, amusing, eta ў 
rageous. The book is a lively memento 0 
chological pioneer. ef. 
Psychol B 48:529-30 N ^51. D. A. aero 
The thesis of this book is that the Aedes of 
Test is an indispensable member of a Be Wr 
tests for the evaluation of intelligence. * + 0 
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considers several definitions of intelligence and 
finds in most of them, although he has to strain 
a little now and then to do so, the common ele- 
ment of planning. He comes, then, to his own 
definition of intelligence as “Capacity for mak- 
ing planned responses to an increasing range of 
relevant stimuli.” While such tests as the Binet 
or the Wechsler are important for measuring, 
in part, this planning capacity, they are not ade- 
quate for the entire task—missing the mark es- 
pecially with respect to “certain capacities or 
traits of a practical nature essential to social 
efficiency.” Citations in this connection are made 
of several accounts of instances in which per- 
sons of low IQ’s have not exhibited feeble- 
mindedness in the sense of being unable to get 
along acceptably in their social and economic en- 
vironments. The Maze is most helpful in the 
identification of such individuals. The Maze, 
too, has exceptional value in the estimation of 
differences of intelligence among racial groups, 
which must always be evaluated in light of the 
problems and the environment of the racial group 
studied. The Qualitative Score, which has been 
used since 1942, has been shown to separate 
from the normal those with tendencies towards 
delinquency. The reviewer finds it very difficult 
to evaluate, on the basis of the information given 
in the manual, a performance according to this 
qualitative score. Wavy lines are supposed to 
be significant, but there is no good description 
of “wavy.” Examples are given, but these do 
not show what could be accepted in contrast 
with what could not. Evidence for the above 
claims has been presented before. The new, and 
for Porteus the final, proof of the value of the 
Maze comes through brain surgery. The Maze 
appears to be sensitive to changes in the capacity 
for planning as commonly observed in lobot- 
omized patients but which are not revealed by 
such tests as the Binet or the Wechsler. The re- 
sults of several studies are given. The Maze 
“has the best claims to being considered a 
‘frontal brain’ test.” Although Porteus is sharply 
critical of the limitations of widely-used tests of 
intelligence and of the statements of test users, 
he states clearly that these tests must always 
be used along with the Maze. One detects a 
rather plaintive note permeating the book. This 
is understandable in one who for 35 years has 
been working assiduously with a test of his own 
making, has seen it used in many studies in many 
parts of the world, but who still has not seen 
it accepted as having the primary value he be- 


lieves it to possess. Porteus is somewhat repe- 
titious and there are a few instances of careless 
proofreading. The book is, however, interesting 
and challenging. Allowing a little for the au- 
thor’s bias, those engaged in clinical practice 
should read it. 
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Revised Stanford-Binet Scale. Ages 2 and over; 
1037; а revision of Stanford-Binet Scale (16); indi- 
vidual; Forms L, M; $16 (oss. 7d.) per complete set 
of testing materials for any one form; $2.2 (85. 2d.) 
per set of printed card material for any one form; $2.80 
(10s. 10d.) per 25 record booklets for any one form; $1 
(ss. od.) per 25 abbreviated record forms for any one 
form; $3.75 (12s. 6d.) per manual, Measuring Intelli- 
gence; $2 per abbreviated directions for administering ; 
pone; (30-90) minutes; Lewis M. Terman and 

aud A. Merrill; Houghton Mifflin Co. ( English dis- 
tributor: George G. Harrap and Co., Ltd.) * 
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Bovp К. McCaunrzss, Director, Iowa Child 
Welfare Research Station, State University of 
Iowa, Iowa City, Iowa. 

It may safely be predicted that the average 
clinical psychologist of the reviewer's genera- 
tion has given hundreds of Stanford-Binets, 
both Forms L and M. He could probably recite 
the test while asleep. Since the literature on the 
tests is tremendous, this review is quite frankly 
impressionistic and summative. One or two 
trends (particularly in clinical practice) which 
have emerged in the past few years are pointed 
out. Ш 

Though universally used, the Stanford-Binet 
is annoying both to the tester and, as standardly 
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administered, to the subject, The academic and 
verbal bias of the items, the uneven distribution 
of "pure" verbal, reality testing, and perform. 
ance items from one year level to another (with 
the consequent tendency to double basal ages 
and ceilings) are disadvantages. The time te. 
quired to administer the test is unpredictable, 
The standard procedure of leaving the subject 
with a string of at least six successive failures is 
distressing to all concerned. The academic na- 
ture of the materials and questions is embar- 
rassing when the tester is confronted with a 
"dead end kid" or dull adult. Because of the 
nature of some of the items it is almost as dis- 
concerting to test the only child, the orphan, or 
the extremely superior preschool child. The test 
is extremely weak for adults and much of its 
equipment out of date for small children. Its 
fairness to the urban Eastern, the rural, the aca- 
demically retarded, or {һе minority group child 
is questionable. The Stanford-Binet is so verbally 
weighted that a performance scale must be ad- 
ministered in addition for valid assessmentaCet- 
tain responses, as to the proverbs, are very hard 
to peg as far as correctness is concerned. The 
MA scale is cumbersome. The organization “ 
the text does not give comparable behavior for 
comparison of one IQ or CA level with another, : 
as does a point scale. | 

Some of these disadvantages, particularly 
leaving a child with a flat sense of failure, may 
be eliminated by serial testing (tests of oe 
at different levels given together) which A 
not seem seriously to affect the pretii Я 
ciency of the test. This trend seems to 
healthy one. , 

Оле advantage of the Stanford-Binet po 
for example, the Wechsler Intelligence id 
for Children is that “we know what we pem 
ing." This advantage is not inherent (а И 
its relatively good standardization popu ari 
and the tremendous amount of effort, ker 
ence, and intelligence which went into E es 
certainly are major factors) but lies in Di 
mendous amount of research which has a 
into the scale and its prediction coem o 
These coefficients (mainly for academic "a i 
ment) are at the present time, however, Mans 
ter for groups than are the group jara rebel 
children (e.g., the reading retardate ort up e 
against school) who are penalized by a da Stan- 
are also penalized by the verbal, academi 
ford-Binet. 


í wide 
However, the Stanford-Binet presents 4 
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range of situations in which a child may be 
placed. Much is learned of his personality, stress 
resistance, power to concentrate, etc., that no 
group test can give. Аз an instrument, it lends 
itself well to “projective” use, a real advantage 
for the clinician. 

In general, the Stanford-Binet plus a per- 
formance test, in the hands of a good clinician, 
is a combination which is remarkably predictive 
and informative. However, if the Wechsler In- 
telligence Scale for Children, which is more in- 
teresting and up to date and is a point scale with 
all the advantages thereof, lives up to some of 
the tentative indications of its academic predic- 
tion efficiency, the Revised Stanford-Binet Scale 
is likely to decline sharply in popularity. The 
reviewer, however, will continue to use the test 
routinely, if for no other reason than the great 
body of research which buttresses it. Forms L 
and M present real advantages (although M, 
with its lack of the vocabulary, so rich in pro- 
jective material, seems to the reviewer weaker) 
in routine, repeated service, and research test- 
ing. For the clinician, it may be advisable to add 
the vocabulary to the Form M. 


For reviews by Francis N. Maxfield, J. W. 
M. Rothney, and F. L. Wells, see 38:1062; for 
excerpts from reviews, see 40:1420; for related 
reviews, see 3:293-4, 40:B1093, and 38:B 497. 


[359] 
*Stencil Design Test. Ages 4 or 6 to superior adults ; 
1944-47; individual; 2 forms, '44—listed below; man- 
uals (47) ; $3.50 per set of testing materials and 25 
record sheets for any one form; 9of per 25 record 
sheets for any one form; 35€ per manual for any one 
form; postpaid; (20-25) minutes; Grace Arthur; Psy- 
chological Corporation. * 
а) STENCIL DESIGN TEST I. Ages 4 to superior adults; a 
subtest of Arthur Point Scale of Performance Tests, 
Revised Form 11 (see 335). / 
b) STENCIL DESIGN TEST П. Ages 6 to superior adults ; 
tentative norms. 
REFERENCES 

т. ARTHUR, GRACE. “А Non-Verbal Test of Logical Think- 
ing." J Consult Psychol 8:33-4 Ja-F '44. * (PA 18:2286) 

2. BOULGER, CATHERINE, AND ARTHUR, GRACE. "An Un: 
Tgbjebsd Design Test." J Consult Psychol 8:31-2 Ja-F '44- 

3. ker cdd Russert Grayvon. “Тһе Leiter Adaptation of 
Arthur's Stencil Design Test." Psychol Service Center J 


1:62-8+ 5 '49. * (PA 24:2920) 2 
S osos" Nest, “А Qualitative Check List for the. Clinical 


of ie Later Mappe NB Sorte Center T 
1169-70 S '49. * (PA deitate) 
BENJAMIN BALINSKY, Assistant Professor of 
Psychology, The City College, New York, New 
York. 

The Stencil Design Test I is part of the Re- 
vised Form II of the Arthur Point Scale of Per- 


formance Tests and may be used by itself as a 
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separate. Stencil Design Test II is not included 
in the Scale but is also administered as a sepa- 
rate, Both tests are individually administered. 
In devising the Revised Form II of the Point 
Scale, the Stencil Design Test was developed as 
an alternate to the Kohs Block Design Test be- 
cause the latter showed practice effect upon repe- 
tition. 

The Stencil Design Tests are challenging and 
provocative of sustained interest for both chil- 
dren and adults. They require the reproduction 
of designs by means of colored stencils. Test I 
consists of 20 designs and 18 colored stencils 
used to reproduce the designs. Test II also con- 
sists of 20 designs but has 24 colored stencils with 
which to reproduce them. Test II is more diffi- 
cult than Test I and the tests are not equivalent 
forms. 

Both Test I and Test II involve form, color, 
and depth (building stencils one over another). 
In Test I the colors serve as clues but in Test 
II the designs are in black and white while the 
stencils are either yellow or blue. Although re- 
versals of color are not considered as errors 
(black and white may be represented by either 
yellow on blue or blue on yellow), the colors 
must necessarily be treated in an abstract man- 
пег. The examinee is not guided by the colors 
as in Test I. This additional factor may account 
in part for the greater difficulty of Test II over 
Test I. 

Neither test can be said to be well standard- 
ized. The norms for Test I are based altogether 
on 968 cases for age groups four to fifteen. For 
Test II the norms are described as tentative. 
They are based on only 421 pupils all of whom 
had experience working on Test I before they 
did Test II. 

Scores for both tests are in terms of chrono- 
logical age in years. For younger children no 
provision is made for months although it is well 
known that growth in the younger ages is too 
rapid to be precisely measured by so wide a unit 
аз а year. In the manual for Test I, IQ's are cal- 
culated on the basis of the score made on Test I. 
In the light of the incomplete standardization, 
this should not be done. Also a test employing a 
single type of measure cannot be considered. to 
ascertain the IQ in the way that a test utilizing 
several kinds of measures does. If the IQ is to 
be used at all, it must be considered only as a 
means of making comparisons between results 
on the Stencil Design Test and other tests and 
this quite roughly. 
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The scoring method can probably be improved. 
The score is the number of designs completed 


within the time limit of four minutes each. The. 


test author indicates inclusion of time credits if 
they should prove to increase the discriminate 
value of the test. This reviewer would also like 
to see a recheck of the order of presentation of 
the cards. They were placed in order of diffi- 
culty but it seems that some are out of order. 

The reliabilities of the Stencil Design Tests 
have apparently not been calculated. Revised 
Form II of the Point Scale of which Stencil 
Design Test I is part has been compared with 
Form I of the Point Scale to give a rather crude 
measure of reliability. 

As for the validity of the tests, evidence is 
available that the Stencil Design Test I dis- 
tinguishes the highest and lowest quarters of 
certain classes in mechanical drafting and archi- 
tectural drafting. Other single cases are cited to 
show the value of the test. Test II is assumed to 
have similar validity. 

It would be important to design more valida- 
tion studies and also to determine more clearly 
what the tests measure. Factor analysis would be 
useful in regard to the latter. It could be that the 
tests are measuring different functions at differ- 
ent age groups. 

There is a mechanical defect that can be easily 
remedied. The stencils are too thin, are hard to 
grasp and pick up, and bend easily. 

The time the test takes is supposed to average 
between 20 and 25 minutes. But the time would 
depend upon whom you are testing. If you hap- 
pen to have selected groups, the average may be 
less or more than 25 minutes, However, the 
possibilities for qualitative observations of be- 
havior of the examinee often make any extra 

ime seem worthwhile, The Design tests would 


_ also be useful for testing the deaf and those whose 


t 


command of English is limited by foreign birth 
or incomplete education. 

The Stencil Design Tests have, on the whole, 
excellent potential. The material is interesting, 
the tests apparently have validity in measuring 
abilities that go into drafting, and they provide 
media for observing behavior. With further 
and more complete standardization, the tests 
should prove very worthwhile. 

S o» 


For a review by James M. Anderson, see 
3:295. # 
[360] 


` "Visual Retention Test for Clinical Use. Ages 8 
S 


» 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK 


TN 
[46 


$2 per man. 
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b) MULTIPLE CHOICE TYPE. 1950; 1 form (i 

test items from. Forms A and B of Отау Саса a 
data on reliability and validity in manual (or data pre- 
sented elsewhere by the author, See 3); test free to 
qualified users; (5) minutes ; the Author 

of Psychology, State University of Iowa, Iowa City, 
Towa. k A 


and over; 1946-50; individual; 2 editions : 

ual (746) ; postpaid; Arthur I’. Benton, * 

а) DRAWING ТҮРЕ. 1946; Forms A, B; no 

ited other than manual; (4) minutes ; 
rporation. 
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Designed for clinical examination of patients, 
this test was developed as a brief technique to 
supplement auditory-vocal digit span tests in 
studying immediate memory. The manual lists 
several advantages of the test : it is brief (4 min- 
utes); the sensorimotor components (visual- 
motor) involved are different from those of 
auditory-vocal digit span tests; the materialto , 
be retained is of a nonsymbolic nature (abstract 
designs rather than numbers, letters, words, of 
pictures) ; the test is less “interpersonal” than 
auditory-vocal digit span tests; the degree of 
difficulty is such that normal persons rarely do 
poorly ; and equivalent forms are available. Nor- 
mative data are given in the manual for 34 nor- 
mal subjects and 126 patients in five ар) 
groups; and in an article (2) for 245 n 
children. Two forms are available with 7 car Я 
each. These provide alternates for tes 
situations, frequently helpful in following bi 
course of recovery or of deterioration in patien 
suffering from brain pathology. " 

In using any brief test such as this, one o | 
decide whether brevity is worth the narrow? 
of range of response of the subjects. In the Mie 
nal setting of the test, the premium was uh 
its cost in range because of the rapidity REN 
in screening patients in a Naval nop ii 
Further, Benton indicates the need for stu ae 
age and sex differences, neither of which he 
yet been reported in the literature by him 
others. Norms of adult performance wou 3d 
useful from a sample less highly y ae idi 
naval personnel, and it is hoped that P the 
thor’s post-World War II experiences W! 


EE 
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test will provide published information on per- 
formance of other adult samples. The children's 
norms are tentative, and it is from clinical ob- 
servation rather than statistical analysis that 
several diagnostic conclusions are drawn. These 
trends, however, may be significant in diagnosis ; 
they indicate that mentally defective children 
perform on the test in accordance with their 
mental age, and mentally superior children in 
accordance with chronological age. Brain in- 
jured children and "a proportion" of children 
with reading disabilities (intelligence within 
average range, as were the brain injured) make 
significantly defective performances. Such trends 
could be extremely valuable for diagnosis, par- 
ticularly if further data showed them to be sta- 
tistically significant and if they were found in 
other comparable tests. Cross comparisons of 
visual-motor tests constitute a needed area of 
research in the undeveloped field of psychodiag- 
nosis in organic pathology. 
LI 


Јоѕерн NEWMAN, Clinical Psychologist, Vet- 
erans Administration Hospital, Canandaigua, 
New York. 

The Visual Retention Test for Clinical Use 
is a performance test of immediate memory de- 
signed to supplement the auditory-vocal digit 
span test. The test consists of seven cards bear- 
ing geometrical designs roughly graded in diffi- 
culty, ranging from one simple figure to three 
complex figures on one card. The subject is re- 
quired to reproduce the design or designs after 
he has looked at the card for 10 seconds, Each 
drawing is scored as correct or incorrect, the 
maximum score being seven. Two equivalent 
forms exist. 

The test has been applied to children and 
normative data are provided. A multiple choice 
type of the Visual Retention Test has also ap- 
peared. This form is suitable for use with sub- 
jects who have motor deficits of such a nature 
as to preclude or impair significantly the ability 
to draw. 

The test was standardized on 160 cases, most 
of them patients in a naval hospital, all male but 
five. This number is rather small for an age 
spread of 17 to 51 years. The only additional 
data we have on this group concern аре, IQ, and 
diagnosis. There are no data on education, 50С10- 
economic status, or occupation, data which 
would seem to be necessary information con- 
cerning any standardization group. We woul 
also like to know the influence of age upon test 
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performance, if any. A close correlation between 
intelligence level and performance on the Visual 
Retention Test is reported but no coefficient is 
given. For a test of this type, a rather low corre- 
lation with intelligence would seem desirable. 

A reliability coefficient of .71 is reported for 
the drawing type of the Visual Retention Test. 
The method of calculation is not given. The 
multiple choice type has a reliability coefficient 
of .76 by the split half method. These coefficients 
are rather low for tests purporting to differenti- 
ate among individuals. The somewhat higher co- 
efficient reported for the multiple choice type is 
probably a reflection of the fact that this test is 
longer than the drawing type. 

Benton informs us that the test is relatively 
insensitive to emotional influences. It is difficult 
to conceive of any human production, particu- 
larly one such as this, to be free of emotional in- 
fluence, Anastasi and Foley in studies of draw- 
ings by various groups of adults have found that 
reproduction of stimuli reflect disturbances in 
perception, in intellectual ability, and in emo- 
tional balance. 

By limiting his scoring technique to correct 
or incorrect, Benton places restrictions upon the 
sensitivity of his instrument in addition to the 
theoretical point whether such dichotomous 
grouping adequately describes any human ex- 
pression. This fact comes out clearly both with 
the multiple choice type and with the drawing 
form where, in 50 per cent of the patients tested, 
impairment was not ‘demonstrated. To explain 
this failure, Benton finds it necessary to take a 
rather difficult theoretical position—that the test 
“failed” only if one assumes that behavioral im- 
pairment consequent to cerebral injury is al- 
ways of a general nature. He holds that variety 
not unity of behavioral defects prevails, Without 
entering into the relative merits of this position, 
it hardly seems that varied expression alone 
would account for so great a discrepancy. 

The Visual Retention Test would have greater 
sensitivity if the scoring technique were ех- 
panded. This can be seen by comparison to very 
similar instruments described by Sharp * and by 
Kendall and Graham.’ These investigators found . 
that their instruments differentiated among 


$ А ов 
various diagnostic groups TW 

А; А. “Тһе Diagnostic Significance | f a Visual 
Mn brewing Test.” " ‘Abn e HAT Im d 441517727 
Q "49. * - 


2 Renda |, patara S. and Graham, Frances K, “Further 


izati f th emory-for-Designs Test on Children 
rE Consult Psychol 12:349-54 5-0 '48. "(PA 


23:1773) 4 


360 


The Visual Retention Test for Clinical Use 
has some usefulness as part of a battery of tests 
to investigate memory defect. In the last analy- 
sis one can take the position of Benton himself 
that “по single test method is likely to achieve 
very impressive efficiency” in differentiating im- 
paired and unimpaired patients. However, this 
should not prevent us from sharpening our tools, 


М/плллм SCHOFIELD, Associate Professor, De- 
partments of Psychiatry and Psychology, Uni- 
versity of Minnesota, Minneapolis, Minnesota. 

This little exercise was devised to fill a gap 
in the battery of functions generally included in 
the clinical examination of patients. Specifically, 
it was designed to round out the evaluation of 
memory as customarily based on auditory digit 
span. The author sought to design a test which 
would have the brevity and administrative fa- 
cility of the digit span test while involving dis- 
tinctly different functions. Accordingly, his 
“memory for designs” test involves vision on the 
sensory side and drawing in the motor aspect. 
Furthermore, since the digits of the auditory 
test are “symbolic,” it was desired that the ma- 
terial of the visual test be nonsymbolic. This 
latter aim seems to have been fairly well accom- 
plishéd through the use of simple, abstract geo- 
metrical designs (squares, circles, etc.) for the 
retention material. It seems unjustified, how- 
ever, to assume without reservation that digits 
are always “symbolic” and geometric designs 
“nonsymbolic.” In discussing auditory digit 
span, the manual points out that this is not a 
pure measure of memory function but appears 
to tap factors of attention as much as those of 
retention. Similar comments regarding the “psy- 
chology” of the Visual Retention Test would be 
appropriate but do not appear. 

The directions for administration are not suf- 
ficiently explicit and detailed ; consequently, ad- 
ministration probably proceeds in a far less 
standardized fashion than is desirable. Directions 
to the subject are only suggested and not di- 
rectly stated in quotation form. Thus, some ex- 
aminers might instruct subjects to draw “ex- 
actly” what they have seen ; others might follow 
the tenor of the manual and tell the subject to 
“draw what he has seen.” Marked differences 
in performances might stem from such variation 
in instruction, such differences being more a 
reflection of personality than of differences in 
retentive ability. The subject is not instructed 
to study each card for the full 10-second exposure 
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unless he begins to draw the first design before 
IO seconds have elapsed. With perverse exact. 
ness, the manual then gives a direct quotation 
to be given the subject in explaining the neces. 
sity of looking at the card for the full exposure, 
Finally, the manual states: “The patient's per- 
formances may be praised." This Seems a par- 
ticularly ill advised permissiveness regarding 
the potentially exhortatory behavior of an ex. 
aminer giving a test in which attention, concen- 
tration, and effort are so important to successful 
performance. 

Normative data are presented in the form of 
distributions of scores on both forms of the test 


for 155 males and 5 females (!), most of whom - 


were patients in a Naval hospital. Of this num- 
ber, 34 were diagnosed as “normal.” The distri- 
butions are given in terms of five levels of in- 
telligence, "superior" through “moron” as es- 
tablished by the Wechsler-Bellevue IQ's of the 
subjects. Such normative data encourage one 
to think of this exercise as a test of general in- 
telligence; the author further encourages this 
with a misleading statement that : “a close corre- 
lation between intelligence level and performance 
on the visual retention test is evident." The 
median scores of the "average" and "superior 


groups are identical and differ by only a single | | 


point, on a 7-point scale, from the median score 
of the "dull average group"! It is difficult to 
see how a test with such an obviously limited 
range of difficulty can be expected to correlate 
closely with general intellectual level. No relia- 
bility data are reported. The two forms appear 
to be well equated. Validity, aside from те; 
ship to Wechsler-Bellevue IQ’s, is given only 
by informal reference to clinical cases. А 
In view of the author's expressed purpose á 
providing a supplement to the auditory de 
span test and his implication, with two case г d 
ports, that disparities between auditory p 
visual retention have clinical import, it is Ч 
fortunate that he does not report the su 
relationships between these two functions T а 
norm group, since auditory digit span Wo we 
readily available from his Wechsler een 
tions. In a sample of 16 patients with ex s 
ganic pathology and positive evidence О Й 
tellectual impairment, nine (56 per oe d Ji 
scores below the average on the Visual A i 
tion Test. In combination with EET E 
span, these data helped to establish the ге "s й 
generality or specificity and gross magn! 
of any retention defect. 
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The idea underlying the Visual Retention 
Test is good. As it stands, it provides a quick 
and convenient technique for clinical screening 
of gross defect in visual retention. It would be 
greatly improved by better standardization, with 
particular attention to more explicit directions 
{ог administration; increase in the range of 
difficulty; better normative data, especially in 
terms of age and sex variables ; and careful vali- 
dation against good clinical criteria. Even then, 
it should not be used as a brief examination for 
general intelligence. 

Persons interested in this type of instrument 
should review the work of Sharp + and of Graham 
and Kendall.? 


For an excerpt from a review, see 3:297. 
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*Wechsler-Bellevue Intelligence Scale. Ages 10- 
70; 1939-47 ; 3 scores: verbal, performance, total ; in- 
dividual; 2 forms; $1.50 per 25 record blanks for any 
one form; postpaid; (40-60) minutes; David Wech- 
sler; Psychological Corporation. * NEA 

a) FORM I. 1939-47 ; $16.50 per set of test materials and 
25 record blanks (/47) ; $3.05 рег manual, The Meas- 
urement of Adult Intelligence, Third Edition (44). 

b) ковм п. 1946-47; $18 per set of testing materials, 
25 record blanks ('47), and manual ('46); $2.25 per 
manual. 
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gan. 

Since its introduction in 1939, the Wechsler- 
Bellevue Intelligence Scale has been awarded a 
widespread clinical usage rivaled by few other 
psychological instruments. Primarily, the test 
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was welcomed as providing a much needed adult 
intelligence scale; one that included a more ex- 
tensive sampling of adult age levels than was 
accounted for in the Revised Stanford-Binet 
Scale, and one that replaced the method of main- 
taining a fixed chronological age in the compu- 
tation of adult IQ's with a method designed to 
accommodate the expected decline in intelligence 
with age. у 
Additionally responsible for its rapid recog- 
nition was the appearance of a scale for measur- 
ing intelligence seemingly possessing theoreti- 
cal substance as well as practical applicability. 
According to Wechsler, the test presents the 
"opportunity of obtaining a measure of the in- 
dividual’s “global” intelligence (i.e., his overall 
level of efficiency in functioning as a whole or- 
ganism) along with the configuration of those 
elements, “not entirely independent but qualita- 
tively differentiable” from one another, that 
compose this: global entity. Wechsler contends 
that the ideal test of intelligence involves the 
measurement of some universal, common factor 
akin to Spearman's “g,” any number of group 
factors something in the nature of Alexander's 
“functional unities," and such concomitant non- 
intellective factors as motivation, persistence, etc. 
that enter into all human measurement. He sug- 
gests, a priori, that his test is so equipped. The 
eleven diversified subtests divided into a Verbal 
anda Performance scale and combined into a Full 
scale offer, on the face of it, precisely the kind 
of practical situation adaptable to this rather 
enigmatic theoretical context. But aside from 
such controversy as may exist over the merits 
and shortcomings of Wechsler’s theoretical posi- 
tion, numerous factor analyses have demon- 
strated that theory and practice certainly have 
not been reconciled in the Wechsler-Bellevue. 
Clinical psychologists were particularly en- 
thusiastic over the diagnostic potentialities in- 
herent in its differential yet ostensibly homoge- 
neous composition. On the basis of a small num- 
ber of cases, Wechsler compiled several tables 
illustrating the typical test patterns of a few 
gross diagnostic categories. These were based 
upon an analysis of subtest scores by the method 
of counting or integrating signs (i.e., the degree 
of deviation of individual subtest scores from the 
mean subtest score). The method was to yield 
a system of pathognomonics based upon intra- 
test patterns, a coneept not far removed from 
the notion of scatter analysis on the Stanford- 
Binet. Close examination of the author’s system 


reveals inconsistencies in the technique per se. 
For subtests within the range of normal intelli- 
gence the number of weighted score points con- 
stituting a “significant” deviation from the mean 
subtest score is less than the size of the stand- 
ard deviation of the weighted score conversion 
scale. Since what might be termed scatter ordi- 
narily varies with the magnitude of the 10, what 
is and what is not “significant” becomes rather 
arbitrary and tends to confound the very mean- 
ing of the pathognomonics so derived. While in 
all fairness to Wechsler it should be pointed out 
that he has cautioned against the type of diag- 
nosis accomplished by comparing individual test 
results against the tables of signs, the inclusion 
of these tables in his book could hardly be said 
to discourage such practices. 

Judging from the research literature, the di- 
agnostic significance of subtest score configura- 
tions has proved to be a favorite bone of con- 
tention. Beginning with the claims set forth by 
Wechsler, psychological journals have been del- 
uged with an interminable flow of studies sup- 
porting, expanding, and rej ecting the diagnostic 
implications of pattern analysis. Most of the 
“typical patterns,” “indices,” and "signs" have 
not proved reliable with either groups Ог indi- 
viduals. In many instances, test patterns have 
not been found sufficiently diagnostic to permit 
their use in differentiating between schizo- 
phrenic and normal subjects (Garfield, 161 ) 
and one investigator termed the clinical applica- 
tion of one of the more popular indices "danger- 
ous" (Webb, 21). As for the problem of making 
the even finer discriminations necessary for the 
diagnosis of nosological subtypes, Brecher (82) 
found that Wechsler-Bellevue signs were able 
to make differential diagnosis between five types 
of schizophrenia only 13 per cent better than 
chance. In general, the whole question of dis- 
order-resistant and disorder-susceptible intellec- 
tual functions has not yet been satisfactorily 
resolved and the method of deviation from the 
mean of the subtest scores is a statistical con- 
cept whose clinical application lacks logical геш- 
forcement. Specifically in the case of 
Wechsler-Bellevue, it is not by any means cer- 
tain that the reliability of the subtests could sup” 
port a system of pathognomonics even if the 
concept were demonstrably valid. Tt would not 
be unfair to say that whatever enthusiasm was 
initially generated by the pathognomonic poss! 
bilities of this test has been cénsiderably dimme 
in recent years. 
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Wechsler offers estimates of the reliability of 
the scales on the basis of two studies. The first 
consists of intersubtest correlations in a method 
analogous to the split half technique. Eight sub- 
tests were selected and divided into two groups 
of four. The reliability coefficient so obtained is 
given as .90, after "correction for attenuation." 
The subtests in each group seem to have been 
selected in favor of obtaining optimal correla- 
tion with each other. Object Assembly, which 
correlates lowest with the other subtests of the 
scales, was omitted from the study. If subtests 
which had shown low correlation with each other 
were used, the derived coefficient would have 
been lower. As a matter of fact, there is some 
question as to whether this method really con- 
stitutes a valid reliability study in a test de- 
signed to measure specific intellectual functions 
as well as general intelligence. If the test had 
been divided in half by items rather than by sub- 
tests, the results would be more relevant. It fol- 
lows that the fairly high correlation between 
Verbal and Performance IQ's, also offered as 
evidence of reliability, must be regarded as 
equally unsuitable. 

A second study employing test-retest at vary- 
ing intervals with 52 subjects yielded a reliability 
coefficient of .94 for the Full scale. No reliability 
coefficients for the subtests are given, but 
Derner, Aborn, and Canter (270) report average 
. test-retest coefficients ranging from .62 to .88 
for 158 normal subjects retested at well con- 
trolled intervals. 

In discussing the validity of his instrument, 
Wechsler acknowledges all the usual difficulties 
inherent in the general problem of validating an 
intelligence test. He comments that the validity 
of an intelligence scale cannot be established 
directly and that indirect methods by way of cor- 
relations with such usual criteria as other meas- 
ures of intelligence, teacher ratings, and psy- 
chiatric recommendations are, in turn, notably 
unsatisfactory. Having thus deplored these cri- 
teria, he proceeds to argue that their pertinence 
in the appraisal of any new test cannot be denied 
(p. 129). He furnishes evidence of agreement 
between Wechsler-Bellevue IQ's and the scores 
of several other tests, teacher ratings, and case 
stüdy data, Aside from such validation, the au- 
thor concludes that the instrument has been put 
to the pragmatic test and proved itself to be a 
good" one. Strange as it may seem, in view of 
the ambiguities involved in arguing from both 
sides of the fence, the latter claim appears to be 
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the most sensible and the one most substantiated 
in clinical experience. 

The directions for administering many of the 
subtests bear clarification; the procedures for 
administration, specificity. For example, if a 
subject exceeds the time limit in completing Ob- 
ject Assembly does he receive credit for what he 
has done within the time limit? In Block De- 
sign, does “pausing to make clear to the subject 
how two half designs can be put together to make 
a solid patch of color” (p. 184) mean that the 
examiner is to explain it verbally? In Compre- 
hension and Similarities the test manual calls for 
a reduction in score on those items where a sub- 
ject “contaminates” his response. In practice, 
decisions on “contamination” are frequently 
difficult to make. Sometimes a subject will give 
inferior responses in an effort to elaborate his 
first and best response. Is this to lower his IQ? 
Many clinicians have difficulty in accepting the 
scoring rationale of these two subtests altogether. 
The proposition that a subject will always give 
the "best" response of which he is capable spon- 
taneously is not a very sound one. In view of the 
fact that Wechsler implies the examiner may 
not always seek to improve a subject's response 
by querying (pp. 171, 181; footnotes pp. 172, 
193) these subtest scores may often reflect what 
the subject has done, not what he can do. The 
ability to think “abstractly” or in the most gen- 
eral terms may be reasonable evidence of high 
intelligence, but it is too subject to educational 
and environmental influences to be measured 
in a perfunctory manner. 

Recent years have seen increasing criticism 
leveled at the Wechsler-Bellevue. The norma- 
tive population, sampling techniques, theoretical 
basis, diagnostic utility, and the structure of the 
test itself have been carefully picked apart and 
found wanting. The effect has been destructive, 
but the value of this test has been considerably 
enhanced by a recognition of what it is not and 
what it cannot do. Handled with judgment and 
a cognizance of its limitations, this test, as an 
instrument for measuring adult intelligence, is 
so superior to anything else available for that 
purpose that it remains a highly important psy- 
chological tool. 


WiLLIAM D. ALTUS, Associate Professor of 
Psychology and Chairman of the Department, 
Santa Barbara College, U niversity of California, 
Santa Barbara, California. 

Before making any comments on this widely 
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used intelligence test, the writer would like to 
present his credentials. He has administered the 
Wechsler, taught it, and has done research with 
it for a period of 12 years. Time does not, how- 
ever, confer infallibility, despite the subjective 
feeling tone of the one who makes pronounce- 
ments. 

The Wechsler tests, Forms I and II, lack an 
adequate ceiling for the quite bright. Probably 
the top five per cent, surely the top two or three 
per cent, is not adequately tested, Since clinical 
practice deals much less frequently with the very 
able, probably this point is not so adverse as it 
might otherwise sound. 

The validation studies on test patterning, 
which Wechsler stresses in his manual, have been 
like the fable of the mountain in labor: Only a 
mouse-size bit of verified. data remains. In the 
writer's opinion, the efficient little digit symbol 
subtest has shown an unspecified amount of 
overlap with what J. McV. Hunt calls “mental 
deficit" though its general responsiveness to 
most types of deviant behavior leaves it differ- 
entially undiagnostic. It also seems likely that 
the juvenile offender and the adult criminal do 
relatively better on performance subtests than 
on verbal ones, in about two cases out of three, 
say. It may well be that lack of attendance at 
school, or application therein, affects the verbal 
scores of this particular group—if it is incon- 
trovertibly shown that this pattern is a genuine 
one, as the writer feels it to be. Factor analytic 
studies of the subtest intercorrelations of clini- 
cal groups at various age levels, matched with 
normal controls might lead the way out of the 
statistical morass into which the pattern valida- 
tion studies have fallen. 

In the writer's opinion, the object assembly 
subtest could very profitably be left out of the 
test altogether. Its very shortness precluded 
, either marked validity or reliability. It is some- 
what cumbersome to administer and, for its 
length, somewhat time consuming. If this test 
were dropped from the scale, the arithmetic sub- 
test could be appreciably lengthened, thus in- 
creasing the reliability of this excellent short 
test. This would leave a total of six verbal sub- 
tests (vocabulary to be included as a regular 
part of the scale) and only four performance 
subtests. Our civilization awards its prizes much 
more frequently to those who are able in dealing 
with verbal concepts; in that sense, a 6 to 4 
weighting of verbal and performance tests should 
increase the validity of the scale, which pres- 
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ently confers equal weights to the two types of 
test materials. 

The writer doubts that the skewness Wechsler 
finds in his distribution of IQ’s is any closer to 
"reality" (if general ability ever proves to be 
completely commensurate) than is the normal 
distribution curve hypothesized by Terman and 
others. It may well be, however, that there is 
greater asymmetry at the lower than at the upper 
part of the IQ curve, i.e., more dull subjects in 
certain IQ ranges than will be found at a cor- 
responding place on the upper end of the curve. 
Birth injuries, disease processes, and post-natal 
trauma could readily account for the differen- 
tial. 

To summarize: (a) the Wechsler needs more 
“top” ; (b) a greater emphasis on verbal subtests 
seems desirable; (c) for whatever reason, the 
“pattern” studies of the Wechsler in terms of 
clinical diagnosis, while not completely futile, 
have proved unrewarding. 


For a review by Robert I. Watson, see 3:298; 
for a review by F. L. Wells and excerpts from 
reviews, see 40:1429; for related reviews, see 
362, 3:299-301 and 40:Br121—-2. 


[362] 
*[Re Wechsler-Bellevue Intelligence Scale.] 
Plumb IQ Slide Rule for use with the Wechsler- 
Bellevue Intelligence Scale. Ages 15 and over; 
1948; $1 per copy, postpaid; С. R. Plumb; Psychologi- 
cal Corporation. * 


J Consult Psychol 13:148 Ap '49. Laurance 
F. Shaffer. * If large numbers of Wechsler's are 
handled, the use of the device is perhaps more 
rapid than are the conventional tables. 


[363] 

*Wechsler Intelligence Scale for Children. Age 
5-15; 1949; a downward extension of Wechsler- Ва. 
vue Intelligence Scale; also called WISC; 12 ila E 
[information, comprehension, arithmetic, sim! SERO ба 
vocabulary, digit span (optional), picture comp n 
picture arrangement, block design, object азе |. 
mazes (optional), coding] yielding 3 scores: bee 
performance, total; individual; 1 form; $22 per d 
plete set of testing materials, manual, and 25 Г 
forms; $2 per 25 record forms; $1.20 per 25 
Maze Tests, e Е subtest bed 
in place of Coding; $3.50 per carryim ea 
ordered with complete set); postpaid; (40-60) ту 
utes; David Wechsler; Psychological Corporation: 
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James M. Anperson, Assistant Professor of 
Psychology, Occidental College, Los Angeles, 
California. 

This test was standardized on 2,200 white 
American children, 100 boys and 100 girls at 
each of the тт age levels from 5 to 15. The chil- 
dren were selected on the basis of age (within 
1% months of their midyear), rural-urban resi- 
dence, father’s occupation, geographic area, and 
the presence or absence of suspected feeble- 
mindednéss. The sample used was fairly repre- 
sentative of the 1940 census with respect to the 
above criteria. The selection of cases shows that 
much greater attention was paid to the necessity 
of securing a representative sample than іп 
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Wechsler's earlier work,? with which the author 
assumes that the user will be familiar. 

The construction of an alternate form of a 
test should be an integral part of the original 
work of constructing and standardizing a test 
and not an afterthought if the two instruments 
are to be truly parallel. Since there is no mention 
of a proposed alternate form for WISC, we can 
suppose that such a form has not been planned. 
The publication of this scale seems unwise be- 
fore an alternate form is available. 

In reviewing the Wechsler-Bellevue Intelli- 
gence Scale in The Third M ental Measurements 
Yearbook, Watson wrote, “The directions for 
Forms I and II are deceptively alike, but close 
scrutiny will show a considerable number of 
minor changes in wording. Most of these changes 
exhibit at least a greater surface clarity, but for 
the sake of these changes one must pay the price 
of learning two sets of directions differing so 
slightly and yet so pointedly that a certain 
amount of confusion and irritation inevitably 
results.’ These same comments apply to the 
WISC. Only now, for those who use all three 
instruments, the confusion is doubly com- 
pounded. A little thought to this problem would 
have added considerably to the insurance that 
the examiner would “follow the directions in the 
manual exactly.” Such confusion detracts con- 
siderably from the ease of administration on the 
part of examiners experienced with the adult 
forms. 

Not only do the directions of the WISC differ 
from those for the Wechsler-Bellevue, Form II, 
but they also differ within the WISC itself “for 
Subjects under 8, and older suspected mental ~ 
defectives" and "for Subjects 8 and older, not 
suspected mental defectives." These double di- 
rections hold for Similarities, Picture Arrange- 
ment, Block Design, Coding, and Mazes. In ad- 
dition, the test material itself is entirely different 
for these two groups on Coding. For the older 
subjects the Digit Symbol test of the Wechsler- 
Bellevue, Form II is used while for the younger 
subjects a different test is employed. 

Tt is well known that the alteration of instruc- 
tions or procedure changes the nature of the task 
required of the subject by some unknown 
amount. This is the primary reason for insistence 
upon strict adherence to standardized direc- 
tions. The fact that the changes in directions in 
the WISC are intentional does not vitiate the 


1 Wechsler, David. The Measurement of Adult Intelligence, 
Third Edition, Baltimore, Md.: Williams & Wilkins Co., 1944. 
Pp. vii, 258. * (PA 19:815) 
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objection to such changes. There is sufficient 
uncontrollable error inherent in every instru- 
ment without confounding it in some unknown 
degree with error that could be controlled. 

The more mathematically facile reader will 
probably enjoy trying to reconcile Table IX in 
the WISC with the corresponding table of clini- 
cal ratings for IQ equivalents on page 8 of the 
manual for Wechsler-Bellevue, Form II. In his 
earlier work (58), Wechsler comments upon 
Terman’s IQ classifications of intelligence, in 
part, “The first thing that strikes us is that the 
limiting second cipher of each new class is a 
zero, Thus the category borderline begins at 
70 IQ, the category dull-normal at 80 IQ, the 
category average at 90 IO, and so on. It is diffi- 
cult to conceive of any statistical procedure 
which would give such neat correspondence. * 
the actual chance of each of them beginning with 
zero is only..one in 10,000,000." "However, 
when we examine Table IX in the WISC man- 
ual, we find that borderline begins at 70 IO, dull 
normal at 80 IQ, average at 9o IQ, and so on. 

On page 4 of the manual we find, "the mean 
(100) and standard deviation (15) which have 
been chosen will give IO's which, on the whole, 
are fairly close numerically to IQ’s of other well- 
standardized tests like the Stanford-Binet. This 
will make comparison with IQ’s of other scales, 
at least at the numerical level, a not too unreason- 
able procedure.” Despite a reminder to make 
individual interpretations in terms of each test, 
this statement may encourage such comparisons. 

Also in the manual we find that, “The WISC 
consists of twelve tests.” However, the norms 
are based upon results from то tests. “It is per- 
missible to give all tests ; indeed in clinical situa- 
tions, their inclusion is strongly advised because 
of the qualitative and diagnostic data they add.” 
This practice should be discouraged rather than 
encouraged. Admittedly, more tests give more 
data but what the data mean is anybody’s guess. 
Wechsler attempts no interpretation and gives 
no hint as to the “diagnostic” value of this data. 
If the tests were omitted in establishing the IQ 
tables because of low correlations with the other 
tests, the practice of assigning IQ’s based upon 
the results of 12 tests from norms based upon 
то tests is highly suspect. Including inferior 
material and prorating to find an IQ would, in 
part, result in increased variability, decreased 
reliability, and an invalidation of the published 
correlations based upon 5 verbal and 5 perform- 
ance tests. It would be more judicious to employ 


the group of tests upon which the norms are 
based and to prorate when fewer tests are used. 

Coefficients of reliability and standard errors 
of measurement are reported for ages 74%, 1014, 
13%. The same measures for the other ages, 
particularly at the lower levels, would be of in- 
terest and importance to the serious user of this 
test. It is hoped that these will appear in the 
publication which Wechsler promises will appear 
at a later date. The reliability for Coding (called 
Digit Symbol previously) is based upon the cor- 
relation of results on two different tests, the 
nonidentity of which is indicated by the reported 
r of .60 (incorrectly given as a coefficient of re- 
liability). These comments hold also for the 
Digit Span test where the results on digits for- 
ward were correlated with the results on digits 
backward. It should also be pointed out that 
corrected odd-even reliability (the method used 
here for the most part) in general tends to be 
higher than reliability based upon the method of 
parallel forms. 

Wechsler, in criticizing the Stanford-Binet in 
his book (58), suggests that tests intended for 
children are unsuited for use with adults. The 
rationale by which his material, originally in- 
tended for adults, is now suitable for children 
is conspicuous by its absence. Although adults 
do not object to the serial presentation of such 
items as the repetition of digits forward and 
backward, children seem to tire of these items 
rapidly. The serial presentation of items 1s 11 
marked contrast to the variation of tasks on the 
Stanford-Binet which seems to be an important 
factor in eliciting the best efforts of the child. | 

There is considerable doubt in this reviewer $ 
mind as to the applicability of this scale for the 
age range for which it is intended. Aside from 
objections on rational grounds, inspection of the 
tables of Scaled Score Equivalents discloses that 
zero raw scores are the equivalent of as much as 
5 units of scaled score. Since the standard devia- 
tion of each test is set at 3 units in the scaling 
procedure, this is an amazing value to assign to 
no performance on a test and apparently re 
sults from the particular statistical approach em- 
ployed. Inspection of the table for age 50 
through 5-3 shows that a zero real performance 
on all tests yields scaled scores which when prO" 
rated and converted give a Verbal IQ of 57 r 
Performance IQ of 55, and a Full Scale 100 
52. (Note that the Full Scale IQ is lower than 
either the Verbal or Performance IQ's.) 


il 
A zero scaled score cannot be earned unt 
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the age 6-8 through 6-11 and here only on one 
test. Not until we reach the norms for 12-0 
through 12-3 do we find that credit is no longer 
given for zero performance. Nothing could more 
clearly indicate the inapplicability of this ma- 
terial for the younger age groups. One must go 
up to year 7, for instance, before a zero score on 
vocabulary yields a zero scaled score. 

While the computational procedure for arriv- 
ing at an IQ for the Stanford-Binet makes it 
sufficiently easy to arrive quickly at an IQ below 
30 (the lowest tabled for the Stanford-Binet), 
many psychometrists may despair of calculating 
WISC IQ's below those tabled. The limits of 
the tabled IQ's are: Verbal 45-155, Perform- 
ance 44-156, Full Scale 46-154. The possibility 
of having scaled scores reach zero before the IQ's 
do was undoubtedly one factor in not extending 
the table further. 

Since the standardization was done on chil- 
dren within 114 months of their midyear (ex- 
cepting the feeble-minded cases where a more 
generous criterion applied), it is difficult to see 
how Wechsler can arrive at norms for every 

-month period and be consistent with his earlier 
objection to interpolation and extrapolation of 
scores and his insistence that a person's results 
be compared with norms based upon results of 
testing persons of his own age. Another ques- 
tionable procedure is that of disregarding the 
days in computing a child's CA, e.g., а child 
whose age is 5 years, 3 months, and 28 days is 
given a CA of 5-3. This procedure was not em- 
ployed in standardizing the test—reason enough 
for questioning its adoption as part of the stand- 
ard procedure in using the test. One wonders 
about the efficiency of a scale so coarse that al- 
most a month's difference in age is not reflected 
in the IQ, particularly at the lower ages. 

The manual is of a convenient size and easy 
to handle. However, the use of bold-face type in 
the directions instead of the customary italics 
makes the reading of directions to the subject 
difficult. The scoring sheet gives adequate space 
for recording answers. Except for one arith- 
metic card, as far as this reviewer can tell, all 
of the material of Wechsler-Bellevue, Form II 
(exclusive of that in the manual) is used in the 
WISC. Had the material been published on sepa- 
rate cards as originally, instead of in spiral bind- 
ings, issuance of a master kit for the Wechsler- 
Bellevue, Form II and the WISC would not 
only be feasible but would effect a considerable 
savings for the purchaser. Separate cards would 


make the necessary shuffling an easy matter. 
(Some easier material has been added and could 
be issued separately to supplement the kits of 
those who already own the Wechsler-Bellevue, 
Form II.) 

In summary, despite a conscientious job of 
trying to secure a representative sample for 
standardization, the inherent weaknesses of this 
test make it an unsatisfactory instrument which, 
in the field of testing children, cannot compare 
with the more adequate Stanford-Binet. 


Harotp A. Dep, Coordinator of Educational 
Activities, The Training School, Vineland, New 
Jersey. 

Wechsler has followed his adult scale with this 
scale for children. It is based on the same phi- 
losophy of global intelligence and the inade- 
quacy of the MA concept as the adult scale. The 
Wechsler Intelligence Scale for Children 
(WISC as it is usually called) is an extension of 
Form II of the adult scale, with additions and 
adaptations to permit use. with children down 
through 5 years. It is useful for children from 
5 through 15 years and better than the adult 
scale for the top of this range. 

The manual warns against assuming that 
knowledge on the adult scale is transferable, 
WISC has different directions and different 
scoring standards. In addition it is pointed out 
that the same test items or materials may have 
different meanings for adults and children. 

The standardization sample is much more 
adequate than for the adult scale. It included 
100 boys and 100 girls at each age level (5 
through 15), with distribution as to area of the 
country, urban-rural proportion, and parental 
occupations being based on 1940 USS. census. 
Only white children were included in the sample. 

The manual contains data on intercorrela- 
tions, as well as on reliability coefficients and 
standard errors, for all subtests and totals. Un- 
like the adult scale, raw scores for subtests on 
WISC are used to enter age tables from which 
Scaled Scores are obtained. One table converts 
all total Scaled Scores to IQ's. Scaled Scores 
have been derived so that a mean Scaled Score 
of 10 and a standard deviation of 3 are obtained 
at each age and for each subtest. Mean IQ is 100 
and standard deviation is set at 15 1Q points. 
Thus, IQ's do not vary from age to age unless 


actual test performance varies with respect to 


peers. А 
Primary advantages include the following: 
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(a) WISC is modern in construction and stand- 
ardization. (b) Test is easily administered. (c) 
It requires less time than most scales giving as 
complete results. (d) Materials are interesting to 
children. (e) Spiral manual is easily handled and 
contains clear directions and tables. (f) There 
are good norms derived from good statistical 
bases. (g) Both Verbal and Performance IQ's 
as well as Full Scale IQ are available, allowing 
differentiation in these basic areas of intelli- 
gence. (Л) IQ’s are directly comparable for 
various ages. (7) WISC provides better than 
most tests for clinical observations of subject's 
behavior. (7) It has future possibilities for clini- 
cal diagnosis from both quantitative and quali- 
tative points. 

Primary disadvantages include the following : 
(a) Evidence indicates the test may be too diffi- 
cult for lower age brackets to be comparable 
with other tests (especially Stanford-Binet). 
(b) MA’s desired by many clinicians are not di- 
rectly considered (Wechsler gives data for ob- 
taining age data in recent article, see 20). (c) 
WISC is not effective for subjects at either end 
of the intelligence distribution (outside of IQ 
range 45-155). (d) Scoring of certain verbal 
items includes considerable subjectivity. (e) 
Few data are available for children in terms of 
other tests (validity claimed seems based on 
acceptance of adult scales). (f) Implied values 
from interpretations of subtests are not substan- 
. tiated in the manual by directions or explana- 
tions for interpretation (only basic statistical 
or percentile interpretation is discussed). 


Boyp R. McCanptess, Director, Iowa Child 
Welfare Research Station, Iowa C: ity, Iowa. 

The WISC, as it seems to have been popu- 
larly labeled within the first few months of its 
publication, is sufficiently similar to the adult 
Wechsler-Bellevue to give the clinician a com- 
fortable feeling of security as he approaches it. 
It is a 12-subtest point scale, divided as in the 
adult form into verbal and performance batteries, 
which can be summed and translated into a total 
IQ. The tests are similar to the adult Wechsler 
with certain exceptions. 

The deviation IQ is used. (Children's scores 
are based on norms from other children of their 
age level) This will be disconcerting to the 
Stanford-Binet habitue, since it tends to limit 
the occurrence of extremely high and extremely 
low IQ's. 


The manual, which was the only source of in- 


formation about the test known to the writer 
until mid-1950, when Seashore (17) began pub- 
lishing supplementary material, is efficiently 
arranged and is self-contained. It gives certain 
rather sparse information on the sample (geo- 
graphic, urban-rural, father's occupation data as 
compared with the 1940 U.S. Census) and ta- 
bles of intercorrelations, reliabilities, and stand- 
ard errors of measurement for the subtests and 
for the Verbal, Performance, and Full Scale 
IQ's. 

During the first year of the test's life, the user 
was left not knowing the test's predictive value 
for other criteria. The most glaring deficiencies 
have been lack of published relationships with 
the classic Revised Stanford-Binet Scale and 
lack of knowledge of the test's predictive effi- 
ciency for school success. Such prediction, per- 
haps regrettably, must be known for a scale cov- 
ering, as the WISC does, the ages of 5 through 
IS years. 

However, from mid-1950 on, Seashore, Wes- 
man and Doppelt (6) and Seashore (17) have 
published more detailed data with reference to 
the standardization sample. A good job of stand- 
ardization has been done. One hundred boys 
and тоо girls at each age level, 5 through 15 
years, form the sample, 2,200 in all. Of these, 55 
were nonrepresentationally selected (in terms 
of census data) from institutions for the men- 
tally defective or special classes. Only white 
children were used. In general, the geographical 
selection and occupational selection of subjects 
is good. The reviewer questions the urban-rural 
sampling (skewed towards urban and small 
town “rural”). For some reason or another, the 
girls in the sample tend to test slightly lower 
than boys. Performance-verbal differences, 
mean IQ’s and variability by age, etc. are re- 
markably even. И 

From early 1951 on, certain relationship and 
validational studies have begun to appear, al- 
though only a few have been published at the 
time of writing this review. Sloan and Schneider 
(18) reporting on 40 "familial or undifferen- 
tiated" mentally defective boys and girls, report 
т?з of .79 and .76 with the Arthur Point Scale 
of Performance Tests, Form I, and the Stan- 
ford-Binet Form L respectively for the WISC 
Full Scale; .83 and .64 for WISC Performance 
Scale; and .47 and .75 for the WISC Verbal 
Scale. Considering the small number of cases 
these are not particularly unsatisfactory. Mean 
IQ's and standard deviations for the Arthur 
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Point Scale of Performance Tests, Form I, the 
Stanford-Binet Form L, and the WISC Full, 
Verbal, and Performance Scales were 64.9, 12.1; 
56.3, 4.8; 59.7, 6.2; 64.6, 12.7; and 58.3, 9.5. 

Stacey and Levin (то), reporting on 70 sub- 
normals, find the Stanford-Binet Form L and 
the WISC Full, Verbal, and Performance mean 
IQ's and standard deviations to be 65.2, 7.21; 
66.1, 8.5; 66.6, 7.0; and 71.6, 10.9, respectively. 
WISC Full Scale correlated .68 with the Stan- 
ford-Binet; WISC Verbal Scale correlated 
.69. 
Cohen and Collier,' for from 49 to 53 normal 
6, 7, and 8 year olds, report mean Arthur Per- 
formance Form II, Stanford-Binet Form L, and 
WISC Full, Verbal and Performance IQ's and 
standard deviations of 94.7, 16.4; 104.8, 15.1; 
99.8, 14.6; 98.5, 14.3; and 101, 14.5. In their 
study, WISC Full, Verbal, and Performance 
IQ's correlate .85, .82, and .80 respectively with 
the Form L Stanford-Binet; and show etas of 
80, .77 and .81 with the Arthur Performance 
Form II. 

Magdsick and Blitz? report for an N of 64 
first grade children an overall correlation of .53 
for WISC Full IQ and teachers' marks, and of 
.76 with reading grades. The Stanford-Binet 
used in the same study shows a correlation with 
reading grades of .75. 

To summarize, evidence begins to appear that 
the WISC is a pretty good test, though it pos- 
sesses certain disadvantages. Among the major 
ones are: insufficient data about what it predicts, 
temptation to do elaborate pattern analyses on 
subjects’ scores, some urban-rural inequity in 
the standardization sample, no Negro children 
in the sample, and a dubious mentally defective 
sample with consequent variable results with de- 
fectives. ; 

Its positive virtues are its **up-to-dateness," 
the fact that an examiner can predict the time 
it will take to give a test, its real interest value 
for children, the fact that all children go through 
a comparable battery of all the available tests, 
plus the tentative indication that it can do the 
job of academic prediction so necessary for a 
test for the ages 5 through 15. The standardiza- 
tion sample is basically a good one, and the 
manual relatively compact. Some of the ambigu- 
ani, open, Bertram Юр and Coleg. Мару Children to Some 


Other Intelligence Tests." Paper read before 1951 Midwestern 
sychological Association. | » 

‚ 2 Magdsick, Winifred K., and Blitz, Leroy L. “An Evalua- 

tion of the Wechsler Intelligence Scale for Children at the 
First Grade Level." Paper read before 1951 Midwestern Psy- 

chological Association. 
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ities of the adult Wechsler about assignment of 
weighted responses are present, and the appen- 
dix of acceptable responses is less complete than 
for the Stanford-Binet. 

But at this point, the WISC seems a good and 
useful addition to the clinical armamentarium. 
Its final place depends, of course, on research. 


J Consult Psychol 13:453-4 D '49. Laurance 
F. Shaffer. * The standardization is a radical 
departure from that of previous intelligence tests 
for children. The MA is discontinued entirely. 
Raw scores for the subtests are transmuted into 
standard scores by separate tables for each year 
level, and then into deviation IQ's with a mean 
of 100 and a standard deviation of 15. While a 
standardization that comparés each child with 
others of his own age has long been desired, the 
lack of any means for expressing intelligence as 
a growth function will seem a shortcoming to 
many. The norming of the test on 2200 boys 
and girls of wide geographic distribution was 
carefully done and fully reported. On the other 
hand, evidences of the validity of the test as 
a whole, its subtests, or its items, are conspicu- 
ously missing from the manual. The value of the 
test for predicting common-sense criteria such 
as school progress or other evidences of adjust- 
ment remains to be established by future re- 
search. With its verbal and performance IQ's 
obtainable from one uniformly standardized 
scale, and its interesting possibilities for sub- 
test analysis, the WISC undoubtedly will be 
used widely by clinicians, and will be evaluated 
by a wealth of studies. 
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Wechsler Memory Scale. Adults; 1045; individual ; 
Forms 1, 2; $2.50 per 50 record forms and set of cards 
for subtest on visual reproduction; 50¢ per manual (re- 
printed from J Psychol 19:87-95 Ја '45); боё per 
specimen set; David Wechsler and Calvin P. Stone 
(Form 2 only) ; Psychological Corporation. * 
REFERENCES 
TEM arn P. "Characteristic Losses and Gains in 
Scores on the Meer ats Scales as. Applica on Ps, енй 
i Bı uring, an ter a Series 0: ectro-Convul- 
siye Shocks.” Abstract "Am Psychol 1:245 Jl '46. * (PA 20: 
d xe Enn FEN C. An Evaluation and Comparison of the 
Wechsler Memory Scale and the Hunt-Minnesota, Test for Or- 
СЕ Brain Damage. Master's thesis, Fordham University (New 


a N. Y), 8. 
en MT J. The Use of the Wechsler Memory Scale 
in Differential Diagnosis. Master's thesis, Fordham University 


k, .), 1940. 

7. Соне, Jaco. “Wechsler Memory Scale Performance of 

Psychoneurotic, Organic, and Schizophrenic Groups." J Con- 

n ЖОЕ 14:371-5 O 'so. * (РА 
„ Howarp, ALVIN К. 

Memory Scale With Selected 

tients." J Consult Psychol 14:376 88 O 'so. * (PA 25: 730) 
9. Watson, RonERT I. The Clinical Method in Psychology, vp. 


349-53. New York: Harper & Brothers, 1951. Pp. xii, 779. * 
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Ivan Norman MzxsH, Assistant Professor of 
Medical Psychology, Washington University 
Medical School, St. Louis, Missouri. 

This memory scale was developed as “a rapid, 
simple, and practical memory examination... 
for clinical purposes." The seven subtests are 
designed to sample personal and current infor- 
mation, immediate orientation to time and place, 
mental control, immediate recall of logical ma- 
terial, memory span for digits, reproduction of 
simple geometric figures from memory, and 
three-trial learning of paired words. The manual 
indicates such advantages as brevity, standardi- 
zation on 200 normal adults, allowance for 
memory variation with age, and comparability 
of memory quotients with intelligent quotients, 
making possible the study of memory impairment 
as compared with deficits in other intellectual 
functions. 

The availability of a memory quotient (MQ) 
corrected for age and thereby comparable to a 
subject’s intelligence quotient is a significant di- 
agnostic aid. However, the procedure for calcu- 
lating MQ’s is based upon the norm group, age 
25-50, a range which limits the use of the con- 
version table even though age corrections 
through 64 are given in the table. Extrapolation 
above age 50 may be faulty ; the study of memory 
and of other intellectual performances among 
the aged has been both limited and controversial, 
principally because of the size and lack of com- 
parability of the samples studied. Wechsler cau- 
tions that, because of the empirical method of 
deriving them, the memory quotient equivalents 
provided in his manual are only approximate. 
Interpretation of MQ’s in psychological practice 
should include this caution. A useful check may 
be provided by comparing the IQ-MQ devia- 
tion with the Deterioration Index of the 
Wechsler-Bellevue Intelligence Scale from 
which the IQ is computed. 

Repeated testing to evaluate the degree and 
rate of impairment or of recovery of function is 
frequently of diagnostic value. Alternate forms 
are needed therefore, and Stone, Girdner, and 
Albrecht (2) have introduced an alternate form 
of the Wechsler Memory Scale. This second 
form is based on data from 87 subjects who were 
tested with both forms. Although there was little 
change in total score from one form to the other, 
the subtests had little differentiating value 
among the samples of student nurses, hospital 
patients, and college students, Thus far reports 


by other workers have not been published so 
that it is not known to what extent Form 2 is 
equivalent to the original form. A lack of pub- 
lished reports on its use also characterizes the 
Wechsler Memory Scale itself, although com- 
munication with a number of clinical psycholo- 
gists confirms the reviewer’s high opinion of it 
as a technique useful in the studying of deficit 
in certain areas of memory functioning. Re- 
sponses of subjects to both the Wechsler-Belle- 
vue Intelligence Scale and the Wechsler Mem- 
ory Scale provide diagnostically rich data for 
psychological study. 


JosepH NEWMAN, Clinical Psychologist, Vet- 
erans Administration Hospital, Canandaigua, 
New York. 

This scale was devised to provide a simple and 
rapid measure of memory function. The direc- 
tions are clear and the scoring criteria are ade- 
quate except for the test of Visual Reproduction 
(memory for designs) where scoring examples 
might profitably be added to the manual, obvi- 
ating the necessity to go back to the original 
sources of the designs in the case of neophytes. 
The administration takes, on the average, 15 
minutes. The total raw score is corrected for 
age, and the corrected score is translated into a 
Memory Quotient (MQ) equivalent. The MQ 
was designed to be directly comparable to the 
Wechsler-Bellevue intelligence quotient, and 
this is the most useful feature of the Wechsler 
Memory Scale. 

The test was standardized on “approximately 
200 normal subjects, ages 25 to 50, both men 
and women,” an inadequate number for so 
wide an age range. Further data include only 
Wechsler-Bellevue IQ. We would want to have 
further data on education, occupation, and socio- 
economic status. Another serious shortcoming 1$ 
the lack of reliability data for either form. An 
estimated reliability coefficient of about .83 for 
clinical populations, based on the correlation be- 
tween Form II and the Revised Alpha Exami- 
nation Form 8, is presented. There is question, 
too, of the equivalence of Forms I and II. In 
devising Form II, Stone and his associates as- 
sembled "similar items previously equated for 
difficulty in other researches." The authors 
recognize the need for further experimenta 
work to determine the actual comparability of 
the two forms. Data concerning the effects O 
practice are also needed. These deficiencies place 
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limitations on the usefulness of the Wechsler 
Memory Scale. 

The test is designed for use with a clinical pop- 
ulation. It makes possible the comparison of 
memory efficiency with other aspects of intellec- 
tual function as measured by the Wechsler- 
Bellevue. Presumably, one should be able to 
distinguish between cases in which there is a 
specific memory defect, frequently found in 
conjunction with organic brain disease, and 
cases in which memory impairment is one aspect 
of a generalized intellectual inefficiency, such as 
may occur in a functional psychosis. In other 
words, performance on the Wechsler Memory 
Scale may provide diagnostic information. 
Wechsler thinks the scale is “useful in detecting 
special memory defects in individuals with spe- 
cific organic brain injuries.” Available research 
does not bear out this contention. Cohen studied 
the differentiating power of the scale among 
groups of “psychoneurotics,” “organics,” and 
“schizophrenics,” and did not obtain any signifi- 
cant results. Howard, on the other hand, found 
that the scale will discriminate between matched 

1Cohen, Jacob. “Wechsler Memory Scale Performance. fc 


Psychoneurotic, Organic, and Schizophrenic Groups. 
sult Psychol 14:371-8 О 'so. * (PA 25:4654) 


groups when the brain damage is of a gross na- 
ture.? 

The value of the Wechsler Memory Scale аз а 
diagnostic instrument has not been established ; 
the situation as to memory functioning in gen- 
eral is still as Klebanoff ° found it—inconsistent 
and contradictory. This limitation diagnostically 
is shared to a greater or lesser extent by all psy- 
chometric instruments. The scale’s greatest use- 
fulness is as an indicator of memory efficiency in 
comparison to the Wechsler-Bellevue. It can 
provide "clinical insights" as to personality re- 
actions which may aid in the diagnostic process. 
As Rabin and Guertin have remarked in their 
review of the Wechsler-Bellevue, almost any 
test employed by a skillful clinician and insight- 
ful worker may prove an “aid in differential di- 
agnosis.” * 


For a review by Kate Levine Kogan, see 
3:302. ; 


2Howard, Alvin R. “Diagnostic Value of the Wechsler 
Memory Scale With Selected Groups of Institutionalized Pa- 
tients." J Consult Psychol 14:376-80 О 'so. * (PA 2524730) 

3 Klebanoff, Seymour G. “Psychological Changes in Organic 
Brain Lesions and Ablations.”” Psychol B 42:585-623 N '45. * 
(PA 20:649) ї ^ i 

4 Rabin, Albert I., and Guertin, Wilson Н. “Research With 
the Wechsler-Bellevue Test: 1945-1950." Psychol B 48:211-48 
Ар 'sr. 
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"Ability to do Quantitative Thinking: Iowa 

ests of Educational Development, Test 4. 
Grades 9-13; 1942-51 (first published as a separate 
in 1951); ra complete battery, see 17; Form Y-2 
(’s1—same as 1949 edition) ; manual (^51) ; general 
manual (751); $3.75 рег 25; separate answer pads or 
answer sheets must be used; $1.05 per 25 answer pads ; 
3 per тоо IBM answer sheets; 50 per scoring stencil ; 
2.50 per 25 first semester ('46) or second semester 
C48) profiles for any one of grades 9-121 $ 
self-interpreting profiles for students (5I); 
School summary report (44) ; 4€ per pupil score sheet 
(48) ; 50¢ per specimen set; cash orders postpaid; 65 
(70) minutes; edited by E. F. Lindquist ; Paul Blom- 
mers; Science Research Associates, Inc. * 


[366] 
*California Mathematics Test. Grades 9-14; 1933- 
50; a revision of Progressive Mathematics Tests (see 
40: 1459); a subtest of California Achievement Tests 
(see 2) ; 3 scores: mathematics reasoning, mathematics 
fundamentals, total; IBM ; Forms AA (50), BB ('50), 
CC (0); manual (^50) ; $1.50 per 25; 35¢ per speci- 
men set, postpaid; separate answer sheets may 
used; 46 per IBM answer sheet; 7¢ per Scoreze an- 
swer sheet; 60€ per stencil for machine scoring of an- 
swer sheets; 20€ per stencil for hand scoring of answer 
sheets; postage extra; 68(78) minutes; Ernest W, 
Tiegs and Willis W. Clark; California Test Bureau. 
REFERENCES ded 

. "Computational Weaknesses о ollege 
ubi Wi, yb EM o) ей 20:367-82 Ap '4s. * (Р. К 
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*College Entrance Examination Board Achieve- 
ment Test in Advanced Mathematics. Candidates 
for college entrance; 1936-51; formerly called Col- 
lege Entrance Examination Board Comprehensive 
Mathematics Test and College Entrance Examination 
Board Mathematics Attainment Test, Gamma Exam- 
ination; available only -in College Entrance Examina- 
tion Board Admissions Testing Program (see 526); 
60(70) minutes; prepared by College Entrance Exam- 
ination Board Committee of Examiners in Mathematics 
in cooperation with the Staff of Educational Testing 
Service: 1951 membership: Raymond D. Douglass 

Chairman), Edwin C. Douglas, Maurice L. Hartung, 

rthur F. Leary, and Abba V. Newton; program ad- 
ministered by Educational Testing Service for the Col- 
lege Entrance Examination Board. * 

REFERENCES 

т. PLUMLEE, Lynnette B. “Comparison of Problem Types in 
the Comprehensive Mathematics Test: Multiple-Choice Type 
Found Valid." Col Bd R 1:17+ f '47. * (РА 22:1880) 

2. RrEGEL, ELeanor J. “Evolution of Math Tests.” Col Bd R 
8:95-8-- My '49. (РА 23:4998) ч 

з. Јонмѕом, А. PEMBERTON. “College Board Mathematical 
Tests and the rM Inventory Predict Scholastic Suc- 
cess in Colleges of Engineering." Abstract. Am Psychol 5:353 
Л "5o. * (PA 25:1279, title only) 

4. ScHULTZ, DoucLas С. “The Comparability of Scores From 


Three Mathematics Tests of the College Entrance Examination 
Board.” Psychometrika 15:369-84 D ’so. * (PA 25:6415) 


PauL І; DresseL, Head, Board of Examiners, 
Michigan State College, East Lansing, Michi- 
gan. 

As stated in the 1950-51 Bulletin of Informa- 
tion of the College Entrance Examination Board, 
the purpose of this test is “to measure the ade- 
quacy of the candidate's basic preparation for 
advanced mathematics courses and for study in 
the physical sciences in college. The test is de- 
signed to appraise the candidate's ability to use 
what he has learned rather than his ability merely 
to repeat formulas or operations." 

As is implied by the statement of purpose and 
the title, the advanced examination covers solid 
geometry, advanced algebra, and plane trigo- 


nometry. The number of items assigned to each 


of these subjects varies somewhat, with trigo- 
nometry being given most weight (20 items in 
2 forms, 18 items in 2 forms), algebra second 
most (17 items in 2 forms, 18 items in 2 forms), 
and solid geometry least (15 items in 2 forms, 12 
items in 2 forms). In all cases the total number 
of items is 50. Generally speaking, the test items 
must be considered rather traditional in nature. 
Despite the stated attempt to measure "ability to 
use....rather than ability merely to repeat formu- 
las" only a few items (6 were counted on one 
form) seem to involve any very original “twist.” 
Others are typical of the more thought-provok- 
ing but nevertheless rather common textbook 
ercises. However, real ingenuity is evident in 
phrasing these items and in selecting numerical 
values so that they require knowledge and 
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thought with a minimum of arithmetical involve- 
ment. The verbal element is also minimized, but 
the reviewer has been assured that this aspect is 
adequately covered in the quantitative reason- 
ing section of the Scholastic Aptitude Test, 
which is taken by all candidates. 

For this examination the items are written by 
the test development staff of the Educational 
Testing Service and carefully reviewed by a 
committee of carefully selected examiners. The 
procedures for developing the examination in- 
dicate that all items are given a preliminary try- 
out and that difficulty and discrimination data 
based on this tryout are utilized in assembling 
the examination. The examination reflects the 
indications that a great deal of care is taken in 
making it. Even the keys provided for the four 
examinations reviewed were apparently checked 
by three people. 

Instructions for the students and the examiner 
are very clear and again reflect the extensive 
experience of the College Board and of ETS in 
such matters. The tests are printed by the offset 
process and are reasonably readable although 
some pages are printed so lightly as to be slightly 
troublesome, especially if contrasted with an ad- 
jacent page printed in the deepest black. There 
is also occasional variation in the size of print 
from one page to another, probably occasioned 
by a different reduction ratio in photography. 
Such matters distract somewhat from appeat- 
ance but probably do not materially affect stu- 
dent performance. Responses to the items are 
presented in either vertical or horizontal fashion 
depending on the length of answers. Such уапа- 
tions as 


АВ А А В 
D..E By „С бр 
D E E 


although consistent, are somewhat distracting: 
It would seem preferable to use a completely 
vertical listing in all cases wherein three or four 
lines must be used for the responses. 

The relatively recent introduction of the one 
hour test apparently has resulted in a lack 0 
validity data. A study made as a basis for decid- 
ing whether the one-hour test would be satisfac- 
tory is reported in College Board Review (2). 
This showed that in five different colleges the 
composite score on a one-hour mathematics 
achievement test and a one-hour aptitude (quan- 
titative reasoning) test correlated from .51 © 
.65 with first term mathematics grades. If suc 
correlations continue to hold—and there is 10 


485 | TESTS & REVIEWS: MATHEMATICS 


368 


reason to suspect otherwise—the statistical va- 
lidity of the combination must be considered 
satisfactory. Reliability coefficients computed by 
the Kuder-Richardson formula for four forms 
of the test varied from .83 to .88. This reviewer 
would consider these good, although K-R values 
in the low .90's are not unattainable. Biserial 
r's for the items average close to .50 on all tests, 
which is very good indeed. 

From 33.1 per cent to 39.5 per cent of a siz- 
able sample of students taking various forms of 
the test finished them. The test, therefore, must 
be regarded as being a combination of speed and 
power, a situation which would no doubt be in 
accord with the thinking of most college mathe- 
matics teachers and consistent with their own 
testing and grading practices. 

The distribution of raw scores shows that the 
range of scores is 40 points or better, with the 
mean score varying from 20.25 to 27.56. This 
excellent spread, together with the reported 
means, indicates that the test difficulty is well 
adjusted to the groups for which the test is in- 
tended. Standard scores are used for reporting, 
and these are so adjusted by means of a small 
number of repeat items (10 to 12 so far as the 
reviewer could check it) that standard scores 
on various forms are comparable. This is a highly 
commendable and very necessary feature of the 
test since candidates for admission may take 
different forms of the test and at varying times 
in the year. 

In summary, it appears to this reviewer that 
the CEEB Test in Advanced Mathematics is 
very carefully done. The fact that a new form is 
prepared for each administration means that sta- 
tistics on the reliability and validity must always 
be recomputed and cannot be available until 
some time after the administration date ; but such 
data as have been available indicate that sta- 
tistical studies are adequate and the results en- 
tirely favorable, The examination is well adapted 
to its stated purpose. Its strongly traditional na- 
ture reflects the general acceptance of the fact 
that the more advanced high school mathematics 
Courses are primarily preparation for further 
work in mathematics or science rather than of 
general education value. No doubt the quantita- 
tive reasoning test covers the latter outcomes to 
some extent, but one might still reasonably ех- 
pect a few items testing the student’s developing 
Concept of the nature of mathematics and о 
mathematical proof. Recognition of relevancy OF 


irrelevancy of information to а problem, the de- 


tection or recognition of hypotheses, and his- 
torical background are likewise regrettably 
missing from the test. The absence of such ma- 
terials may in part reflect the difficulties of test- 
ing such matters satisfactorily by objective 
means; more probably it emphasizes their ab- 
sence in mathematics teaching—a situation 
which the test serves to perpetuate. 


[ 368] 
College Entrance Examination Board Achieve- 
ment Test in Intermediate Mathematics. Candi- 
dates for college entrance; 1936-51; formerly called 
College Entrance Examination Board Mathematics 
Attainment Test, Beta Examination; available only in 
College Entrance Examination Board Admissions 
Testing Program (see 526); 60(70) minutes; pre- 
pared by College Entrance Examination Board Com- 
mittee of Examiners in Mathematics in cooperation 
with the Staff of Educational Testing Service: ns 
membership: Raymond D. Douglass (Chairman), Ed- 
win C. Douglas, Maurice L. Hartung, Arthur F. Leary, 
and Abba V. Newton; program administered by Edu- 
cational Testing Service for the College Entrance Ex- 
amination Board. * 
REFERENCES 

т, Rrecer, ELEANOR J. "Evolution of Math Tests.” Col Bd R 

8:95-8+ My 40 (PA 23:4! 


2. ScnurTZ, DoucLas G. р е Comparability of Scores From 
Three Mathematics Tests of the College Entrance Examination 
Board.” Psychometrika 15:369-84 D '50. * (PA 25:6415) 
PAuL J. Browns, Associate Professor of Ed- 
ucation, State University of Iowa, Iowa City, 
Iowa. [Review of Forms YAC, YAC2, ZAC1, 
and ZAC.] 

This test is one of the options of the College 
Entrance Examination Board's program of 
achievement tests and, as such, may become 
one test in a variable battery designed to predict 
college success. Considered as a single test, it 
is intended primarily for use in predicting suc- 
cess in college mathematics. On the theory that 
past achievement in mathematics provides the 
best basis for such a prediction, this test is de- 
signed to measure achievement in the college 
preparatory type of high school mathematics 
courses, covering a year of plane geometry and 
one and one-half to two years of algebra. 

The setting up of test specifications was the 
responsibility of a five-man committee of mathe- 
maticians, three of whom represented the var- 
ious types of collegiate institutions, one the in- 
dependent preparatory school, and the other the 
public high school. This committee was assisted 
by a technical consultant from the mathematics 
section of the Test Development Department 
of the Educational Testing Service which pre- 
pared the actual test items and had at its disposal 
the facilities of this department as well as those 
of the Statistical Analysis and Research depart- 
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ments of the same organization. All items were 
reviewed by each committee member, assembled 
into a pretest which was tried out on a sample 
group, and subjected to a statistical item analy- 
sis. 

The test consists of 50 five-response multiple 
choice type items. With few exceptions the item 
stem is presented in question form. No use is 
made of a “none of these" type of. response. It 
may be that the use of such a response would not 
add to the predictive power of this test. This is 
a matter which can only be settled experi- 
mentally. However, in the case of a mathematics 
test, where it is almost impossible to anticipate, 
let alone include as distractors, the many differ- 
ent wrong responses various candidates might 
obtain, it would appear that the use of a "none 
of these" response might result in an improve- 
ment in the predictive power of the test. 

Four forms of this test (YAC, YAC2, ZAC1, 
and ZAC) were studied in the preparation of this 
review. In each case approximately 40 per cent 
of the items dealt with plane geometry, and the 
remaining 60 per cent were divided about equally 
between first and second year algebra. While a 
few items of the Y forms recurred in the Z forms, 
the two Y forms and the two Z forms are the 
best examples of pairs of tests built to the same 
specifications and yet containing independent 
samples of item content that have come to this 
reviewer's attention. 

The items generally are of excellent quality. 
No cumbersome, time-consuming computations 
or algebraic manipulations are involved. There 
is no paraphrasing of the stereotyped textbook 
exercise. It is true that a fairly extensive knowl- 
edge of mathematical facts and techniques is re- 
quired. In plane geometry the gamut runs from 
the simple relationships between the angles 
formed when two parallels are cut by a trans- 
versal to the relationships between the segments 
of intersecting chords of a circle. In algebra it 
extends from the simple task of reading a value 
from a graph to the solution of a literal equation 
in which the unknown appears with fractional 
exponents. But at least equal in importance to 
a knowledge of mathematical facts and tech- 
niques is the ability to select from that knowl- 
edge those facts, techniques, or principles ap- 
propriate to the handling of a novel problem 
situation and to apply the selection correctly to 
the solution of the problem. The ability to reason, 
to think mathematically, thus becomes an im- 
portant aspect of the test score. Certainly the 


test is difficult, but not inappropriately so for the 
select group and special purpose for which it is 
intended. 

It is regrettable that no information regarding 
the predictive power and reliability of the par- 
ticular test forms under review was made ayail- 
able to this reviewer. It is the opinion of this re- 
viewer that the forms of this test are adequate in 
both respects. It should be understood that this 
opinion is based on evidence available with re- 
gard to previous mathematics tests of the College 
Entrance Examination Board and is not based* 
on experimental evidence derived directly for 
the test forms here reviewed. 


[369] 

*Cooperative Mathematics Pre-Test for College 
Students. College entrants; 1936-48; IBM; Forms X 
(47), Y (48) ; Forms A, B out of print; no data on 
reliability and validity; no specific manual; general 
Cooperative manual (’51); Form X norms C47), 
Form Y norms ('48); $2 per 25; 50€ per specimen set, 
postpaid; separate answer sheets may be used; 8of per 
25 IBM answer sheets; 156 per scoring stencil; cash 
orders postpaid; 40(45) minutes; prepared by the 
Committee on Tests of the Mathematical Association 
of America; Cooperative Test Division, Educational 
Testing Service. * 
E. P. STARKE, Professor of Mathematics, Rut- 
gers University, New Brunswick, New Jersey. 

This test was designed by the Committee on 
Tests of the Mathematical Association of Amer- 
ica “to furnish a supplementary means of check- 
ing on classification in appropriate mathematics 
courses” according to the publisher's catalog. 
Forms X and Y are adapted from the experi- 
mental Forms А and B. Forty items, to be com- 
pleted in 40 minutes, provide a sampling of ele- 
mentary and intermediate algebra and geometry 
(limited to simple mensuration). For each item 
one correct and four incorrect answers are pro- 
vided. Norms for entering college freshmen аге 
available. 1 4 

Memory of simple concepts and skills is teste 
superficially, with little attention to the under- 
lying, and more significant, thought processes. 
The student who solves a problem in a eem 
way merely because it looks like one for vu 
that is the remembered solution will still make 
a presentable score on this test. This reviewer 
feels that there probably is a poor correlation 
between scores on this test and subsequent m. 
formance in college mathematics courses. ©” 
test can be used to eliminate those who are st 
prepared for college science and mathematics 5 
it will be of little use for predicting success 
more advanced work. 
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The optional answers printed for each ques- 
tion are frequently poorly selected. Some an- 
swers which would result from common errors 
are not included while others are included that 
would be chosen only by the wildest guessers. 
Too many of the problems can be worked back- 
wards from the answers and thus do not test the 
intended skill. A number of the items should be 
provided with the choice, “None of these an- 
swers.” In general, the items are carefully 
worded and unambiguous, although Item 9 of 
Form X misses its purpose: “If a/b = 3/2 and 
b/c — 2/7, what does a/c equal?" The correct 
answer is obtained by equating a and c to 3 and 7, 
respectively, with no knowledge of operations 
with fractions. 


For reviews by M. W. Richardson and S. S. 
Wilks of Forms A and B, see 38:1073. 


terms-concepts, application, appreciation, total; IBM; 
(’50—same as test copyrighted in 1940), RO 


RO), Vernon Price (X), and Bernice Orshansky 
(5i Cooperative Test Division, Educational Testing 
егүісе. * 


REFERENCES 
à See 40:1433. 
. See 3:305. AK 

4 Henri, БА P., лхо DiVzsrA, FRANCIS J. "An Evaluation 
of Five Factors for Predicting the Success of Students Entering 
Ше New Уо State College of Agriciltupe; Ed & Psycho 

eas 8:389-95 au '48. * 23:4412 р р 

5. СУНЫ ates тё Cooperative Mathematics Test 
Program (Mathematics for Grades 7, 8, and 9, Elementary . - 
gebra, Intermediate Algebra, Plane Geometry, Solid Geome гу, 

rigonometry, and Secondary School Mathematics) : Baie i Ба 
and Correlations With School Marks," рр. 89-95. (Р. 255 7 
In 1950 Achievement Testing Program m Independent 5с! fod 
and Supplementary Studies. Educational Records Bulletin, No. 
54. New York: Educational Records Bureau, July 1950. Pp. xi 
119. Paper, lithotyped. * 


For a review by M. L. Hartung of Forms Q 
and RO, see 3:305; for reviews by Richard M. 
Drake, Judson W. Foust, and С. M. Ruch of 
Form P, see 40:1433- 


[371] 

XDavis Test of Functional Competence in Math- 
ematics: Evaluation and Adjustment Series. 
Grades 9-13; 1951-52, CI950-51; IBM; Forms A 

C51), BM (^52); manual (51); $2,50 per 25; sepa- 
rate qood sheets must be m Ж, pedes ША 
swer sheets; postage extra; 35 Per sps ji 
paid; Bo CAD iM Dodd J. Davis; World Book 
о. 


[372] 
XFunctional Evaluation in Mathematics. Grades 
4-6, 7-9; 1952; IBM for Tests 1 and 4; 2 levels; 3 
tests; Form A; 50€ per manual; postage extra; 756 
per specimen set, postpaid; 25(30) minutes per test; 
edited by William A. Brownell; Ben A. Sueltz; Edu- 
cational Test Bureau, Educational Publishers, Inc. * 
а) TESTS I AND 4, QUANTITATIVE UNDERSTANDING. 
IBM; $1.90 per 25 of any one level; separate answer 
sheets may be used ; 8sé per 25 answer sheets; 50¢ per 
stencil for scoring answer sheets. 
b) TESTS 2 AND 5, PROBLEM SOLVING. $1 per 25 of any 
one level. 
C) TESTS 3 AND Ó, BASIC COMPUTATIONS. $1 per 25 of 
any one level. 


Cuartes S. Ross, Associate Editor, Silver 
Burdett Company, New York, New York. 

Three separate 25-minute tests, Quantitative 
Understanding, Problem Solving, and Basic 
Computations, are provided at both the ele- 
mentary and upper level on the thesis that a 
single test cannot measure adequately the wide 
range of abilities found in a modern program of 
arithmetic and mathematics. 

These new tests make two major contribu- 
tions. First, they give attention to concepts, 
principles, relationships, judgments, and infer- 
ences which should be the outcomes of a rich 
and meaningful arithmetic program. Second, 
their mathematical content is similar in scope and 
emphasis to the content of most modern arith- 
metic programs. It is the lack of these qualities 
in many widely used standardized tests that 
makes the Functional Evaluation in Mathe- 
matics tests particularly significant. 

The two tests of quantitative understanding 
(Test 1, grades 4-6, and Test 4, grades 7-9) are 
ingenious in determining ability to use mathe- 
matical reasoning. A liberal use of diagrams and 
illustrations makes it possible to get at ideas 
which could not be approached economically 
through words alone. These tests are particu- 
larly effective in their treatment of the decimal 
number system, common fractions, and relation- 
ships. They are marred, however, by several un- 
realistic or confusing items. For example, the 
hour hand of an illustrated clock is beyond the 
center point of the 3, while the minute hand is 
nearer 10 than 11, making it possible in the mul- 
tiple choice answers to choose the time 3:52 
rather than the intended choice, 2:53. 

The two tests of problem solving (Test 2, 
grades 4-6, and Test 5, grades 7-9) consist of 
three pages of problems each. АП the problems 
on one page are concerned with a single situa- 
tion. This results in so exhausting the possibili- 
ties of the situation that some of the problems 
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are forced. Moreover, there seems to be a lack of 
concern for reality in many problems, leading 
to possible serious misconceptions on the part 
of the pupils. For example, one page concerns 
a trip by a class to а dairy, which on further in- 
spection seems to be a dairy farm. The cars car- 
rying the pupils drove at 20 miles per hour into 
the country. The 315-acre farm (why not 320?) 
supported only 35 head of cattle. One cow gave 
7 gallons of milk each day. Every day the farmer 
filled 28 crates of 24 quart bottles each, appar- 
ently unconcerned about pasteurization. The 
weight of milk is given as 2 lb. per quart, with- 
out qualification. Possibly some of these items 
could be justified in isolation or as exceptional 
cases, but the effect of the page of problems is 
unreal to the extent of creating a totally false 
picture of either a dairy or a dairy farm. It seems 
reasonable to expect all instructional material, 
including arithmetic tests, to be concerned about 
the total education of the child as well as his 
education in the subject matter field. 

The two tests of computation (Test 3, grades 
4-6, and Test 6, grades 7-9) are constructed 
carefully with a fair distribution of facts and 
types of computational difficulty. 

The 12-page manual gives directions for ad- 
ministering and scoring the tests and interpret- 
ing test results. Norms based upon results of 
about 1,000 cases at each grade level are pro- 
vided. In addition, the manual explains the ra- 
tionale underlying the organization and content 
of the tests and the steps followed in their devel- 
opment and furnishes evidence of satisfactory 
validity and reliability. 

On the whole, and despite faulty execution in 
several respects, these tests fill a need which has 
become acute in recent years. School systems 
which provide a modern arithmetic program 
have been forced to evaluate this program by 
using tests that are inadequate in content or in- 
appropriate because of emphasis. The Functional 
Evaluation in Mathematics tests will appeal to 
Schools which want to measure the total effec- 
tiveness of a meaningful treatment of mathemat- 
ics. 

In interpreting test results, the test user 
should realize that “the reliabilities of the sepa- 
rate tests do not warrant attaching significance 
to small differences.” This means in practice that 
for most pupils one cannot say, for example, “Не 
is above the norm in quantitative understanding 
but below the norm in computation.” In general, 
the results of all three tests should be lumped 
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or averaged for the individual before a compari- 
son with the norm is made. Furthermore, the 
user should understand that the tests are de- 
signed as power tests rather than speed tests, 
The time allotment of 25 minutes per test seems 
inadequate, particularly for the tests on quanti- 
tative understanding. The user may wish to in- 
crease this time allotment. 


[373] 

*General Mathematics III: Achievement Exam- 
inations for Secondary Schools. High school; 
1951; I form; no data on reliability and validity; no 
manual; Minnesota norms (median and quartile devia- . 
tion) available; similar norms for other regions by 
special arrangement with publisher; 7€ per test, post- 
age extra; 60(65) minutes; edited by Walter W. 
Cook; prepared by a curriculum committee of high 
school teachers for use in the Minnesota State Board 
Achievement Examinations Program; Educational 
Test Bureau, Educational Publishers, Inc. * 


[374] 

*Graduate Record Examinations Advanced 
Mathematics Test. Senior year college through 
graduate school and candidates for graduate school; 
1939-51; available only in Graduate Record Examina- 
tions programs (see 527); 180(220) minutes; pre- 
pared by the Advanced Mathematics Test Committee 
appointed by Educational Testing Service: S. S. Wilks 
(Chairman), A. A. Albert, C. B. Allendoerfer, S. 5. 
Cairns, and J. R. Kline; Educational Testing Serv- 
ice. * 


[375] 

*Mathematics: Every Pupil Test. Grades 78: 
1930-51; new form usually published each April mi 
December ; form April 1951; no data on reliability an 
validity; no manual; norms ('51); 216€ per test; I 
per answer key; postpaid; 40(45) minutes; Ohio 
Scholarship Tests, Ohio State Department of Educa- 
tion. * 


[376] 

*Mathematics Test 1. Ages 12-13; 1951-52; 3 
scores: mechanical arithmetic-elementary geometry, 
problem arithmetic-statistics-symbols-simple geometry- 
graph reading, total; 1 form, ’51; no data on validity; 
no norms for part scores; manual ['52] ; 115s. per 1005 
155. рег 12; Ls. 4d. per single copy; postage extra; 0 
(65) minutes; I. Macfarlane Smith; National КОШ 
dation for Educational Research in England ап 
Wales. * 


[377] У 
*Rogers Achievement Test in Mathematics tor 
Technical and Industrial Schools. Grades 8-12 am d 
adults; 1942-46; 1 form, '42; no data on reliability БРА 
validity ; manual [46] ; $1.50 per 25; 35 рег specim a 
set; postpaid; 30(40) minutes; William J. Morgan 
and William W. Rogers; Psychological Corporation. 


[378] jon 
*Snader General Mathematics Test: Evaluation 
and Adjustment Series. Grades 9-13; DL 
cI950-51; IBM; Forms AM (1), BM (52); et 
ual (751); $2.50 per 25; separate answer sheets ae 
be used; 80¢ per 25 IBM answer sheets; postage ied; 
tra; 35 per specimen set, postpaid; 40(50) minute": 
Daniel W. Snader; World Book Co. * 
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PauL J. Вгоммевѕ, Associate Professor of Ed- 
ucation, State University of Iowa, Iowa City, 
Towa. 

Ninth grade general mathematics courses are 
usually of two types : those intended for the non- 
mathematically inclined, which emphasize prac- 
tical applications of mathematics (mainly arith- 
metic) and those intended for all students, which 
draw their content from the fields of arithmetic, 
algebra, geometry, and numerical trigonometry. 
The Snader General Mathematics Test is in- 
tended for ninth grade students who have com- 
pleted a course of the latter type. 

To achieve curricular validity, the content 
specifications were based on a study of widely 
used textbooks and representative courses of 
study. Consideration was also given to recom- 
mendations of the National Council of Teachers 
of Mathematics and the Commission on Post- 
War Plans. 

The items are of the multiple choice type. Five 
responses, one of which is usually “none of the 
above,” are employed throughout. The use of a 
“none of the above” response is commendable, 
but it should be noted that in some instances it 
is not employed in conformity with accepted 
principles in that it appears as a correct response 
for some rather computationally complex exer- 
cises. The 130 items (65 in each form) were 
selected from a pool of 330 after an unusually 
extensive trial (1,400 students in 13 schools in 
6 states) on the basis of difficulty and discrim- 
inatory power or item validity. The mean of the 
item validity indexes is reported as .37 for each 
form. In general, the items selected appear to 
be too difficult, The mean value of the difficulty 
indexes is approximately 37 per cent and less 
than one fourth are 50 per cent or higher. It is 
the opinion of the reviewer that this is due at 
least in part to computational slips resulting 
from the needless complexity of the numerical 
values employed in many of the exercises. 

The score, the number right, may be con- 
veniently obtained from the separate answer 
Sheets either by means of the conventional mask 
Or by machine. The use of a separate answer 
Sheet results in the further advantage of a non- 
consumable booklet. The score may be converted 
into a normalized staridard score. The percentile 
ranks of the standard scores for the standardiza- 
tion sample are given in the manual. In compari- 
son with many tests of this type, the standardiza- 
tion sample appears to be quite extensive (2,190 
Students in 35 schools from 22 states). 


The manual is more complete and far more 
honest in its claims than is customary. One item 
not ordinarily reported is a table of item diffi- 
culty values. By means of this table teachers may 
compare the success of their own classes on any 
given item with that of the trial sample. 

The test is heavily loaded (42 per cent) with 
arithmetic items. In fact, about one-third of the 
items deal with arithmetic skills taught in the 
lower grades. Granting the maintenance of such 
skills to be an objective of the high school math- 
ematics program, this degree of attention in a 
40-minute test of general mathematics seems 
inordinately high. On the other hand, the area 
of numerical trigonometry as applied to indirect 
measurement, which is treated in every general 
mathematics text known to the reviewer, is 
allotted but a single item in each form ; this item, 
moreover, is extremely superficial, requiring 
only that the student remember that a certain 
angle is known as an angle of elevation. The me- 
chanics of elementary algebra with emphasis on 
the solution of simple equations and the use of 
formulas is fairly well covered (25 per cent) 
and the items for the most part are quite well 
constructed. The area of informal geometry re- 
ceives about the same degree of attention (23 
per cent) but the exercises are somewhat poorer 
in quality. The test lacks further balance due to 
the relatively few problems (15 per cent) re- 
quiring some reasoning ability. The emphasis is 
definitely on mechanics and memory. Moreover, 
the problems that do involve some reasoning are 
made superficially difficult by the complexity of 
the computations involved. Nevertheless, it is 
clear that more than the usual care has been 
taken in the development of this test, which is 
among the better tests of its type known to the 
reviewer. 


Howarp F. FEHR, Professor of Mathematics 
and Head of the Department of the Teaching of 
Mathematics, Teachers College, C olumbia Uni- 
versity, New York, New York. 

In the United States the term general mathe- 
matics is applied to courses of such widely dif- 
ferent objectives and content as highly abstract 
college preparatory mathematics or remedial 
arithmetic. Most commonly the term is applied 
to ninth year mathematics courses for students 
of average ability who are not preparing for 
college. In this test the term applies to a course 
which is offered to all students in the ninth year 
and includes arithmetic, informal geometry, basic 
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algebra, graphical representation, and numerical 
trigonometry. It is not designed particularly for 
nonmathematicall inclined students. In any 
school the objectives and textbook used must be 
carefully compared with the items on this test 
before selecting it as a valid instrument for 
measuring student achievement. 

The objectives measured by the test are not 
specifically stated. By reviewing the test items, 
the sources of objectives, and the content areas, 
it is easy to outline the basic knowledge meas- 
ured by the test. The arithmetic (approximately 
42 per cent of the items) includes the basic op- 
erations with integers, fractions, decimal frac- 
tions, percentages, and social applications; the 
informal geometry (approximately 25 per cent) 
includes angle relations, area, volume, recogni- 
tion of form, and similar figures ; the algebra (ap- 
proximately 33 per cent) includes simple opera- 
tions, linear equations, formulas, and variation. 
Despite the author's assertion, there is no item 
on numerical trigonometry or on computation 
with approximate data. 

The test has two equivalent forms, each con- 
taining 65 items. These are apportioned into 
three parts: Part A with 28 items on skills, Part 
B with 28 items on facts and relations, and Part 
C with 9 items on reasoning or problem-solving. 
The items are multiple choice with 5 selections. 
All but 12 of the 65 items have a “none of the 
above" as a fifth choice. In many of the items it 
is difficult to discover how the author selected 
the incorrect choices, since many common pupil 
errors do not lead to any of the submitted 
choices. 

The raw scores (number of correct responses) 
are converted into standard scores with an arith- 
metic mean of 98 and a standard deviation of 
12.5. These standard scores conform to the IQ's 
obtained on the Terman-McNemar Test of 
Mental Ability which has the same mean and 
standard deviation in "the general mathematics 
test standardization population." One is thus 
supposed to be able to tell whether a student's 
achievement in mathematics is at or below his 
mental ability, but this is a very dubious pro- 
cedure for this test. Percentile norms are based 
on 2,190 pupils in 35 schools. The split half re- 
liability coefficient was .80 for one population 
and .84 for another. This is low if one cares to 
use his test results for prediction purposes, and 
it again indicates the difficulty in constructing 
а test in this area of knowledge. 
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The test items are well written, the figures 
drawn clearly, and the arrangement of exercises 
is open and neat so that no mechanical difficulties . 
enter into taking the test. 

It is difficult to state the educational signifi- 
cance of the test. When the objectives of a course 
are the same as those measured by the test, it 
is a reasonably good instrument for evaluating 
individual achievement. This agreement would 
happen in relatively very few schools. In the 
case of individual or school achievement, the 
test should certainly be supplemented by other 
evaluations made by the classroom teacher. The 
test cannot possibly serve as a diagnostic device, 
and its predictive value of future success is lim- 
ited by its low reliability. 

In summary, the test is designed for a special 
kind of general mathematics course given for all 
pupils in the ninth grade. The results of the test 
in any given school must be judged in compari- 
son with the objectives of the test. The test con- 
tains about 85 per cent skill and fact items and 
I5 per cent reasoning items. This test cannot be 
compared with other tests since there are no 
standard tests designed to measure the same ob- 
jectives. The test can be used to measure achieve- 
ment ; but until it is more widely used and more 
information is gathered, its guidance and predic- 
tive values are small, and it must be used with 
the greatest, caution for these purposes. 


[379] 
*A Test of General Proficiency in the Field of 
Mathematics: Cooperative General Achieve- 
ment Tests, Revised Series, Test III. Grades 10- 
12 and college entrants; 1940-51; 3 scores: terms ang 
concepts, comprehension and interpretation, mid 
IBM; Forms X (47), Y (48), Z C51) ; Forms ОКТ 
out of print; no norms for part scores; no specific man 
ual; general battery folder (’51) ; general Cooperative 
manual ('51); norms ['41]; $2.50 per 25; Sof per 
specimen set, postpaid; separate answer sheets may j 
used; 80€ per 25 IBM answer sheets; 15¢ per so 
cil for scoring answer sheets; cash orders posipa Y 
40(45) minutes; Paul J, Burke (X) and Bernice 2 
shansky (Y, Z); Cooperative Test Division, Educ 
tional Testing Service. * 
REFERENCES  , n 
т. DeGoover, Mervin Henry. Validation of the Cooperative 


; 1 d Lag die 
General Achievement Test in Mathematics at the Universi e 
Utah. Master's thesis, University of Utah (Salt Lake ity 


Utah), 1948. stability of 
ОВ А С. “Predictability °° 
Success їп Mathematics. J Ba Res 441148-0 О "so. * (PA 25: 
5030. 

For a review of the complete battery by Р Ай 
І. Dressel, see 5; for a review by John F. pl 
dolph of Forms S and T, see 3:316; jor a revi? 
by John V. McQuitty of the complete battery, 


see 3:3. 
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[380] 

Ability for Algebra: Fife Tests of Ability, Test 
3. Entrants to secondary schools (England and Scot- 
land); 1947; for complete battery, see 750; I form 
[47]; no data on reliability; no data on validity in 
manual (for data presented elsewhere by the author, 
see 1-2) ; manual [47]; 5s. 6d. per 25; 4d. per single 
copy ; 4d. per manual; specimen set not available; post- 
age extra; 30(35) minutes; Frank M. Earle; Univer- 
sity of London Press Ltd. * 


REFERENCES 

т. Елкі, Frank M. Tests of Ability for. Secondary School 
Courses. Publications of the Scottish Council for Research in 
ete 10. London: University of London Press, 1936. 

2. EARLE, FRANK M. Reconstruction in the Secondary School. 
London: University of London Press, 1944. Pp. 188. A 

b Waste кы pd (oe re of a Y for Al- 
gebra Shown by Secondary School Entrants. Bachelor's thesis, 
Glasgow University (Glasgow, Scotland), 1949. Abstract: Brit 
J Ed Psychol 20:65-6 F ’50, * (PA 25:570, etie only) 
WirLIAM G. Morrex korr, Research Associate, 
Educational Testing Service, Princeton, New 
Jersey. 

This test is one of four which grew out of an 
investigation carried on between 1931 and 1935 
under the sponsorship of the Scottish Council for 
Research in Education. F. M. Earle, subcom- 
mittee chairman, had as his object “to devise 
tests that would correlate satisfactorily with 
some suitable criterion of success in school sub- 
jects and which could safely be used by com- 
paratively inexperienced individuals" (r). 

The test is printed clearly, and the format 
seems satisfactory. A misprint occurs in Item 
47. All of the items are keyed correctly. One 
overall score is obtained. 

Impressive follow-up studies have been re- 
ported by Earle (т). In one study, the mathe- 
matical work of 221 students, who two years be- 
iore had taken the test as first year students, 
was used as a criterion. Of the 126 scoring above 
45 on the test, 95 per cent were doing satisfactory 
work; of the 95 scoring below 45, only 13 per 
cent were doing satisfactory work. In another 
study, the second year work of 172 pupils, who 
had been tested the year before, was used as a 
criterion. Of the 64 scoring above 58, 91 per 
cent were doing satisfactorily ; for the 108 scor- 
ing under 58, the work of only 11 per cent was 
satisfactory. 

While the first section of the test is a typical 
number series test, the items of the second sec- 
tion are rather different from those contained 
in the formulas or problems parts of other alge- 
bra aptitude tests, such as the C alifornia Algebra 


Aptitude Test. Rather heterogeneous in nature, 
many of the items of this second section were 
found by the reviewer to be quite refreshing in 
expression and challenging in task set. However, 
several problems toward the end of the section 
seem to involve a great deal of algebraic so- 
phistication. Also, the norms data tend to con- 
firm the reviewer's suspicion that many of the 
later items may be rather difficult, the upper 
quartile score for twelve-year-old boys being 
only about half the maximum score on the test. 

The manual of instructions, though brief, in- 
cludes directions for administering and scoring, 
norms for children attending secondary schools 
in England and (separately) in Scotland, and 
а section on "interpreting а score." Americans, 
accustomed to separate answer sheets and easy- 
to-use scoring devices, will find the scoring ar- 
rangements for the Fife test cumbersome. The 
section on interpreting a score, however, 1s 
presented in terms of the probability of success 
or failure at certain scliool levels for a given 
score on the test—a very desirable feature from 
this reviewer's standpoint. No reliability data 
are given in the manual. Such information 
should, of course, be provided. It would be still 
more desirable to have the standard error of 
measurement at various score levels. 

SUMMARY. This aptitude test grew out of a 
lengthy research program and has shown good 
validity in Scotland and England. Its content 18 
interesting and challenging, and the guide to in- 
terpretation of a score is one that deserves to 
be used more widely. On the negative side, the 
reviewer must point to the cumbersomeness of 
the scoring and the possibility that the test might 
not be so appropriate for predicting success in 
algebra courses in America as in the United 
Kingdom. The lack of normative data based on 
results obtained by American school children 
should also be mentioned. 


For a review by I. Macfarlane Smith of the 
complete battery, see 713; for a review by James 
Mazwell of the complete battery, see 3:8. 


[381] 
xXAdvanced Algebra: Achievement Examinations 
for Secondary Schools. High school; 1951; 1 form; 
no data on reliability and validity; no manua ; Minne- 
sota norms (median and quartile deviation) available ; 
similar norms for other regions by special arrangement 
with publisher ; 7€ per test, postage extra; 60(65) min- 
utes; edited by Walter W. Cook; prepared by a cur- 
riculum committee of high school teachers for use in 
the Minnesota State Board Achievement Examinations 
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Program; Educational Test Bureau, Educational Pub- 
lishers, Inc. * 
[382] 
*Advanced Algebra Test: State High School 
Tests for Indiana. 3 semesters high school ; 1934-50 ; 
Form B ['50]; Form A out of print; mimeographed ; 
no data on reliability and validity ; no manual; norms 
[50]; 4%@ per test; 156 per specimen set; postpaid; 
40(45) minutes; Paul C. Baker; State High School 
Testing Service for Indiana, Purdue University. * 
[383] 
*Algebra Prognosis Test. High school and college; 
1949; 6 scores: arithmetic computation, arithmetic 
problems, number relationships, number series, alge- 
braic formulae, total; 1 form; manual ['49] ; $1.75 per 
25, postage extra; 356 per specimen set, postpaid; 35 
(45) minutes; Corydon L. Rich; C. A. Gregory Co. * 


PauL J. Віоммевѕ, Associate Professor of 
Education, State University of Iowa, Iowa City, 
Iowa. 

The purpose of this test is to predict per- 
formance in ninth grade algebra and in freshman 
college algebra. The test consists of five sub- 
tests. The exercises of the first deal with the 
fundamental operations of arithmetic computa- 
tion applied to whole numbers, common and 
decimal fractions, denominate numbers and per- 
centages. The exercises of the second consist of 
arithmetic problems requiring some reasoning 
ability. Those of the third and fourth subtests 
are the conventional type of number relationship 
and number series exercises. Those of the fifth 
require the substitution of numerical values in 
formulas. All are separately timed and are in- 
tended to contain more exercises than the pupils 
can complete so that rate of work is one aspect 
of each subtest score. 

The exercises of all subtests except the third 
are of the free response type. The test booklets 
are consumable. The scoring, which could be 
made more convenient, is by strip key. All sub- 
test scores are based on the number right and 
are combined by addition into a composite which 
is the score recommended for use in making the 
prognosis. 

The manual accompanying the test is sub- 
standard. Only one half of a page is devoted to 
discussing the test and its uses. The vital ques- 
tion of validity is dealt with as follows: “The 
validity as compared to actual grades has varied 
from .60 to .90, with an over-all score of .78. 
The test has been found to be very helpful in 
forecasting a pupil's work and an excellent guide 
for sectioning where numbers warrant more 
than one section." How the various coefficients 
were combined to produce this “over-all score" ; 
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whether adjustments for attenuation were made ; 
how many groups were involved; what these 
groups were like; how the criterion grades were 
established; and how variation in achievement 
within sections, formed with the guidance of this 
test, compared with variation among such sec- 
tions are representative questions left unan- 
swered. 

The discussion of reliability is equally unsatis- 
factory. The manual simply states that, “This 
test has a coefficient of reliability of .93 as given 
to over 800 students entering the ninth grade in 
high schools and the freshman year in college." 
Without question this coefficient is an outright 
misrepresentation of any useful conception of 
this test's reliability. The simple fact that it is 
based on a sample taken from such extremely 
different populations as ninth graders and col- 
lege freshmen is sufficient to make it meaning- 
less. Moreover, no information is given as to 
how the coefficient was obtained. Consider- 
ing that only one form of the test is available, 
it is likely that the odds-evens split test pro- 
cedure was employed. If this is true, the coef- 
ficient .93 is further invalidated, since speed is 
unquestionably an important element of the 
score. 

The manual contains no information regard- 
ing the development of the test. If this test repre- 
sents the latest thought in the measurement 0 
algebra aptitude, one can only conclude that no 
progress has been made in this area in the last 
two decades, for in 1930 J. Murray Lee pub- 
lished such a test (Lee Test of Algebraic Abil- 
ity) which, except for the fact that it contained 
no subtest on arithmetic computation, is an al- 
most exact parallel of the test under review. 
Moreover, it would be difficult to say in which 
of these two tests the exercises were superior 11 
quality. д 

What meager information is available regard- 
ing the validity and reliability of the Algebra 
Prognosis Test together, with its similarity to 
existing tests, indicates that it is no better than 
its predecessors as a predictor of success in ninth 
grade algebra. It is just another such test and as 
such. represents no improvement or contribu- 
tion. Moreover, it should be noted that authors 
of other tests of this kind make no claims T€ 
garding their usefulness as predictors of success 
in freshman college algebra. It would appeat 
that a test for which such a claim is made should 
involve more extensive use of the content 0 
high school algebra, 
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WirLiAM G. MorrEwKorr, Research Associ- 
ate, Educational Testing Service, Princeton, 
Меш Jersey. 

There are 85 items in this test, apportioned 
among the five subtests as follows: (a) Compu- 
tation in Arithmetic, 21; (b) Problems in 
Arithmetic, I4; (c) Number Relationships, 10; 
(d) Number Series, 16; and (e) Algebraic 
Formulae, 24. The printing is clear and the gen- 
eral format is satisfactory. The key, provided on 
a separate sheet, has guide lines for folding so 
that the responses on the key can be made to 
line up with those written by the student in the 
test booklet. There are, unfortunately, two errors 
in the key, for Item 19 in subtest 1 and for Item 
24 in subtest 5. 

The test is intended to indicate how well a 
student will do in algebra. The half-page of ma- 
terial on “Description and Use of Algebra Prog- 
nosis Test" states that students with scores above 
114 (119 is perfect) are “capable of doing su- 
perior work," whereas those with scores below 
29 are “hopeless.” 

There is good reason to believe that all the 
subtests are speeded. The directions state, “You 
are not expected to finish any one test.” The 
number of problems per minute for the various 
subtests is 2.1, 3.5, 3-3, 2.3, and 3.0. The time 
limit for the last part, Algebraic Formulae, 
seems especially inadequate for any large pro- 
portion of students to finish. Such a high rate 
of speeding seems undesirable in a prognosis 
test, for it is likely to place a great premium on 
previous acquaintanceship with the kind of ma- 
terial involved. 

The first two subtests (Computation in Arith- 
metic and Problems in Arithmetic) probably are 
based on the thesis that ability in arithmetic is 
a good indicator of ability in algebra. The next 
two subtests (Number Relationships and Num- 
ber Series) are evidently numerical reasoning 
measures, The fifth subtest, Algebraic Formulae, 
seems to this reviewer to be an out-and-out 
measure of learning which tests ability to sub- 
stitute in formulas and to manipulate algebraic 
expressions. Like the problems part of the Cali- 
fornia Algebra Aptitude Test, this subtest seems 
to rely for its prognostic value on the belief that 
a student can surely learn this type of task if he 
already knows how. 

Since the publisher’s statements regarding re- 
liability and validity are brief, they may readily 
be quoted: “This test has a coefficient of relia- 
bility of .93 as given to over 800 students enter- 


ing the ninth grade in high schools and the fresh- 
man year in colleges. The validity as compared 
to actual grades has varied from .бо to .90, with 
an over-all score of .78.” The inadequacy of 
these statements is striking. The method of 
measuring reliability is not stated; if the tests 
are indeed speeded, and the value of .93 is, say, 
derived from correlating odd and even numbered 
items, the value quoted may be spuriously high. 
Furthermore, a better description of the group 
should be given, for a mixture of high school 
and college students (as the “over 800” are said 
to be) is likely to be rather heterogeneous. A 
reliability coefficient from a group varying 
greatly in ability, like this mixture, is likely to 
be much higher than one computed for a single 
class in a given school. Since the test calls for 
the obtaining of part scores, information should 
also be provided as to the reliability of these 
scores. It would be even better if standard errors 
of measurement were provided for both total and 
part scores. 

Likewise the inadequacy of the validity state- 
ment is clear. The circumstances under which 
the validities from .60 to .90 were obtained are 
not stated. Furthermore, a validity of .90 against 
“actual grades” is a value this reviewer finds 
rather hard to believe. In view of the scanty evi- 
dence on validity, one wonders whether any stu- 
dent who unhappily scores below 29 should be 
labeled as “hopeless,” as the test author would 
have us interpret his score. Also, if part scores 
are to be obtained, the validity of these should 
be described. 

The entire statement on norms is the follow- 
ing sentence : “The mean is 72 and the standard 
deviation is 17.” The reviewer doubts the use- 
fulness of these items to the average classroom 
teacher. The author does not even describe the 
nature or the size of the group on which these 
statistics are based, nor state when the testing 
was done. 

suMMARY. All of the subtests of this test ap- 
pear to be highly speeded. Items of the final, 
heavily weighted, subtest require a good deal of 
ability in the substitution in formulas and ma- 
nipulation of algebraic expressions—presumably 
the sort of thing to be taught in the course 
in algebra. The pamphlet accompanying the 
test gives only a brief, unsatisfactory state- 
ment regarding reliability and validity and 
for normative data provides only the mean 
and standard deviation for an undescribed 


group. 
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tions, the item on Form B of each test is very 
similar in principle to the corresponding item on 
Form A. The items are arranged in approximate 
order of difficulty. 

The items on both tests are of the short answer 
variety and are carefully chosen to avoid com- 
plicated algebraic expressions and tedious arith- 
metical computation. The 36 items in Part I, 
Test 1, constitute a good sampling of formal 
items requiring the exercise of such mechanical 
skills as fundamental operations with monomials 
and polynomials, simplification, special products, 
factoring binomials, difference of squares, sim- 
ple quadratics, variation, solution and evaluation 
of equations and formulas, and the construction 
and reading of line and bar graphs. Twelve of the 
18 conventional verbal problems in Part II re- 
quire the expression of relationships in symbolic 
language; the remainder call for both an equa- 
tion and its solution. 

Part I, Test 2, contains 20 items, of which 18 
are formal equations to be solved algebraically 
and 2 require the construction of a straight line 
and a parabola. Included in the wide range of 
skills necessary for the correct solution of these 
items are the fundamental operational axioms; 
the solution of literal equations, formulas, in- 
complete and complete quadratics, the radical 
equation, and simultaneous equations of the 
first degree; and the use of the quadratic for- 
mula, Part II consists of 25 standard verbal 
problems, in each of which the student is re- 
quired to set up and solve an equation. Each 
equation differs in some respect from all the 
others. The majority of the items can be an- 
swered by setting up a simple equation of the 
first degree ; in five items the use of the quadratic 
is involved. 

Administration of the tests is simple and 
straightforward. Directions are stated clearly 
and concisely, Ample space is provided in the 
booklets for computation, and the examiner is 
permitted to use his judgment in accepting cor- 
rect answers not included in the answer keys. 
Percentile ranks are tabled for total and part 
scores on each test. Information regarding the 
nature and selection of the standardization 
groups is meager. Test 1 was standardized on 
598 students completing the first semester’s work 
in high school algebra. Norms for Test 2, based 
on substantially larger numbers, are available 
for public school students completing the first 
year's work in high school and for private school 
students at the end of the first and second years 


of high school. The public school students re- 
sided in five cities. Either the sample of public 
School students was not representative or Test 
2 is too difficult, because the median scores for 
this group on Part II and the total test are 6 and 

16 while the possible scores obtainable are 50 
and 72, respectively. The adequacy of the norms 
in schools where problem solving receives the 
emphasis it merits is questionable. The extent 
of the agreement between scores on Test т and 
teachers' marks of 20 years ago is indicated by 
correlations of .68 and .72 for two groups of 
students; reliability coefficients based on the 
administration of both forms of Test 1 to these 
groups are .89 and .94. The odd-even reliability 
of Test 2, with the Spearman-Brown formula 
used, is .92 for an unspecified group. There is 
little information regarding the validity of Test 
2. 

Test 1 appears to bea suitable test for a course 
where the main emphasis is placed on conven- 
tional problem solving and on the ability to em- 
ploy mechanical and manipulative skills in fa- 
miliar situations. Test 2 gives due prominence 
to the use of the equation, but the examiner may 
wish to use as a final examination in first year 
algebra some test which measures a wider range 
of objectives and which lends itself to more 
meaningful interpretation. 


For reviews by L. B. Kinney and S. S. Wilks, 
$ее 40:1436. 


[387] 
*Cooperative Algebra Test: Elementary Alge- 
bra Through Quadratics. High school; 1932-5!; 
IBM; Forms S (42), T rote printings bear a 
1950 copyright), Y (48), Z (о); Forms 1932-37, 
N-R out of print; no data on validity; no specific 
manual; general Cooperative manual ('51); norms 
[37]; $2.50 per 25; 50¢ per specimen set, postpaid; 
separate answer sheets may be used; 80¢ per 25 
answer sheets; 15¢ per stencil for scoring answer 
sheets; cash orders postpaid; 40(45) minutes; L. 
Siceloff (S), Leone E. Chesire (S), Margaret P. Ми 
tin (S, T), Marion Е. Shaycoft (T), Robert S. Lank- 
ton (Y), and Bernice Orshansky (Z); Cooperative 
Test Division, Educational Testing Service. * 


REFERENCES 

1. pz, Házorp L. A Statistical Analysis of an Elementa) 
ME eur iere thenia, Omo ысы a , 
1940. . бо. tracts о] asters "Ses...., 1941, D. 30. 

iea STEWART, Naout; Bryan, MixrAM M.; AND BURKE, Les 
. "Correction for Guessing in the Scoring of Pretests: Ta 
pon Item Difficulty and Item Validity Indices, . Hier 
The Seventh Yearbook of the National Council on gamin "КЕ, 
Used in Education, 1949-50, Fairmont, W.Va.: the ae 
Fairmont State College, 1950. Рр. v, 55, xi. Paper, m 
raphed. ^ 
1 i TOWNSEND, AGATHA. “The Cooperative Mathematics To 
Program (Mathematics for Grades 7, 8, and 9, Elementary гу, 
ebra, Intermediate Algebra, Plane Geometry, Solid coma iea 
Trigonometry, and Secondary School Mathematics) : Relia on 
and Correlations With School Marks," pp. 89-95. (PA S fools 
In 1950 Achievement Testing Program in Independent $ No. 
and Supplementary Studies. Educational Records Bulletin, 
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54. New York: Educational Records B: , Рр. xiii 
$6 Paper, отер © ФА as: CNN July 1950. Pp. xiii, 
.. 4, Rositpa, M., Sister. "Is an 1.Q. an Index t. Algebra Abil- 
ity?" J Ed Res 44:391-93 Ja 's1. * (PA 25 18258) "i ` 


STANLEY CLARK, Associate Professor of Edu- 
cation, University of Saskatchewan, Saskatoon, 
Saskatchewan, Canada, [Reviews of Forms R, 
S, T, Y, and Z.] 

This test is designed to provide a measure of 
achievement at the end of the first year’s course 
in algebra. Standard procedures for administer- 
ing and directions for scoring and for interpret- 
ing the results are given in the general manual. 
Specific instructions, printed on the cover of 
the test booklet, are clear and adequate. No in- 
formation regarding its construction is supplied 
with the test. According to the publisher’s cata- 
log, the development of the Cooperative achieve- 
ment tests follows currently accepted practices 
with respect to curricular validation, prelimi- 
nary tryout, item analysis, and so on. 

With few exceptions multiple choice items 
are used throughout the five available forms of 
the test. The items are arranged, to some degree, 
in order of difficulty in each of the three parts of 
the test. Parts I and III of Forms R and S 
measure the manipulative skills; Part II con- 
tains verbal problems. Forms T, Y, and Z are 
somewhat similar in pattern: usage and under- 
standing of the mechanics of algebra are tested 
by the items in Part I, while Parts II and III 
consist of verbal problems. In the last three 
forms of the test more success has been achieved 
in devising items requiring understanding and 
in formulating a greater variety of problems. 

The following wide range of problems is cov- 
ered in the test: (a) Mechanical and manipu- 
lative skills (usage and to some extent under- 
standing) which include operations with mono- 
mials, fractional (Forms R, S only) and integral 
exponents, negative exponents (R, S) ; quad- 
ratic surds; simplification of algebraic expres- 
sions; signed numbers, parentheses, algebraic 
fractions; evaluation of expressions by substi- 
tution; special products: two binomials (S, Y, 
Z), square of binomial (R, S, Y); factoring : 
difference between two squares, complete quad- 
ratic (S, T, Y, Z), removal of common term 
(R, S, T, Y); solution of equations: funda- 
mental operational axioms, literal equations and 
formulas, roots of quadratic (S, T, Y, Z), sub- 
stitution, simultaneous equations; square root: 
estimated square root of a number (Y, Z), 
square root of an algebraic number ; quadratic 
formula (Z); and trigonometrical ratios (T). 


(b) Verbal problems which include: expression 
of relationships in symbolic language; setting 
up equations or formulas from stated conditions ; 
solutions of problems involving ratio and pro- 
portion, variation, time rates, mensuration, and 
so on; problems involving trigonometrical ra- 
tios (T, Y, Z); and problems involving the 
Pythagorean theorem (Y, Z). (c) Graphs which 
include reading statistical graphs; and func- 
tional graphs : given one coordinate, to find the 
other (R, S, T), solution of simultaneous equa- 
tions, identification of graph from equation (R, 
S, T, Z), functional dependence (Y), identifica- 
tion of graph given the relationship between the 
variables (Y). 

Some items may be correctly answered on the 
basis of some principle other than that intended. 
Item ro, Part III, Form Z, may be answered 
without any knowledge of the theorem of Py- 
thagoras. Where scale drawings are employed, 
particular care must be taken in constructing the 
items. The required angle in Item 1r, Part II, 
Form T, is obviously thirty degrees or less. The 
unknown base of the triangle in Item 10, Part 
III, Form Y, is exactly twice the stated alti- 
tude of goo feet. Both items can be solved with- 
out using the trigonometrical ratios. 

The test is easily scored. A fan key is pro- 
vided for scoring when the answers are placed 
in the booklets; a perforated key is available 
for hand scoring the answer sheets. Scores are 
corrected for guessing. Tables on the fan key 
and separate answer sheet convert raw scores to 
scaled score equivalents. The latter may be con- 
verted to percentile ranks. Reliability of the 
forms is stated in terms of the standard error 
of measurement. Norms are available for public 
secondary schools of the East, Middle West, 
West and South. The establishment of these 
norms predates the construction of the currently 
available forms of this test. Their applicability 
is apparently based upon the sequential equating 
of the various forms as they have appeared. 

This test should serve as a valuable tool in 
the school’s measurement and evaluation pro- 
gram. The abilities involved in solving problems 
for which no answer is provided and in check- 
ing the results against the stated facts are im- 
portant objectives which require a different ap- 
proach in their measurement. 


For a review by W. C. Brenke of Forms R 
and S, see 3:321; for reviews by Harl R. Doug- 
lass and Harold Fawcett of Form P, see 40: 
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1438; for reviews by William Betz and Helen 
Walker of Form 1937 and by S. S. Wilks of 
Form N, see 38:880. 


[388] 

*Cooperative Intermediate Algebra Test: Quad- 
ratics and Beyond. High school, 1933-51; IBM; 
Forms S ('50—same as test copyrighted in 1942), T 
(’43—some printings bear а 1951 copyright), Y (^48), 
Z ('5o—same as test copyrighted in 1949) ; Forms 
1933-37, N-R out of print; no data on validity ; no spe- 
cific manual; general Cooperative manual ('51); 
norms ['37]; $2.25 per 25; 50€ per specimen set, post- 
aid; separate answer sheets may be used; 80€ per 25 
вм answer sheets; 156 per stencil for scoring answer 
sheets; cash orders postpaid; 40(45) minutes ; John A. 
Long (S), L. P. Siceloff (S), Leone E. Chesire (S), 
Margaret P. Martin (T), Marion F. Shaycoft (T), 
M. Isobel Blyth (Y), and Bernice Orshansky (7); 
Cooperative Test Division, Educational Testing Serv- 
ice. 


REFERENCES 


EO онаи А. Р. The Prediction of Scholastic Achievement 
for Freshman Engineering Students of Purdue University, Pur- 


5020) 
2, Jounson, A. P. The Relationship of Test Scores to Scho- 
Engineering Freshmen Entering Pur- 
ғ, 1939. Doctor's thesis, Purdue Uni- 


[een Intermediate Algebra, Plane Geometry, Solid Geometry, 


and Correlations With School Marks," pp. 89-95. (PA 25:567) 
chools 


54. New 
119. Paper, lithotyped. 

Lucien B. Kinney, Professor of Education, 
Stanford. University, Stanford, California. 

This test is intended primarily as an evalua- 
tion of end-of-course achievement in intermedi- 
ate algebra. Two sets of percentile norms are 
included—one based on the results of 4,000 pu- 
pils in sixty schools at the end of grade 11 after 
one and one half years of study of algebra, the 
other based on the results of 2,000 pupils in 
thirty schools in grade 11 after two years of 
study of algebra. 

The content covers materials included in the 
typical second course in algebra, including 
quadratics and beyond. It is intended to provide 
for functional rather than formal testing of the 
more frequently emphasized skills and abilities. 
The test items are exclusively of the multiple 
choice type, in order to provide for machine 
scoring. This places emphasis on ability to 
recognize, rather than to produce, the correct 
answer. Inspection of items suggests that in 
some, but not in all, an actual solution of the 
exercise would be more economical than simply 
a checking of the various alternatives. In Forms 
T through Z this factor is taken care of to some 
extent by having “none of the above,” or an 


equivalent, included occasionally as one choice, 
Information is needed, however, as to the degree 
to which the test actually measures ability to 
produce the correct answer. 

The several forms of the test show two differ- 
ent patterns of organization. Forms R and § 
each consist of three parts, of which Part II is 
made up of verbal problems, and Parts I and 
III are devoted to manipulative exercises. The 
time limit of ten minutes allowed for the solu- 
tion of 12 verbal problems in Part II places a 
premium on speed. Forms T, X, Y, and Z are 
set up in two parts, with verbal problems com- 
prising a portion of Part II. In Form T, for 
example, 18 of the 23 exercises in Part II are 
verbal problems. Fifteen minutes is allowed for 
this part, placing similar emphasis on speed. 

Forms T through Z give more attention to 
the functional understanding of algebra, and 
less to the manipulative abilities, than do the 
earlier forms. The emphasis on interpretation of 
graphs and formulas, and on the language of 
variation, in these later forms is commendable. 
Not so commendable, however, is the failure to 
keep the information in statistical graphs up-to- 
date. The data in the graph on rate-of-climb for 
airplanes was probably obsolete when first used 
in 1943 (Form T, Part II, Items 6 and 7) ; it 
certainly was in 1949 (Form Z, Part II, Items 
16 and 17). d 

The test is well designed for administrative 
convenience, Testing time of 40 minutes is re- 
quired for each form, thus allowing for adminis- 
tration during a single class period. The manual 
provided gives detailed directions for adminis- 
tering tests of this type. Special attention is given 
to the problem of administering tests to large 
groups, giving directions to the students, pre- 
paring the materials for testing, and handling 
various common emergencies. Directions for 
scoring include the use of scoring keys and the 
scoring stencils which are supplied for each 
form. The plan of operation for scoring large 
numbers of tests is covered in detail. The com- 
puting of scaled scores and the use of percentile 
norms are adequately covered, and the use of 
various norms is clearly explained. 

In general, this is a good test, designed under 
the direction of competent authors and expert 
test technicians. The limitations of the test аге 
those imposed by the use of the multiple choice 
questions and the strict time allowances which 
place at least as heavy emphasis on speed as оп 
mathematical understanding. 
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E. P. STARKE, Professor of M. athematics, Rut- 
gers University, New Brunswick, New Jersey. 
[Review of Forms R, S, T, Y, and Z.] 

This is an objective type test comprising 64 
to 68 multiple choice items (56 in Forms R and 
S) to be completed in 40 minutes. Virtually all 
the topics usually associated with intermediate 
algebra are covered to some degree, with con- 
siderable emphasis upon the ability to translate 
verbally stated problems into their mathematical 
equivalents. Forms T, Y, and Z include ques- 
tions in numerical trigonometry, while an appre- 
ciation of the function concept is touched upon 
in Forms Y and Z. In a review in The Third 
Mental Measurements Yearbook, L. B. Plum- 
lee pointed out that “the majority of items [in 
Forms R, S, T], both formal and verbal, are 
typical of those found in algebra textbooks and 
test memory of certain concepts and processes, 
rather than the use of these concepts and proc- 
esses in solving problems with novel settings.” 
It should be noticed that Forms Y and Z show 
definite improvement in overcoming this defect. 
Very little, however, has yet been done to over- 
come the most fundamental objection to the 
multiple choice type of test ; namely, that it fails 
to measure the student's power of analysis, his 
comprehension of logical principles, or even the 
accuracy of his work. The short time allowed 
per item and the necessity of presenting exactly 
five responses to each make this fault a hard one 
to correct. 

For a number of items in Forms T, Y, and Z 
one of the options presented is “None of the 
above” or “The answer is not given." This is a 
real improvement over the earlier forms since, 
when one of these options is used, it becomes 
difficult for the student to work the problem 
backwards or to hit upon the answer by mere 
elimination of obviously far-fetched alternatives. 
More care should be taken, however, to insure 
that the results of the most common errors are 
included among the alternatives, whether “None 
of the above" is one of them or not. For if neither 
it nor the results of the most common errors be 
included, then many students who make these 
common errors, by not finding their results 
listed, will have received an effective hint that 
they should try again. On the other hand, if 
“None of the above” is listed but the correct an- 
swer is not, then all students who obtain wrong 
answers that are not listed will automatically be 
counted right. Both situations tend to lessen the 


validity of the test. 


Considerable care has been shown in the 
preparation of the tests. They are well printed 
and arranged, except that fractional exponents 
in Forms S and Z are too small. Adequate di- 
rections for administering and scoring the tests 
are provided. Norms determined from test re- 
sults of 4,000 12-grade-system students and 
2,000 11-grade-system students are available. 

The tests are fairly free from objectionable 
wording, although the following should be noted. 
In Form Y, 1-29, “The expression has no fac- 
tors" would better read “The expression can- 
not be factored.” In Form Z, I-4, “If equation 
ig subtracted from equation... the resulting 
equation is" could be improved. In Form Z, 11-8 
is definitely wrong, the cited function being a 
step-function which is not equal to any of the 
choices given. Some students might be confused 
by the two graphs on one plot (Forms T and 
Z). There will be some quarrel with including, 
as concepts of intermediate algebra to be tested 
here, “slope” and “y-intercept” (Forms Y and 
Z), the factor theorem (Forms Y and Z), the 
minimum point of a vertical parabola (Form 
Z), and "antilogarithm" (Form 2): 

As has been previously noted, Form T, par- 
ticularly Part II, requires considerably more 
time than has been allowed. Forms Y and Z are 
somewhat better in this regard. 


For a review by L. B. Plumlee of Forms R, 
S, and. T, see 3:322; for reviews by Albert A. 
Bennett and Earle R. Hedrick of Form P, see 
40:1439; for reviews by J. O. Hassler and S. S. 
Wilks of Form 1937, see 38 :88т. 


[389] 
XElementary Algebra: Achievement Examina- 
tions for Secondary Schools. High school; 1951; 
1 form ; no data on reliability and validity ; no manual ; 
Minnesota norms (median and quartile deviation) 
available; similar norms for other regions by special 
arrangement with publisher; 7% per test, postage ex- 
tra; 60(65) minutes; edited by Walter W. Cook; pre- 
pared by a curriculum committee of high school teach- 
ers for use in the Minnesota State Board Achievement 
Examinations Program; Educational Test Bureau, 
Educational Publishers, Inc. * 


[390] 

*Elementary Algebra: Every Pupil Test. High 
school; 1929-51; new form usually published each 
April and December ; form December 1951; no data on 
reliability and validity; no manual; norms ('51) ; 2 
per test; 16 per answer key ; Hur ne ; 40(45) min- 
utes; Ohio Scholarship Tests, hio State Department 
of Education. * 


[391] 
*First Year Algebra Test: State High School 
Tests for Indiana. 1, 2 semesters high school; 1933- 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK 


[ 500 


392 


50; 2 levels ; mimeographed ; no data on reliability and 
validity; no manual; no norms; 416€ per test; 15¢ per 
specimen set; postpaid; 40(45) minutes; Paul C. 
Baker; State High School Testing Service for Indi- 
ana, Purdue University. * 

а) FIRST SEMESTER. Form B [50]; Form А out of 


print. 
b) SECOND SEMESTER. Form О [’50]; Form N out of 


print, 


[392] 
*First Year Algebra: 20th Century Test. 1, 2 
semesters high school; 1949; 2 levels; Form A; no 
data on reliability and validity; no norms—author rec- 
ommends the use of local norms; 10 or more copies, 
5¢ each, postage extra; 35¢ per specimen set, postpaid ; 
40(45) minutes; [Ardis Sanders]; Benton Review 
Publishing Co, Inc. * 
D TEST I. I semester high school. 
b) TEST п. 2 semesters high school. 


[393] 

Iowa Algebra Aptitude Test, Revised Edition. 
High School; 1929-42; 5 scores: arithmetic, abstract 
computation, numerical series, dependence and varia- 
tion, total; IBM; 1 form, '42; $1.35 per 25; 25€ per 
manual ('42); 35€ per specimen set, postpaid; sepa- 
rate answer sheets may be used; 36 per IBM answer 
sheet; 20€ per set of stencils for machine scoring of 
answer sheets; postage extra; 35(40) minutes; Harry 
A. Greene and Alva H. Piper; Bureau of Educational 
Research and Service, State University of Iowa. * 


REFERENCES 


1. See 40:1441. 
2-3. See 3:327. 


Hanorp GULLIKSEN, Professor of Psychology, 
Princeton University; and Research Advisor, 
Educational Testing Service; Princeton, New 
Jersey. 

This test is composed of four separately 
timed subtests, yielding four subscores and a 
total score. All of the items are four-alternative 
multiple choice. No information is given in the 
manual regarding the percentage of examinees 
completing the later items in each part. How- 
ever, the test seems speeded judging by the num- 
ber of items and time limits for each of the parts 
and by the fact that this reviewer exceeded the 
time limit on two of the four parts. 

The quality of printing and general format 
are satisfactory, but the type size used for frac- 
tions could profitably be made somewhat larger. 
All problems are well stated. The key is correct. 
Correct answer “not given” is one of the four 
alternatives for every problem in the first three 
sections and is the appropriate answer in a rea- 
sonable number of items. Such a practice seems 
excellent to this reviewer. 

The reliability of the test is given as .87 for the 
revised edition, .88 and .92 for the old edition. 
The probable error of measurement and the 
standard deviation are given for each group. It 


is unusual to find such complete information 
given in a test manual. 

Two validity coefficients are given (.66 with 
algebra grades and .76 with the Columbia Re- 
search Bureau Algebra Test), as well as inter- 
correlations and validities for two other uni- 
dentified "prognostic tests" which had a lower 
validity. Unfortunately, no information is given 
on the mean, variance or error of measurement 
so that approximate validities for other groups 
cannot be estimated. 

Percentile norms are presented for the total 
test and each of the four subtests. One table is 
based on over four thousand eighth grade stu- 
dents taking the test at the end of the year. An- 
other table is based on over five thousand begin- 
ning high school students tested in September. 
Norms are also given separately for boys and 
girls. 

The illustrative example given in the manual 
demonstrates how to interpret part scores de- 
spite the fact that no information is given on 
their reliability or validity. To this reviewer it 
seems more appropriate to caution the test user 
against such a practice. The manual also sug- 
gests a cutting score: "it is probable that stu- 
dents ranking below the twenty-fifth percentile 
will find the subject so difficult that they should 
be excluded or diverted into a course in simpli- 
fied mathematics." The evidence which would 
justify such a statement is not presented. It 
seems very unfortunate that a manual so com- 
plete in most other respects does not include a 
scatter diagram of algebra grades against test 
scores or give information for specified groups 
on the probability of success in certain courses 
at different aptitude test score levels. The man- 
ual suggests briefly that “it may be desirable for 
the teacher to establish this lower critical limit 
experimentally at least once in his own schools.” 
But the teacher is given no suggestions regard- 
ing an appropriate way to do this. 

An unpublished preliminary experiment done 
in 1929 is reported briefly in the manual. The 
study compared a group test of mental ability, 
an arithmetic survey test, a verbal problem 
analysis test in arithmetic, a silent reading test, 
and an experimental edition of the Jowa Al- 
gebra Aptitude Test. It is reported that the 
arithmetic survey and reading tests constituted 
“the best two-test combination.” It is also indi- 
cated that the experimental form was revised 
as a result of this study. However, there is no 
indication of the nature of these revisions, an 


şor | 


no evidence is presented regarding the validity 
of the revised test as compared with the "best 
two-test combination." 

To summarize, this test is well constructed and 
should be seriously considered and tried out 
where it is desirable to predict success in first 
year algebra. Since it is a speeded test, work 
still remains to be done on the comparative value 
of speed versus power tests for predicting suc- 
cess in algebra. In view of the finding cited in 
the unpublished study referred to above, the use 
of an arithmetic achievement test and a reading 
test should also be considered, especially where 
a coordinated testing program serving several 
purposes can be planned. 


Emma Spaney, Assistant Professor of Psy- 
chology, Queens College, Flushing, New York. 

The problems and exercises in this test are of 
the 4-choice type, having only one right answer, 
and containing the option “not given” or “can- 
not tell” for each item. In this way, it is possible 
for each student to mark an answer to any item 
which he attempts. 

Part 1 of the test consists of 30 arithmetic examples 

covering addition, subtraction, multiplication, and divi- 
sion of whole numbers, fractions, and decimals. 
The format of this part is somewhat confusing, 
since, in a few cases, items are indented from the 
left-hand margin in order to leave space for 
computation of preceding items (5, 11, 16, 19, 
20, 30). Dotted lines run from the item across 
the page to the choices, and sometimes interfere 
with the computation of preceding items (16, 
22, 23). In this reviewer’s opinion, it would be 
better to set the items in two columns and place 
the choices under each question, leaving space 
around the items for computation. 

Part 2 consists of 25 verbal FIM involving the 
use of simple arithmetical ап algebraic procedures 
with abstract values. 

These items would be considerably improved by 
using boldface type for the “abstract values, 
to differentiate them from the text. 


Р i jally designed number- 
art 3 consists of 40 specially pei ed upon to 


Here, again, the format needs improvement. 
Some of the number series contain exponents 
Which are so microscopic in size that it is diffi- 
cult to distinguish a 3 from an 8 or, indeed, to 
see them at all unless one has excellent visual 


acuity. 
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Part 4 is composed of specially designed exercises 

which call upon the pupil to identify the effect on one 
value in an equation which results from the variation 
(increase or decrease in value) of another value, This 
test, though brief [ro items] 1з a valuable addition to 
the battery. 
No evidence to support this statement appears 
in the manual, although statistical results deal- 
ing with the validity of the total score on the 
test are given. 

The manual reports correlations with teach- 
ers' grades in algebra at the end of the first 
semester for 105 students who also took two 
other prognostic tests in algebra and a standard- 
ized achievement test. The Iowa test correlated 
.76 with the achievement test and .66 with teach- 
ers’ grades. Both of these correlations indicate 
significant relationships between total test scores 
and achievement. 

Using the Kuder-Richardson “foot-rule” 
formula for reliability, the probable error of a 
score has been estimated as 3.33 score points, 
based on a sample of 336 unselected eighth grade 
pupils prior to beginning the study of first year 
algebra. On the basis of this probable error, one 
can deduce, through the use of normal curve 
statistics, that there are 95 chances in 100 that 
a pupils true score on this test is within approx- 
imately 10 points of his obtained score in either 
direction. On the basis of a maximum possible 
score of 105 points on the total test, this is suffi- 
ciently close for purposes of sectioning students. 
Reference to the percentile norms based on 4,379 
eighth graders tested in May, and on 5,786 pros- 
pective algebra students tested in September, in- 
dicates that a 10-point raw score difference in 
total score would reflect a difference in per- 
centiles of roughly то to 15 percentile points at 
the probability level under consideration. This is 
adequate for purposes of sectioning the students 
at the beginning of the semester. 

Directions for administering and scoring the 
tests and for interpreting the results are 
clear. 

In the opinion of this reviewer, the authors 
have demonstrated usability of the total score 
for sectioning purposes. The format could be 
improved considerably, however. Any use of the 
part scores for diagnostic purposes should be 
considered on the basis of validity and reliabil- 
ity figures for the parts. It is suggested that such 
a study not be done until the format has been 
revised, since extraneous factors may be meas- 
ured in the present form possibly distorting 
part-score results, although not seriously af- 
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fecting statistics based on the longer total test 
score. It is debatable whether percentiles should 
be reported for the separate parts in the absence 
of validity and reliability data, since this might 
give misleading information for an individual 
student. 


For a review by David Segel, see 3:327; for 
reviews by Richard M. Drake and M. W. Rich- 
ardson of an earlier edition, see 40:1441. 


[394] 

*Lankton First- Year Algebra Test: Evaluation 
and Adjustment Series. Grades 9-13; 1951-52, 
c1950-51; IBM; Forms AM (51), BM (752) ; manual 
(51); $2.15 per 25; separate answer sheets must be 
used; 80€ per 25 IBM answer sheets; postage extra; 
35€ per specimen set, postpaid; 40(50) minutes; Rob- 
ert Lankton; World Book Co. * 


REFERENCES 
т, LANKTON, Rosert STANLEY. Evaluation sf Achievement 


and Comparisons of Achievement in First Year ebra of Pub- 
lic High School Students Grouped According to Their Mathemati- 
cal Backgrownds and Interests. Doctor's thesis, University of 
Michigan (Ann Arbor, Mich.), 1951. Abstract: Microfilm Ab- 
stracts 11:584-s no 3 ’51. (PA 26:1679) 

STANLEY CLARK, Associate Professor of Educa- 
tion, University of Saskatchewan, Saskatoon, 
Saskatchewan, Canada. 

The primary purpose of this test is “to meas- 
ure the extent to which students have achieved 
the important objectives of a high school course 
in first-year algebra." The content of the test 
and the objectives to be measured were carefully 
selected after an extensive study of commonly 
used textbooks, representative courses of 
study, and relevant publications of national or- 
ganizations. The objectives are outlined in the 
manual under two main headings: meaning— 
understanding the language of algebra, and skill 
—using the language of algebra. About 80 per 
cent of the items were designed to measure the 
objectives included in both categories ; the other 
20 per cent measure primarily the objectives in- 
cluded under the first heading. During the pre- 
liminary tryout teachers were invited to criti- 
cise the test with respect to content, item form, 
administrative directions, and so on. Pertinent 
suggestions received were taken into consider- 
ation when the final forms were developed. 

The test is described as being essentially a 
power test and contains 55 5-response multiple 
choice items. The two forms have been carefully 
balanced with respect to content, coverage of 
objectives, item difficulty, and item validity. 
The preliminary edition, consisting of three 
forms of 75 items each, was administered to ap- 
proximately two thousand students in 20 high- 
schools from 7 states. Subsequently, indices of 


difficulty and validity were computed for each 
item ; the difficulty indices for the items used in 
the final forms are tabled in the manual. The 
items on each form are arranged very closely to 
their order of difficulty. 

During the standardization program, the test 
was administered to over three thousand stu- 
dents in 57 schools scattered through 22 states. 
The median CA of the students in the normative 
group was 15 years I month, and their median 
10 on the Terman-McNemar Test of Mental 
Ability was 106. This information is furnished 
in the manual to enable the prospective users to 
compare their students with the standardization 
group. A table on the answer sheet provides nor- 
malized standard score equivalents for the raw 
test scores, facilitating the uniform interpreta- 
tion of all tests in the Evaluation and Adjust- 
ment Series. Percentile norms are provided in 
the manual for use in interpreting results at the 
end of the course in first' year algebra. 

Corrected split half reliability coefficients in- 
dicate that the reliability of this test is moder- 
ately high. Coefficients of .87 and .84 were found 
for groups of 134 and 155 respectively. The cor- 
relation between the two forms of the test ad- 
ministered less than one week apart to a group 
of ОО students was .81. 

The manual contains much instructive infor- 
mation regarding the construction, administra- 
tion, scoring, and uses of the test. The material is 
sound and readable. Directions for giving the 
test are simple, clear, and complete. A perfor- 
ated stencil is available for hand scoring. 

There are discrepancies in some of the items. 
In Item 46, Form A, responses g and h are 
equivalent. In Item 3, Form B, the expression 
“gain or loss” is used in the stem. Either “gain 
or “loss” should be omitted because choices 4 
(—9 yd.) and е (9 yd.) are ruled out according 
to the existing wording of the item. The use of 
“none of the above” in such questions as Item 
45, Form B, is not feasible because one of the 
numerical answers must meet the conditions of 
being “most nearly” the desired ratio. In Form 
B, the expression “none of the above” is used 14 
times as a distracter. In no case is it the correct 
answer. 

The teacher of first year algebra who is look- 
ing for a well constructed test will do well to 
consider this test in the light of the objectives of 
his course. As the manual suggests, if some of 
the local objectives do not receive the desired 
emphasis on the test, teacher made measurement 
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devices can be employed to secure the supple- 
mentary information. 
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xLarson-Greene Unit Tests in First-Year Al- 
gebra. High school; 1947; 6 tests in 1 booklet; Forms 
X, Y; $3.50 per 25; separate answer sheets must be 
used; 40€ per 25 answer sheets for any one test; 50¢ 
per complete set of scoring stencils; тоё per scoring 
stencil for any one test; 25¢ per manual; postage ex- 
tra; 50@ per specimen set, postpaid; Robert E. Larson 
and Harry A. Greene; Bureau of Educational Re- 
search and Service, State University of Iowa. * 
a) TEST I. 5 scores: literal notation, simple equations 
and formulas, simple graphs, directed numbers, total ; 
40(45) minutes. 
b) TEST п. 3 scores: fundamental operations, equations 
of the first degree (one unknown), total; 38(45) min- 
utes. 
C) TEST III. 3 scores : equations of the first degree (two 
unknowns), special products and factoring, total; 39 
(45) minutes. 
d) TEST IV. 3 Scores: 
total; 36(40) minutes. 7 
е) TEST V. 3 scores: variation, indirect measurement, 
total; 38(45) minutes. 
f) TEST уг. 3 scores: powers-roots-radicals, quadratic 
equations, total; 39(45) minutes. $ 


[396] 
*Orleans Algebra Prognosis Test, Revised Edi- 
tion. High school ; 1928-51; 1 form, ’50; manual (51); 
$2.10 per 25, postage extra; 356 per.specimen set, post- 
paid; 39(44) minutes; Joseph B. Orleans; World 
Book Co. * 


fractions, fractional equations, 


REFERENCES 


1-4. See 40:1444. 4 “4 - P 
s. SEAGOE, May V. “Prediction of Achievement in Foreign 
Languages.” J Appl Psychol 22:632-40 D '38. * (PA 13:2725) 


Hanorp GULLIKSEN, Professor of Psychology, 
Princeton University; and Research Advisor, 
Educational Testing Service; Princeton, New 
Jersey. 

This test consists of 11 subtests, 9 of which 
are each preceded by a separately timed “Jesson” 
which the student must carefully study in order 
to work the problems. The total testing time of 
39 minutes is divided into 20 periods of from І 
to 3 minutes each. A total of 17 minutes is al- 
lotted for *study" and 22 minutes for working 
problems. Since 76 problems are to be worked 
in this time, there is a certain premium on speed 
of performance. The overall working time on the 
preceding form (1928) of the test was 81 min- 
utes for 162 items. It is interesting to note that 
in assembling the new (1951) form, items were 
selected primarily from the first halves of the 
various tests of the old form. Only 20 items were 
deleted from the first halves of the tests that 
were shortened, while 37 items were deleted 
from the second halves. It may be that items af- 
fected by speeding were not as good discrimi- 
nators as those not so affected. f 

The format is good and the working space 
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adequate ; however, the fractions and exponents 
are not in all cases clearly legible for persons 
with subnormal vision. The directions for ad- 
ministering are clear and the key is correct. 

A stepped-up split half reliability of .92 is 
cited in the manual. Since the test is speeded, 
this reliability may well be spuriously high, 
while the standard error of measurement may be 
spuriously low. Thus, no satisfactory evidence 
on reliability of the test is presented. 

The evidence on validity is better. The manual 
reports correlations of .59 and .60 between 
scores on the revised edition of the test and 
scores on the Seattle Algebra Test administered 
after a half year of study. The manual also cites 
"evidence of the validity of the original Orleans 
Algebra Prognosis Test. * The correlation be- 
tween the prognosis and achievement test scores 
was .71.” This statement is curious since the 
1932 manual states that “the multiple correlation 
between the prognosis test and the achievement 
test scores was .71." This figure is for 250 stu- 
dents in New York City tested in 1927. One 
wonders if these same validity data are being 
cited 23 years later without mentioning the date 
or the fact that a multiple correlation is being 
cited. 

The manual gives an expectancy chart with 
detailed directions for making such charts. This 
seems like a very good idea that might be more 
widely used in test manuals. Percentile norms 
for total test score are also given. No norms or 
any method for interpreting part scores is pre- 
sented. This is an excellent idea since the part 
scores are probably not reliable enough for such 
use. 

In interpreting scores, the manual states that 
“students whose prognosis scores are below 25 
will have great difficulty in achieving satisfac- 
tory grades in algebra." (A score of 25 corre- 
sponds to a percentile of 18.) If one accepts à 
score of 16 or more on the Seattle Algebra Test 
as satisfactory, then the expectancy chart shown 
is evidence for the use of such a cutting score. 

In summary, this test is well worth consider- 
ing as an algebra prognosis test. However, the 
fact that it is speeded with no evidence for effec- 
tiveness of speeding is unfortunate. The direc- 
tions for establishing local norms are good and 
evidence for validity is satisfactory. Evidence 
on reliability leaves much to be desired. 


Emma ЅрАмЕҮ, Assistant Professor of Psy- 
chology, Queens C ollege, Flushing, New York. 
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The problems in this test are all of the comple- 
tion type, with spaces indicated for the student 
to record his answers. 

COMPOSITION AND FORMAT. There are II 
parts to the test, 10 of which consist of both a 
lesson and a test, covering basic elements in- 
volved in learning algebra successfully : “(1) an 
appreciation of the use of symbols (letters) to 
represent numbers, (2) the ability to substitute 
values for these symbols, (3) the ability to rep- 
resent quantities by means of symbols and to 
use them, (4) the ability to express relationships 
by means of symbols, and (5) the combination of 
all of the above in solving problems.” The format 
is good. Boldface type and italics have been used 
to good advantage, and the spacing is designed 
to allow for computation. Directions to the stu- 
dent are clear and concise. 

vALIDITY. Correlations between prognosis 
test scores and scores on a standardized achieve- 
ment test were .60 and .59 for 278 beginning al- 
gebra students in one school, and 94 in another 
in the fall of 1950. The point is well taken in the 
manual that these correlations are conservative, 
since several students were dropped before the 
end of the semester, thus curtailing the range of 
talent. 

NORMS AND THE INTERPRETATION OF RE- 
SULTS. Percentile norms for total scores only are 
given, based on 1937 beginning algebra students 
in the ninth grades of six New York City high 
schools tested in the fall of 1950. The technique 
for determining a critical score to separate the 
probable failures from the probable successes on 
the basis of an achievement test as criterion is 
very well explained and illustrated with the data 
from the group on whom reliability statistics 
were computed. A particular school can thus set 
up its own critical or cutting scores on the basis 
of data locally collected. A table showing proba- 
bility of success corresponding to total score in 
three categories (high, average, poor risk) is 
provided as a guide for the teacher in interpret- 
ing a particular score. Although the basis for 
this categorization is not clearly stated, the 
points of demarcation correspond to the 16th 
and 84th percentiles approximately, indicating 
that on the assumption of normality of total score, 
the two extreme categories contain students who 
deviate from the mean by at least one standard 
deviation, while the middle category contains 
students who deviate from the mean by less than 
one standard deviation. Before this table can be 
adopted for use in a particular situation, the 


local procedure for failing students should be 
carefully considered in relation to the 16 per cent 
placed in the category of “likely to fail.” It is pos- 
sible that an individual teacher will wish to re- 
vise this in terms of local practice. 

ADMINISTRATION AND SCORING. Directions 
for administration and scoring are clear and 
concise. The scoring key would be improved, 
however, if the subtest answers were arranged 
so that a fan key could be made. At the present 
time, the scorer must refold the key each time 
it is used, which makes handling awkward and 
slows down the scoring process. 

In this reviewer's opinion, the test and man- 
ual are very carefully worked out and clearly 
presented. Evidences of validity and reliability . 
are sufficient to warrant using the test for sec- 
tioning and for work with individual students. 


For a review by S. S. Wilks of the original 
edition, see 40:1444. 
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*Seattle Algebra Test: Evaluation and Adjust- 
ment Series. High school; 1951; IBM; Form АМ; 
$1.75 per 25; separate answer sheets must be used; 

per 25 IBM answer sheets; postage extra; 356 per 
specimen set, postpaid; 40(50) minutes; Harold B. 
Jeffery, Earl E. Kirschner, Philip Stucky, John К. 
Rushing, David B. Scott, and Otie P. Van Orsdall; 
World Book Co. * 
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*A.C.E.R. Arithmetic Tests. Grades 3-8; 1931-51; 
6 parts; Forms C [’46], D [’51]; Forms A, B out of 
print; manual ['51]; 6s. 6d. per то sets of all parts; 
45. per IO sets of parts 1-4; 3s. 6d. per set of Scoring 
stencils for parts 1-4; 3d. per key for part 5; 3d. per 
key for part 6; 5s. per manual; 9s. 6d. per specimen 
set; cash orders postpaid within Australia; Australian 
Council for Educational Research. * 
@) PART I, ADDITION. Is. per 10; 4(10) minutes. 
b) PART II, SUBTRACTION. Is. per 10; 4(10) minutes. 
C) PART III, MULTIPLICATION. Is, per I0; 4(10) minutes. 
d) PART IV, DIVISION. Is. per 10; 4(10) minutes. 
€) PART V, ARITHMETICAL PROCESSES. Is. 6d. per I0; 20 
(30) minutes. 
f) PART VI, TERMS AND RELATIONS. Is. 6d. per 10; 20 
(30) minutes. 


[399] 

XA.C.E.R. Number Test. Ages 13.5 and over; 1942; 
I score: simple addition and multiplication; 1 form; 
no data on reliability and validity ; 2s. 7d. per 10; 15. 
per set of scoring keys; 6d. per manual; 15, 100. рег 
specimen set; cash orders postpaid within Australia; 
10(20) minutes; Australian Council for Educationa 
Research. * 


Lesie M. Haynes, Head, Department of Ap- 


plied Psychology, Sydney Technical College, 
Sydney, New South Wales, Australia. 
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The Australian Council for Educational Re- 
search has pioneered the field in Australia in 
test construction and particularly in the con- 
struction of tests of attainment in reading and 
arithmetic. The experience gained over the last 
two decades has been well used in the A.C.E.R. 
Number Test, which aims to assess the ability of 
subjects in the age range 13 years 6 months to 
adulthood in the fundamental arithmetical op- 
eration. 

The test consists of two subtests: 180 items in 
addition and 300 items in multiplication. The 
items are well graded and the whole is an ex- 
cellent example of test construction. The test, 
which is set up in clear type and in one booklet, 
is supported by an adequate manual and an easily 
manipulated marking key. 

Norms are supplied for boys and girls sepa- 
rately, adults, male and female university stu- 
dents, and male technical college students. The 
norms for children are more satisfactory than 
those supplied for the other groups ; although the 
figures for children are given for only one year 
and in only one Australian state. Moreover, 
there is no indication as to whether a correction 
was made for the select population which re- 
mained after the compulsory school-leaving age 
of 15 years was reached. 

The adult norms were "estimated by the 
Royal Australian Air Force Vocational Guid- 
ance Section from data collected by them." Al- 
though such a procedure is permissible as a 
beginning, it is to be hoped that the ACER has 
been collecting data to enable it to give further 
information about adults. It might well be that 
the "figures" which "suggest a drop in the mean 
score after leaving school," are due either to 
insufficient data or to an unfortunate compari- 
son with the select population at the 15 year old 
level. 

The data for university population was taken 
from one Australian university over a period 
from 1943 to 1947. Norms for the whole first 
year population and by faculties are available. 
Some of the faculty norms are not reliable be- 
cause of the small populations. Validity coeffi- 
cients are available for students taking courses 
in engineering (N = 74, r = .26), physical sci- 
ence (N = 70, r — 73) and pure mathematics 
(М = 70, r = .35)- 

The norms for technical college students 
might be more confidently received if they were 
derived from data from all Australian technical 
colleges. 


The A.C.E.R. Number Test aims to meet a 
long felt need in educational guidance, voca- 
tional guidance, and "vocational" selection. This 
need is only partly provided for, and it will re- 
main so until more complete norms, further vali- 
dation data, and alternative forms of the test 
are available. 


[400] 
XArithmetic Essentials Test. Grades 3-4, 5-6, 7-9; 
1949-51; 3 levels; Forms А (^49), B (^51) ; no data 
on reliability and validity; no manual; no norms ; $1.80 
рег 30; 10Ф per specimen set; postpaid; nontimed (40- 
50) minutes; James T. Shea; Steck Co. * 


Foster E. GROSSNICKLE, Professor of Mathe- 
matics, New Jersey State Teachers College, Jer- 
sey City, New Jersey. 

Each of the three tests in this series consists 
of 50 items. Two of the tests have 40 examples 
and 10 verbal problems; the third test has 39 
examples and 11 verbal problems. There are no 
norms, A class analysis blank accompanies each 
test. The instructions on this blank direct the 
teacher to keep a record of each pupil's response 
to each item in the test. This analysis enables the 
teacher to find the type of example or verbal 
problem each pupil misses. 

Since the primary purpose of the test is to 
find the point at which a pupil has difficulty in a 
given process, the reviewer assumes that the 
function of the test is diagnostic. If this assump- 
tion is correct, the test has certain defects which 
make it an unreliable test. A diagnostic test must 
have at least three samples of a given type in 
order to provide a reliable diagnosis. The test 
under consideration does not meet this require- 
ment. Furthermore, many of the known types 
of examples in the different processes are not 
sampled. 

There is an evident lack of consistency in la- 
beling the answers to some of the problems in- 
volving measurement. In the test for grades 5 
and 6, for example, one problem involves yards, 
and the answer on the key calls for a labeled 
answer in yards; the next problem involves 
weight expressed in pounds, and the answer on 
the key is 9, and not 9 pounds. There are other 
similar inconsistencies dealing with this phase 
of work. There are no directions for scoring the 
test. If a pupil omits a label, but gives the cor- 
rect numerical value for the answer, the teacher 
does not know whether to mark the answer right 
or wrong. 

On the whole, the types of examples offered 
are commendable. This is especially true of those 
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types dealing with common fractions. The de- 
nominators of fractions to be added or subtracted 
are representative of the kind found in prob- 
lems which have social significance. On the other 
hand, it is unfortunate that in the addition and 
subtraction of decimal fractions so many ex- 
amples have been set up with ragged decimals. 
Examples involving ragged decimals, in which 
the numbers to be added or subtracted are ex- 
pressed in different degrees of accuracy, have 
no social significance. As long as standard tests 
perpetuate material of this kind, the teacher feels 
obligated to teach topics which should be elimi- 
nated from the curriculum. 

It is difficult to appraise this series of tests 
since it is not possible to know the specific func- 
tion each test is to serve. The tests do not include 
enough samples to be diagnostic. No norms are 
given; therefore the tests cannot be used for 
survey purposes. The reviewer does feel, how- 
ever, that the classroom teacher may use the 
tests to advantage for finding the comparative 
achievement of pupils within the class group. 


Cuartes S. Ross, Associate Editor, Silver 
Burdett Company, New York, New York. ә 

Each of the three tests consists of 50 items, 40 
of which are computational examples divided 
among the various processes to be tested. The 
remaining ІО items in each test are verbal prob- 
lems. Working space is provided in the test 
booklets. 

The author emphasizes the importance of 
analyzing the test to find the specific difficulties 
of the individual, and of following up the analy- 
sis with individual remedial work. A “class 
analysis” blank is provided. 

If the test is used for specific diagnosis of dif- 
ficulties in computation, the results will be dis- 
appointing merely because there are not enough 
items to diagnose accurately. specific difficulties 
in the various processes. For example, in the 
test for grades 5-6, only three of the 40 compu- 
tational items are on long division with two- 
figure divisors. In each of the three items the 
pupil will get the right quotient figure the first 
time if he follows the normal procedure in esti- 
mation. So the teacher will not know from this 
test whether the pupil can supply the right quo- 
tient figure when his first choice is wrong. 

The tests appear to be made up of excep- 
tionally simple examples, as though the objec- 
tives of the arithmetic curriculum were con- 
sciously lowered through the medium of the test. 
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For instance, in the test for grades 5-6 the most 
difficult example in addition of common frac- 
tions is 8% + 714. This does not represent the 
range of achievement to be expected in addition 
of fractions at the end of the sixth grade. 

The tests make use of “ragged” columns of 
decimals. For instance, 16.0107 appears in the 
same addition example as 9.8. This does not 
happen in real life. If a measurement were car- 
ried out to ten-thousandths, it would never be 
added to a measurement carried out only to 
tenths. It would first be rounded to 16.0, so 
that no item would be less accurate than any 
other, and the sum would pretend to no greater 
degree of accuracy than it actually represented. 

The 10 verbal problems in each test resemble 
the formal and unmotivated problems found in 
the textbooks of an earlier era. However, the 
author has used reasonable data. For example, 
the price of steak is $1.00 a pound, not $.48 as 
is true in some of the older tests. 

These tests supply no data on reliability and 
validity, and furnish no norms. There is no 
manual. Many purchasers of tests will be con- 
scious of these omissions. 

Basically, the tests not only limit themselves 
through the failures previously mentioned, but 
misinterpret the objectives of a modern arith- 
metic program. Achievement in arithmetic can- 
not be well measured, as these tests attempt to 
measure it, through responses to a few compu- 
tational examples and formal problems alone. 
The true “Arithmetic Essentials" are more sig- 
nificant than this. They are understandings and 
skills which enable the individual to arrive at 
accurate judgments through the use of accurate 
computation. These tests do not measure the 
total effectiveness of a good arithmetic program. 


[401] 


*Arithmetic: Every Pupil Test. Grades 3, 4, 5-6; 
1930-51; new form usually published each April and 
December ; 3 levels; form April 1951; no data on relia- 
bility and validity; no manual; norms (’51) ; 214€ per 
test; 1¢ per answer key; postpaid; 40(45) minutes; 
prepared by Emma Wiggins, Catharine Ingersoll, and 
Hazel Johnson under the direction of W. G. William- 
son; Ohio Scholarship Tests, Ohio State Department 
of Education. * 


[402] 

*Arithmetic Fundamentals Test: State High 
School Tests for Indiana. High school; 1944-45; 
Forms A ['44], B [44]; mimeographed; no data on 
reliability and validity; no manual; norms [45]; 47? 
per test; 15¢ per specimen set; postpaid; 40(45) min- 
utes; Doyle T. French, and Albert R. Mahin; State 
High School Testing Service for Indiana, Purdue Uni- 
versity. 
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[403] 
*Arithmetic Reasoning Test. Clerical applicants 
and high school ; 1948; Forms A, B; no data on valid- 
ity; manual [48]; $2 per 25, postage extra; o¢ per 
specimen set, postpaid; 10(13) minutes; Jay i Otis 
and David J. Chesler; Personnel Research Institute, 
Western Reserve University. * 


[404] 

*Arithmetic: Seven Plus Assessment: Northum- 
berland Series. Ages 7-8; 1951; for complete bat- 
tery, see 24; 1 form ['51]; no data on reliability ; 
manual ['51]; 7s. 6d. per 25; 6d. per single copy; Is. 
per manual; 2s. 6d. per specimen set (includes the 
other 2 tests in the series) ; postage extra; (100) min- 
е „© M. Lambert; University of London Press 
Ad. 


For a review by Stanley D. Nisbet of the com- 
plete battery, see 24. ^ 


[405] 

*Arithmetic Test, Fundamental Operations: Do- 
minion Tests. Grades 4-8; 1934-48; Forms А (734), 
B (734); revised manual (49); revised norms ('48) ; 
$1 рег 25; 10€ per scoring key; 256 per manual; 556 
per specimen set; postage extra; 35 (45) minutes; pre- 
pared by the Department of Educational Research, 
Ontario College of Education, University of Toronto; 
distributed by Vocational Guidance Centre. * 


For a review by C. L. Thiele, see 3:332. 


[406 ] ws 
Arithmetic Test (Fundamentals and Reason- 
ing): Municipal Tests: National Achievement 
Tests. Grades 3-6, 6-8; 1938-39; a subtest of Munici- 
pal Battery (see 20) ; 6 scores: computation, number 
comparisons, comparisons, roblem analysis, problems, 
total; 2 levels; Forms А (738), B (39); no data on 
reliability and validity and no description of normative 
population in manuals; no norms for part scores; man- 
uals (38) ; $1.75 per 25 of any one level; 35€ per speci- 
men set of any one level; postage extra; 60(70) min- 
utes; Robert К. Speer and Samuel Smith; Acorn Pub- 
lishing Co. * 
Foster E. GROSSNICKLE, Professor of Mathe- 
matics, New Jersey State Teachers College, 
Jersey City, New Jersey. { 
"These tests are designed to measure achieve- 

ment in the computational phase of number, 
ability to solve verbal problems, and understand- 
ing of number relationships or comparisons. The 
usual pattern is followed for testing the first two 
traits mentioned. Three-choice test items are 
used to measure the understanding of number 
comparisons or relationships. The technique 
used may be illustrated by the following : 

a 20—6=10 +4 

v. 20— 6 = more than 16 

k, 20— 6 = more than 12 + 6 

The part of the test used to measure under- 

standing of number relationships and number 
concepts is easily scored by the use of a code 
word. The code word for one form of the test 1s 


“answer.” A list of responses to each question 
on the test is lettered. The correct response 18 
indicated by one of the letters of the code word. 
This method makes it possible to score all of the 
test, with the exception of those facts dealing 
with computation and problem solving, without 
the use of a key. 

Norms are supplied for the complete test at 
three different levels depending upon the pupil's 
IQ. At grade 6.1, the norm for a pupil with an 
IQ below 9o is 35; the norm for a pupil with an 
IQ from 90 to 135 is 43; and the norm for a 
pupil with an IQ above 109 is 50. A pupil's effi- 
ciency status for his particular level of IQ is 
determined from his score. From the illustration 
given, it is seen that the norm for the slow mov- 
ing pupil in the beginning sixth grade is 15 less 
than the norm for the superior pupil at this grade 
level. It is to be regretted that the makers of the 
test have not furnished experimental evidence 
to justify these norms. 

The test is commendable because it attempts 
to measure some other ability besides that in- 
volved in computation and problem solving. 
That part of the test which measures relation- 
ships is excellent. There is complete agreement 
among teachers that arithmetic should be mean- 
ingful to the pupil. A pupil who understands 
arithmetic is able to discover relationships among 
quantities. One part of the test measures the 
pupil’s ability to read a verbal problem intelli- 
gently. This part of the test directs the pupil 


- to identify the data given and to recognize the 


question asked in the given problem. Low read- 
ing ability is one of the chief causes of unsatis- 
factory achievement in problem solving. A test 
which helps to locate deficiency in reading verbal 
problems should be useful to the teacher of arith- 
metic. 

Although the tests are good, there are certain 
minor defects in their structure which should be 
improved. A test should perpetuate the type of 
subject matter which has social significance. An 
example of the type, 19.86 + 5 + 8.102, has no 
social significance and conveys the incorrect 
mathematical concept to add unlike quantities of 
this kind. A similar example is given in sub- 
traction of decimal fractions. Unfortunately, 
these are the only two examples given in addi- 
tion and subtraction of decimal fractions on one 
form of the test for grades 3-6. 

Tt is an accepted principle among textbooks 
in arithmetic that abbreviations for measures 
should be expressed in the singular form. Thus, 
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the abbreviation for yards is yd. and not yds. 
This test has not been consistent in its use of 
abbreviations of measures. One example is ex- 
pressed as “22 gals. 3 quarts." Both quantities 
should be expressed in full or both should be 
abbreviated. In the latter case the quantity 
should be written as 22 gal. 3 qt. 

Certain parts of the test need revision in order 
that various prices stated in problems will be in 
agreement with the prices in use today. It is no 
longer possible to purchase butter at 33 cents a 
pound. A property tax of $1.60 per $100 assessed 
valuation does not agree with the inflated rates 
today. 

The test has some minor defects but, on the 
whole, it is one of the best standard tests in arith- 
metic available. The test measures both achieve- 
ment in such traits as computational arithmetic 
and problem solving and in understanding of 
number relationships. There is a need, however, 
for the authors to supply data dealing with the 
validity and reliability of the test. 


Cuartes S. Ross, Associate Editor, Silver 
Burdett Company, New York, New York. 

The authors of these tests have attempted to 
measure not only computational skill and prob- 
lem solving ability, but also understanding and 
ability to use mathematical reasoning. Any at- 
tempt to broaden the base of arithmetic tests is 
a move in the right direction. However, the at- 
tempt to measure understanding does not neces- 
sarily mean success in doing it, and the degree 
of success which these tests will have in this 
respect is open to question. 

The most interesting sections of the test are 
Part II, “Number Comparisons,” and Part ПІ, 
"Comparisons." Many of the items іп these 
sections appear to be aimed at grasp of mathe- 
matical principle, but are so complicated that 
failure with the item reveals nothing of a spe- 
cific nature about the pupil’s arithmetical diffi- 
culties. 

Part II is made up of multiple choice items 
such as the following: 

c. 64+ 5 = 16 x 616 

a. 616 = .5 = more than 614 -- 

n. 616 + .5 = more than 1% + s. 
The authors appear to be testing in this item the 
understanding that there are more .5's in a num- 
ber than there are 5's in it. But if so, why do they 
complicate the situation in the statement of the 
correct choice, the third one, by changing the 
6% to 134? A failure to select this choice as the 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK 


[ 508 


correct one might be due to several factors other 
than failure to understand the principle involved. 

Part III is made up of verbal statements of 
situations followed by multiple choice alterna- 
tives. Some of these verbal statements are ex- 
tremely complicated. When they are compared 
to the verbal problems which follow in Parts IV 
and V, they seem to belong to an entirely differ- 
ent test designed to measure a far greater degree 
of maturity and skill. 

Part IV of each test is called Problem Analy- 
sis. This section is a mere application to con- 
ventional problems of the outmoded problem 
solving formula: What does the problem tell 
you? What does the question ask? What must 
you do to find the answer? Part IV will con- 
tribute little to discovering the pupil's ability to 
analyze problems, since this ability depends on 
understanding the relationships between the 
numbers within the problem rather than upon a 
rote application of a simple series of questions. 

In addition to these weaknesses, the test for 
grades 3-6 is inappropriate for use in many 
schools because the pupils are expected to use 
skills which they have not yet acquired. In Part 
I of the test, the third grade child in a modern 
arithmetic learning situation could be expected 
at the most to solve only three of the 35 items, 
merely because he has not yet encountered simi- 
lar items in his study of arithmetic. Further- 
more, the authors appear to expect some under- 
standing of per cent by the end of grade 6, 
although many modern curriculums do not intro- 
duce this topic until grade 7. 

The prospective user of the tests will doubt- 
less question the lack of data on reliability and 
validity, and the failure of the publisher to supply 
background information as to the method of 
preparation and scope of experimental use of the 
items. Although the apparent objectives and 
ingenuity shown in some of the items are praise- 
worthy, there are too many faults in the exectt- 
tion of the tests to warrant unquestioning ac- 
ceptance of them. 


y 


[407] ) 
*Arithmetical Reasoning Test. College and adults; 
1941-42; Forms A (41), B('41) ; preliminary mania 
(42); 95€ рег 25; 50@ per specimen set; cash orders 
postpaid; 40(45) minutes; Alfred J. Cardall; Scienc 
Research Associates, Inc. * 
\\пллАм L. Scnaar, Associate Professor of 
Education, Brooklyn College, Brooklyn, New 
York. 


The major purpose of this test is to predict 
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success in certain types of occupations that re- 
quire skill in solving arithmetical problems. The 
test purports to measure the quantitative aspect 
of problem solving ability, that is, the kind of 
intelligence that is essential for accountants, stat- 
isticians, engineers and research workers. It 
attempts to test the reasoning elements rather 
than the computational skills, discriminating be- 
tween the type of mind that can see various parts 
of a problem in relationship to each other and 
that which gets hopelessly lost in computational 
activities. The material, based upon an examina- 
tion of the kinds of problems occurring most fre- 
quently in business activities, includes the follow- 
ing general types of problems, in order of im- 
portance: (а) computation of deviations from 
the list price; (b) computation of interest on 
loans, notes, etc. ; (c) figuring margins of profit ; 
(d) payroll computations; (е) average com- 
modity prices, various uses of percentage, finan- 
cial statements, and foreign exchange conver- 
sions. 

Certain specific weaknesses are to be noted: 
(a) the problem material used does not adhere 
very closely to the findings of the reported survey 
of arithmetical problems occurring in business; 
(b) the problems are stated in stereotyped, tra- 
ditional textbook style; (c) many of the prob- 
lems are unrealistic from a business point of 
view; (d) some of the “problems” do not really 
call for problem solving thinking if the meanings 
of the terms are known ; (e) a few of the prob- 
lems at the end of the test could be more aptly 
described as puzzles; (f) there is an overem- 
phasis upon the concept of proportion, reminis- 
cent of the continued proportions (chain rule) 
in medieval textbooks ; (g) the basis of the pub- 
lished norms is hardly a large enough sam- 
ple; (A) the "unusual predictive significance" 
claimed above means very little, since nothing is 
said concerning the nature of the "carefully con- 
trolled employee ratings" used. Furthermore, 
this reviewer disagrees with the contention that 
using algebra to solve certain arithmetical prob- 
lems is necessarily an indication of lack of rea- 
soning, unless the algebraic procedure is used 
mechanically. 

While this test may measure “the reasoning 
element” to some extent, it also seems to measure 
to a considerable degree, if not fundamental 
computational skills, then a certain facility with 
numbers as such—that is, ready recognition of 
multiples, factors, equivalent fractions and the 
like, However, such facility, over and above 


reasoning ability, is also quite useful to account- 
ants and engineers, a consideration which may 
contribute to the presumably good predictive 
significance of the test. 


[408] 
*Basic Arithmetic Skills: Iowa Every-Pupil 
Tests of Basic Skills, Test D, New Edition. 
Grades 3-5, 5-9; 1940-47 ; for complete battery, see 15; 
4 scores: fundamental knowledge, fundamental opera- 
tions, problems, total; IBM for grades 5-9; 2 levels; 
Forms L (до), М C41), N (742), O (743), Р (44— 
Advanced Battery only); manual ('45) ; battery man- 
ual (47) ; 33€ per specimen set of any one level; post- 
age extra; H. F. Spitzer in collaboration with Ernest 
Horn, Maude McBroom, H. A. Greene, and E. F. Lind- 
quist; Houghton Mifflin Co. * 
а) ELEMENTARY BATTERY. Grades 3-5; $1.60 per 25; 
57(65) minutes. 
b) ADVANCED BATTERY. Grades 5-9; IBM for Forms о 
and Р; $1.75 per 25; separate answer sheets may be 
used with Forms О and Р; 63€ per 25, IBM answer 
sheets; 40€ per set of stencils for machine scoring of 
answer sheets; 63 or 68(80) minutes. 

For reviews by William A. Brownell and Le- 
roy Н. Schnell, see 3:334; for reviews by Mir- 
iam M. Bryan and Anton Thompson of the com- 
plete battery, see 15; for reviews by Frederic L. 
Ayer, Gustav J. Froehlich, and Ralph С. Preston 
of the complete battery, see 3:10; for reviews by 
William A. Brownell, J. Murray Lee, and 
Charles W. Odell of the 1937 form of the com- 
plete battery, see 38:872. 


[ 409] 

*A Brief Survey of Arithmetic Skills. Grades 5- 
12; 1947-51; Forms A [47], В [47] ; mimeographe 
manual [51] ; norms (47) ; 5€ per test; 256 per speci- 
men set; postage extra; 10(15) minutes; Árthur E. 
Traxler; Educational Records Bureau. * 

Wikram A. Browne, Dean, School of Edu- 
cation, University of California, Berkeley, Cal- 
ifornia. ў 

Each form of this test comprises 44 exercises 
arranged in 4 cycles of 11 exercises each, as fol- 
lows: 4 examples with whole numbers (one in 
each of the fundamental operations), 4 with 
fractions (again, one in each operation), 2 with 
decimals (placing the decimal point in a multi- 
plication and in a division answer), and I word 
problem. The time limit of 10 minutes is said to 
be long enough for even slow pupils to go through 
at least one of 4 cycles. Reliability is reported 
as being about .80, having been calculated by à 
number of recognized procedures. 

The manual states that the purpose of the test 
is “to identify the pupils whose mastery of basic 
arithmetic skills is so inadequate that they will 
be greatly handicapped in all their work in math- 
ematics.” The author recommends that pupils 
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making low scores be further studied by other 
tests and by other methods to diagnose their 
weaknesses before giving them remedial instruc- 
tion. 

So long as the test is employed in the way 
proposed, it should be a useful instrument. The 
author has done what he can to prevent misuse 
by his clear explanations in the manual and by 
réstricting test norms to percentile ranks for 
each grade. He has explicitly warned teachers 
and administrators against administering the 
test for the purposes of diagnosis and against 
measuring individual achievement in terms of 
grade standing. 


Henry Van ENcEN, Professor of Mathematics 
and Head of the Department, Iowa State Teach- 
ers College, Cedar Falls, Iowa. 

Each form of this test consists of four cycles 
of 11 problems each. Each cycle consists of one 
problem of each of the following types: addi- 
tion, subtraction, multiplication, division, addi- 
tion of fractions, subtraction of fractions, multi- 
plication of fractions, division of fractions, plac- 
ing the decimal point in multiplication of decimal 
fractions, placing the decimal point in division 
of decimal fractions, and the solution of a prob- 
lem stated in sentence form. The test was con- 
structed to make available an instrument for 
evaluating the skill aspects of arithmetic which 
"can be given in a few minutes in order to obtain 
a quick, overall measure of the computational 
skill of an individual or a class group." The time 
limits on the test are short—ro minutes. This 
time limit is "sufficient to allow even the slow 
pupils to go through at least one of the four cy- 
сеа: 

The reviewer would criticise the similarity of 
problems in the four cycles. For example, in 
Form A the four problems in long division are: 
(a) 486 + 18, (b) 648 + 27, (c) 2304 + 36, and 
(d) 7728 + 84. These examples certainly do not 
differ much in type. They do not contain the 
usual "tough spots" in long division. Why ask 
the child to work four problems so alike in a 
Io-minute test? The same situation prevails in 
the other processes covered in the test. 

The whole “educational climate" surrounding 
a test of this nature is distasteful to the reviewer, 
In the first place, the manual makes a point of 
“fundamental tools” and “indispensable skills” 
in arithmetic. It then goes on to point out that 
teachers do not want to spend as much as one 
class period to collect information on the compu- 


tational skills of their class. If one could be sure 
that this urge to test in a brief period were moti- 
vated by a desire to have time to develop funda- 
mental mathematical concepts, then the situa- 
tion would be somewhat better. However, in 
view of the fact that this test, as many others 
are, is eoncerned only with the mechanical fea- 
tures of mathematics (often confused with math- 
ematics by the uninformed), it would seem that 
too many school men are interested in evaluating 
mechanics in order to teach more mechanics. 

In the second place, this test is distasteful to 
the reviewer because it is a simple one which 
every teacher should be able to make. Of course, 
she cannot collect the data and treat data as is 
usually done by test constructors. However, 
when one is measuring computational skill only 
it would seem that the heavy statistical treatment 
of the data is not at all necessary. Certainly, 
teachers working together in a given system can 
develop a simple test of computational skills and 
interpret the results in the light of local condi- 
tions. Such a procedure would certainly be more 
desirable than comparing results obtained in a 
skills test from systems where local conditions 
vary, as well as the philosophy which serves as 
a basis for the curriculum. 


[410] 
*[Brueckner Diagnostic Arithmetic Tests] 
Grades 4-8 or 5-8; 1926-43; 4 scores: addition, sub- 
traction, multiplication, division; 3 tests; no data on 
reliability and validity; no norms; postage extra; 40f 
per specimen set of any one test, postpaid ; Ж, 
Brueckner, Н. W, Distad (Fractions), and Abbie 
Chestek (Fractions); Educational Test Bureau, Edu- 
cational Publishers, Inc. * 
@) BRUECKNER DIAGNOSTIC TEST IN DECIMALS. Grades 
5-8; 1926-42; 1 form ('42—same as test copyrighted 
in 1926) ; $1.25 рег 25; 25¢ per manual (29); non- 
timed in part (60-65) minutes in 2 administrations. 
b) BRUECKNER DIAGNOSTIC TEST IN FRACTIONS. Grades 
5-8; 1926-43; 1 form ('42—same as test copyrighted 
in 1930) ; second revised manual ('43) ; $1.50 per 25; 
356 per manual ; 50% per 25 individual diagnostic sheets 
(26); nontimed (120-140) minutes in 4 administra- 
tions. 
C) BRUECKNER DIAGNOSTIC TEST IN WHOLE NUMBERS. 
Grades 4-8; 1926-42; 1 form ('42—same as test bye 
righted in 1929) ; $1.50 per 25; 356 рег manual (7 ^ 
50€ per 25 individual diagnostic sheets ('29) ; nontime 
(30) minutes in 4 administrations. 


Hersert F. Sprrzer, Associate Professor of 
Education, State University of Iowa, Iowa City, 
Iowa. 

BRUECKNER DIAGNOSTIC TEST IN DECIMALS. 
This test consists of four pages of exercises and 
examples covering the writing of decimals (ex- 
pressing decimals as fractions and fractions а5 
decimals), and addition, subtraction, multipli- 
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cation, and division of decimals. It may be used 
with individuals or for class diagnosis за combi- 
nation class and individual summary sheet is 
provided. The author recommends that the test 
be divided into two parts, and the parts given on 
successive days. Separation of the test into two 
booklets would facilitate this. For individual di- 
agnosis, the author recommends that pupils be 
asked to work aloud at times. As an aid to thor- 
ough diagnosis, 112 types of difficulties are listed 
and then reclassified under 18 headings in the 
accompanying manual. 

If one is content to limit diagnosis to compu- 
tation, the test gives fair coverage. However, 
four of the 12 addition examples arid seven of 
the 17 subtraction examples are concerned with 
ragged decimals, a phase of decimals which is 
either not taught or not emphasized today. Cer- 


tain aspects of the content are also questionable. . 


For example, the pupil is asked to change .5 to 
a common fraction but is not told to change the 
common fraction to the simplest form. The an- 
swer key does not give credit for 549 as an an- 
swer. Furthermore, it would seem that a group 
diagnostic test should not depend entirely on the 
pupil's doing all the computation for every ex- 
ample. For instance, to determine whether or 
not a pupil can place the decimal point in prod- 
ucts, the multiplication is not required. 

BRUECKNER DIAGNOSTIC TEST IN FRACTIONS. 
This test consists of four pages of examples, one 
each devoted to addition, subtraction, multipli- 
cation, and division of fractions. An individual 
and a class diagnostic record sheet and a 36- 
page manual are supplied. 

Since the test is devoted entirely to computa- 
tion in fractions, it would be more appropriate 
if the word “computation” were used in the title. 
The test has no items dealing with the basic 
fraction concepts or uses of fractions. The diag- 
nosis of computational ability is apparently to 
be determined without any direct attempt to get 
at the understanding of the process. Ability to 
get the correct answer by the one method taught 
in most textbooks seems to be the only factor 
considered. 

With regard to remedial work, the point of 
view stated in the manual is as follows: “When 
difficulties are discovered, the remedial work is 
usually obvious.” To implement diagnosis and 
remedial work, the major part of the manual 
(29 pages) consists of an analysis of difficulties 
in subtraction, multiplication, and division of 
fractions and an analysis of the examples in the 
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test. These analyses are complete and furnish a 
good reference for any one concerned with analy- 
sis of types. 

The test may have value for diagnosis in the 
limited area of fractions with which it deals, but, 
because of the limited aspect of fractions cov- 
ered, the test falls short of being a satisfactory 
instrument for diagnosis in fractions. 

BRUECKNER DIAGNOSTIC TEST IN WHOLE NUM- 
BERS. This test consists of five pages of examples, 
one page each for addition, subtraction, and mul- 
tiplication, and two pages for division of whole 
numbers. A class summary sheet and a sheet 
for record of individual diagnosis are supplied. 
The latter lists 69 specific difficulties. 

The major portion of the 24-page manual is 
devoted to a description of common difficulties 
in each of the four fundamental processes, to- 
gether with suggestions for remedial work to 
overcome the difficulties. While the treatment 
of difficulties is comprehensive, failure to use 
examples from the test discourage easy identifi- 
cation of the specific difficulties which pupils may 
experience. The remedial suggestions are almost 
entirely of this type: “Drill on Minneapolis sub- 
traction drill and study cards 8, 9, 10, 12, and 
13.” They indicate, as does the content of the 
test, emphasis on a type of “narrow” arithmetic 
that is concerned with but one type of compu- 
tation without much reference to understand- 
ing. 
Tf the only concern with regard to ability to 
work with the fundamental processes is accuracy 
of computation by the one method which adults 
consider best, this test may serve a useful pur- 
pose. However, those instructors who are asso- 
ciated with the better arithmetic programs of 
today will find little of interest in this test. 


[411] 


*California Arithmetic Test. Grades 1-4.5, 4-6, 
7-9, 9-14; 1933-50; а revision of Progressive Arith- 
metic Tests фе 40: 1459); а subtest of California 
Achievement Tests (see 2); 3 scores: arithmetic rea- 
soning, arithmetic fundamentals, total ; IBM for grades 
4-14; 4 levels; Forms AA (^50), BB (50), CC (5 
DD (so); manuals (^50); postage extra; 350 per 
specimen set of any one level, ostpaid ; Ernest W. 
Tiegs and Willis W. Clark; California Test Bureau. * 
а) PRIMARY. Grades 1-4.5; $1.50 per 25; 50(60) min- 


tes. 
b) Grades 4-6; $1.50 per 25; separate 


b) ELEMENTARY. 
pi sheets may be used; 4€ per IBM answer sheet; 


¢ per Scoreze answer sheet; 60¢ per stencil for ma- 
Ghine scoring of answer sheets ; 20@ per stencil for hand 
scoring of answer sheets; 60(70) minutes. 


с) INTERMEDIATE, Grades 7-9; prices same as for Ele- 


; 75(85) minutes. 
ped ЗА California Mathematics Test (366). 
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Ковевт L. Burcu, Associate Editor of Ele- 
mentary School Textbooks, Ginn and Company, 
Boston, Massachusetts. 

The format of this revision of the Progressive 
Arithmetic Tests has been challengingly de- 
signed by Raymond Loewy Associates. The tests 
and manuals, printed in green and green tint, 
are attractive, well planned, spacious, and easy 
to read. The new answer sheet, Scoreze, which 
includes in one unit an answer sheet of the IBM 
type, a self-correcting answer page with each 
item identified, diagnostic profiles, a diagnostic 
analysis chart, norms, and charts for translating 
scores, is ingenious and useful. The contents of 
the tests and manuals are, however, little changed 
from former Progressive Arithmetic Tests. 

At each of the three levels—primary, ele- 
mentary, and intermediate—the tests have a 
section measuring skill with number facts and 
operations that provides an adequate survey of 
skills in fundamentals, but, because of time limi- 
tations, does not have sufficiently detailed cov- 
erage for diagnostic purposes. At the elementary 
and intermediate levels, the pupil performs the 
computation each time and then selects one of 
five responses, the last of which is always 
“None.” 

The test at each level also has a “reasoning” 
section. Actually, the problem subtest is the only 
one involving reasoning, for the items concerned 
with number concepts, signs, rules, and vocabu- 
lary are more properly classed as information. 
The inclusion of items other than problem solv- 
ing represents an attempt to broaden the base of 
measurement. However, the content of recent 
arithmetic courses of study and textbooks is 
much more broadly conceived than is indicated 
in the California Arithmetic Test. The next edi- 
tion might well emphasize the more compre- 
hensive measurement of concepts, rationaliza- 
tions, principles, judgments, and so on, that are 
encompassed in the now. widely accepted and 
widely presented arithmetic program. In the 
meantime, it would be highly desirable if the 
publisher would provide a supplementary page 
with items measuring achievement in these rela- 
tively untouched areas. This supplement is 
needed so that test users will not be led to be- 
lieve that computation and problem solving are 
the only important aspects of the arithmetic 
program. The supplement will also provide the 
publisher with data that are needed as guides to 
the eventual incorporation of such items in fu- 
ture editions of the tests. 


The manuals are simply organized and help- 
ful. They do, however, overstress the use of the 
tests for diagnostic purposes. While cautionary 
statements are occasionally included concerning 
the limitations of such tests for diagnostic pur- 
poses, they are not nearly prominent enough nor 
so complete as they should be. The directions for 
administering and for scoring the test are good, 
and the grade, age, and percentile norms are 
adequately presented. 

In summary, the California Arithmetic Test, 
with four carefully equated forms at each of three 
levels, provides a good instrument for surveying 
performance in fundamental operations, prob- 
lem solving, and certain aspects of information. 
The tests are easily administered and scored, and 
the comparison of individual scores with norms 
can be done in a minimum of time. The diagnos- 
tic features of the test are not so valuable as 
they may seem, but they have some value for 
teachers aware of the diagnostic limitations of 
tests that survey widely. 


For reviews by Warren G. Findley, Alvin W. 
Schindler, and J. Harlan Shores of the complete 
battery, see 2; for reviews by C. L. Thiele and 
Harry Grove Wheat of an earlier edition, see 
40:1459; for reviews by C. W. Odell and Hugh 
B. W ood of an earlier edition of the total battery, 
see 40:1193; for a review by William A. Brown- 
ell of an earlier edition, see 38:893; for a review 
by D. Welty Lefever of an earlier edition of the 
total battery, see 38:876. 


[412] 

*Cotswold Measurement of Ability I: Arithme- 
tic. Ages 10-11; 1947-51; 4 forms (labeled Series I 
[47], 2 ['48], з ['49], 4 ['51]) ; manual ['47] ; оз. per 
20; 504. per single copy; 15. per manua! ; postage ех- 
tra; 40(50) minutes; C. M. Fleming and J. W. Jenkins 
(manual); Robert Gibson & Sons (Glasgow), Ltd. * 
W. L. Sumner, Senior Lecturer in Education, 
University of Nottingham, Nottingham, Eng- 
land. 

This test is part of a test battery designed for 
use with British children of 11 years of age who 
are passing from primary school to secondary 
school education. The two other tests in the bat- 
tery are those in English attainment and in verbal 
mental ability. The tests are in wide use in Eng- 
land for selecting children who are deemed suit- 
able for secondary grammar school education. 
The tests could be used in other English-speak- 
ing countries, but only after a certain amount 
of translation of peculiarly English words and 
idioms had been effected. 
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The test is based upon careful analysis of the 
difficulties involved in addition, subtraction, 
multiplication, and division at the primary school 
level (і.е., the 8-11 year-old groups in English 
schools). The test material 15 so arranged as to 
provide balanced testing of most of the 390 basic 
arithmetical combinations, the simplest of meas- 
ures in common use, and the reading and rea- 
soning involved in the solution of simple prob- 
lems. Items are arranged in order of difficulty. 
Practice material is provided on the back cover 
of the test. ' 

The manual gives information about the con- 
struction and standardization of the test and 
instructions for administering and scoring the 
test and interpreting test scores. According to 
the manual, validity was secured by contrasting 
the responses from the top and bottom thirds of 
a control group of children. The correctness of 
each response was determined after pupils" re- 
plies were analyzed, and ambiguous items dis- 
carded. The reliability coefficient for Series I, 
estimated by the split half method of correlating 
responses to odd and even items, is reported as 
96; and the intercorrelation of Series І with 
Series 2 as .88. A table of percentiles and stand- 
ardised scores is provided, the age range being 
from 10.1 to 11.10 years. 

The test is quite suitable for the purpose for 
which it was devised. The test items are short, 
and yet each item demands a creative process of 
reasoning and not the mere recognition and un- 
derlining of right answers. Errors of sampling 
have been reduced by the well-spread coverage 
of the processes to be tested and by the large 
number of questions set. There is a correct bal- 
ance between simple “mechanical” arithmetic 
and problems based on this elementary work, 
The validation and grading as regards difficulty 
has been effected in a thorough manner. The ad- 
ministration of the test calls for care in timing 
each section and in the giving of instructions to 
turn the test booklet upside-down for the later 
sections of it. The marking is objective. 

The authors suggest that while this test may 
be used separately, research evidence at present 
available indicates that the battery of tests pro- 
vides more adequate and reliable sampling of 
the present performance of pupils who need 
educational guidance or for whom cumulative 
record cards are kept. 

[ 413] 


Diagnostic Chart for Fundamental Processes in 
Arithmetic. Grades 2-8; 1925; individual; 1 form; 
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no data on reliability; no norms; $3 per 50 sets of pu- 
pil’s work sheet and teacher's diagnostic chart, postage 
extra; 30¢ per specimen set, postpaid; (20) minutes; 
С. Т. Buswell and Lenore John; Public School Pub- 
lishing Co. * 

Lro J. BRUECKNER, Professor of Education, 
University of Minnesota, Minneapolis, Minne- 
sota. 

This chart consists of four sets of examples, 
each set illustrating one of the four processes of 
addition, subtraction, multiplication, and di- 
vision of whole numbers and ranging from easy 
to difficult examples. The numbers of types of 
examples included in the different parts are 23, 
22, 22, and 21, respectively. The inclusion of 
two examples of each type adds to the reliability 
of the results. 

The diagnostic chart is primarily a teaching 
device and is not intended for measurement or 
classification. Hence no standard scores or time 
standards are provided by the authors. The chart 
is a diagnostic instrument which enables the 
teacher to study in detail the methods of work of 
pupils whose progress in arithmetic is unsatis- 
factory. As the pupil works the examples on a 
work sheet, he is required to “do all of his think- 
ing aloud,” so that the teacher can make a note 
of the thought processes he uses. The manual 
suggests that the teacher should feel free to use 
“the technique of skilled questioning” whenever 
there is any doubt about the thought processes 
the pupil is using. On a teacher’s diagnostic chart 
the examiner keeps a detailed record of the ob- 
served reactions and responses of the pupil. To 
assist the teacher in making an analysis of the 
findings, the blank contains for each process a 
list of “habits” which forms the basis of an in- 
ventory of the kinds of errors, incorrect re- 
sponses, faulty methods of work, and immature 

rocedures that were observed. These lists of 
so called “habits” are based on the results of the 
preliminary studies undertaken at the time the 
test was developed in 1925. The results of this 
analysis should give the teacher a clear under- 
standing of the processes by which the pupil does 
his work and of the ways in which effective 
teaching may help him. 

Ordinarily only one process is to be diagnosed 
at a sitting. The authors state that “a period of 
fifteen or twenty minutes” is sufficient for a 
skilled examiner to determine the characteristic 
work habits of most pupils. 

Tt is evident that the Buswell-John chart pro- 
vides no direct evaluation of such important as- 
pects of the learning of arithmetic as rate of 
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work, accuracy of work, the level to which any 
of the abilities are developed, or of the extent 
of the pupil's understanding of the process. How- 
ever, it provides an excellent plan for studying 
the thought processes of pupils. 

A number of modern textbooks include simi- 
lar diagnostic tests of the operations which can 
be used in much the same way that the Buswell- 
John chart is used. The education of teachers 
should include experiences which will acquaint 
them with diagnostic procedures and the kinds 
of faulty habits that are so frequently discov- 
ered in the work of slow-learning pupils. The 
former should be used informally whenever there 
is evidence of learning difficulty. 

The contents of the manual would be greatly 
improved if there were given some discussion of 
the causes of inefficient habits of work. A dis- 
cussion of the corrective and remedial measures 
used to overcome typical faults would also be of 
value. Some indication of the relative degree of 
cruciality of the various habits of work would 
add significance to the diagnosis. 


For reviews by H. E. Benz and Foster E. 
Grossnickle, see 40:1456. 


[414] 
*Graded Arithmetic-Mathematics Test. Ages 7- 
21; 1949; I form; 5s. per 25; 3d. per single copy ; od. 
per manual; postage extra; 20(25) minutes; P. E. 
Vernon; University of London Press Ltd. * 


[415] 
*Los Angeles Diagnostic Tests: Fundamentals 
of Arithmetic. Grades 2-8; 1925-47; Forms 1 (25), 
2 (25), 3 (26), 4 (26) ; no data on validity; manual 
(47); $1.25 per 25, postage extra; 25¢ per specimen 
set, postpaid; 40(45) minutes; Caroline Armstrong 
and Willis W. Clark; California Test Bureau. * 


[416] 

*Metropolitan Achievement Tests [Arithme- 
tic]. Grades 3-4, 5-6, 7-9.5; 1933-49; a subtest of 
Metropolitan Achievement Tests (see 18) ; 3 scores: 
arithmetic fundamentals, arithmetic problems, total ; 
ques ; Forms R C46), S C47), Т (49) ; manual 

48) ; directions for administering ('47) ; S135 per 2 
of any one level; 80¢ per manual; postage extra; 35 
per specimen set of any one level, postpaid; Richard D. 
Allen, Harold Н. Bixler, William L. Connor, Fred- 
erick В. Graham, and Gertrude Н. Hildreth; World 
ан Co. * 
а) ELEMENTARY ARITHMETIC TEST. Grades 3-4; 1937- 
47; Forms R, S only; 65(75) minutes. Я s7 
b) INTERMEDIATE ARITHMETIC TEST. Grades 5-6; 1933- 
oH 80(90) minutes. 
C) ADVANCED ARITHMETIC TEST. Grades .5; I 
49; 80(90) minutes. ase] 


Ковевт L. Burcu, Associate Editor of Ele- 


` mentary School Textbooks, Ginn and Company, 
Boston, Massachusetts. 


The current edition of the Metropolitan 


Achievement Tests and the accompanying ma- 
terials represent the kind of careful test prepa- 
ration that The Mental Measurements Year- 
books have tried to encourage. The analysis of 
items, equating of forms, establishment of norms, 
and determination of reliability and validity have 
been neatly designed and scientifically executed, 
The story of the construction of the tests, plus 
a wealth of material for helping users to under- 
stand the nature, values, and limitations of such 
achievement tests, are presented in a manual 
that is unique in its comprehensiveness. 

This review deals specifically with the arith- 
metic tests in the Metropolitan batteries which 
are published as separate tests. Each test covers 
a relatively narrow range of grade levels, an 
important fact in achievement tests that must 
necessarily sample widely within a rather short 
time. The items are well chosen to survey 
achievement within the relatively confined areas 
of computation and problem solving. This limi- 
tation in coverage must be recognized, for these 
arithmetic tests thus become valid only for 
measuring that portion of the typical program 
which pertains to computation and problem solv- 
ing. They fail to measure achievement in such 
aspects as number concepts, rationalizations, 
and generalizations. 

Perhaps the achievement of pupils in these 
newer areas is as yet so low that the inclusion of 
items to evaluate the level of understanding 
would affect seriously the reliability of these 
tests. If so, the least that should be done is to 
provide a supplementary page with items that 
are constructed to evaluate the portions of the 
typical program not now covered by the arith- 
metic tests. This supplementary material could 
be made part of an experiment that would serve 
two purposes: (а) test users would have test 
information pertaining to all of the elements of 
the arithmetic program, and (b) information 
would be gained that would enable the test au- 
thors to construct and incorporate items meas- 
uring newer aspects of arithmetic in a later re- 
vision of the test. Until some such procedure 1$ 
followed, test users will have to make provision 
within their own school systems for measure- 
ment of the portions of the teaching program 
that are not now included in the tests. 

The reliability coefficients, which vary from 
87 to .95, indicate a degree of dependability that 
is adequate for group testing. The standard er- 
rors of measurement which are usually smaller 
than 8 standard score units indicate a stability 
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sufficient for some individual measurement if 
proper caution is exercised. 

The format of the tests is traditional but well 
organized and clear. The standard score plan is 
good, and the use of grade equivalents and age 
equivalents as the basic norms is acceptable in 
view of the fact that percentile norms and modal- 
age grade norms are also available. A unique 
and important contribution is the development 
of special grade norms for parochial schools, 
segregated Negro schools, white schools in 
southern states, schools in New England, and 
schools in New York State. 

In sum, such good features as the careful con- 
struction and standardization, the development 
of sectional norms, the stressing of the need for 
fall testing, and the provision of a comprehensive 
and influential manual, all represent an attempt 
in developing this testing program to utilize the 
best of presently known principles. If the test 
user is willing to overlook the conservatism of 
the authors in limiting the content of the arith- 
metic tests basically to computation and to the 
solution of verbal problems, he will find that 
these are excellent tests which have been de- 
veloped in accordance with the best available in- 
formation in test construction. 


For reviews by Peter L. Spencer and Harry 
Grove Wheat of an earlier edition, see 40: 
1458.1; for reviews by Foster E. Grossnickle 
and Guy М. Wilson of an earlier edition, see 
38:892; for reviews by Warren С. Findley of 
the complete battery, see 18; for reviews by 
E. V. Pullias and Hugh В. Wood of an earlier 
edition of the complete battery, see 40:1189; for 

reviews by Jack W. Dunlap, Richard Ledger- 
wood, and Charles W. Odell of an earlier edition 
of the complete battery, see 38:874. 


[417] 

Number Fact Check Sheet. Grades 5-8; 1046-47; 
IBM; Forms A (46), B (746) ; manual (’47) ; $r per 
25 IBM test-answer sheets ; боё per set of scoring sten- 
cils; postage extra; 25¢ per specimen set, postpaid ; 
nontimed (25-30) minutes; Roy Cochrane; California 
Test Bureau. * 
Мішам M. Bryan, Assistant Editor, Silver 
Burdett Company, New York, New York. 

This test is designed to measure the extent of 
mastery of the fundamental number facts. The 
test, organized in four sections, each covering one 
of the fundamental processes, is printed on an 
IBM test-answer sheet. Each of 390 facts is 
completed on the sheet, тоо of them (25 involv- 
ing each process) incorrectly. The pupil is asked 


to determine whether each fact is correctly or 
incorrectly completed, and to mark out all in- 
correct completions. The score for each section 
is obtained by subtracting from 25 (representing 
the number of incorrectly completed facts) the 
number of “errors left” plus the number of 
“rights blurred.” 

Of the combination of correctly and incor- 
rectly completed facts, the manual says this: 
“This kind of presentation is good. In business 
and engineering, figures have to be checked. It 
must be made certain that the first answer is 
right. In building a bridge, for instance, a wrong 
answer might make the bridge dangerously 
weak, so, of course, the answers are carefully 
checked.” The reviewer disagrees with this ra- 
tionalization. While the development of suffi- 
cient mastery of number facts to enable one to 
detect and correct computational errors is, of 
course, a desirable ultimate outcome, a more 
immediate purpose of the modern arithmetic 
program is the development of such complete 
mastery of the number facts as will reduce the 
likelihood of the errors occurring in the first 
place. The wrong approach to achieving this 
purpose would appear to be the negative one in 
which the pupil is forced to observe manufac- 
tured errors and so is encouraged to make in- 
correct associations that he may carry on into 
adulthood, thus placing the bridge in double 
jeopardy. 

But beyond this, there would seem to be some 
question as to the validity of the rationalization 
underlying the organization of the test as a 
whole. Confronting the pupil with a body of 
isolated number facts does not appeal to the 
reviewer as being the most efficient technique to 
use beyond the fourth grade, because there are 
so few practical situations in which the facts are 
met in isolation. Why, then, require the pupil 
to struggle with the manipulation of 390 sepa- 
rate facts when these might more efficiently 
be combined in simple examples: which would 
measure not only knowledge of the facts but 
ability to apply that knowledge in many prac- 
tical situations, and at the same time reduce the 
danger of poor number habits and attitudes 
developing as a result of sheer weariness and 
boredom? The futility of concentrating on iso- 
lated number facts is nowhere better illustrated 
than in the case of the zero facts. In real life the 
pupil never adds o to 4 to learn that he has four 
of something, or to multiply or divide О by 4 to 
find that he has “none” of “nothing.” Arith- 
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metic situations involving the zero require the 
pupil to add, subtract, multiply, and divide with 
zero in multiple-place numbers, never with zero 
in isolation. It is difficult to see why so much 
emphasis should be put on his learning to use 
it in isolation. By actual count, 54 (or 14.6 per 
cent) of the 390 facts covered in this test are 
unrealistic zero facts. From the point of view of 
mathematical utility, the multiplication facts 
about “1” also comprise an especially question- 
able group when presented in isolation. 

While the overall format of the test is simple 
and practical from the point of view of adminis- 
tering and scoring, there are some defects in the 
arrangement of the examples. In the subtraction 
section the answers are consistently placed one 
space to the left of where they should be, and the 
two-place answers in the addition section strad- 
dle the examples. Since proper placing of an- 
swers is an important factor in encouraging com- 
putational accuracy, the improper placement in 
this test sets a poor example for the pupil. АП 
100 multiplication facts are arranged horizon- 
tally. In modern arithmetic textbooks the more 
common vertical arrangement is stressed. 

The single-sheet manual of directions indi- 
cates that reliabilities of .93 for the fifth and 
sixth grades and .93 for the seventh and eighth 
grades have been determined for the test by the 
Kuder-Richardson formula. No further details 
are given. Tentative percentile norms for total 
score are provided for fifth and sixth and for 
seventh and eighth grade groups, the former 
representing 12 elementary schools and the latter 
two junior high schools. Since 100 per cent ac- 
curacy on the number facts for each pupil is the 
only acceptable goal from the fifth grade onward, 
the value of any percentile norms for a number 
facts test is questionable. With the differences 
that one may expect to find in individual achieve- 
ment in the separate processes, norms for total 
score only would seem to be of negative value. 

In summary, this test should be easy to ad- 
minister and easy to score. Beyond those attri- 
butes, the reviewer feels that the test has little 
to offer. 

[418] 
*Retail Arithmetic Worksample. Selection and 
training of retail employees; 1948; 1 form; prelimi- 
nary mimeographed manual; $1.25 per 25; 35¢ per 


specimen set; postage extra; 20(25) minutes; Carrie 
E. Boyle; Crowley, Milner and Co. * 


REFERENCES 


1. Capwett, Dora F. Psychological Tests for Retail Store 
Personnel. Pittsburgh, Pa.: Research Bureau for Retail Train- 
говорите of Pittsburgh, 1949. Pp. 48. Paper. * (РА 25: 


WirLIAM J. E. Crissy, Assistant Professor of 
Psychology, Queens College; and Lecturer on 
Applied Psychology, Fordham University; New 
York, New York. 

This test has considerable “face validity” as 
a measure of the essential arithmetic processes 
needed in such jobs as sales clerk, claims ad- 
juster, and buyer in retail stores. It is easily 
administered and the directions are clear and 
explicit. Working time is 20 minutes—not an 
unreasonable time expenditure in view of the 
on-the-job importance of what is measured. 

The manual contains reliability and validity 
data. The former are not too meaningful since 
only a stepped-up reliability coefficient (.92) is 
reported and no measure of dispersion is fur- 
nished. The validity data are fairly conclusive. 
Apparently significant relationships were found 
when such criterion measures as ratings given 
employees by instructors in system training 
classes and records of cash errors were used 
with various precisely described occupational 
groups. However, the statements and data 
furnished are difficult to evaluate, since no meas- 
ures of dispersion are reported on the criterion 
measures and only extremes of the distribution 
are employed in the statistical analyses. 

While the test was originally developed for 
use by one store, the manual states that several 
stores within the industry have now used it and 
have reported validity data in general agree- 
ment with those originally determined. Per- 
centile norms are provided for nine occupational 
groups. The correlations of the test scores with 
other measures indicate a considerable overlap 
between what is measured and general aptitude 
traits. For example, on a mixed applicant and 
employee sample, the correlation with intelli- 
gence as measured by the Wonderlic Personnel 
Test is .65; on another group, all salespeople, 
the correlation with intelligence as measured by 
an adaptability test is .54. 3 

In summary, the reviewer recommends this 
test as an apparently reliable and valid work- 
sample measure of the arithmetic processes used 
by retail store personnel. 


[419] i 
*Stanford Achievement Test [Arithmetic]; 
Grades 2-3, 4-6, 7-9; 1922-46; a subtest of S! tanfora 
Achievement Test (see 25) ; 3 scores: arithmetic m 
soning, arithmetic computation, total; IBM for ga io 
4-9; 3 levels; the Manual for Interpreting referred 0 
in the directions for administering has not been Ds 
lished ; postage extra; 35% per specimen set of any ОП" 
level of any one edition, postpaid; separate Lad: 
sheets may be used for grades 4-9; Truman L. Kelley, 
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Еі M. Ruch, апа Lewis M. Terman; World Book 

o. 

a) PRIMARY ARITHMETIC TEST. Grades 2-3; 1940-41; 

Forms D (40), E (40), F (41); directions for ad- 

ministering ('40); $1.25 per 25; 25(30) minutes. 

b) INTERMEDIATE ARITHMETIC TEST. Grades 4-6; 2 edi- 

tions. 

1) Hand Scoring Edition. 1940-43; Forms D ('40), 
E (40), Е Сат), С (42), Н (43) ; directions for 
administering (’40) ; $1.65 per 25; 50(55) minutes. 
2) Separate Answer Sheet Edition. 1041-46, C1940- 
46; Forms DM, (41), ЕМ (43), FM. (4 ) ; direc- 
tions for administering ('41) ; $2.30 per 25; 856 рег 
25 ІВМ answer sheets; 40€ per set of machine scor- 
ing stencils for any one form; 55 per 25 profile 
charts ('42) ; 60(65) minutes. 

с) ADVANCED ARITHMETIC TEST. Grades 7-9; 2 editions. 
1) Hand Scoring Edition. 1940- orms D (до), 
Е (40), Е (41), G (42), Н (43) ; directions for 
administering (’40) ; $1.65 per 25; 50(55) minutes. 
2) Separate Answer Sheet Edition. 1941-46, C1940- 
46; Forms DM (лт), EM (42), FM (46) ; direc- 
tions for administering ('41) ; $2.30 per 25; 85€ рег 
25 IBM answer sheets; 40€ per set of machine scor- 
ing stencils for any one form; 55€ per 25 profile 
charts (42); 60(65) minutes. 

REFERENCES 
т. Laney, ARTHUR R., Jr. “Validit; of Employment Tests for 


Gas-Appliance Service Personnel" Personnel Psychol 4:199— 


208 su '51. * (PA 26:1735) 


Rosznr L. Burcu, Associate Editor of Elemen- 
tary School Textbooks, Ginn and Company, 
Boston, Massachusetts. F 

In their inception, the Stanford arithmetic 
tests represented the best in the then recognized 
principles of test construction. They have been 
surpassed by more recently developed achieve- 
ment tests but are still acceptable to survey 
achievement in problem solving and in computa- 
tion in schools which need a minimum of helpful 
supplementary materials. The test user must be 
cautious in employing the norms, since they may 
well be inaccurate for present-day schools. 

These Stanford tests need to be completely 
overhauled. The authors have a remarkable op- 
portunity to build an instrument that will be 
far ahead of any other such test on the market. 
The modal-age grade norms, which are basic in 
the present edition, should be continued; per- 
centile norms should be made available in addi- 
tion to the traditional age and grade norms, The 
“equated scores” can be improved by using a 
wider scale. The items in the test can be better 
grouped than at present. The grade placement of 
items needs to be carefully examined. A much 
more useful Manual for Interpreting than was 
earlier available for this series should be pro- 
vided. Most important, the authors need to ex- 
plore very carefully the possibility of trying to 
measure more of the objectives of the present- 
day program in arithmetic as revealed in current 


courses of study and arithmetic textbooks. If 
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this were attempted, the test would include items 
to measure concepts about whole numbers, deci- 
mals, fractions, and the number system. There 
would also be items to evaluate achievement in 
regard to the generalizations, relationships, 
judgments, and understandings about the oper- 
ations that occupy-an increasingly larger portion 
of the time in the modern arithmetic program. 

It is too easy for authors to excuse the omis- 
sion of items that are not computational or prob- 
lematic in nature with the argument that the 
tests should follow and not lead teaching trends. 
By the same token, tests should not retard cur- 
riculum trends, which, unless arithmetic achieve- 
ment tests broaden the areas measured to include 
the whole program covered by curriculum guides 
and arithmetic textbooks, is just what they do. 

A revision of the Stanford Achievement Test 
in Arithmetic—no matter how carefully and 
conscientiously conceived—that fails to broaden 
the base of measurement can be, at best, only a 
slight improvement over tests that are currently 
available. Conversely, if the tests are developed 
thoroughly and scientifically and if they do a 
good job of measuring arithmetic objectives in 
the broader sense, then they may well be unpar- 
alleled as instruments for measuring achieve- 
ment in arithmetic. 


For reviews by Paul R. Hanna and Claude 
E. Norcross and Vergil E. Herrick of the com- 
plete battery, see 25; jor reviews by Walter W. 
Cook and Ralph C. Preston of the complete bat- 
tery, see 3:18. 


[420] 
*Test of Mechanical Arithmetic. Ages 8-10; 1949- 
50; Forms ТА ['49], 1B ['49]; manual ['50] ; 14s. per 
100; 25. per 12; 3d. per single сору; od. per manual ; 
postage extra; 30(35) minutes; Miriam E. Highfield ; 
National Foundation for Educational Research in Eng- 
land and Wales. * 


[ 421] 

*Woody-McCall Mixed Fundamentals in Arith- 
metic, Grades 2-9; 1920-36; Forms I (720), II (20), 
III (23), IV. (23) ; no data on reliability and valid- 
ity; revised manual ('36) ; 90¢ per 100; 2 ¢ per speci- 
men set; postpaid; 20(25) minutes; Clifford Woody 
and William A. McCall; Bureau of Publications, 
Teachers College, Columbia University. * 


WiLiAM А. BrownEtt, Dean, School of Edu- 
cation, University of California, Berkeley, Cali- 
fornia. 

This test, published first in 1920, is now of in- 
terest more for historical than for practical or 
for research reasons. It was prepared in the éarly 
days of what has come to be known as “the meas- 
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urement movement," and for 10 or 15 years it 
was probably more widely used than any other 
arithmetic test of comparable organization and 
content. In 20 minutes it was supposed to evalu- 
ate both individual and class achievement in the 
subject, and to do so in a way that warranted 
classification of a child or a group anywhere in 
the grade range from 1.9 to 15.1, and in the age 
range from year 6.7 to 21.9. Differences of but 
two points in scores were many times equivalent 
to differences of a half grade (or more) in grade 
placement and of as much as a year and a half 
in age placement. 

The content of the test reflects fairly well the 
conception of arithmetic in vogue in the 1920's 
(save that it includes nothing on “problem solv- 
ing"). Skill in "the fundamentals" was the ma- 
jor objective in arithmetic; hence, the test was 
designed to assess proficiency in computation. 
Form I may be analyzed for illustrative purposes. 
The first example calls for the addition of two 
digits ; the last, for the addition of five numbers, 
one of which is a two-place number, and the re- 
maining four mixed decimals. Between these 
two examples are 33 others in mixed order. 
There are 10 examples in addition (six with 
whole numbers, none with fractions, and four 
with decimals, including money numbers), 7 in 
subtraction (four with whole numbers, two with 
fractions, and one with decimals), 11 in multi- 
plication (six with whole numbers, three with 
fractions, and two with decimals), and 7 in di- 
vision (four with whole numbers, two with frac- 
tions, and one with denominate numbers). Sum- 
marized in a different way, 20 of the 35 exam- 
ples require computation with whole numbers, 
7 with fractions, 7 with decimals, and 1 with 
denominate numbers. 

One of the addition examples involves 16 ad- 
dends, and another includes a mixed decimal of 
"six places. The subtrahend in one subtraction 
example is also a six-place mixed decimal. The 
next to last item in the test is .0963 X .084. 
These items, suited to the arithmetic program 
in 1920, appear in no modern test or text. Be- 
cause it measures skills carried far beyond the 
limits to which they are now taught, the test has 
long since been outmoded. But the test has other 
limitations of content, limitations that must have 
reduced its validity even in the days of its great- 
est popularity. For one thing, it contains little 
of the arithmetic taught in the higher grades. 
The fraction skills represented are few ; there is 
only one example with denominate numbers, and 


there are none with per cents and ratio. In a 
word, the sampling was unduly restricted, prob- 
ably in the attempt to produce a test that could 
be administered in a short period of time, 

As is implied ім the foregoing paragraphs, 
this test is of little usefulness nowadays. Compu- 
tational skills have been simplified, in the sense 
that instruction in each case stops at, or just a 
little beyond, the level of complexity required 
in adult life in the community. But that is not 
all. Computational skills as such are not re- 
garded as ends in themselves; rather, they are 
viewed as important to the extent that they are 
functional, to the degree that they make for in- . 
telligent adjustment to our highly quantitative 
culture. Moreover, there is growing recognition 
of noncomputational arithmetic—of the ability 
to "size up" quantitative situations, to approxi- 
mate, and to arrive at solutions “close enough" 
for practical purposes and without benefit of 
paper and pencil. 

Such changes in our conception of arithmetic 
call for corresponding changes in evaluation, If 
the products of today's schools are to behave 
intelligently in quantitative situations, they must 
(a) understand arithmetic as a special system of 
thinking and (b) have the disposition to use 
(and the habit of using) the arithmetic they 
know. By the same token, evaluation is being 
broadened to include basic mathematical con- 
cepts, principles, and generalizations ; resource- 
fulness and originality in solutions ; flexibility in 
thinking with numbers and about number rela- 
tionships; the tendency to employ arithmetic as 
it is learned, in a natural, confident, and useful 
manner, and so on. 

The test under review obviously provides no 
means of evaluating such learning outcomes, 
however well it may have served the needs for 
evaluation in 1920 or 1925. Scores on the test 
(for which there are age, grade, and T-score 
norms) do not measure achievement in the 
modern day program. Certainly they cannot be 
said to be useful for individual diagnosis, ОГ 
even for meaningful group diagnosis. At best the 
scores can be justifiably employed only for the 
crudest kinds of comparison, as, for example, in 
rating school units with respect to national 
norms. Even for this purpose, other more re- 
cently standardized tests with their more care- 
ful selection of content, with their more compre- 
hensive coverage, and with their longer working 
time and ampler working space, are to be prê- 
ferred. 
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[422] 
Columbia Research Bureau Plane Geometry 
Test. Grades 10-12; 1924-26; 2 booklets; the 2 book- 
lets together called “Augmented Test"; Forms A 
('26), В (26); manual ('26) ; supplementary manual 
(^26); $1.75 per 25, postage extra; 356 per specimen 
set, postpaid; Herbert E. Hawkes: and Ben D. Wood; 
World Book Co. * 
а) PARTS 1-11. 1924-26; 60(65) minutes. 
b) PARTS III-VI. 1926; an optional essay-type test; no 
data on reliability; 70(80) minutes. 
REFERENCES 

т. Unmsmock, RICHARD STEPHEN, “Construction of a Selec- 
tion Test for College Graduates." J General Psychol 41:153-93 
О "49. * (PA 24:4874) 


Суви. J. Ноут, Director, Bureau of Educa- 
tional Research; and THEopoRE E. KELLOGG, 
Instructor, University High School; University 
of Minnesota, Minneapolis, Minnesota. 

This test consists of Part 1, 65 items in 20 
minutes, and Part 2, 35 items in 40 minutes. 
The required time limits are not convenient for 
many present school schedules; the authors do 
not suggest the possibility of splitting the ad- 
ministration of the test. The items in Part 1 are 
true-false which are corrected for guessing, and 
the items for Part 2 are problem type questions 
requiring numerical or literal answers. Answers 
in Part 2 may be left in formula or equation 
form, thus avoiding unnecessary arithmetical 
computation in the test. About one third of the 
items in Part 1 require recognition of geometric 
facts with little deviation from the usual text- 
book statement. Other items in this part require 
inferences ranging from simple to complex. The 
items in Part 2 all require application of one or 
several geometric relationships to a specific 
problem. The content of the test is a good sample 
of the geometric relationships found in the usual 
text, No problems are of a practical application 
or “realistic” nature. The test as a whole em- 
phasizes deductive inference with little or no 
emphasis on the formal proof. 

Available with this test is an augmented test, 
consisting of Parts 3-6 which may be mimeo- 
graphed or put on the board and the required 
answers written by the students. This test is 
intended to serve as a check, to provide increased 
information and to verify standings on Parts 
1 and 2. Parts of the augmented test include the 
following : Part 3, Loci (15 minutes ) objectively 
scored; Part 4, Converses (5 minutes) objec- 
tively scored; Part 5, Definitions (5 minutes) 


nearly objectively scored; Part 6, Demonstra- 
tions (45 minutes) subjectively scored. 

Since this test is over 20 years old it is of in- 
terest in reviewing it, to compare and contrast 
it with the two other tests the reviewers have 
recently considered—the Cooperative Plane Ge- 
ometry Test and the Shaycoft Plane Geometry 
Test. Both the Cooperative and the Shaycoft 
tests have more clearly defined and adequate 
norm groups although both of these tests leave 
something to be desired in this respect. The 
norms for the Columbia test are based largely 
on college freshmen and hence have only limited 
usage for high school classes. Present day tests 
have not particularly improved in determining 
and reporting validity and reliability. All three 
tests depend largely upon face validity for the 
former quality and upon measures of internal 
consistency for reliability. More adequate atten- 
tion needs to be given to the particular student 
abilities assessed by specific items and subsec- 
tions of the tests. The format and typography of 
the Shaycoft and Cooperative tests are greatly 
improved as compared with the Columbia. On 
a continuum of amount of memorization and 
speed required the Columbia ranks highest, the 
Cooperative second, and the Shaycoft third. On 
a continuum similarily related to deductive in- 
ference the same order prevails among the de- 
mands of these three tests. f 

In constructing a plane geometry test the an- 
swers to certain basic questions are more or less 
clearly accepted by the author. Some of the most 
cogent of these questions revolve about the func- 
tions appropriately served—in the secondary 
school curriculum—by instruction in plane ge- 
ometry. A most sketchy outline of these func- 
tions would include the following considera- 
tions: (a) Is geometry instruction intended 
only for a select few—a mathematical intelli- 
gentsia? (b) Should the tenth grade course in 
geometry seek foremost that pupils develop their 
skills in devising deductive proofs of geometric 
propositions? (с) Should the subject matter of 
geometry primarily be considered a vehicle for 
aiding students to develop their abilities to reason 
logically in problem solving—deductively and 
inductively, directly and indirectly, utilizing in- 
tuition, estimation, measurement, scale drawing, 
analysis, synthesis and many other modes of 
attack? (d) Should instruction emphasize prac- 
tical geometric applications of the subject matter 
content in such areas as mapping, designing, 
triangular bracing, surveying, other indirect 
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measurement and relations of geometric figures 
to common activities and forms ? Certainly these 
questions are not sufficient for specifying the 
purposes of instruction, nor can one expect that 
an author’s answers to these must be all or none 
—either positive or negative. Rather, most 
teachers recognize advantages of partial accept- 
ance of each of these several purposes and of 
other goals more carefully stated elsewhere. The 
Cooperative and Columbia tests seem to be based 
largely upon an affirmative answer to the first 
two questions, and a negative answer to the third 
and fourth, while the Shaycoft test seems to be 
based upon negative answers to all four ques- 
tions. The reviewers feel geometry is necessarily 
selective but not in the degree indicated by the 
Columbia test, that there is some place for intui- 
tion and induction in geometry instruction, that 
application in real situations is of importance, 
and that geometry instruction has something to 
offer in addition to reasoning for its own sake. 
If one accepts these tenets, there is much to 
be desired in the tests reviewed. As pointed out 
earlier, one cannot test everything in a limited 
period of time; selection is necessary. Hence, by 
an author's selection and design of test situations 
he reveals his position regarding the purposes of 
geometry instruction. The authors of current 
tests have given too little attention to the abili- 
ties which students need in order to answer the 
items correctly. As a result, current tests place 
too great an emphasis on the memorization of 
subject matter content so that high achievement 
in geometry has come to be synonymous with a 
high stock of memorized factual details. Such 
tests, if used seriously by schools, will warp the 
curriculum and the instructional activities to 
exclude such things as practical applications and 
attention to problem solving. 

Thus, a series of tests in which the several stu- 
dent abilities developed in geometry instruction 
are evaluated is in order. If practical application 
is to be an important part of geometry teaching, 
then it should be an important part of testing ; 
and similarly for other aspects of instruction. It 
is also of importance to test over a range com- 
mensurate with the pupil abilities the teacher 
is likely to confront in his classroom. This is 
not a plea for making geometry less difficult, 
which the reviewers oppose, but a plea for 
realistic consideration of existing conditions 
in the classrooms of the better schools of to- 
day. 


For reviews by W. Elmer Lancaster and 
J. H. Minnick, see 40:1466. 


[423] 

*Cooperative Plane Geometry Test. High school; 
1932-51; IBM; Forms S (42), Т (’51-—same as test 
copyrighted in 1943), Y (48), 2 (^50) ; Forms 1932- 
37, N-R out of print; no data on validity; no specific 
manual; general Cooperative manual (51); norms 
[37]; $2.25 per 25; 50€ per specimen set, postpaid; 
separate answer sheets may be used; 80€ per 25 IBM 
answer sheets; 15€ per stencil for scoring answer 
sheets; cash orders postpaid ; 49045) minutes; John А. 
Long (S), L. P. Siceloff (S), Emma Spaney (S), 
Margaret P. Martin (S, T), H. Vernon Price (8), 
and Bernice Orshansky (Z); Cooperative Test Divi- 
sion, Educational Testing Service. * 


REFERENCES 
т. Townsend, AGATHA, "The Cooperative Mathematics Test’ 
Program (Mathematics for Grades 7, 8, and 9, Elementary Al o 
е, Intermediate Algebra, Plane Geometry, Solid Geometry, 
rigonometry, and Secondary School Mathematics): Reliabili- 
ties and Correlations With School Marks," pp. 89-95. (PA 25: 
67) In 1950 Achievement Testing Program in Independent 
Schools and Supplementary Studies, Educational Records Bulle- 
tin, No, s4. New York: Educational Records Bureau, July 1950. 
Pp. xiii, 119. Paper, lithotyped, * 


Суви. J. Ноут, Director, Bureau of Educational 
Research; and TuroporE E. KELLOGG, Instruc-* 


tor, University High School; University of 


Minnesota, Minneapolis, Minnesota. [Review of 
Forms Y and Z.] 

This test has a total working time of 40 min- 
utes. It is organized in three parts: I, (30 items: 
10 minutes) ; II, (20 items: 15 minutes) ; and 
III, (15 items: 15 minutes) : Students are asked 
to proceed to the next part after the designated 
time limits but may return to any part if less 
time than that allowed is required for others. 
There is some tendency for the test items to be 
more difficult or more time-consuming as the 
test proceeds, but items of various difficulty lev- 
els are scattered. 

The test emphasizes the traditional content of 
high school plane geometry and seems to be 
sufficiently challenging for even the most able 
student. Not more than 25 per cent of the items 
test solely the recognition of facts in straight- 
forward form, A vast majority of questions Te 
quire not only knowledge of facts and relation- 
ships but their intelligent use. The ability to 
prove theorems is tested by sampling the various 
skills and understandings required for рг00 
separately, but no one proof is followed through 
completely. ; 

The first 30 items are modified true-false, 1 
that the pupil is required to decide whether 
statements are always true, sometimes true, or 
never true. Several of these require considerable 
insight. The next 35 items are five-option multi- 
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ple choice, ranging from recognition and simple 
computation to selection of reasons. The scoring 
of all items in the test is corrected for guessing 
or chance success. 

The test definitely tends toward the difficult 
end of the continuum from easy to difficult. The 
student taking the test confronts an undue em- 
phasis on speed and may be considerably handi- 
capped if he does not have a large number of 
geometric facts at his fingertips or if he has a 
slow reading rate. A student scoring high on 
the test undoubtedly would possess considerable 
geometric skill. A low score, however, might not 
be an equally valid indication of lack of such skill 


since the student who “figures things out” would 


have considerable difficulty in meeting time lim- 
its. This characteristic of the test lowers its va- 
lidity as a power test. The reviewers feel that 
while facility in the use of geometric content is 
basic in determining the good student, speed is 
too large a determinant of a high score in this 
test. 


The several forms of the test are designed to- 


be directly comparable, variations in difficulty 
from form to form presumably being taken care 
of by the conversion of raw scores to Scaled 
Scores. However, an analysis of results obtained 
by 89 students in a sample administration leads 
the reviewers to question the equivalence of the 
Scaled Scores provided for the two most recent 
forms, Y and Z. In this administration signifi- 
cantly different mean Scaled Scores were ob- 
tained on the two forms by two equivalent 
groups, and there was likewise a significant dif- 
ference in variance of Scaled Scores by the two 
groups. The lower mean, Scaled Score was ob- 
tained on the form that careful examination 
shows to be the more difficult of the two forms, 
thus reinforcing the reviewers’ conviction that 
the Scaled Scores for at least one of these forms 
are faulty. 

The test largely ignores an experimental or 
inductive approach to geometric problems; 
whether or not such approaches should be con- 
sidered in a test of this type is a moot question. 
The reviewers feel there is much value in such 
approaches. Few or no items are couched in 
terms of realistic or practical situations. Nearly 
every widely used text gives considerable atten- 
tion to practical applications, and most teachers 
consider these important. The reviewers feel 
that failure to relate geometric ideas to real prob- 
lems is a definite weakness of this test. 


In fairness, it should be pointed out that a 
single test with such a limited number of items 
cannot be expected to test all the desirable in- 
structional objectives. The reviewers feel, how- 
ever, that test authors and publishers must pre- 
pare batteries of tests for assessing the most im- 
portant student abilities which geometry instruc- 
tion is aimed at developing. At present, teachers 
of geometry are faced with a number of published 
tests, such as those under review, testing a sort 
of nebulous general geometric achievement. Just 
what student abilities these tests measure and to 
what use the scores can be put are questions that 
must be answered before the validity of the tests 
will be known. 

The published information regarding the basis 
for determining: the content of the Cooperative 
Plane Geometry Test is totally inadequate. The 
teacher and school administrator are entitled to 
a clear-cut statement of the instructional objec- 
tives.appraised by any published test—and pub- 
lishers are obligated to provide this informa- 
tion. On the basis of such statements the pros- 
pective purchaser can decide the uses (if any) 
which he can make of the scores obtained from 
such tests. Authors and publishers who have 
previously developed favorable reputations have 
often rested on their laurels, marketing their 
tests on the basis of past reputation. Prospective 
purchasers of achievement examinations should 
objectively consider each examination on its own 
merits. 


For reviews by Harold P. Fawcett and C. O. 
Oakley of Forms R, S, and T, see 3:357; for a 
review by Leroy H. Schnell of Form P, see 
40:1467; for reviews by Charles C. Weidemann 
of Form 1937 and S. S. Wilks of Form N, see 
36:993. 

[424] 

*Geometry Attainment Test. Secondary schools ; 
1948; 1 form [48]; no data on reliability; manu 
[48] ; 5s. per 12; 6d. per single copy; Is. per manual; 
Is. 3d. per specimen set; postage extra; 90(95) min- 
utes; R. D. Walton; University of London Press Ltd. * 
I. MacrantANE SmitH, Officer-in-Charge, 
Tests Division, National Foundation for Educa- 
tional Research in England and Wales, London, 
England. 

This test is intended to measure attainment 
in geometry at any stage of a secondary school 
course after six months. The manual lists several 
uses to which the test may be put, briefly sum- 
marised as follows: to compare levels of attain- 
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ment, to diagnose weaknesses, to assist in re- 
vision, and to discover children with high geo- 
metrical ability but low verbal ability. 

Since the content of the test appears to be 
based on the geometry curriculum of the gram- 
mar school, it would seem to be intended for use 
with grammar school pupils. The content is cer- 
tainly not suitable for typical secondary modern 
school courses in England and Scotland. The 
manual states that the test has been standardised 
by application to 1,484 pupils in different types 
of secondary schools in England and Scotland. 
It is not made clear, however, whether this sam- 
ple was intended to be representative of all types 
of schools (grammar, technical, and secondary 
modern) or whether it consisted only of good, 
average, and poor grammar school pupils. 

There is a certain amount of ambiguity about 
the timing. For pupils who have studied geome- 
try for more than two sessions the time limit is 
90 minutes. For pupils in the first form it.is 30 
minutes, and for pupils in the second form it is 
60 minutes. The manual states, however, that, if 
necessary, first and second form pupils may be 
given a longer period than the times suggested. 
In view of this latitude in timing, the data on 
which the norms are based may be expected to 
show considerable variations from one school to 
another. 

‘ Norms are given in two ways: according to 
the length of time during which geometry has 
been a subject of study and according to age. The 
test is intended for the age range 12-0 to 16-5. 
Since the highest possible score is 100 and the 
mean score of the age range 16-0 to 16-5 is 43, 
the level of difficulty would appear to be most 
suitable for pupils in this age range. For pupils 
at the end of the fourth year, the 5oth percentile 
is 50. The original sample of 1,484 pupils was 
much too small to give reliable norms for such 
a wide age range and to take adequate account 
of possible variations in curriculum from one 
area to another. The manual states that the 
norms are possibly too low, owing to the dis- 
turbance of education during the Second World 
War, and it invites teachers to forward sum- 
maries of results so that more adequate norms 
can be prepared. In a private communication to 
the writer, however, the author states that data 
obtained more recently show that the standards 
have not altered since the war. 

The manual contains no information about 
validity or reliability, but the author states that 
the reliability coefficient has been found to be 


.87 by the split half method. The correlation with 
marks in ordinary school examinations in ge- 
ometry is usually about .7. This result is very 
satisfactory, especially since it was obtained 
from groups of pupils of a comparatively re- 
stricted range of ability. 

The following minor criticisms may be men- 
tioned: (a) In Item 32 it is not made clear that 
two answers are required. (b) Item 50 is vague, 
Intelligent pupils might consider that some of 
the possible answers are too obvious to be worth 
mentioning. (c) Items 6, 16, 37, 54, and 57 are 
not clear as to the extent variants of the stand- 
ard enunciations should be penalised. A 

In spite of the foregoing criticisms, the test 
is likely to be of considerable value to teachers 
for the purposes stated in the manual. The author 
is justified in claiming that it includes “most of 
the important results that are usually dealt with 
in a three years’ course of school geometry,” if 
it can be assumed that the course referred to is 
that normally taken in a grammar school. Intelli- 
gent pupils are likely to do well in the test, partly 
because it is more interesting to them than an 
ordinary examination and also because the ques- 
tions test deductive reasoning. Duller pupils will 
do worse in it than in an ordinary examination 
because there is less opportunity to reproduce 
material learned by rote. There is no reason to 
doubt the author’s claim that the test is of value 
for discovering pupils with good geometrical 
ability. 


W. L. Sumner, Senior Lecturer in Education, 
University of Nottingham, Nottingham, Eng- 
land. 

Within the limits of its scope this test fulfils 
its purpose admirably. It is intended for pupils 
of both sexes who have completed a three year 
course of school geometry in an English sec- 
ondary grammar or "public" school, but the 
questions are so graded that it can be given at 
any stage after six months’ work in the subject. 
A very complete manual of instructions is avail- 
able for use with the test. 

The test, which is influenced by the standard 
school geometries based on Euclid, has been de- 
veloped from a large number of trial questions. 
There is at least one short question on each 0 
the main results, instead of the few long ques 
tions of the ordinary type of examination where 
there is inadequate sampling of the pupil's 
knowledge. Diagrams have been used rather 
than written questions ; most of the answers 1- 
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volve little writing for the pupil and permit 
quick, objective marking. 

Certain conventions of signs and nomencla- 
ture have been observed throughout the test ; it 
is pleasing that these are becoming general in 
the teaching of geometry in Britain. The test, 
which covers much of the ground to the cyclic 
quadrilaterals of Euclid, is essentially practical, 
and is not concerned with the spatial and other 
factors of mental ability which contribute to 
geometrical skill. 


[425] 


*Geometry: Every Pupil Test. High school; 1929- 


51; new form usually published each April and Decem- 
ber; form April 1951; no data on reliability and valid- 
ity; no manual; norms ('51) ; 216€ per test; 1€ per an- 
swer key; postpaid; 40(45) minutes; Ohio Scholar- 
ship Tests, Ohio State Department of Education. * 


[426] 
X*Lane-Greene Unit Tests in Plane Geometry. 
High school; 1929-44; 6 tests in 1 booklet; Forms A 
C44), В C44) ; $3.50 per 25; separate answer sheets 
must be used; 40 per 25 answer sheets for any one 
test; 50¢ per complete set of scoring stencils; 10€ per 
scoring stencil for any one test; 25€ per manual; post- 
age extra; 50€ per specimen set, postpaid; Ruth O. 
Lane and Harry A. Greene; Bureau of Educational 
Research and Service, State University of Iowa. * 
а) TEST I, FUNDAMENTAL IDEAS OF GEOMETRY. 35(40) 
minutes. 
b) TEST II, PARALLEL LINES AND TRIANGLES. 5 scores: 
indicating what is to be proved, indicating what is 
given, proving theorems, miscellaneous, total; 38(45) 
minutes. 
C) TEST III, RECTILINEAR FIGURES. 5 scores: same as 
Test II; 38(45) minutes. 
d) TEST IV, THE CIRCLE. 5 scores: locus problems, con- 
struction problems, proving and applying theorems, 
miscellaneous, total; 38(45) minutes. 
€) TEST V, PROPORTION AND SIMILAR POLYGONS. 5 
scores: proportion, proportional lines, proving and ap- 
plying theorems, miscellaneous, total; 36(40) minutes. 
Í) TEST VI, AREAS OF POLYGONS. 5 scores: computation 
of lengths of lines and areas of polygons, comparison 
of areas, proving theorems, miscellaneous, total; 36 
(40) minutes. 


[427 ] 
*Orleans Geometry Prognosis Test, Revised 
Edition. High school; 1929-51; 1 form, '50; manual 
(’51) ; $2.40 per 25, postage extra; 356 per specimen 
set, postpaid; 39(44) minutes; Joseph B. Orleans; 
World Book Co. * 
REFERENCES 


1-3. See 40:1471, M 

4. (CIUS Mat P. The Prediction of Success in Plane Geom- 
etry, Master's thesis, Warns University (Detroit, Mich.), 1942. 
Pp. 81. (Bibiana gis of Master's Theses 1941-1942, 1943, Е E 

5. Unrsrock, RICHARD STEPHEN, “Construction of а elec- 
tion Test for College Graduates," J General Psychol 41:153-93 
О "49. * (PA 2414874) 


For reviews by Edward E. Cureton and 
Charles C. Weidemann, see 40:1471. 


[428] 
*Plane Geometry: Achievement Examinations 
for Secondary School. High school; 1951; 1 form; 
no data on reliability and validity; no manual; Minne- 


sota norms (median and quartile deviation) available; 
similar norms for other regions by special arrange- 
ment with publisher ; 76 per test, postage extra; 6o (65) 
minutes; edited by Walter W. Cook; prepared by a 
curriculum committee of high school teachers for use 
in the Minnesota State Board Achievement Examina- 
tions Program; Educational Test Bureau, Educational 
Publishers, Inc. * 


[429] 
*Plane Geometry Test: State High School Tests 
for Indiana. r, 2 semesters high school; 1933-52; 2 
levels ; mimeographed ; no data on reliability and valid- 
ity; no manual; 4166 per test; 15€ per specimen set; 
postpaid; 40(45) minutes; Philip Peak, Douglass 
Brown, Margaret Goodson (A), Pryce Noe (A), and 
Reino Takala (O); State High School Testing Serv- 
ice for Indiana, Purdue University. * 
а) FIRST SEMESTER. Form А o norms Psi. 
b) SECOND sEMESTER. Form О ['51]; Form N out of 
print; norms ['52]. 


[430] 
*Plane Geometry: 20th Century Test. 1, 2 semes- 
ters high school; 1949; 2 levels; Form А; no data on 
reliability and validity; no norms—author recommends 
the use of local norms ; 10 or more copies, 5€ each, post- 
age extra; 356 per specimen set, pns 40(45) min- 
Ше, Ж Benton Review Publishing , Inc, Fowler, 
n 


[431] 
*Schrammel-Reed Solid Geometry Test. High 
school; 1950; Forms A, В; $1.15 per 25; 20€ per speci- 
men set; postpaid; 50(55) minutes; H. E. Schrammel 
and Virginia M. Reed; Bureau of Educational Meas- 
urements, Kansas State Teachefs College of Empo- 
ria. 

[432] 
*Seattle Plane Geometry Test: Evaluation and 
Adjustment Series. High school; 1951; IBM; Form 
AM; $215 per 25; separate answer sheets must be 
used; 80¢ per 25 IBM answer sheets; postage extra; 
35€ per specimen set, postpaid; 40(50) minutes; Har- 
old B. Jeffery, S. L. Merriam, Clifton T. Smith, Roy 
[у Kellogg, and Richard E. Bennett; World Book 


[433] 

*Shaycoft Plane Geometry Test: Evaluation 
and Adjustment Series. Grades 10-13; 1951-52, 
c1950-51; IBM; Forms AM (51), BM ('52) ; man- 
ual (51); $2.15 per 25; separate answer sheets must 
be used; 80€ рег 25 IBM answer sheets; postage ех- 
tra; 35€ per specimen set, postpaid; 40(50) minutes; 
Marion F. Shaycoft; World Book Со, * 

Hanorp P. Fawcett, Professor of Education, 
Ohio State University, Columbus, Ohio. 

. The two forms, Am and Bm, of this test are 
so nearly equivalent that it will suffice to review 
only Form Am. “Part A is devoted primarily to 
the measurement of important basic concepts 
and facts of geometry, while Part B measures 
application and reasoning on the basis of in- 
formation," This quotation from the manual 
quite clearly delineates the character of the 43 
questions in Part A. Of the 17 questions in Part 
B, 7 yield a numerical answer computed from 
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given data and a memorized formula or geo- 
metric principle, which is more recall and compu- 
tation than "reasoning." There are several ques- 
tions in Part B which provide the student an 
opportunity to show evidence of careful, logical 
thinking. The last question, for example, con- 
tains excess data which must be intelligently ig- 
nored or at least set aside in the process of solu- 
tion. When this type of question is considered, 
the test has more than the 2 per cent given to 
geometric reasoning, as stated in the manual. 

It seems of dubious value to have questions 
with more than one correct answer on an objec- 
tive test. Questions 15 and 20 are of this type, 
due to the fact that congruent figures are similar 
and also equal. Moreover, since the key shows 
only one correct answer for each of these ques- 
tions, the norms are slightly affected. 

As for the “demonstrative” aspect of this test, 
the fact that only 2 per cent is given to questions 
on “analytic vs. synthetic proof ; indirect proof ; 
and converse of theorem" indicates how inade- 
quately this instrument tests the understandings 
associated with either inductive or deductive 
proof. There is no attempt to apply geometric 
reasoning to nonmathematical situations or to 
scientific, engineering, and everyday problems. 

The manual is quite complete in its statistical 
treatment of the test. An average coefficient of 
reliability of .82 was determined from three cor- 
rected split half reliability coefficients based on 
test results from groups of 146, 116, and 181 
students. A coefficient of reliability of .80 was 
determined by administering both forms to 161 
students in one community within an interval of 
less than one week. The authors, using the stand- 
ard error of measurement, state that “there are 
two chances in three that an individual’s score 
on the test does not differ by more than 5 points 
from his hypothetical ‘true’ score.” 

Three experimental forms of this test were 
administered to approximately 2,200 students. 
Difficulty and validity indices were computed 
for each of the items and, on the basis of this in- 
formation, items were selected to yield the two 
final forms of the test. The test was administered 
to 2,914 students in 54 schools in 24 different 
states for standardization in the spring of 1950. 
Raw scores on the test are converted to a scale 
of normalized standard scores. Percentile norms 
corresponding to standard scores are provided 
for use in interpreting the results of the test ad- 
ministered at the end of one year of instruction 
in plane geometry. 
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In summary, the Shaycoft test is a good, or- 
dinary, factual plane geometry test, with some 
questions involving reasoning in purely geo- 
metric situations and some using applications 
of formulas in computational problems. Even 
though the objectives and content of this test 
were determined by using the pronouncements 
of most of the important committees on mathe- 
matics education since 1923, it seems that little 
attention is given to the fundamental aspects of 
proof and to the social significance of geometry, 


Cyrit J. Ноут, Director, Bureau of Educational 
Research; and Turopore E. KELLOGG, Instruc- 
tor, University High School; University of 
Minnesota, Minneapolis, Minnesota. 

This 40-minute test consists of two parts, A 
(43 items) and B (17 items). After 20 minutes 
students are asked to proceed to Part B but are 
directed to return to Part A if time permits. 
According to the author, Part A measures 
primarily concepts and facts, while Part B meas- 
ures application and reasoning. This distinction 
is reasonably clear-cut, though the relative em- 
phasis is questionable (43: 17 in terms of items, 
or 20:20 in terms of time) : the reviewers con- 
sider application and reasoning important 
enough to warrant a much greater emphasis both 
in time and in number of items. 1 

The categories used by the author in describ 
ing the test content imply that the intended user 
is more interested in the percentage distribution 
of items according to subject matter categories 
than according to the pupil abilities tested. There 
are, no doubt, many individuals, schools, and 
other organizations whose evaluation of instruc- 
tion is based solely on the acquisition of subject 
matter with little or no concern for the abilities 
which pupils develop. However, for such а 
tough-minded subject matter view of geometry, 
the Shaycoft test seems to be too seriously wa- 
tered down. On the other hand, for the evalua- 
tion of general educational development, the au- 
thor has not given sufficient attention to meas- 
uring each of the several aspects of reasoning 
ability. Я 

About опе third of е test items require little 
more than the recognition of definitions an 
other statements as usually presented in geome 
try texts. Very few items demand moderately 
complex deductions or explanations from the 
student. The largest portion of the items require 
the student to consider relatively straightforwar 
problems or queries either intuitively or with 
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one- or two-step reasoning to a specific case from 
a definition, assumption, or theorem. Most of 
the items, furthermore, follow too closely the 
phraseology of the usual textbook. Some com- 
binations of general statements are required in 
solutions but nothing approaching the ability 
to prove a proposition is tested. Not more than 
five items are likely to require any computation 
and even that is exceedingly elementary. The 
above statements are not based upon student in- 
terviews but upon careful analyses by the re- 
viewers. It is recognized that a student’s attack 
upon any particular item is dependent upon his 
classroom activities and other experiences; a 
problem for one student may be a memory exer- 
cise for another. However, the mental processes 
described above seem the most likely for stu- 
dents in instructional situations within the ex- 
perience of the reviewers. 

In a number of instances the Shaycoft test 
utilizes propositions which are given as theorems 
or corollaries in most texts but which are omitted 
or treated as problems by many teachers. Thus 
the abilities tested by such items are likely to 
differ widely for students in different situations. 
Hence, it appears that a low percentage of cor- 
rect replies (difficulty, as given in the manual) 
to certain items in Part A may hinge upon this 
lack of uniformity in pupils’ experiences with 
related propositions. 

The first 10 items are of the unmodified true- 
false type and are not corrected for guessing. Of 
Items 11 through 30, 8 require the matching of 
figures with their names; 10, the completion of 
factual statements by selection from a short list 
of terms or phrases; and 2, the recognition of 
whether or not a statement or its converse is 
true. The last 30 items of the test are of the 
5-response multiple choice variety. 

The matching Items 11-30 have a number of 
serious technical inadequacies. The items in- 
corporated into a single matching exercise do 
not possess sufficient homogeneity for this form 
of testing. Out of nearly 200 pupil replies for 
Form Am the reviewers found that 40 per cent 
of the distractors in Items 11—30 were not chosen 
by any student. Such ineffectiveness of distrac- 
tors renders the probability of chance success 
great, Choices of the correct answers in several 
of these items can be made on the basis of very 
superficial. knowledge and understanding (e.g., 
Items 28 and 29). In some cases the test encour- 
ages uncritical considerations by defining a 
theorem as a "statement which is proved” (Item 


31), by requiring the student to assume lines 
not necessarily straight as straight (Item 57), 
and by crediting Pythagoras with the relation- 
ship among the squares of the sides of a right 
triangle (Item 33). 

Terms such as “hypothesis” (Item 32) and 
“analytic method” (Item 55) are perhaps terms 
which may be unfamiliar to many students who 
might be able to use the concepts effectively. 
There is some question as to whether the stu- 
dent of geometry should necessarily be familiar 
with these terms; the concepts are certainly im- 
portant. 

Since Form Bm was unavailable in quantity 
order at the time this review was written, there 
has been no opportunity to test the equivalence 
of Forms Am and Bm although study of Form 
Bm indicates that both cover approximately the 
same areas. The item difficulties reported in the 
manual show that the average percentage of 
correct responses to the items is 61 for Form 
Am and 60 for Form Bm. The reviewers’ check 
on these difficulty indices reveals that the items 
are as easy as, or easier than, the test manual 
indicates. This is especially of concern since 
the reported item difficulties would function to 
restrict variability in the upper end of the distri- 
bution of test scores and hence render unreliable 
discriminations among high ability students. 

The manual claims validation “according to 
rigid standards” and gives, as objective evidence, 
the average correlation between items and total 
test score, found by using the Flanagan table for 
approximating the item-test correlations based 
on the upper and lower 27 per cent of the “item- 
analysis population.” It is well known that such 
correlations give very little if any evidence of 
test or item validity. Only when the upper and 
lower groups are determined on the basis of 
some criterion known to be valid can high item- 
criterion correlations be considered evidence of 
item validity. Hence, as in the case of many 
published tests, the Shaycoft test must be con- 
sidered lacking in objective evidence of valid- 
ity. 

Few or no items—depending on one's point 
of view—are couched in terms of realistic or 
practical situations. Nearly every widely used 
text gives considerable attention to practical ap- 
plications and most teachers consider that the 
abilities required for such application are im- 
portant instructional outcomes. The reviewers 
feel that failure to relate geometric ideas to real 
problems is a definite weakness of this test. 
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[ 434] 
*Solid Geometry: Achievement Examinations 
for Secondary Schools. High school; 1951; 1 form; 
no data on reliability and validity; no manual; Minne- 
sota norms (median and quartile deviation) available; 
similar norms for other regions by special arrange- 
ment with publisher; 7¢ per test, postage extra; 60 
(65) minutes; edited by Walter W. Cook; prepared 
by a curriculum committee of high school teachers for 
use in the Minnesota State Board Achievement Exam- 
inations Program; Educational Test Bureau, Educa- 
tional Publishers, Inc. * 

[435] 
*Solid Geometry Test: State High School Tests 
for Indiana. High school; 1934-47; Form A ['46]; 
mimeographed; no data on reliability and validity; no 
manual; norms ['47]; 444¢ per test; 15€ per specimen 
set; postpaid; 40(45) minutes; Philip Peak, оша 
Brown, Margaret Goodson, and Pryce Noe; tate 
High School Testing Service for Indiana, Purdue Uni- 
versity. * 

[436] 
Survey Test in Plane Geometry. High school; 
1939; Forms A, B; manual [397] ; $1.25 per 25, post- 
age extra; 354 per specimen set, postpaid; 50(55) 


minutes; Harl R. Douglass, Richard M. Drake, and 


Virgil R. Walker; Educational Test Bureau, Educa- 
tional Publishers, Inc. * 

Harorp P. FAwckrT, Professor of Education, 
Ohio State University, Columbus, Ohio. 

This test attempts to evaluate both the ability 
to prove originals and the extent of factual 
knowledge of geometry. It has two forms, equiva- 
lent in all respects, with three types of questions. 
In Section A there are 12 questions, each of 
which asks the student to determine which of 
five given reasons furnishes a logical explana- 
tion for a given geometric statement. Only two 
of these questions employ a geometric figure as 
part of the given conditions. 

In Section B there are three short originals, 
each consisting of a figure and statements of 
what is given and what is to be proved. The 
Statements and reasons which constitute the 
proof must be chosen from two lists of I3 items, 
and the number of steps in each proof is speci- 
fied. The pamphlet of directions states that the 
examples in this section represent an attempt 
“Чо test the ability to prove an original with as 
few restrictions as possible upon the form for 
the answer." Although this type of question is a 
relief from the usual ones that ask for factual 
knowledge, it presents many obstacles to a truly 
original proof. There is sufficient evidence avail- 
able to indicate that rarely do any two indi- 
viduals think exactly alike on a given problem. 
The method of attack, the perception of what is 
important and relevant data, the recall of re- 
lated information, the rate of progress in solving, 


and the number of steps needed, all vary with 
the individual. To insist, as this test does, that 
the proof of an original be accomplished in a 
specified number of steps, no more and no less, 
ignores the fact that individuals differ and tends 
to hamper original, creative thought. 

Section C is primarily computational. The 14 
questions there utilize formulas and proportion- 
ality relations to obtain numerical answers. It 
is worth mentioning that the student is required 
to write in the answers to these questions: they 
are not to be chosen, perhaps by trial and error, 
from among several incorrect answers. This is 
a definite improvement over many tests of this 
sort. In this part of the test, however, where one 
might expect to see geometric applications of a 
broad nature, the authors have seen fit to remain 
wholly within the domain of geometry and have 
neglected the application of geometry not only in 
daily life but also in science and the trades. This 
is undoubtedly due to the fact that "the Minne- 
sota State Course of Study was used as a basis 
for determining the topics and the proportion 
of the tests to be devoted to each topic." 

The manual gives a coefficient of reliability of 
.92 as determined from a sampling of 200 high 
school geometry pupils; the method used in de- 
termining the coefficient is not reported. Simi- 
larly, the derivation of the table of norms that 
is provided is not given. 7 

In summary, it can be said that this is a good 
test of factual knowledge and computational 
skill in geometry. Although the authors have at- 
tempted to furnish a relatively unrestricted form 
in which students can show their ability in, and 
understanding of, proof, it is the opinion of the 
reviewer that the form used restricts original 
thinking to such an extent that it is doubtful 
that any real understanding of proof is meas- 
ured. 


[437] Үр 
*The Van Dyke Solid Geometry Test. НЫ 
school ; 1940; І form; no data on reliability and уа = 
ity; no manual; no description of normative, рор 
Чоп; $1.50 per 25, postage extra; 15¢ per specimen Гү 
postpaid; nontimed (90) minutes; L. A. Van Dyke; 
C. A. Gregory Co. * 


TRIGONOMETRY 


[438] Ese 
*Cooperative Plane Trigonometry Ten yc 
school and college; 1932-51; Forms 1932—39, * For ms 
titled Cooperative Trigonometry Test; IBNS ('so— 
O ('so—same as test copyrighted in 1938), 
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same as test copyrighted in 1944), Y ('48—some print- 
ings bear a 1950 copyright) ; Forms 1933-37, P out of 
print; no data on validity; no specific manual; general 
Cooperative manual ('51); high school norms ['37]; 
college norms ['42]; $2.25 per 25; 50€ per specimen 
set, postpaid; separate answer sheets may be used; 806 
per 25 IBM answer sheets; 15€ per stencil for scoring 
answer sheets; cash orders postpaid; 40(45) minutes; 
John А. Long (О, Y), L. P. Siceloff (О, Y), and Staff 
of the Cooperative Test Service (U); Cooperative 
Test Division, Educational Testing Service. * 


For a review by G. E. Hawkins of Form P, 
see 40:1474; for reviews by J. O. Hassler and 
S. S. Wilks of the 1937 edition, see 38:1074. 


[439] 
*Rasmussen Trigonometry Test. High school and 
college; 1940; Forms A, В; $1.05 per 25; 20€ per spec- 
imen set; postpaid; 40(45) minutes; edited by О. J. 
Peterson; Otho M. Rasmussen; Bureau of Educational 
Measurements, Kansas State Teachers College of Em- 
poria. i 


[440] 
xTrigonometry Test: State High School Tests 
for Indiana. High school; 1944-49; Form B ['40]; 
Form A out of print; mimeographed; no data on relia- 
bility and validity; no manual; norms ['47]; 4166 per 
test; 15¢ per specimen set; postpaid; 40(45) minutes ; 
M. W. Keller; State High School Testing Service for 
Indiana, Purdue University. * 


MISCELLANEOUS 


Reviews By Jean D. Amberson, James M. Anderson, Warren R. Baller, Reign Н. Bittner, Arthur 
H, Баи, Hester Chadderdon, Н. Harrison Clarke, Stuart A. Courtis, Robert L. Ebel, Albert 


Ellis, 


ertram Epstein, Leslie M. Haynes, Edith M. Huddleston, Clifford E. Jurgensen, Paul E. 


Kambly, M. Ray Karnes, Willard А. Kerr, Faith Madden, William J. Micheels, Victor H. Noll, 

Jacob S. Orleans, Worth J. Osburn, Willard W. Patty, James Н. Ricks, Jr., Harold F. Rothe, 

William L. Schaaf, Harold G. Seashore, Helen Shacter, Benjamin Shimberg, Hilda Taba, 
and Arthur E. Traxler. i 


AGRICULTURE 


[441] 

Farm Shop Tools: Reco; ition and Use: State 
High School Tests for Indiana. High school; 1942- 
43; Forms A ['43], B [43] ; mimeographed; no data 
on reliability and validity ; no manual; norms ['43] ; 5¢ 
per test; 13€ per specimen set; postpaid; 50(55) min- 
utes; Nelson M. Parkhurst; State igh School Test- 
ing Service for Indiana, Purdue University. * 


REFERENCES 


.1, PARKHURST, NxLso M. Construction, Evaluation, and Re- 
vision, of a Test on Farm Shop Tools. Master's thesis, Purdue 
University (Lafayette, Ind.), 1943. 


M. Ray Karnes, Associate Professor of Indus- 
trial Education, University of Illinois, Urbana, 
Illinois. 

This test purports, as the subtitle indicates, 
to measure the abilities of farm shop students 
to identify by name the tools appropriate for 
farm shop use and to recognize the particular 
function or functions for which each tool is de- 
signed. Each form of the test contains six parts 
as follows: Part A, comprising approximately 
two thirds of the total number of items in the 
test, consists of a series of matching exercises in 
which the student is required to match sketches 
of tools with their names. Part B includes a 
total of 12 items in two matching exercises and 
requires that the names of tools be matched with 


the jobs which are normally performed with 
these tools. Part C, consisting of 6 items, calls 
for the matching of tools with the kinds of 
tradesmen who use them. Part D is made up of 
6 items of one variation of the multiple choice 
type. In responding to each item the student 
designates the one tool which “does not belong” 
in the series of five with which it is listed. In 
Part E, the student selects the proper tool to be 
used to perform a particular job in responding 
to the six multiple choice items. Part F consists 
of 26 true-false items, the majority of which 
relate to tools and their uses. Form B is similarly 
constructed. 

Norms are based on 293 and 439 cases for 
Forms A and B respectively and are reported 
in T-score form. 

An analysis of the test and of the contents of 
several farm shop textbooks reveals that the 
test covers practically all of the hand tools and a 
majority of the machines treated in these texts. 
From the standpoint of the names of the tools 
and the major purpose for which each is de- 
signed, the test is comprehensive. However, one 
might very well raise the question as to where 
he could obtain a measure of the student's knowl- 
edge of the sizes in which various tools are avail- 
able, indices of quality, accessories and varia- 
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tions, proper care of tools, and the multiple uses 
commonly made of many of the tools. From this 
standpoint, the test is definitely restricted. 

Numerous inaccuracies appear in the body of 
the test and in the key. For example, ro of the 
intended correct responses in Form B involve 
the names of tools which are distinctly different 
from the names given these same tools by their 
manufacturers and distributors and by trades- 
men who use them. The analysis of farm shop 
texts reveals that these same inconsistencies 
appear in them. 

The sketches, especially of small hand tools, 
are not clear reproductions. Six items in Form 
B call for the student to discriminate among de- 
tails which are obscure in the sketches. 

The test items and the key are constructed in 
such a manner as to permit objective scoring. 
Directions and item construction are such that 
students should be able to interpret them ob- 
jectively. Stated norms indicate that the test is 
highly discriminating. 

The test can be severely criticized upon the 
basis of its mechanical features. Norms supplied 
are based upon 150 itenis in each form of the 
test, and 150 numbered items appear in both 
forms. The student actually makes only 134 re- 
sponses on Form A and 135 on Form B. Since 
the directions for administering the test indi- 
cate that the student will receive one point for 
each correct response, it is not clear how he can 
obtain a raw score greater than 134 and 135 and 
a T-score above 71 and 75 on Forms A and B 
respectively. 

The items as well as the alternate responses 
are designated numerically. This could be some- 
what confusing to students in responding to mul- 
tiple choice items and especially so in responding 
to the matching exercises. The use of letters to 
designate alternate responses should simplify the 
test from the standpoint both of the student and 
of the tester who analyzes results. 

While norms indicate that the test is suffi- 
ciently difficult to obtain a wide range of scores, 
several of the alternate response items, especially 
the matching, contain specific determiners and 
obvious answers. An example in the matching 
exercise is the inclusion of a sketch of one type 
of hammer, with only one type of hammer listed 
among the alternate responses. 


[442] 
*Graduate Record Examinations Advanced Agri- 
culture Test. Senior year college through graduate 
school; 1946-51; available ‘only in the Institutional 
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Testing Program of Graduate Record Examinations 
(see 527); 105(125) minutes; Educational Testing 
Service. * 


BUSINESS EDUCATION 


[443] 

*Bookkeeping: Achievement Examinations for 
Secondary Schools. High school; 1951; 1 form; no 
data on reliability and validity; no manual; Minnesota 
norms (median and quartile deviation) available; simu- 
lar norms for other regions by special arrangement 
with publisher; 7€ per test, postage extra; 60(65) 
minutes; edited by Walter W. Cook; prepared by a 
curriculum committee of high school teachers for use 
in the Minnesota State Board Achievement Examina- 
tions Program; Educational Test Bureau, Educational 
Publishers, Inc. * 


[444] 
*Bookkeeping I: Every Pupil Test. High school: 
1939-51; new form usually published each April; form 
April r951; no data on reliability and validity; no 
manual; norms ('51); 2%@ per test; 16 er answer 
key; postpaid; 40(45) minutes; Ohio Scholarship 
Tests, Ohio State Department of Education. * 


[445] 

*Bookkeeping Test: National Business En- 
trance Tests. Grades 12-16 and clerical applicants; 
1938-52; this series was entitled National Clerical 
Ability Tests from 1938-46 and United-NOMA Busi- 
ness Entrance Tests in 1947; for complete battery, see 
453; Form 1342 (749) ; no data on reliability; general 
series folder ('50) ; series manual ('49) ; norms (2); 
$5 per 25; $1 per test when scoring service and certifi- 
cates for acceptable work are desired (tests adminis- 
tered only at National Business Entrance Test Centers 
which may be established in any community) ; 50¢ per 
sample test; $2 per specimen set (includes all 6 tests 
in the series) ; postpaid; 120(130) minutes; prepared 
by the Joint Committee on Tests representing the 
United Business Educational Association and the Na- 
tional Office Management Association; Joint Commit- 
tee on Tests, * 


REFERENCES 
Same as for 453. 


For reviews by Harvey A. Andruss and Ray 
G. Price of the 1946 form, see 3:368; for a re- 
view by Paul S. Lomax of all 1946 tests in this 
series, see 3:396. 


[446] 

*Business Fundamentals and General Informa- 
tion Test: National Business Entrance Tes j^ 
Grades 12-16 and clerical applicants; 1938-52; Us 
series was entitled National Clerical Ability TEE 
from 1938-46 and United-NOMA Business Entrant 
Tests in 1047; for complete battery, see 453; 3 scores: 
business fundamentals, general information, bic : 
Form 1391 (48); no data on reliability; по norms 
for part scores; general series folder (’50); 9 
manual (49); norms ('s2); $5 per 25; $I рег je 
when scoring service and certificates for acceptab 

work are desired (tests administered only at Nanon Е 
Business Entrance Test Centers which may be Е 
lished in any community) ; available free when T $ 
tests in the series are ordered; 50€ per sample test 5); 
per specimen set (includes all 6 tests in the serm! 
postpaid; 75(85) minutes; prepared by the Joint 
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mittee on Tests representing the United Business Edu- 
cation Association and the National Office Manage- 
ment Association; Joint Committee on Tests. * 


REFERENCES 

Same as for 453. 

For reviews by Vera M. Amerson and C. C. 
Upshall of the 1946 form, see 3:369; for a re- 
view by Paul S. Lomax of all 1946 tests in this 
series, see 3:390. 

[447] 
XBusiness Relations and Occupations: Achieve- 
ment Examinations for Secondary Schools. High 
school; 1951; 1 form; no data on reliability and valid- 
ity; no manual ; Minnesota norms (median and quartile 
deviation) available; similar norms for other regions 
by special arrangement with publisher; 7€ per test, 
postage extra; 60(65) minutes; edited by Walter W. 
Cook; prepared by a curriculum committee of high 
school teachers for use in the Minnesota State Board 
Achievement Examinations Program; Educational 
Test Bureau, Educational Publishers, Inc. * 

[448] 
*Commercial Arithmetic Test: State High 
School Tests for Indiana. High school; 1944-51; 
Form А [’45]; mimeographed; no data on reliability 
and validity; no manual; norms ['51] ; 406 per test; 
156 per specimen set; postpaid; 40 (45) minutes ; Char- 
lotte Henderson and Philip Peak; State High School 
Testing Service for Indiana, Purdue University. * 

[449] 
*Cooperative Commercial Arithmetic Test. 1, 2 
semesters high school; 1944-51; 3 scores: numerical 
operations, problems, total; IBM; discontinued: Forms 
U, X; no data on reliability and validity ; no specific 
manual; general Cooperative manual (’51) ; norms 
[47]; 40(45) minutes; Cooperative Test Division, 
Educational Testing Service. * 
Bertram Epstein, Assistant Professor of Edu- 
cation, The City College, New York, New York. 

Form U (1944) contains a total of 80 items 
organized into two parts. Part I consists of 60 
computation exercises with a 15-minute work- 
ing time limit. These computations are of two 
types: (a) those involving the conversion, from 
one form to another, of common fractions, deci- 
mal fractions, and percentages, and (5) those 
involving the three cases of percentage. The 
items are arranged in alternate blocks of ro 
items of each type. Part II consists of 20 verbal 
problems involving common business situations 
in discount, interest, profits, taxes, etc., and cen- 
tering upon percentage calculations. A single 
sheet of percentile norms for each part and for 
the total score on the.test is provided. These 
norms are based on data for 693 pupils in nine 
' high schools at the end of one semester of study 
and for 376 pupils in nine high schools at the 
end of two semesters of study. 

Form X (1947) is of the same length and type 
as Form U. The only apparent difference, apart 
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from the specific numbers employed in the exer- 
cises, is the random arrangement of the two 
types of items in Part I instead of their alterna- 
tion in blocks of 1o. Again, a single sheet of per- 
centile norms for each part and for the total score 
is provided. These norms are based on data for 
154 pupils in two high schools at the end of one 
semester of study and for 92 pupils in three high 
schools at the end of two semesters of study. 

The single sheets of percentile norms com- 
prise the total of explanatory and supporting 
material provided. No information is given con- 
cerning the nature of the normative population 
beyond the numbers of pupils and schools in- 
volved and the number of semesters of commer- 
cial arithmetic studied ; there is no indication of 
the sex, age, background, type of school in- 
volved, etc. The normative data for Form X are 
peculiar. The median scores reported for the two 
parts of the test are identical for pupils complet- 
ing one and two semesters of study. Also, the 
median total score reported for pupils complet- 
ing one semester of study is actually one point 
higher than that reported for those completing 
two semesters. The norms also raise many ques- 
tions concerning the weighting of the parts and 
the meaning of the total score. For instance, 
Part II (the 20 verbal problems) constitutes 
only about one fifth of the total score yet is given 
five eighths of the total working time. 

There is no information given as to the sources 
and the validation of the test items. The items 
themselves seem to be in no way superior to 
the exercises already profusely found in any 
commercial arithmetic textbook. 

In each part of both forms pupils are instructed 
to find the correct answer “by working each 
problem." No further instructions to the pupil 
or to the examiner are given concerning the ma- 
terial to be used for this purpose. Neither the 
test booklet nor the answer sheet provides work- 
ing space. A testing situation in which high 
school pupils of the first and second years must 
manipulate an 8-page test pamphlet, a separate 
answer sheet, and several sheets of loose paper 
is indeed awkward. 

The format of the test is poor. The size of 
type used calls for greater spacing between lines 
of print and numbers. It would probably be even 
better to use a larger size of type with appro- 
priate spacing. 

It is surprising to find the Cooperative Test , 
Division imprint on a test with so many obvious 
faults. The most charitable comment that can be 
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made concerning this test is to call it “disap- 
pointing." 

[450] 
*Gilbert Business mE High AS DAE: 
paid; «o(45) sini: Mire D. Gilbert and Ole M. 
Rasmussen; Bureau of Educational Measurements, 
Kansas State Teachers College of Emporia. * 
WirLiAM L. Scnaar, Associate Professor of 
Education, Brooklyn College, Brooklyn, Меш 
York. 

On the whole, the test seems to fulfill its pur- 
pose rather well. Its chief merit is the full and 
well balanced coverage of topics : simple interest, 
cash discount, trade discount, profit and loss, 
commission, bank discount, averages, deprecia- 
tion, installment purchases, reconciliation of 
checking account, partnerships, stocks and 
bonds, property taxation, insurance, and men- 
suration are all represented. The verbal prob- 
lems are, in the main, realistic from a business 
point of view, and a few of them require thought- 
ful consideration. Among possible weaknesses, 
the following points are to be noted: (a) there 
might have been more items involving percent- 
age and ratio, two extremely important matters 
for business use; (b) there might have been 
fewer items on addition, subtraction and multi- 
plication with denominate numbers; and (c) 
it would seem that the time allowance (40 min- 
utes) is much too short for the 47 items. The 
first two are minor criticisms; the third is more 
pertinent, and is apparently justified by the low 
median scores reported. It seems unlikely that 
many pupils will finish the test; and since it is 
not intended to be a power test, its usefulness is 
somewhat impaired. It is also dubious whether 
the test will be very helpful in analyzing student 
and class weaknesses, as suggested by the au- 
thor. However, despite these shortcomings, the 
test is unquestionably a sound measure of 
achievement, usable at the close of any typical 
course in high school business arithmetic, and 
to be recommended for that purpose. 


[ 451] 

X*Hiett Simplified Shorthand Test (Gregg). 1-2 
semesters high school; 1951; IBM; Forms A, В; $175 
per 25; 356 per specimen set; separate answer sheets 
may be used; 85€ per 25 IBM answer sheets; 25€ per 
set of stencils for machine scoring of answer sheets; 
postpaid; (50) minutes; V. C. Hiett and H. E 
Schrammel; Bureau of Educational Measurements, 
Kansas State Teachers College of Emporia. * 


[452] 
*Machine Calculation Test: National Business 
. Entrance Tests. Grades 12-16 and clerical appli- 


cants; replaces Key-Driven Calculating Machine Abil- 
ity Test (see 40: 1487) ; 1941-52; this series was en- 
titled National Clerical Ability Tests from 1938-46 
and United-NOMA Business Entrance Tests in 1047; 
for complete battery, see 453; Form 1394 ('49); no 
data on reliability; general series folder (^50); series 
manual (740); norms ('52); $5 per 25; ўй рег test 
when scoring service and certificates for acceptable 
work are desired (tests administered only at National 
Business Entrance Test Centers which may be estab- 
lished in any community) ; 50 per sample test; $2 per 
specimen set (includes all 6 tests in the series); 120 
(130) minutes; prepared by the Joint Committee on 
"Tests representing the United Business Education As- 
sociation and the National Office Management Asso- 
ciation; Joint Committee on Tests, * 


REFERENCES 
Same as for 453. 


For a review by Elisabeth Fehrer of the 1946 
form, see 3:384; for a review by Paul S. Lomax 
of all 1946 tests in this series, see 3:396. 


[453] 

*National Business Entrance Tests. Grades 12-16 
and clerical applicants; 1938-52; this series was en- 
titled National Clerical Ability Tests from 1938-46 and. 
United-NOMA Business Entrance Tests in 1947; 6 
tests (also listed separately) ; no data on reliability; 
general series folder ('50); series manual ('49); 
norms ('52); $5 per 25 of any one test; $1 per test 
when scoring service and certificates for acceptable 
work are desired (tests administered only at National 
Business Entrance Test Centers which may be estab- 
lished in any community) ; the Business Fundamentals 
and Genaral, Information Test is available free when 
other tests in the series are ordered; soé per sample 
test; $2 per specimen set (includes all 6 tests in the 
series) ; postpaid; prepared by the Joint Committee on 
Tests representing the United Business Education As- 
sociation and the National Office Management Asso- 
ciation; Joint Committee on Tests. * 

а) BOOKKEEPING TEsT. Form 1392 ('49); 120(130) 
minutes, For reviews by Harvey A. Andruss and Kay 

G. Price, of the 1946 form, see 3:368. 

b) BUSINESS FUNDAMENTALS AND GENERAL INFORMA- 
TION TEST. 3 scores: business fundamentals, general in- 
formation, total; Form 1391 ('49) ; no norms for part 
scores; 75(85) minutes. For reviews by Vera 

MINOR and C. C. Upshall of the 1946 form, see 

3:309. 

C) GENERAL OFFICE CLERICAL TEST (INCLUDING FILING). 
Replaces Filing Test (see 3:379) ; Form 1393 (49); 
120(130) minutes, ' EM 
d) MACHINE CALCULATION TEST. Form 1304 ('49); 
120(130) minutes. For a review by Elisabeth Fehrer 
i the 1046 form, see 3:384. А 
©) STENOGRAPHIC TEST. Long Form 1395 ('49): 115 
(125) minutes; Short Form (зо): 60(70) minutes. 
For reviews by Ann Brewington and Elizabeth Fehrer 
of the 1946 Long Form, see 3:301. x ; 

) TYPEWRITING TEST, Formerly called Typing Гер 
Long Form 1396 ('49): 120(130) minutes; Shor 
Form ('50): 60(70) minutes. For reviews by E. б. 
Blackstone and Beatrice J. Dvorak of the 1946 Long 
Form, see 3:394. : 


REFERENCES 

1—9. See 40:1476. А; 

10. Nztsox, Јони Н, А Study of, Relationships Бейит, 
Achievement of Stenographers and Typists on the Nahon 
Business Entrance Tests and Their Performance in Begiy d 
Positions. Doctor's thesis, New York University (New po 
N.Y.), 1951. Abstract; Microfilm Abstracts 11:938-9 no 4 
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For а review by Paul S. Lomax о} the 1946 
series, see 3:396. 


[454] 
SRA Dictation Skills. Grades 9-12 and adults; 
1947 ; 2 scores: speed, accuracy; 2 parts; the Accuracy 
Album includes two 12-inch records (78 rpm) pre- 
senting I practice and 5 test letters; the Speed Album 
includes three r2-inch records (78 rpm) presenting 1 
practice and 8 test letters; preliminary manual; 49€ 
per Speed booklet and answer pad; 49€ per Accuracy 
booklet and answer pad; separate answer pads must be 
used ; $1.90 per 25 answer pads for any one part; $18.75 
per set of five records; $1 per specimen set; cash or- 
ders postpaid; (40) minutes; Marion W. Richardson 
and Ruth A. Pedersen; Science Research Associates, 
Inc. * 
REFERENCES 

т. Pepersen, RutH A. "The Development of Two Machine- 
Administered Scales of Stenographic Proficiency." Abstract. 4m 
Psychol 2:350-1 Ag '47. * (PA 21:4665, title only) 


Hanorp F. Котне, Personnel Director, Ameri- 
can Hospital Supply Corporation, Evanston, 
Illinois. 

This test may be compared with the Seashore- 
Bennett Stenographic Proficiency Tests (see 
455) since they are both designed for largely the 
same purpose. Both tests are standardized, re- 


corded worksamples. ПЕТ 
In the SRA test, the examinee takes dictation 
by any stenographic system and then inserts the 


answers to questions on the answer pad which . 


is inserted into the test booklet. The test con- 
sists of letters, printed in the test booklets, and 
in each letter some words are missing. The prob- 
lem is to supply the missing words on the answer 
sheets from the notes that were taken as the 
records were played. The missing words are es- 
sential to meaning of the letters and hence the 
test questions cannot be answered by guesswork 
alone. 

Neither the SRA nor the S-B test has a time 
limit. The SRA test is scored objectively, in con- 
trast to the semisubjective scoring of the S-B 
test. The number of errors on each letter is 
counted and this number is interpreted into 
either work standards or norms by using several 
tables that are in the manual. This is true for 
both the speed and also the accuracy (or diffi- 
culty) sections of the SRA test. 

The SRA manual describes in detail some of 
the research that was conducted in making this 
test. Word counts of the difficulty of typical busi- 
ness letters were made. Items in the trial form 
were analyzed, based on the responses of 205 
high school stenography students, to select items 
for the final form. Performances on the two sec- 
tions, speed and accuracy, were found to be only 
slightly related to each other (correlation of .27 


TESTS & REVIEWS: MISCELLANEOUS—BUSINESS EDUCATION 455 


for the 205 students). The reliability of the 
speed scale has been estimated at .80 but the 
manual does not specify upon what grounds this 
estimate is based. 

There are several important aspects in which 
this test is weak. The most glaring technical 
omission is the complete lack of validation data. 
The test is almost bound to be valid, since it is a 
worksample, but proof is still necessary. Norms 
based on a sample of employed stenographers, 
not just on the 205 students, would be helpful. 
More information on reliability is desirable, апа · 
particularly reliability based upon a sample of 
employed workers. 

From a practical point of view, this test suf- 
fers from being “unreal.” It tests only a part of 
a stenographer’s job. Stenographers generally 
expect to transcribe their notes into letters and 
not into answers on an answer sheet. Perhaps it 
is more valuable to test dictation ability and typ- 
ing ability separately, and not together as in 
the Seashore-Bennett tests, but the value of sepa- 
rate testing is not demonstrated here. 

The SRA test is shorter than the S-B tests, 
but SRA gives less information—that is, no in- 
formation about typing. The SRA test is scored 
more objectively but involves some rather com- 
plicated tables. In general, although the pre- 
liminary research and background work is im- 
pressive for the SRA test, the end result in its 
present form is not very imposing. 


[455] 
Seashore-Bennett Stenographic Proficiency 
Tests: A Standard Recorded ic ined 
Worksample. Adults; 1946; Forms B-1, B-2; no 
data on reliability and validity in manual (for data 
presented elsewhere by the authors, see т); 3 styles of 
vinylite recordings: standard style 78 rpm (four 12- 
inch records), transcription style 33% rpm (one 16- 
inch record), “LP” microgroove style 33% rpm (one 
12-inch record); each record set includes manual, 
script, and 100 summary charts; distribution is re- 
stricted to business firms; $19.50 рег standard record 
set; $17 per transcription record set; $13.50 per micro- 
groove record set; $2 per 100 summary charts; 35¢ per 
manual; postpaid; (50-60) minutes; Harold Seashore 
and George K. Bennett; Psychological Corporation. * 


REFERENCES К 
1. Ѕелѕнове, Haron G., AND Bennett, GEORGE К. “А Test 
of Stenography : Some Preliminary Results." Personnel Psychol 
1:197-209 su 48, * (PA 22:4709 


+ 


Harotp F. Котне, Personnel Director, Ameri- 
can Hospital Supply Corporation, Evanston, 
Illinois. 

This standardized worksample was designed 
for the use of personnel departments in select- 
ing, training, and upgrading employees. It is 
used most advantageously with other types of 
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tests and is not intended to predict all aspects of 
а stenographer's performance. - І 

The test consists of буе letters dictated at three 
levels of speed, plus a general introduction. The 
testee must take dictation and then transcribe 
the notes into “mailable” business letters. There 
is no fixed time limit, and the test may be stopped 
at any time if the testee is obviously not capable 
of continuing. 

The ultimate scoring criterion is “mailability,” 
and there is no single, objective scoring method. 
' The manual suggests a way of scoring.six fac- 
tors: (a) neatness and cleanness of typing, (b) 
arrangement of the letter, (c) quality of the 
stroke, (d) typing errors, erasures, etc., (е) 
errors of English, and (f) changes in wording 
and meaning. The script that accompanies each 
test set contains a printed copy of the directions 
that are dictated on the record, a standard copy 
of each letter, and several samples of “‘good” 
and “poor” transcriptions. У 

Although the scoring is semisubjective, а 
study by the test authors (r) shows that this 
test can be scored fairly consistently by persons 
with a little training. Two clerks, scoring the 
papers of 52 stenographers, had reader relia- 


bility coefficients ranging from .74 to .97 on the . 


individual letters. In the same study, scores on 
letters 1, 3, and 5 were compared with scores on 
letters 2 and 4, yielding corrected reliabilities of 
„Зо, .83, and .91 for groups numbering 39, 52, 
and gi respectively. A validity coefficient of .49 
was obtained with a combination of supervisors' 
ratings ona group of 52 employed stenographers. 
In addition, the test scores differentiated fairly 
well between this employed group and a group 
of 39 trainee stenographers. This is not exactly 
an overwhelming amount of validity data, but 
it does indicate that the test may be used with a 
reasonable expectation of valid results, at least 
pending further research. 

These tests are valuable in that they are easy 
to give, there are some research data to support 
them, and they appear to be testing something 
real and practical. In this latter connection they 
should be compared with the SRA Dictation 
Skills, which appears “unreal” to many testees. 

These tests suffer somewhat from a shortage 
of-norms which is in turn related to the semi- 
subjective method of scoring. It would be de- 
sirable to have more research results, especially 
on validity. 

In the reviewer’s opinion, this is a useful test 
and its strong points fairly well offset the weak 
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ones. The scoring method is not bad in actual 
industrial practice where an exact placement of 
each person’s abilities at one specific point in 
a continuum is not necessary in day-to-day work, 
Probably the most serious weakness is a practical 
one—the tests take too long. Industrial people 
would rather spend 10 minutes than an hour in 
testing. In some respects the use of records is a 
little inconvenient. It would probably be desir- 
able to have ediphone or dictaphone recordings, 


For a review by Ann Brewtngton, see 3:386, 


: [456] 

*Shorthand I and II: Every Pupil Test. 1, 2-3 
years high school; 1938-51; new form usually pub- 
lished each April; 2 levels; form April 1951; no data 
on reliability and validity; no manual; norms (71); 
210 per test; 16 per answer key; postpaid; 25(30) 
minutes; Ohio Scholarship Tests, Ohio State Depart- 
ment of Education. * 

R SHORTHAND I. I year high school. 

b) SHORTHAND п. 2-3 years high school; out of print. 


[457] 

*Simplified Shorthand Test: State High School 
Tests for Indiana. 1, 2, 3, 4 semesters high school; 
1950-51; 4 levels; Form B ['51]; lithotyped; no data 
on reliability and validity; no manual; norms ['51]; 
€ per test; 15¢ per specimen set; postpaid; 40(45) 
minutes; M. E. Studebaker, B. M. Swinford, Vernal 
H. Carmichael, Frances R. Botsford, and Russell Burk- 
hart; State High School Testing Service for Indiana, 
Purdue University. * 


[458] 

qutesographie Test: National Business Entrance 
Tests. Grades 12-16 and clerical applicants; 1938-52; 
this series was entitled National Clerical Ability Tests 
from 1938-46 and United-NOMA Business Entrance 
Tests in 1947; for complete battery, see 453; long, 
short editions; no data on reliability; general series 
folder ('50); series manual ('49); norms (^52); $5 
per 25; $1 per test when scoring service and certificates 
for acceptable work are desired (tests administered 
only at National Business Entrance Test Centers 
which may be established in any community) ; 50€ per 
sample test; $2 per specimen set (includes all 6 tests in 
the series); postpaid; Long Form 1395 (749): 115 
(125) minutes; Short Form ('50): 60(70) minutes; 
prepared by the Joint Committee on Tests representing 
the United Business Education Association and the 
National Office Management Association; Joint Com- 
mittee on Tests. * 


REFERENCES 
Same as for 453. 


For reviews by Ann Brewington and Eliza- 
beth Fehrer of the 1946 form, see 3:391 5 for а 
review by Paul S. Lomax of all 1946 tests in this 
series, see 3:396. 

E [459] 
XTest for Stenographic Skill. Applicants for steno- 
graphic positions; 1950; individual; 1 form; no ata 
on reliability and validity; no norms; $5 per test me 
ual ($5.50 if ordered on то day trial basis), postpaid; 
no other materials needed ; nontimed (10-20 minutes ; 
Edward N. Hay; Aptitude Test Service. * 
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Ким H. BrrrNER, Assistant Director of Per- 
sonnel Research, The Prudential Insurance 
Company of America, Newark, New Jersey. 

This test is designed to measure the ability 
of a stenographer to take dictation and imme- 
diately read back aloud from her shorthand notes 
the material dictated. Unlike most stenographic 
tests, the stenographer does not transcribe her 
notes on a typewriter. It is the author’s thesis 
that the ability to take dictation is one thing, the 
ability to type is another, and that the two should 
not be confused in a single test but should be 
measured separately, There is some question, 
however, whether these two skills combine ad- 
ditively to give a measure of a stenographer’s 
skill. 

The test, which must be administered indi- 
vidually to each applicant, consists of a warm-up 
letter and 12 letters, representing three levels of 
difficulty, which may be read to the applicant 
at 40, 60, 80, 90, 100, or 120 words per minute. 
Difficulty was established on the bases of the 
number of syllables per hundred words and the 
sentence length. There is some evidence that 
these are measures of reading ease; their rela- 
tion to ease of shorthand note taking is not pre- 
sented, but it is reasonable to expect them to 
be positively correlated. The reading rate is con- 
trolled by tables giving the time in which the 
reader should reach the end of each line, using 
a sweep second watch to check his reading pace. 
Practice is recommended to assure the retention 
of the appropriate reading speeds. One may 
start dictating at any of the difficulty levels and 
at any speed between 40 and 120 words per 
minute. The warm-up test is designed to indicate 
the appropriate beginning speed for each candi- 


date. Immediately after each letter is dictated it^ 


is read back while the examiner notes the errors 
made. No standard order of presenting the vari- 
ous letters or the number of letters to be dictated 
is specified. 

The objective of the test is not clearly stated 
in the manual, but presumably it is to determine 
the top levels of difficulty and speed at which the 
applicant can perform satisfactorily. Scores on 
each letter are to be given in terms of “Excel- 
lent,” “Good,” “Fair,” “Failure.” The adjective 
ratings depend on the number of errors and the 
ease with which the letter is read back. Since 
errors are not defined and rating the ease of read- 
ing back is highly subjective, the reliability of 
scoring is open to question. Norms in the usual 
sense are not provided. The only aid provided 


for interpreting the test results is the adjective 
rating scale, but no information is given about 
its derivation. 

No evidence of the reliability or validity of 
the test is given. Because of the many opportuni- 
ties for variability in administration and scor- 
ing, the test’s reliability should be determined. 
Evidence of the validity of the test—its relation 
to on-the-job performance—is especially needed. 


Сілғғокр E. Jurcensen, Personnel Director, 
Minneapolis Gas Company, Minneapolis, Min- 
nesota. 

The title of this test is not accurately descrip- 
tive; the test is one of shorthand skill, not of 
stenographic skill. An applicant can pass the 


test with flying colors without knowing how to 


insert a piece of paper into a typewriter. This 
objection is limited to the title of the test and is 
no criticism of the nature of the test. The re- 
viewer agrees with the test author that typing 
ability should be measured independently of : 
shorthand ability. 

This test is a spiral bound 20-page manual 
consisting of 13 test items (mostly letters) to- 
gether with information on the rationale of their 
development and instructions regarding their 
use. The test was developed for individual test- 
ing in situations not requiring complicated and 
time-consuming measures of dictation speed. 
The author has used various techniques to con- 
trol letter difficulty and dictation speedsand to 
keep the test simple and short. Dictated letters 
are read back rather than typed. This reduces 
testing time to a fraction of that otherwise re- 
quired. A warm-up letter of 150 words is dic- 
tated at a progressively increasing rate of speed. 
Proficiency on this letter indicates the speed at 
which the first test letter should be dictated. 
Thus no time is wasted in starting far above or 
below the examinee’s level of ability. 

The test proper consists of 12 items, most of 
which are letters. Each item consists of 10 lines 
of 10 words each. Not all items are intended to 
be used with each examinee. The test is stopped 
whenever the examinee is unable to re-read dic- 
tated material with satisfactory fluency and ac- 
curacy. The test time is largely a function of 
the examiner’s judgment (based on the warm-up 
letter) regarding the speed at which to start 
dictation. A vari-speed guide enables even a 
relatively inexperienced examiner to pace his 
speed quite accurately from 40 to 120 words 
per minute. 
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Test performance is scored by rating the ease 
and accuracy with which the examinee reads 
back her shorthand notes. The author recom- 
mends a four-point scale of excellent, good, fair, 
or failure ; each of these terms is briefly defined. 
The rating score is one based essentially on a 
practical judgment, i.e., does the examinee read 
back her notes in such way that a satisfactorily 
accurate letter could be typed from the notes? 
No provision is made for an overall score; each 
dictated item must be scored separately. No data 
are presented on agreement in scoring by differ- 
ent examiners. 

А careful reading of the manual together with 
an hour of practice should enable any compe- 
tent personnel clerk to administer and score the 
test adequately. The same is true for line super- 
visors. 

A major advantage of this test is the control 
of difficulty of dictated material by the number 
of syllables per hundred words. The first six let- 
ters have from 141 to 143 syllables each, the next 
three have 162 or 163 syllables, and the last three 
have 182 to 184 syllables. Based on Flesch’s 
readability formulas these three levels are classed 
as fairly easy, fairly difficult, and difficult. 

The author apparently believes that reliability 
and validity figures are irrelevant in a test of this 
type, for he presents no such data. Presumably 
such information is, or could be made, available 
inasmuch as development of the test started 
about 1935. The test appears to have been con- 
structed with care, and one may hazard the guess 
that the test would be valuable in employing 
stenographers. However, statistical data are 
lacking. 

In spite of the lack of validity and reliability 
data, the reviewer recommends the experimental 
tryout of this test in the employment of stenog- 
raphers in situations where simplicity and 
brevity are important in measuring shorthand 
skill. 


[460] 
Turse Shorthand Aptitude Test. Grades 8-10; 
1937-40; 8 scores: stroking, spelling, phonetic asso- 
ciation, symbol transcription, word discrimination, dic- 
tation, word sense, total; 1 form, '40; no norms for 
part scores; manual ('40); $2 per 25, postage extra; 
ў“ per specimen set, postpaid; (45-50) minutes; Paul 

Turse; World Book Co. * 
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. Barrett, Domoruv M. “Prediction of Achievement in 
T iti: nd Stenography in a Liberal A; » 
Tob]. Psychol D Ua * (PA 21:162. rts College, 


1 Apel Psychol 30:624-30 . 4) 
. Hosler Russert J. “. уана Testing in Shorthand,” 
J Bas Ed 22:25 My '47. * (PA 22:834) 

5. Selection and Training of Shorthand Students in Ontario 
Secondary Schools. A study conducted by The Shorthand Survey 
Committee of The Ontario Commercial Teachers’ Association 
and The Department of Educational Research, Ontario College 
of Education, University of Toronto. Toronto, Canada: Sir Isaac 
Pitman & Sons (Canada) Ltd., 1949. Pp. vii, 68. Paper, * 


Lestiz M. Haynes, Head, Department of Ap- 
plied Psychology, Sydney Technical College, 
Sydney, New South Wales, Australia. 
According to the author, the abilities neces- 
sary for success in shorthand fall into two 
groups, those that enable the student to write 
shorthand from dictation and those which en- 
able the student effectively to use the recorded 
shorthand matter. The author's further analysis 
of these abilities is fundamental to the whole 
test, for upon this the subtests are based. We are 
asked to accept the fact that “а good shorthand 
prognosis test should include measures of each 
of the following: manual dexterity; ability to 
write, carry matter in the mind, and listen for 
new matter simultaneously; ability to learn and 
combine abstract symbols; ability to associate 
the correct literal spelling of a word with its 
phonetic form; ability to discriminate between 
words having similar or identical shorthand out- 
lines; spelling ability; and ability to construct 
entire words from the incomplete shorthand out- 
lines." No attempt is made to justify these as- 
sumptions ; some justification would seem to be 
necessary in view of consistent research findings 
that symbol writing and spelling are the princi- 
pal abilities required for success in shorthand. 
The actual subtests are stroking (manual dex- 
terity), spelling, phonetic association, symbol 
transcription, word discrimination, dictation, 


“and word sense. 


The test has been thoroughly validated against 
various shorthand achievement tests and, if these 
are accepted as adequate criteria, the figures 
quoted in the manual justify the inclusion of 
most of the subtests. The author's stated aim 18 
to eliminate those unfit to study shorthand. The 
validation data seem to indicate that the test can 
achieve this. The stated aim is, however, not à 
satisfactory justification for failure to extend the 
validation of the test to the employment situa- 
tion itself. s 

The manual is a well prepared document 
aimed at bringing about the scientific use of this 
measuring instrument. The directions for ad- 
ministering are particularly well.set out, while 
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sound advice is offered in the section dealing 
with the interpretation and use of the test re- 
sults. 

In all, this test is well worth using in the 
selection of students for commercial courses in- 
volving the teaching of shorthand. For this pur- 
pose it is probably best used, as the author him- 
self indicates, with other guidance data, such as 
intelligence test results and teachers’ marks. It 
would also seem to have some value as a diag- 
nostic instrument to be used with students under- 
going training. : 


For a review by Philip Н. DuBois, see 3:393: 


[461] 
*Typewriting I: Every Pupil Test. High school; 
1938-51 ; new form usually ublished each April; form 
April 1951 ; no data on reliability and validity ; no man- 
ual; norms ('51); 2%¢ per test; 1¢ per answer key; 
postpaid; 45(50) minutes; Ohio Scholarship Tests, 
Ohio State Department of Education. * 


[462] 

*Typewriting Test: National Business Entrance 
Tests. Grades 12-16 and clerical applicants; 1938-52; 
formerly called Typing Test; this series was entitled 
National Clerical Ability Tests from 1938-46 and 
United-NOMA Business Entrance Tests in 1947; for 
complete battery, see 453; long, short editions; no data 
on reliability; general series folder (750) ; series man- 
ual (40); norms (^52); $5 per 25; $1 per test when 
scoring service and certificates for acceptable work are 
desired (tests administered only in National Business 
Entrance Test Centers which may be established in 
any community) ; 50¢ per sample test; $2 per specimen 
set (includes all 6 tests in the series) ; postpaid; Long 
Form 1396 (40): 120(130) minutes; Short Form 
('50) : 60 (70) minutes ; prepared by the Joint Commit- 
tee on Tests representing the United Business Educa- 
tion Association and the National Office Management 
Association; Joint Committee on Tests. * 


= REFERENCES 
Same as for 453. 


For reviews by E. G. Blackstone and Beatrice 


J. Dvorak, see 3:394; for a review by Paul 
S. Lomax of all 1946 tests in this series, see 


3:306. 


mimeo; 

manual ; 
set; postpaid; Frances R. Botsford 
hart; State High School Testing 


Purdue University. * К 

a) First semester, Form B ['451; Form А out of 

print; 30(35) minutes. 
‘SECOND OR FOURTH E 

second semester: 30(35) minutes; 


40 minutes. А 
с) ded Semester, Form B [45]; Form A out of 


print; 40(45) minutes. 


semesters. Form N [45] ; 
fourth semester: 
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COMPUTATIONAL AND 
SCORING DEVICES 


[464] 
*Age and І.О. Calculator. 1940-51; for calculating 
IQ's from MA’s between 3 and 32 years and CA's 5 
years and over; $r.50 per set of 12 [revised] charts 
[51], postage extra; California Test Bureau. * 


[465] 
X*The Delp 1.0. Computer. 1950; for computing 
IQ's between 30 and 180 from MA’s between 2.5 and 
16; $2 per computer, postpaid; H. A. Delp; Houghton 
Mifflin Co. * 


J Consult Psychol 14:333 Ag '50. Laurance 
F. Shaffer. A circular slide rule of plastic ma- 
terial, 6 inches in diameter, to aid the calcula- 
tion of CAs and IQs. One side is used to calcu- 
late months of CA since the last birthday, from 
the birth date and the testing date. The other 
side of the computer divides MA by CA to ob- 
tain IQ, with a special scale for CAs between 13 
and 16 for use with the Revised Stanford-Binet. 


[466] 
*Hankes’ Answer Sheets. Special answer sheets 
and scoring services available on three tests; scoring 
machine developed by E. J. Hankes and owned by 
Engineers Northwest; postage extra (air mail always 
used) ; Engineers Northwest. * 
а) VOCATIONAL INTEREST BLANK, Hankes’ Answer 
Sheets: 1-49 copies, 3¢ each; 50-99, 274€ each; 100- 
499, 2166 each; 5 , 246 each; 1000 and over, 2¢ 
each; 706 for scoring one Hankes Answer Sheet 
(Form M scored for 47 scales, Form W scored for 26 
scales) ; plus 10¢ for each order of 5 tests or less; 900 
for scoring one IBM answer sheet; 90€ for scoring 
directly from test booklet; psychologist subkeys (clini- 
al C. 9), experimental (’ ). idance ('49), indus- 
trial (49, V.A. clinical Cay, available only as a 
part of Hankes’ Scoring Service: 25¢ extra when done 
with regular scoring of Vocational Interest Blank, 50¢ 
when done alone with Hankes’ Answer Sheet, 7 
when done alone with IBM answer sheet or directly 
from test booklet; individual report forms furnished 
as part of test scoring. 
b) THE PERSONALITY INVENTORY. Hankes' Answer 
Sheets: $1 per 100; 20¢ for scoring one Hankes’ An- 
swer Sheet; 25¢ for scoring one IBM answer sheet; 
25¢ for scoring directly from test booklet; only raw 
scores reported. 
с) MINNESOTA MULTIPHASIC PERSONALITY INVENTORY. 
Hankes’ Answer Sheets: $1.85 per 50, $15 per 500; 
30€ for scoring one Hankes' Answer Sheet; 35¢ for 
scoring one IBM answer sheet; 13 scores; individual 
report forms furnished as part of test scoring. 
REFERENCES 

т. Ѕткомо, Е. K., Jn, AND Hankes, E. J. "A Note on the 
Hankes Test Scoring Machine.” J Appl Psychol 31:212-4 Ар 
"47. * (PA 21:2537) 


2. Kmrepr, Рипар Н. Differential Interest Patterns of Psy- 
Fer cried Doctor's thesis, University of Minnesota (Minneapo- 
18, їпп.), 1 


d ў I Psychol 3:482-8 O '49. * (PA 24:2965 
А он) WiLsUR L. *An IBM Card Profile for the Strong 
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MEE X Blank.” J Appl Psychol 34:415-6 D 'so. * 
( 5. Erbe C. Haro, AND Квтерт, PuiriP Н. “Modified Di- 
rections for Strong Vocational Interest Blank When Used ,With 
the Hankes Answer Sheet.” J Appl Psychol 35:169-71 Je ’51. 
(PA 26:986) 

[467] 


*The Multiscore Profile Form and Scoring 
Codes. 1951; a generalized profile answer sheet and 
scoring aids for use with Personality Inventory, Guil- 
ford-Martin Inventory of Factors GAMIN, Abridged 
Edition, Guilford-Martin Personnel Inventory, An In- 
ventory of Factors STDCR, Kuder Preference Record 
—Vocational, Kuder Preference Record—Personal, 
and Minnesota Multiphasic Personality Inventory; $4 
er 10 scoring codes for any one test; $2.50 per 25 
Multiscore Profile Forms; 50€ per specimen code for 
any one test; MEN. Morse P. Manson; Western 
Psychological Services. * 


[468] 
*SRA Self-Scorer. Grades 3-16; 1949; a teaching- 
testing device designed for use with teacher-constructed 
objective tests; four types of answer sheets (true-false, 
4-choice, true-false and 4-choice, and 5-choice) ; two 
patterns of correct responses may be used with a given 
type of answer sheet; 1—14 scorers, $1.50 each; 15-49, 
$1.25 each; 50-99, $1.10 each; 100 or more, $1 each; 
$1 per 25 answer sheets of any one type; $2 per speci- 
men set; cash orders postpaid; Maurice E. Troyer and 
гое W. Angell; Science Research Associates, 
nc. 
REFERENCES 


1. ANGELL, GEORGE W., AND TRO AURICE E. ew 
A G W. Troyer, M. E. “A N 
Self-Scoring Test Device for Improving Instruction.” Sch & 
Soc yeu Ja 31 '48. * (PA 22:2415) 

2. ANGELL, GEORGE W., “The Effect of Immediate Knowledge 
of Quiz Results on Final Examination Scores in Freshman 


Chemistry." J Ed Res $3913 Ја '49. * (PA 23:3913) 
3. Саѕѕтру, Viora Marte. The Effectiveness of Sei Teaching 
Devices in Facilitating Learning. Doctor's thesis, Ohio State 


University (Columbus, Ohio), 1949. (Abstracts of Dissertations 
QNO Quarter 1948-49, 1950, pp. 73-9.) (PA 24:6500, 
only. 


4. Jones, Howard L., anp Sawyer, Micuart О, “A New 
Бананц Instrument.” J Ed Res 42:381-5 Ја '49. * (PA 23: 
James M. Anverson, Assistant Professor of 
Psychology, Occidental College, Los Angeles, 
California, 

This device permits immediate knowledge of 
the correctness or incorrectness of a choice for 
so-called objective examinations and, as will be 
immediately appreciated, is a valuable aid to 
learning. 

The answer sheet is folded over a cardboard 
insert with holes punched in it corresponding to 
all the possible choices on the answer sheet. 
This is inserted into an open-faced frame. The 
student indicates his choice by punching through 
the answer sheet with his pencil (or a stylus). 
If a red dot appears his answer is correct. The 
student continues to punch until the correct an- 
swer is found. 

The SRA Self-Scorer is particularly useful 
when the teacher wants the examination to be 
a learning experience but does not want his file 
of examination items to be circulated. It has met 
with enthusiastic acceptance in several of the 


reviewer’s classes. There are, however, certain 
disadvantages. Outstanding among them are: 
(a) the initial cost of the material is high; (b) 
the built-in keys make retyping of existing ex- 
aminations necessary, (с) occasionally answer 
sheets do not register properly (this is a minor 
defect easily remedied by replacing the answer 
sheet), (d) the plastic face of the frame has a 
tendency to warp, thus making insertion of the 
answer sheets difficult at times, but this too is a 
minor inconvenience, and (е) storage and trans- 
portation present problems, especially if the 
examination room is any distance from the office 
or place of storage. 

The device may be sealed if it is necessary to 
administer tests under conditions of inadequate 
proctoring. 

All in all, this is an ingenious device which 
will help obviate many of the objections to the 
objective type examination. It is well worth the 
expenditure of money and occasional effort to 
facilitate the students’ learning and to bring 
about a more favorable "set" concerning “ех- 
ams" for most students. 


ARTHUR E. ТвАхгЕВ, Executive Director, Edu- 
cational Records Bureau, New York, New 
York. 

Ever sinc the first objective tests were de- 
vised, ways have been sought to reduce the labor 
of scoring and to increase the educative value 
of the test results. Strip keys, fan keys, punched 
stencils, scoring boards, and other familiar de- 
vices have for many years contributed to speed 
and accuracy of scoring. At intervals, experi- 
ments in improved scoring have been under- 
taken with more novel devices such as a pro- 
cedure for determining the number of correct 
answers by weight, a cash register device, the 
use of materials sensitive to different kinds of 
light rays, and the photoelectric cell. Thus far, 
the most successful invention for rapid, Jarge- 
scale scoring of objective tests is the Interna- 
tional Test Scoring Machine, produced by the 
International Business Machines Corporation. 

Most procedures for improved scoring of ob- 
jective tests have been aimed primarily at stand- 
ardized tests. Concurrently, there is a need to 
assist classroom teachers and others in develop- 
ing rapid, effective procedures for scoring in- 
formal tests of their own construction. In line 
with this need the SRA Self-Scorer was recently 
made available. 

The SRA Self-Scorer is designed to spread 
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the labor of scoring among all the individuals 
who take a test, and, at the same time, to serve 
as a teaching device. It consists of a combination 
answer sheet and key, a perforated mat, and a 
plastic frame. There are four types of answer 
sheets and two forms of each type. One type pro- 
vides for 300 true-false items; a second type al- 
lows space for 150 four-choice items; a third 
type includes 90 four-choice items and 120 true- 
false items ; the fourth type makes provision for 
150 five-choice items. Each answer sheet is de- 
signed in the form of a folder. Red dots printed 
inside the folder are arranged so that a red dot 
is aligned with the position of each correct re- 
sponse. 

The procedure in using the Self-Scorer is as 
follows. The test author decides which one of the 
four kinds of answer sheet he intends to use and 
which one of the two forms he will employ. He 


then drafts questions of the kind required by the. 


answer sheet chosen. He shuffles his questions 
and rearranges the order of the responses until 
his test agrees precisely with the key inherent 
in the answer sheet. After duplicating his test, he 
arranges a sufficient quantity of answer sheets 
in the perforated mats. He then administers the 
test after instructing the pupils to attempt each 
question and to keep trying a question until the 
correct answer is shown by the appearance of 
the red dot. The score is the number of punches 


made by the individual plus a penalty for omis- - 


sions. As in golf, the lower the score the better 
the record. 

The SRA Self-Scorer should be evaluated 
both as a testing technique and as a teaching de- 
vice. Much attention is given in the manual to 
the value of the Self-Scorer in the improvement 
of learning, but its effect upon the measurement 
situation is largely neglected. A question may 
appropriately be asked whether as good a test 
is obtained with this scoring procedure as with 
the ordinary procedure. In any event, the SRA 
Self-Scorer yields a somewhat different kind of 
measure, and users ought to be apprised of the 
nature of the difference. On the positive side, 
it may be pointed out that more intensive sam- 
pling of an individual's understanding of the 
questions in the test is obtained by this scoring 
method than is the case when the subject simply 
chooses one answer he thinks is correct. The 
variance is probably greater, and one may hy- 
pothesize that the reliability may be increased by 
this scoring procedure. On the other hand, the 
Self-Scorer apparently has some limitations 


from the standpoint of measurement. The test 
author is somewhat restricted in that he must 
make his test conform exactly to the answer 
sheet he chooses. Fewer questions can be pre- 
sented in an equivalent amount of time, for each 
individual must repeat his attempts at questions 
missed on the first attempt. Consequently, the 
test author is obliged either to sample less ex- 
tensively the area covered by the test or to allow 
more time for administration. The test items are 
weighted unequally in the score and this weight- 
ing is not according to judgment or evidence 
concerning the relative importance of the ques- 
tions but according to their difficulty for each 
individual. Weighting according to difficulty is 
not necessarily detrimental to measurement, but 
it can be a limitation unless the test author uses 
care to avoid introducing questions dealing with 
obscure and trivial points that may have escaped 
the attention of even the best students. 

Potential users of this Self-Scorer should un- 
derstand that their students may be ranked in 
very different order by this scoring procedure 
than by the usual one. If an extreme and im- 
probable example may be used for purposes of 
emphasis, let us assume that on a test of 150 five- 
choice items Student A answers 110 questions 
correctly but that the other forty questions are 
on an area he has not studied and that he guesses 
so badly on these that he uses all five choices 
on each one before he arrives at the correct 
answer. If he were scored on the basis of number 
right on the first attempt, he would obtain a 
score of 110, a respectable mark on a 150-item 
test of average difficulty. Marked on the Self- 
Scorer, his score would be 310. Assume further 
that Student B misses all but ten of the items on 
the first attempt but that he hits all these cor- 
rectly on his second attempt. His score accord- 
ing to the customary procedure would be 10, as 
compared with rro for Student A, but on the 
Self-Scorer procedure he would actually make a 
better record than Student A—290 as compared 
with 310. This is an artificial example which is 
unlikely to be duplicated in a real situation, but 
less marked shifts in the relative standing of 
students, as appraised by the two scoring pro- 
cedures, would no doubt occur frequently. 

The following questions are not taken up in 
the manual, but it is believed that answers to 
them would be helpful in evaluating the Self- 
Scorer: (a) What is the correlation between 
scores obtained by this scoring procedure and 
scores on the same test taken by the customary 
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procedure? (b) What is the reliability of scores 
yielded by the Self-Scorer as compared with the 
reliability of scores found in the usual way? (c) 
What is the relative amount of time required to 
administer the same multiple-choice test in the 
usual way and according to the procedure used 
with the Self-Scorer? (d) What is the relative 
validity of scores yielded by the two procedures 
as indicated by correlation with an outside cri- 
terion? 

Apart from the question of the measurement 
value of the SRA Self-Scorer, it is appropriately 
pointed out in the manual that the Self-Scorer 
has certain definite advantages for teachers and 
students. It apprises the student of his errors as 
he takes the test and makes a report of scores 
available immediately while interest in the ex- 
amination is at a high point. Thus, this device 
can be a useful teaching aid. 

While the instructional values of the Self- 
Scorer should make strong appeal to teachers in 
a variety of fields, there is one feature of this 
device which many teachers will no doubt regard 
critically. The procedure of test administration 
virtually forces students to guess, and even to 
guess repeatedly, on questions about which they 
are totally uninformed. Teachers often complain 
that students do quite enough guessing without 
encouragement, and such teachers are likely to 
look somewhat dubiously upon any device which 
makes guessing a requirement. 

The manual is attractive in format, well- 
organized, and well-written. Its tone, however, 
is scarcely one of impartially weighing the merits 
of the Self-Scorer. Portions of it read as though 
they might have been written by the publisher’s 
sales staff. A pictorial illustration on page 2 pre- 
sents in a somewhat misleading way a rather 
flimsy research finding. 

In summary, this reviewer is of the opin- 
ion that the instructional advantages of the SRA 
Self-Scorer outweigh its disadvantages. He is 
somewhat doubtful about its merit as a contri- 
bution to measurement. It is suggested that this 
device be used rather cautiously until further 
evidence of research is made available. 


[469] 
BENI Tweeddale I.Q. аа Tables. Chrono- 
logical ages 5-17; 1950; Is. 6d. per copy, postpaid; 
B. B. Mager ; Oliver & Boyd, Ltd. * (kon 


[470] 
xV.G.C. Answer Strips. Grades 3-16; 1950; a scor- 
+ ing device designed for use with teacher-constructed 
objective tests; two types of answer sheets (true-false 
and 5-choice); four patterns of correct responses may 


be used with a given type of answer sheet; 18¢ per 25 
answer sheets for any one type; 40€ per specimen set 
(including transparent plastic scoring stencil) of any 
one type; postage extra; Vocational Guidance Cen- 
tre. * 


ETIQUETTE 


[471] 

The Best Thing to Do: А Test of Knowledge of 
Social Standards. Grades 4-8; 1931; Forms A, B; 
$1.50 per 25; 10€ per specimen set; cash orders post- 
paid; nontimed (20-50) minutes; Frank E. Tomlin; 
Stanford University Press. * 

Heren Suacter, Consulting Psychologist; and 
Lecturer in Psychology, Northwestern Univer- 
sity; Chicago, Illinois. 

The purpose of this test is to discover 
knowledge of standards of social conduct among 
children in grades 4-8. There is frank acknowl- 
edgment that such social knowledge is not nec- 
essarily translated into social action, yet the test 
is presented as a measure of conduct as well as 
of attitudes. It is highly questionable whether 
this device performs such service. : 

The test is published in two forms, each con- 
sisting of 45 problems. A problem presents a 
situation followed by four possible reactions, 
the requirement being to indicate the best and 
the worst things to do under the circumstances 
described, or to indicate the best and worst rea- 
sons for doing it. Some situations are stated in 
ambiguous terminology. Interest in doing the 
test may lag before completing the reading of the 
180 possibilities involved in the 45 problems. 
The preadolescent working on the test may be- 
come irked at both the length of the task and the 
equivocal statements. Furthermore, in at least 
some of the situations, circumstances pertinent 
to the individual might alter the selection of а 
particular reply as “best.” \ 

Pupils taking the test are given opportunity 
to finish, although slow readers, it is warned, 
require 55 minutes. The average time is said to 
be 30 minutes. Scoring is by checking responses 
against a key. A table of norms was computed 
from responses of children living “in four sepa- 
rated communities." These are not further de- 
scribed, nor is there noted the number of chil- 
dren in each grade for which norms are stated. 
Norms for Form A are based on replies of 658 
children; for Form B there were 536 children. 
Scores are offered for thirds of the year of each 
grade (such a breakdown seems hardly prac 
ticable or necessary) ; it appears that from the 
time a child is ready to be promoted to fifth grade 
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to the time he is ready to be graduated from 8th 
grade, he gains 13 points of social awareness. 
There is no statement as to relative significance 
of the 45 items or as to which is learned early or 
later. 

Validity of the test is admittedly difficult to 
establish, and the figures presented in the manual 
are (again admittedly) not meaningful. Relia- 
bility coefficients were sought by correlating 
scores of the two forms of the test: in the case 
of one grade .88 is noted ; for a three-grade range 
.84 is reported. It is stated that the reliability of 
one form is higher than that found by correlat- 
ing two forms, presumably because of loss 
of interest after the prolonged attention re- 
quired. 

Just how efficient this device is in connection 
with character education programs, for which it 
is proposed, is not apparent from the data pro- 
vided in the manual. The literature has no re- 
port of any large scale use of the material. Its 
place in history (it is already two decades since 
its publication) is perhaps to mark the develop- 
ment in the awareness of psychologists and edu- 
cators of how the social and emotional health 
of children can be furthered more effect- 
ively through more dynamic approaches 
keyed to everyday use in understanding and 
behaving. 

The test might serve as stimulation for group 
discussion in a classroom. Its use as a means 
of discovering character and culture back- 
grounds is highly disputable. As a test of con- 
duct it would not be adequate. As a test of knowl- 
edge of social standards it would be limited in 
usefulness not only by the form and the length, 
but also because individual circumstances and 
differing situational factors would create con- 
fusion in arriving at a forthright "best" or 
“worst” response. 


[472] 
Furbay-Schrammel Social Comprehension Test. 
Grades 9-16; 1941-42; 15 scores: social calls, teas- 
receptions-parties, introductions, invitations, table eti- 
quette, dress and personal habits, public courtesies, cor- 
respondence, house guests, conversation, traveling, 
funerals, dances and balls, courtships-engagements- 
weddings, miscellaneous ; Form А (41); mimeo- 
graphed manual а) ; $1.70 per 25; 20€ per specimen 
set; postpaid; 80( 24 minutes; John H. Furbay and 
H. E. Schrammel; Bureau of ducational Measure- 
ments, Kansas State Teachers College of Emporia. * 


REFERENCES 
1. BARKER, CLARENCE, JR, A Comparative Study of the Social 
Comprehension oj кр s of College Freshmen. Master's 
thesis, Kansas State Teachers | ollege (Emporia, Kans.), 1941. 
Po (en Annotated Bibliography of Theses....1939-1944, 
1944, D. 5. 


James Н. Ricks, JR., Assistant Director, Test 
Division, The Psychological C orporation, New 
York, New York. 

This test is intended to measure the exam- 
inee's knowledge of the accepted rules of social 
conduct. The 330 questions are presented in 
seven pages of unfortunately small, though legi- 
ble, type. A split half reliability of .86 (220 male 
and female college freshmen) is reported for the 
total score. However, provision is made for no 
less than 15 part scores based on from 14 to 32 
items each, and no evidence as to the reliability 
of these part scores is offered; it hardly seems 
possible that such brief parts could have accept- 
able reliability. More than three fourths of the 
items are true-false. 

The manual discusses the test's validity solely 
in terms of internal consistency and of the ex- 
tent to which items differentiated students scor- 
ing high from those scoring low on the battery 
of entrance tests used in 1940 at Kansas State 
Teachers College of Emporia. The authority or 
authorities determining the right answers to the 
test are not cited ; presumably they were authors 
of standard books on etiquette. There is no indi- 
cation that any effort has been made to relate 
scores on the test to either past, present, or fu- 
ture social behavior of the person to whom the 
test has been given. 

Anyone who contemplates using this test 
should, of course, have clearly in mind the use 
he plans to make of the scores. If he is interested 
in making observations or predictions as to the 
social conduct of his examinee, the user's own 
intuition and common sense will have to assume 
the responsibility for such generalizations in 
the absence of any evidence on which to base 
conclusions regarding the significance of the test 
score. On the other hand, if he wishes only to 
appraise the subject's grasp of the contents of 
books on etiquette, as compared with that of 
students in small high schools and colleges in 
the Midwest, the total score may satisfy his 
needs. The best that can be said for the test is 
that it serves a limited purpose with limited 
efficiency. Even then, the Test of Etiquette: 
George Washington University Series is prob- 
ably preferable when this purpose needs to be 
served. 


[473] 
Test of Etiquette: George Washington Univer- 
sity Series. Grades 10-16; 1941-42; I form, '42; 
mimeographed manual (ат); $3 per 25; 30€ per speci- ` 
men set; postpaid; 50(55) minutes; Creelman Row- 
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land and Mitchell Dreese; Center for Psychological 
Service, George Washington University. * 

James Н. Ricks, Jr., Assistant Director, Test 
Division, The Psychological Corporation, New 
York, New York. 

As evidence of the adequacy of this test for 
measuring knowledge of correct social usages, 
the authors offer a list of eight books on eti- 
quette, published between 1936 and 1940, from 
which the items were drawn. In addition, the 
items comprising a preliminary form of the test 
were checked by “а jury of known social compe- 
tence.” No evidence is offered regarding the re- 
lationship of scores on the test to the way per- 
sons tested behave in actual social situations. In 
fact the authors make no claim that the test has 
validity in any behavioral sense; they are con- 
tent to say, “It is hoped that the use of the test 
will stimulate interest in correct social behavior.” 

As compared with the Furbay-Schrammel 
Social Comprehension Test, this test is equally 
reliable (.88), only two thirds as long, and is 
printed in very slightly larger though still 
crowded type. Most of the items are true-false. 
Norms are presented separately for the two sexes 
as well as for both sexes combined ; no informa- 
tion is given as to the number of cases on which 
the norms are based, but a somewhat greater 
variety of groups is reported than for the Furbay- 
Schrammel test. The test and the manual appear 
to have been unchanged since publication in 
1942. No scoring key is provided—the manual 
contains a list of correct answers with the help- 
ful suggestion that the test user may want to 
make a scoring key by copying these answers on 
strips of light cardboard. At least one of the 
answers given in the list is not the correct answer 
to the corresponding item. 

Like the Furbay-Schrammel test, this one 
contains some items which are obsolete, some for 
which the correct answers may vary according to 
local custom, and some dealing with situations 
normally encountered by only a small minority 
even of high school or college educated persons ; 
however, most of the items are unexceptionable. 
In a few items the authors seem to be attempting 
to test a real sense of courtesy or considerateness 
as well as a knowledge of the rule books. In gen- 
eral, this test accomplishes the same end as the 
Furbay-Schrammel in a slightly more efficient 
manner. It may serve to help motivate study of 
books on correct social practice, and its useful- 
- ness is probably defined by the extent to which 
such study makes for good social adjustment. 
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[474] 
Test of Knowledge of Social Usage. Grades 7-12; 
1933-42; I form, '42; manual (42); $5.50 per 100; 
25€ per specimen set; postpaid; nontimed (30-50) min- 
utes; Ruth Strang, Marion A. Brown, and Dorothy C. 
Stratton; Bureau of Publications, Teachers College, 
Columbia University. * 


REFERENCES 

1. STRANG, Котн. Pa Sch dy of Social Чафо їп Junior and 
Senior High Schools.” Sch & Soc 34:709-12 21 "a1. * (PA 
6:1642) 
HirpA ТАВА, Professor of Education, San Fran- 
cisco State College, San Francisco, California. 

For its limited purpose—namely, the measure- 
ment of knowledge of current social usage—the 
test appears to be quite adequate. Its sampling 
of areas seems sufficiently wide in scope. It cov- 
ers table manners, good taste in dress and ap- 
pearance, manners for host and guest, good form 
in walking with people, respect and considera- 
tion for others, behavior in a group, respect for 
property, and good manners at performances 
and games. Though the test is old, it shows few 
signs of obsolescence. A test made more recently 
might have included items on host and guest 
situations at television parties ; it might also have 
included more items on group situations other 
than parties, and on conduct in parliamentary 
procedure. One might question the de rigueur 
of form required in a few items, such as adoles- 
cents standing up in a public restaurant when 
their friends join them; but probably such re- 
quirements are not beside the point in the light 
of what is being taught in classes on good man- 
ners or in texts on etiquette. A few items seem а 
bit obvious for a true and false test. For that 
reason it is a pity this form was selectedefor 
convenience of scoring. A multiple answer test 
would have allowed more weighing of shadings 
in behavior. Handscoring only is indicated. The 
final score is obtained by subtracting wrong an- 
swers from right ones. J 

Quite valid procedures were followed in the 
making of the test. Current books оп etiquette 
were examined, and items most frequently men- 
tioned selected. Fifty teachers and personne 
workers rendered judgment on the selection. 
Three hundred high school pupils evaluated the 
items in the preliminary form, rating each item 
according to their estimate of its importance. ш 
addition, 1,614 students in junior and senior 
high schools took the preliminary form before 
revision, and 2,500 after revision ; and the ques- 
tions with which these students had difficulty 
were discussed with them in order to detect mis- 
conceptions and inadequacies. Changes in thé 
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1942 edition were based on the judgment of 40 
persons who had worked with adolescents. 

The test is reliable enough. A correlation of 
87, obtained froma retest of 46 pupils one month 
after the first test, is reported. 

An interesting additional use for the test is 
described in the manual. It is suggested that 
after knowledge has been tested, teachers use the 
test as a scale to appraise actual practice of eti- 
quette, by having students mark each statement 
according to whether the practice in case is fol- 
lowed seldom, usually, or always. 


HANDWRITING 


[475] 
Ayres Measuring Scale for Handwriting. Grades 
2-8; 1912-17; also called Ayres Handwriting Scale: 
Gettysburg Edition; 2 scores: quality, speed; 1 form, 
'17; original “Three Slant Edition” and adult hand- 
writing edition out of print; no data on validity; 24 
per scale, postpaid (only 1 copy needed); Leonard P. 
Ayres; Public School Publishing Co. * 
REFERENCES 

1. KING, ĪRVING, AND Јонмѕом, Harry. “The Writing Abili- 
ties of the Elementary and Grammar School Pupils of a City 
School System Measured by the Ayres Scale.” J Ed Psychol 


Psychol 5:589-95 iD 
„3. PINTNER, RUDOLF, “A Comparison of the Ayres and Thorn- 
dike Handwriting Scales.” J Ed Psychol 5:525-36 Noren 
. BREED, FREDERICK S., AND CULP, VERNON. *An Ау цен 
and Critique of the Ayres Handwriting Scale." Sch Soc 2: 


639-47 О '15. 
5. GRAY, & Truman. “The Training of Judgment in the Use 
of the Ayres Scale for Handwriting.” J Ed Psychol 6:85-98 F 


6. Manvet, Herscuer T. “The Use of an Objective Scale 
I 


for Grading Handwriting.” El Sch J 15:269-78 Ja '15. 
7. BREED, FREDERICK S. “The Comparative Accuracy of the 


Ayres Handwriting Scale, Gettysburg Edition.” El Sch J 18: 
458-65 Е ’18. * » 

8. Кооз, Leonard V. “The Determination of Ultimate Stand- 
ards of Quality in Handwriting in the Public School.” El Sch J 
18:423-46 F '18, * i 

RAXLER, ARTHUR E., AND ANDERSON, Hanmorp A. “Group 
Corrective Handwriting in the Junior High School—An Experi- 
ment." Sch R 41:675-84 N '33. * 


Worrn J. Оѕвовм, Professor of Remedial and 
Experimental Education, University of Wash- 
ington, Seattle, Washington. 

This is a 1917 revision of the original Ayres 
handwriting scale, first published in 1912. It was 
based on a study of the legibility of 1,578 samples 
of the handwriting of children of the upper ele- 
mentary grades of 40 school systems in 38 states. 
The degree of legibility of the samples was as- 
certained by a series of 15,780 accurately timed 
readings made by 10 paid investigators. The 
quality of each sample of writing was determined 
by its degree of legibility as shown by these 
readings, the assumption being that that writing 
is best for practical purposes which can be most 
easily read. The scale was designed as a meas- 
uring device rather than as a standard. 
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Since the aim was to construct an accurate 
measuring instrument, special care was exercised 
in controlling such factors as spelling, memoriza- 
tion of content, excessive motivation of the 
pupils, practice effect on the readers, and type of 
writing style. No account was taken of the per- 
sonal equation of the teacher who used the scale. 

In the original scale each paragraph had dif- 
ferent content. In the revision this feature was 
sacrificed. The content is the same throughout 
—the Gettysburg Address, which most of the 
pupils are likely to know by heart. To balance 
that, provisions are made in the revised scale for 
the measurement of the rate of reading. No in- 
formation is available concerning the reliability 
of the scores obtained by a single teacher. There 
is little or no provision for diagnosis. A teacher 
who finds that her pupils are writing poorly has 
no means of knowing why their writing is poor. 

In spite of the shortcomings noted, the scale 
in its present form seems to be the best means 
that we now have of measuring the quality of 
handwriting. 


[476] 
*Normal Handwriting Scale. Grades 4-8; 1947; 1 
form; no data on reliability and validity; no descrip- 
tion of normative population ; tentative norms; 15¢ per 
scale, postpaid (only 1 copy needed) ; (30-35) minutes ; 
‘Albert Grant; Zaner-Bloser Co. * 


[477] 
The Thorndike Scale for Handwriting of Chil- 
dren. Grades 2-8; 1910-12; 2 scores: s eed, quality ; 
1 form, ^12; 20€ per scale, postpaid (only 1 copy 
needed) ; E. L. Thorndike; Bureau of Publications, 
Teachers College, Columbia University. * 
REFERENCES 

т. THORNDIKE, Е. L. “Handwriting.” Teach Col Rec 11:83- 

175 Mr '10. * 


2. KELLEY, TRUMAN LEE. “Comparable Measures.” J Ed 
Psychol 5:589-95 Dire: * 4 
3. PINTNER, RUDOLF. “A Comparison of the Ayres and Thorn- 


dike Handwriting Scales." J Ed Psychol 51525-36 N '14. *. 
4. THORNDIKE, E. L. “Teachers Estimates of the Quality of 
Specimens of Handwriting.” Teach Col Rec 15:279-91 plus in- 


serts N "14. * 


Sruart A. Courtis, Professor Emeritus of 
Education, University of Michigan, Ann Arbor, 
Michigan. 

The Thorndike Scale for Hi andwriting of 
Children will always be of interest because it 
was the first scientifically calibrated instrument 
for the measurement of an educational product. 
It is, also, a fine example of the way principles 
derived from the scientific study of human be- 
havior may be applied to the solution of practical 
problems. 

Educational values are largely *intangibles." 
Some samples of handwriting are more pleasing 
and more legible than others. Thorndike's prob- 
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lem was to make a measuring instrument that 
would enable a teacher to express his judgment 
of differences in merit of handwritings in equal 
and consistent units of amount. 

Previously psychologists had studied the per- 
ception of small differences in man. Thorndike 
derived his concept of a unit amount of merit 
{тот their work. He defined a unit of general 
merit as the difference between two samples of 
handwriting that is recognized by 75 per cent 
of competent judges. This definition made it 
possible to select from samples of handwriting 
ranging from worst to best those whose differ- 
ence from the sample next below it was recog- 
nized by 75 per cent of handwriting experts. 
These selected samples make up the scale. 

Thorndike thus created an objective scale for 
measuring sensed differences in an intangible in 
units of amount that are equal in terms of the 
definition at all parts of the scale. His method 
has been used by others to measure other in- 
tangibles and is capable of very wide application. 

It is truly amazing, however, that in spite of 
the wide use of this scale, permanent contribu- 
tions to the improvement of the teaching of 
handwriting are conspicuous by their absence. 
Two reasons, among others, for this unproduc- 
tiveness and the consequent gradually waning 
interest in measurement will be discussed. 

Many teachers welcomed the scale and gave 
tests gladly. In general, they discovered only that 
their classes made scores just about standard 
for their grades, a little higher or a little lower 
in some cases. But the most disturbing aspect of 
the situation was the revelation of the enormous 
range of individual differences in every class in 
comparison with the small differences between 
class averages and the norms. Also it was most 
confusing to teachers that after a period of 
earnest teaching effort a retest showed few per- 
ceptible signs of any change in their classes. 

Ultimately, the general attitude of teachers 
crystallized into one best described by the word 
“resistant.” They were too wise to oppose meas- 
urement; they participated cooperatively in 
whatever measurement plans were made for 
them by their administrative superiors, but for 
their own work they quietly and effectively ig- 
nored measurement. To all comments on test- 
ing and test results, they developed a stock an- 
swer which effectively puts an end to discussion : 
“So what?” 

The mere use of a scale does not improve 
teaching. Neither do surveys or experimental 
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research. In general, few reports of measure- 
ment efforts will be found that help a teacher 
teach handwriting to Billy Smith any better with 
measurement than without. Misunderstanding 
of the nature of the scale is not Thorndike’s fault, 
The true function of the scale is to secure quanti- 
tative data which may be analyzed to reveal uni- 
versal laws and principles. Application of such 
contributions to the improvement of instruction 
calls for an entirely different activity. 

The second reason is probably both the cause 
and the explanation of the first. Education is 
concerned with the progress of individuals. 
Measurement of individuals inevitably creates 
a need for statistical methods of treating the re- 
sults but statisticians had readily available only 
mass statistics ; techniques for finding averages, 
trends, correlations, etc., which are not appro- 
priate for individual data. 

Thorndike himself set a bad example, statis- 
tically, in making his scale. Fullerton and Cat- 
tell’s work? was based on studies of many re- 
sponses by one individual to a given test situa- 
tion—individual data. Without experimental 
proof Thorndike assumed that one response by 
each of many individuals would give the same 
result—mass data. Also Thorndike adopted as 
a statistical expedient ? in the absence of knowl- 
edge, the assumption of normality of distribu- 
tion in educational data. These assumptions, 
which are false, delivered educational measure- 
ment directly into the power of statisticians. It 
has never been able to extricate itself from the 
web of logical deductions from false assumptions. 

To study individuals a teacher needs to meas- 
ure each one under a variety of stimuli, not once 
but often enough to secure reliable measures of 
his status. These measurements must be T€ 
peated at frequent intervals and cumulative 
records kept. Then individual teaching prescrip- 
tions to meet his revealed needs may be made 
intelligently. Each child's growth curve is unique. 
His scores may not be combined with other 
children's scores until corrected for that unique 
ness. у 

Such use of the Thorndike scale quickly 
brings to light problems whose solution must be 
attacked experimentally, not statistically. Among 
them are: (a) the cause and significance o 

1 Fullerton, С. S., and Cattell, J. McK. On the Perception 
datore Nos Pinas, Fe 

niversity 0! ennsylvania Press, 1892. M ‚у. am 
кус iL E OL еде 


York: Bureau of Publications, Teachers College, 
versity, 1926. Pp. xxvi, 616. * 
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variations in the handwriting scores of indi- 
viduals on repeated tests; (b) the reduction of 
such variable scores to valid equivalent scores 
under standard conditions; and (c) the trans- 
formation of subjective units of amount into 
linear objective units. Until these problems have 
been solved experimentally (they have not been, 
and cannot be, by mass statistics), it is impossi- 
ble for measurement in handwriting, or any other 
educational field, to be productive in terms of 
benefits to the child measured or to his teacher. 


HEALTH 


[478] 
xCollege Health Knowledge Test. College; 1950; 
12 scores: social and biological background, nutrition 
and diet, excretion and cleanliness, exercise and body 
mechanics, fatigue and rest, mental hygiene, reproduc- 
tion and heredity, prevention and control of disease, 
eye-ear-teeth hygiene, hygiene of environment, use of 
medical care, total; Form А ; no norms for part scores ; 
$2 per 25; 15¢ per specimen set; cash orders postpaid; 
nontimed (28-50) minutes; Terry H. Dearborn; Stan- 
ford University Press. 
REFERENCES 
1, DEARBORN, TERRY HAMILTON. The Measurement of Health 


Knowledge. Doctor’s thesis, Stanford University (Stanford, 
Calle), 1950. (Abstracts of Dissertations....1949-50, 1950, рр. 
311-3. 


H. Harrison CLARKE, Professor of Physical 
Education, and Director of Graduate Studies, 
Springfield College, S; pringfield, Massachusetts. 

"This test consists of 100 multiple choice items 
sampling the II areas listed above. Although 
constructed for use at the college level, the au- 
thor states that it may be used with informed 
adults and in senior high schools offering a full 
semester course in personal hygiene. 

As the title indicates, this test is definitely a 
knowledge test. No attempt is made to deal 
with attitudes toward health and health be- 
havior. It is true that proper and correct health 
knowledge is essential, but if this knowledge is 
not translated into appropriate conduct, the 
health teacher's effort is only partially effective. 
Health tests at the college level which attempt 
to evaluate attitude and behavior are the Byrd 
Health. Attitude Scale and Shaw and Troyer's 
Health Education Test. 'The latter test also eval- 
uates health knowledge. 

While it is interesting to know the student's 
position in relation to each of the 11 sections of 
the test, the validity of comparisons with maxi- 
mum scores for each section is not stated. The 
number of questions under each section varies 


from five for "hygiene or environmerit” to 15 
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for “nutrition and diet.” The unwary may be led 
to interpret such scores as measures of the, 
student’s knowledge in the section areas. Cer- 
tainly, until evidence is presented to support the 
contention, this practice would be unwarranted. 

Certain questions have misleading implica- 
tions. For example: “In state and federal hos- 
pitals, the majority of beds are occupied by 
patients suffering from (1) mental disorders, 
(2) tuberculosis, (3) cancer, (4) orthopedic 
defect, (5) venereal disease.” In the correct an- 
swer, “mental disorders,” the assumption, fre- 
quently made, that there are more patients with 
mental disorders than suffer from other condi- 
tions, is fallacious. The reason so many hospital 
beds are occupied by these patients is, rather, that 
they occupy them for so long a time, years in 
many cases. 

Certain questions, excellently stated, require 
a wide range of knowledge to answer. For ex- 
ample: “Digestion is (1) aided by relaxing after 
a meal, (2) completed in the stomach, (3) 
speeded by exercise before a meal, (4) improved 
by a brisk walk to ‘settle’ a meal, (5) normally 
completed in one hour.” To answer the question 
correctly, the student must have some knowl- 
edge of the digestive process and the effect of 
various practices upon this physiological process. 
In this way, such questions serve the purpose of 
two or more in bringing out the different aspects 
involved in the situation. 

From the standpoint of health knowledge only, 
this test has definite value. The author has uti- 
lized his resources in the approved manner. The 
test compares very well with other tests of a 
similar nature. A few fundamental questions 
relating to curricular validity, may, however, 
be raised for all health knowledge tests. Will 
100 multiple choice items sample adequately 
knowledge in 11 health areas? What comprises 
the proper health education curriculum? What 
number of questions are needed to cover ade- 
quately a defined area of knowledge? Should 
not health knowledge tests be more specific and 
complete in terms of areas of health knowledge 
included ? These questions should be answered in 
future research in this area. 


[479] 
*Cornell Medical Index—Health Questionnaire. 
Adults; 1949; separate forms for men and women; 
$2.50 per 50 of any one form, postage extra; specimen 
set not available; nontimed (10) minutes; Keeve Brod- 
man, Irving Lorge, Albert J. Erdmann, Jr. and Har- 
old G. Wolff; Cornell University Medical College (or- 
ders should be sent to Albert J. Erdmann, Jr. The 
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New York Hospital, 525 East 68th St., New York 21, 
N.Y.) * 

[ 480] 
X*[French Tests for Professional Courses in 
Knowledge of Sports]: Physical Education Ma- 
jor Examinations. College; 1943; 16 tests (badmin- 
ton, basketball, body mechanics, canoeing, 
hockey, folk dance, golf knowledge, recreational sports, 
rhythms, soccer, softball, stunts and tumbling, swim- 
ming, tennis, track, volley ball knowledge) ; 1 form; 
no data on reliability and validity (for data presented 
elsewhere by the author, see 2); no manual; 60€ per 
complete battery; 6¢ per copy of any one test; postage 
extra; 30(35) minutes per test; Esther French; De- 
partment of Physical Education for Women, State 
University of Iowa. * 


REFERENCES 


т. Frencu, Еѕтнев Louise, The Construction of Knowledge 
Tests in Selected Professional Courses in Physical Education. 


Doctor’s thesis, State University of Iowa (Iowa City, Iowa), 
1942. FRENCH, EsrHrm, "The Construction of Knowledge Tests 
in Selected Professional Courses in Physical Education.” Res Q 
14:406-24 D '43. * (PA 19:3487) 

Н. Harrison CLARKE, Professor of Physical 
Education, and Director of Graduate Studies, 
Springfield College, Springfield, Massachusetts. 

Each of the 16 tests consists of 22 to 33 multi- 
ple choice items. Diagrams are used frequently 
and effectively in the test items. The items on 
rules are limited to those essential for intelligent 
play. In the sports activities, emphasis is placed 
on knowledge of fundamental techniques and 
strategy of individual and team play. 

A few negative aspects should be mentioned. 
On the printed directions given on the cover 
page of each test, a time limit ts intimated but not 
stated. Here, too, the testee is cautioned to avoid 
guessing since wrong answers will be penalized, 
but no scoring method is given. In the test Body 
Mechanics, several negative questions are used, 
such as: “Which of the following is not an indi- 
cation of pronation ?" 

In general, the tests appear to be quite suit- 
able for use with women students majoring in 
physical education. Higher reliability for several 
of the tests, particularly those covering stunts 
and tumbling, softball, basketball, recreational 
sports, and body mechanics, would be desirable. 
Norms for each test based on some standard 


score procedure would be helpful, rather than - 


the mean scores now provided in the research 
report. Inasmuch as the curricular validity for 
the tests is based on professional courses pre- 
sented at the University of Iowa, the tests may 
be more applicable at that institution than else- 
where. 


[ 481] 
Gates-Strang Health Knowledge Tests. Grades 3- 
8, 7-12; 1925-37; 2 levels; manual [37]; $4.40 per 


100; 25€ per specimen set; postpaid; Arthur I. Gates 
and Ruth Strang; Bureau of Publications, Teachers 
College, Columbia University. * 

a) ELEMENTARY TESTS. Grades 3-8; Forms A (’37), 


B (37), C (37) ; 40(45) minutes. Я 
b) P aic TESTS. Grades 7-12; out of print: Forms 
D-F. 


REFERENCES 
т. GATES, ARTHUR I., AND STRANG, Коти. "A Test in Health 


Knowledge." Teach Col Кес 26:867—-80 Je '25. 

2. BENKERT, JOSEPH M. An Appraisal оў Health Knowledge 
in Certain Senior High Schools of Beaver County, Pennsylvania, 
Doctor’s thesis, University of Pittsburgh (Pittsburgh, Pa.), 
1941. (Abstracts of Theses....1941, 1942, рр. 33-42.) 

Paur E. Kama y, Professor of Education, Uni- 
versity of Oregon, Eugene, Oregon. 

In their manual the authors say, “Since in 
health education instruction is not standardized, 
valid national norms are not reasonably fixed.” 
This statement written in 1937 is probably 
equally accurate today. Not only is it diffi- 
cult to fix valid national norms, it is also 
equally difficult to construct a valid test. The 
items in these tests cover topics such as clean- 
liness, care of the body, foods, sanitation, rest, 
safety, prevention of disease, clothing, physiol- 
ogy, and first aid. Anyone interested in health 
tests should examine these tests item by item 
and decide whether they test what is being cur- 
rently taught. In this reviewer’s opinion a mod- 
ern health test should include some items cover- 
ing such topics as sulfa drugs, antibiotics, blood 
types, and atomic radiation. 

The manual includes some test results ob- 
tained in a large city system, a suburban school 
system, and rural schools in two counties. The 
number of cases on which the reported statis- 
tics were compiled is too small. This fact coupled 
with the lack of items on modern health develop- 
ments seriously limits the value of these tests. 


For an excerpt from a review, see 3:419; for 
reviews by Frederick Rand Rogers and A. H. 
Turney and an excerpt from a review, see 40: 
1500. 


[482] T 
*Health and Safety Education Test: National 
Achievement Tests. Grades 3-6; 1947; 5 звона 
good habits, cause and effect, facts, application of rules, 
total; Form A; $2 per 25; 356 per specimen set; 2034 
age extra; 40(45) minutes; edited by Clifford L. 
Brownell; Lester D. Crow and Loretta C. Ryan; 
Acorn Publishing Co. * 


[483] 4 
*Health Education and Hygiene: Every Рар" 
Test. Grades 7-9; 1935-51; new form usually puo 
lished each April and December ; form December 1957 $ 
no data on reliability and validity; no manual; nor 
(’51) ; 214€ per test; 16 per answer key; postpaid d 
(45) minutes; Ohio Scholarship Tests, Ohio State 
partment of Education. * 
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[484] 

XHealth Inventories. High school and college; 
1040—50; a revision of the instruments developed in 
1940 and described in Cooperation in General Educa- 
tion (see т below); 6 parts; I form. ’50; no data on 
reliability; no norms; manual (о); $2.25 per 25 of 
any one part; separate answer sheets must be used; 
75@ per 25 answer sheets for any one part; 25¢ per set 
of scoring stencils for any one part; cash orders post- 
paid; 50¢ per specimen set, postpaid; nontimed (50) 
minutes; developed by the Cooperative Study in Gen- 
eral Education (Ralph W. Tyler, Director) ; Cooper- 
ative Test Division, Educational Testing Service. * 
a) HEALTH ACTIVITIES. 
b) HEALTH INFORMATION. 
с) HEALTH INTERESTS. 
d) HEALTH ATTITUDES, 
е) ANALYZING HEALTH PROBLEMS. 
f) JUDGING SOURCES OF INFORMATION IN HEALTH PROB- 
LEMS. 

REFERENCES 


1. COOPERATIVE STUDY IN GENERAL EDUCATION, EXECUTIVE 
COMMITTEE, RALPH W. TYLER, DIRECTOR. Chap. 7, “The Major 


Project in Science,” pp. 142-57. In Cooperation in General Edu- 
cation. „Washington, .C.: American Council on Education, 1947. 
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BENJAMIN SHIMBERG, Assistant Chief, Ex- 
perimental and Evaluation Services Branch, Di- 
vision of Public H ealth Education, Public 
Health Service, Federal Security Agency, 
Washington, D.C. 

I. Backcrounp. The six health inventories 
listed above were developed in 1940 by a com- 
mittee of science instructors participating in the 
Cooperative Study in General Education spon- 
sored by the American Council on Education. 
They were conceived as tools to measure the ex- 
tent to which college science courses contribute 
to the more intelligent handling by students of 
their problems of personal and public health. 
They were also seen as a means of discovering 
health topics about which students were unin- 
formed or misinformed. The health problems 
were originally identified through analysis of 
existing information relating to the health of 
adolescents. Preliminary forms of the inventories 
were revised after giving them to students in 
four colleges. The revised inventories were used 
in the Cooperative Study until its termination in 
1944. 

Il. DESCRIPTION OF THE INVENTORIES. А. 
INVENTORY I, HEALTH ACTIVITIES. This inven- 
tory consists of 100 statements about health prac- 
tices related to personal appearance and hygiene, 
diet and nutrition, general operations of the 
body, and health hazards. The student first 
marks each item according to whether he en- 
gages in the practice “regularly,” “occasionally,” 
or “never.” The student then marks the first 70 
items as “sound,” “unsound,” or “doubtful” as 


to soundness in terms of science principles. 
Thirty items in this inventory are not scored. 

В. INVENTORY 2, HEALTH INFORMATION. In 
this inventory the student signifies his belief 
in the truth or falsity of each of 110 statements, 
using a five-point scale. The student indicates 
the degree of conviction he has in his response. 
The overall scoring key treats each statement as 
a true-false item, yielding subscores for each of 
the five subject matter areas. Separate keys are 
provided to determine on which items the stu- 
dent feels certain of his response and on which 
he feels that he is uncertain or uninformed. 

С. INVENTORY 3, HEALTH INTERESTS. This in- 
ventory was designed to determine whether a 
student is interested in learning about certain 
health topics and whether he wants to study these 
topics as part of his college program. 

D. INVENTORY 4, HEALTH ATTITUDES. This 
inventory attempts to measure student attitudes 
toward six premises or assumptions concerning 
the way people behave in various types of health 
situations. The six assumptions are quoted be- 
low: 

1. That there is one right way in health practices or 
problems. 

2. That qualified agencies should have a right to con- 
trol in health problems. 

3. That individuals have a гөрн to fulfill in 
maintaining at a maximum their own health as well as 
the health of others. 

4. That good, adequate, and buoyant health requires 
the greatest concern for bodily functions, for dangers 
from bacteria, etc. 


5. That personal appearance—rugged manly phy- 
sique—is the most important element of health. 


ape d 


6. That it is nature's purpose to maintain health at 

a maximum, especially if nature is “uncontaminated” 
by man’s activities. 
To determine whether the student subscribes to 
or rejects these assumptions, 60 statements are 
presented which presumably call for value judg- 
ments rather than judgments of fact. There is 
no indication on the test booklet that specific 
assumptions are being tested. Each statement 
seemingly stands on its own and may be an- 
swered “agree,” “disagree,” or “uncertain.” 
There are 10 statements scattered through the 
test which are intended to reflect the student’s 
attitude toward each of the assumptions. Two 
of the assumptions (2 and 3) are considered 
sound while four are considered unsound, The 
scoring gives credit for accepting items involv- 
ing “good” assumptions and for rejecting those 
reflecting “bad” assumptions. 

Е. INVENTORY 5, ANALYZING HEALTH PROB- 
tems. The three tests in this inventory attempt 
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to evaluate the student's ability to think criti- 
cally with respect to various types of health 
materials. The first set of questions involves the 
interpretation of data. The student is required 
to judge the extent to which each of 22 state- 
ments is justified on the basis of the data pre- 
sented. In the second test, which is based on a 
health advertisement and a magazine article 
about health, the student is asked to appraise the 
competence and integrity of the individuals or 
organizations involved. The final test asks the 
student to justify the conclusions stated in con- 
nection with four health problems. He may select 
from the statements given (a) those based on 
valid scientific principles; (b) those involving 
false analogy; (c) those assuming or restating 
the conclusion which was to be justified; and 
(d) those using true but irrelevant reasons. 

F. INVENTORY б, JUDGING SOURCES OF IN- 
FORMATION IN HEALTH PROBLEMS. The purpose 
of this inventory is to appraise the student's skill 
in evaluating the relevance and reliability of 
various sources of health information. Several 
health questions are presented. After each there 
is a list of from five to seven possible sources 
from which information might be obtained. The 
student evaluates each source as: (a) “А good 
place to go for valid information on the problem." 
(b) “The source may give some information... 
but is one you would not use if there were other 
sources available." (c) “The source is a poor one 
and you would not use it under ordinary circum- 
stances." Part II of the inventory lists I9 sources 
of health information and requires the student 
to indicate his use of and familiarity with each 
of these sources. The items of Part II are not 
„scored. 

ПІ. COMMENTS ABOUT THE INVENTORIES. 
A. COMMENTS ABOUT GENERAL CONTENT. The 
topics covered and problems presented in these 
inventories were drawn mainly from investiga- 
tions conducted in the 1930’s or earlier. No sub- 
stantial revisions have been made to bring the 
content into line with new developments in sci- 
ence and health. Some obsolete items have been 
removed, but the inventories have basically the 
same content they had in 1940. This factor may 
limit their usefulness in situations where com- 
prehensive up-to-date coverage is desired. In the 
Health Interests inventory for example, the stu- 
dent is not given the opportunity to indicate his 
desire to know about current health develop- 
ments such as the sulfa drugs and antibiotics, 
psychosomatic medicine, and new diagnostic 


techniques. At best, this inventory would now 
give an incomplete picture of the health topics 
which are of greatest interest to present day 
students. An obvious effort has been made to 
cast most of the tests into a functional frame- 
work, but some of the items do not seem related 
to the needs of college students. Some have a 
textbookish quality: “How are spermatozoa 
(sperm cells) and ova (eggs) produced?" (In- 
ventory 3, Item 108) is one of the interest items, 
In Inventory 5, for example, most of the prob- 
lems presented for analysis are essentially rea- 
soning problems involving health material. They 
are not the sort of problems students are likely 
to meet outside the classroom. 

B. COMMENTS ABOUT FACTUAL CONTENT. 
Since the inventories are concerned with health, 
it is only reasonable to inquire about the extent 
to which competent health specialists have re- 
viewed the items and answer keys. Neither the 
report of the original study nor the current test 
manual indicates that such a review was made 
at any time. The only critical review of the test 
items reported appears to have been by a group 
of college science instructors attending a work- 
shop at the University of Chicago in the summer 
of 1940. In personal correspondence with this 
reviewer, the publisher said, "No attempt was 
made to change the key....and thus it represents 
the authority of the original authors and not the 
authority of the Educational Testing Service." 
The test manual does caution users about the 
controversial nature of some items, but nowhere 
is it made clear that ETS assumes no responsi- 
bility for the accuracy of the answer keys. In the 
absence of any statement to the contrary, users 
would assume that ETS had checked the keys 
before republishing these tests. Further credence 
is given to the authority of these tests by promi- 
nently listing on the title page of each test, the 
names of the distinguished Executive Committee 
of the Cooperative Study in General Education. 
At the request of this reviewer a physician, 4 
nurse, two nutritionists, a health educator, an 
a proféssional writer on problems of public 
health each reviewed one or more of the inven- 
tories from the viewpoint of his specialty. Their 
comments and criticisms raise serious doubt a5 
to the value of these instruments. Below are some 
of the criticisms brought out in the comments 
of the professional consultants. 

Many health practices which have been key ed 
as "desirable" or “undesirable” cannot be an- 
swered without further qualification. For €x- 
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ample: (a) Health experts who took the test 
would not agree that one should regularly add 
additional salt (common salt or salt tablets) to 
the diet during the summer months (Inventory 
1, Item 22). The consensus was that the desira- 
bility of the practice would vary with such fac- 
tors as climate, type of activity, and health status 
of the individual. No valid generalization is pos- 
sible without taking these into account. (b) 
According to the key it is desirable to "ask a 
physician to check your blood pressure" (Inv. 
1, q. 48) regularly or occasionally. What about 
the person who gets a regular physical examina- 
tion? He has no occasion to ask for such a test, 
yet he would lose credit for answering "never." 
(c) Some of the questions asked are not gen- 
erally applicable. Students who do not engage 
in a given activity, or who have not faced a par- 
ticular health problem cannot answer meaning- 
fully in terms of "regularly," “occasionally,” or 
“never.” Take, for example, the question about 
the treatment of athlete's foot with drug store 
preparations (Inventory 1, Item 7). А student 
may mark "never" because he has never had the 
disease. On the other hand, a student who has 
had athlete's foot, and therefore had occasion to 
use some medication may reply "never" for an 
altogether different reason. 

The answers given to some of the items are 
not always based on the best current knowledge 
about the subject. For example, in Inventory 1, 
each of the 70 health practices is keyed as 
“sound” or “unsound” ; that is, substantiated or 
contradicted by scientific principles. Not one of 
the items is keyed as “doubtful.” It would in- 
deed be comforting if science had conclusively 
resolved all questions about the soundness of 
popular health practices. Unfortunately, though, 
the current status of many items is in doubt, 
either because no good evidence exists, or be- 
cause different investigators have reported con- 
flicting results. Below are a few of the items on 
which the consultants disagreed with the answer 
key. (a) Some statements are controversial: 
According to the key, a satisfactory method of 
removing unwanted hair is the use of an electro- 
lytic needle by an expert (Inventory 2, Item 7). 
Our consultants pointed out that some skin spe- 
cialists do not consider this method “satisfactory” 
since they fear it may result in tissue damage. 
Moreover, the term "expert" is ambiguous; it 
could mean a technician not under medical super- 
vision, (b) Some statements are not supported 
by existing evidence: The statement “Tooth de- 


cay can be lessened by brushing the teeth fre- 
quently and vigorously” (Inventory 2, Item 16) 
is keyed as “true.” Experts in dental health say 
that brushing the teeth within a few minutes 
after eating may cut down on decay, but not “fre- 
quent brushing” per se. There is no evidence 
that brushing vigorously has anything to do with 
reducing tooth decay. (с) Some statements are 
contradicted by existing knowledge. “The union 
of sperm and egg is possible only on one day in 
every twenty-eight,” (Inventory 2, Item 94) is 
keyed as “true.” Doctors say that fertilization 
may take place during a two or three day period. 
Another item keyed as “false” states, “It is 
possible to determine definitely when a person 
has been completely cured of gonorrhea” (In- 
ventory 2, Item 76). Experts now say that this 
statement is "true" or “probably true.” (d) 
Some statements cannot be answered “true” or 
“false” because reliable data are lacking: “Ap- 
proximately 25% of the American population 
has a serious deficiency in one or more vitamins.” 
(Inventory 2, Item 27) This statement may 
have been correct in 1940, but we do not know 
whether it still holds today. 

С. COMMENTS ABOUT SPECIFIC INVENTORIES. 
The general comments about content and the 
accuracy of factual items summarize this re- 
viewer’s opinions about all but two of the in- 
ventories. These require separate discussion. 

Inventory 4. Health Attitudes Inventory. The 
consultants ‘who reviewed the Health Attitudes 
inventory raised some. fundamental questions 
about two of the assumptions endorsed by the 
people who prepared the inventory. (a) As- 
sumption 2 states “That qualified agencies should 
have a right to control in health problems.” This 
might be an acceptable statement from the view- 
point of public health if it were modified to read 
“where the health of others is involved.” With- 
out such a qualification, the assumption implies 
that any agency has a right to interfere in mat- 
ters of purely personal health. About half of the 
items which presumably test this assumption are 
very weak, either because of ambiguities, wrong 
emphasis, or incorrect information. One item 
proposes that “Every person, whether or not he 
has athlete’s foot, should have to treat his feet 
(stepping into a solution) before entering a 
swimming pool” (Item 6). This may be an un- 
fortunate illustration since the thinking on this 
practice has changed. Several consultants were 
of the opinion that these baths may actually trans- 
mit athlete’s foot. One item states that “School 
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officials should have a right to send malnour- 
ished children to those agencies where they may 
receive an adequate diet” (Item 10). What 
agencies ? What is meant by “send” ? Would this 
be done with or without parental consent? An- 
other item proposes that "A severe penalty 
should be imposed by law on any individual who 
runs his motor unnecessarily in public garages 
where the individual knows there is no pro- 
vision for the removal of carbon monoxide" 
(Item 35). Might it be wiser to require the 
garage owner to install exhaust fans? Here the 
difference of opinion is not over the desirability 
of social control but rather as to the best method. 
(b) Premise 3, which states “That individuals 
have a responsibility to fulfill in maintaining at 
a maximum their own health as well as the health 
of others" might be acceptable if we knew where 
"responsibility" began or ended. This reviewer 
would certainly not endorse such a premise if 
he thought it implied interference in the per- 
sonal affairs of others. From the type of items 
used to test this assumption, one wonders 
whether the item writers themselves had a clear 
idea of what the statement implied. Here, for 
example, are two items with which students are 
expected to agree: 

Failure to have a periodic physical examination is 
one characterization of poor citizenship (Item 38). 

Any person buying unpasteurized milk in a locality 
where pasteurizing is required is as much a "slacker" 
X er as is the person selling the milk (Item 
9). 
Here are two items with which students are ex- 
pected to disagree: ] 


It is no one else's business if a person decides to go 
on a Hollywood diet (a meatless diet) or any similar 
kind of a diet (Item 36). 


Even though the use of high heels may cause falls 
and crippled feet, it is a woman's own business and no 
one else's if she wishes to wear high heels (Item 15). 

Inventory 6. Judging Sources of Information 
in Health Problems. The test idea is a good one, 
especially as a stimulus for a discussion which 
might explore the strengths and weaknesses of 
each suggested source. However, the present test 
has some serious shortcomings: (a) The date 
of publication is not given for many of the titles 
listed as good sources. In the opinion of this re- 
viewer, the student who rejects such material 
would be showing sound judgment. (b) Some of 
the references listed are long outmoded. For ex- 
ample, several consultants indicated that neither 
Personal Hygiene Applied, by Williams, nor 
Shadow on the Land, by Parran, are considered 


the best references today, more than ten years 
after publication. (c) Registered pharmacists 
are listed as secondary sources of information on 
questions like, “Are the advertised throat anti- 
septics and cold remedies of any value?” (Item 
21) No doubt pharmacists would try to provide 
reliable information on this topic, but this re- 
viewer would hesitate to suggest going to a 
source whose viewpoint may be colored by eco- 
nomic self-interest. (d) Careful editing of this 
test would have revealed that the following refer- 
ences need to be brought up-to-date: (1) The 
magazine Hygeia is now called Today's Health 
(Item 28). (2) Dr. Thomas Parran is no longer 
Surgeon General of the U.S. Public Health 
Service. (q. 38.) (3) National Society for the 
Control of Cancer is now the American Cancer 
Society (Item 12). 

D. QUALITY OF TEST ITEMS. The presence of 
many poor test items in the inventories leaves 
one with the feeling that they were published 
prematurely. Many items are ambiguous, clum- 
sily worded, double-barreled. A frequent com- 
plaint heard from persons who reviewed the 
tests was that some items, especially in Inven- 
tory 1, could not be answered in terms of the di- 
rections. ; 

This reviewer found it difficult to understand 
why the staff of ETS had refrained from making 
editorial changes in obviously poor items. The 
test manual states that the ETS staff “has at- 
tempted to keep revisions to a minimum in order 
to retain the original character of the inven- 
tories,” In personal correspondence with this 
reviewer, ETS has further clarified its reasons 
for not making extensive editorial revisions. The 
following excerpts seem relevant: 


It was felt that many changes were needed. IR 
changes were not made because they would have Su 
stantially altered the test to the extent that they Ade : 
no longer be comparable to the original tests. ж for. kn 
reason (research) the tests were not substanti x 
changed even though it was felt that such changes M s 
necessary in order to produce good tests., In o йе 
words, these inventories were reprinted without 
usual ETS treatment of tests. 


If the prime motive for reprinting the tests 0 
their original form was to make them available 
for research, this reviewer feels that they shou 

have been made available only for this рр 
By including them in the general catalog of the 
Cooperative Test Division without pointing 0 
that they had not been edited or brought up-t 
date, ETS has not fulfilled its responsibility Y 
prospective purchasers, who rely on reputa 
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test organizations to furnish them with materials 
of high technical quality. 

IV. Comments ABOUT THE Test MANUAL. 
A. SCOPE OF MANUAL. The test manual includes 
material оп the background of the study, de- 
scriptions of each inventory, and suggestions for 
interpreting the scores. Much of this material 
was taken directly from Chapter 7 of the final 
report of the Cooperative Study, Cooperation їп 
General Education (т). No normative data 
are given. A table shows the maximum score 
attainable for each inventory and for each sub- 
test within an inventory. Suggestions given for 
interpreting the various scores seem adequate. 
However, the number of keys provided could 
probably be reduced without sacrificing any use- 
ful data. 

в, STATISTICAL ANALYSIS. There is no indi- 
cation in the manual that these inventories have 
been subjected to any form of statistical analy- 
sis. (a) Validity. The validity of these inven- 
tories is not discussed, nor are any data pre- 
sented to indicate that they measure what their 
titles say they measure. If the basic thinking 
which underlies these inventories is sound, there 
should be a demonstrable relationship between 
high scores, desirable health behavior, and good 
health. (b) Reliability. In view of the quality of 
the test items, the questions raised by the health 
specialists, and the brevity of some of the tests, 
this reviewer would predict low reliability on 
most of the tests. Unfortunately, reliability co- 
efficients are not reported. (с) Intercorrelations. 
No intercorrelations have been computed among 
the various tests, hence we do not know to what 
extent the various tests are measuring the same 
factors. It would also be desirable to know how 
these tests correlate with measures of mental 
ability and with standard science achievement 
tests. (d) Scoring Procedure. Scoring is ac- 
complished by means of tissue paper overlays, 
which seem to work best when only a count of 
right or wrong answers is desired. When one 
tries to identify specific problems about which a 
student is uninformed or misinformed, it is 
necessary to lift the scoring key to indicate on 
the answer sheet that an item is wrong. In this 
reviewer’s opinion, punched out scoring keys 
would be more satisfactory. 

V. Summary. This reviewer believes that the 
basic idea which underlies these inventories is 
sound and that some of the tools reveal a good 
deal of creative imagination. 

However in their present form, these inven- 


tories fall far short of what one might expect in 
the light of current standards of test develop- 
ment. The health topics covered are not up-to- 
date. The keys do not always reflect the current 
thinking or scientific progress which has oc- 
curred during the past decade. The items are 
generally of poor quality. No data are given as 
to the validity or reliability of these tests. 

This reviewer hopes that ETS and others who 
are interested in this problem will resume work 
on these inventories, so that sound, up-to-date 
instruments will be available to those who would 
like to use them. 
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*Health Test: National Achievement Tests. 
Grades 3-8; 1937-49; 5 scores: recognizing best hab- 
its, health comparisons, causes and effects, health facts, 
total; Forms A (49), В (40); no data оп reliability 
and validity and no description of normative population 
in manuals; no norms for part scores; manuals (738) ; 
$1.75 рег 25; 356 per specimen set; postage extra; non- 
timed (40) minutes; Robert K. Speer and Samuel 
Smith; Acorn Publishing Co. 


Јлсов S. ORLEANS, Professor of Education, and 
Director of Research and Evaluation, Division 
of Teacher Education, The College of the City 
of New York, New York, New York. 

The test consists of four parts, one each on 
recognizing health habits, health comparisons, 
causes and effects, and health facts. Each part 
contains 20 multiple choice questions. In Part 
I each question is followed by 3 choices ; in Parts 
II and IV, by 4 choices; and in Part III, by 5 
choices. 

Information was not available to this reviewer 
concerning the basis for testing these four as- 
pects of the subject or the specific items of con- 
tent covered by the 80 questions. Most of them 
would appear to measure significant knowledge. 
A few of the items may be regarded as question- 
able from the standpoint of either content or Eng- 
lish. Should grade children be expected to know 
what is meant by “cholera infantum" ог 
“tetanus”? Does “failing to wear rubbers in 
wet weather” cause colds or is it a condition that 
permits the cause of colds to become effective? 

Although the test as a whole appears to have 
merit, there is occasional evidence of looseness 
in question preparation such as the inclusion of 
two answers both of which cannot be true, thus 
limiting a four-choice question to two choices ; 
and occasional questionable use of English (e.g. 
“the food and air we use"). The questions are 
largely factual except for those on recognizing 
health habits in which the pupils are asked to 
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judge the best of three ways of performing var- 
ious acts as done by three children. This type of 
question is probably more interesting and mean- 
ingful to children than would be the simple fac- 
tual form. 

The instructions are simple and clear, although 
third graders and some fourth graders may be 
expected to have difficulty with them. The same 
reading difficulty may hold true for the ques- 
tions themselves in the case of third and fourth 
graders. If the questions are arranged in the 
order of difficulty, again pupils in the lower 
grades for which the test is intended probably 
need to be instructed to go on to the further 
parts of the test without spending an undue 
amount of time on each part. 

No data were available on the reliability of 
this test. Norms are furnished for the test as a 
whole for grades 3 through 8. Norms are not 
furnished separately for the four parts. The typ- 
ical achievement of pupils at the start of the 
third grade is represented by 25 questions an- 
swered correctly, and at the end of the eighth 
grade by 62 questions answered correctly. The 
range of 5 grades from the end of the third grade 
to the end of the eighth grade is 27 points, which 
means that one point of score in this range repre- 
sents approximately two months of achievement. 
No data were available on the number of pupils 
in each grade on whom the norms are based, or 
the geographical or other type of distribution 
they represent. The class record provides spaces 
for scores on the four parts of the test. It is 
doubtful that, with only 20 questions per part 
covering a range of six grades, the scores on 
the separate parts are reliable enough for indi- 
vidual measurement. 

The scoring is based on the clever device of 
a code word. Each choice is preceded by a letter, 
the letters differing from question to question. 
The letter that is part of the code word indicates 
the correct answer. This scheme is undoubtedly 
a time saver for hand scoring. No evidence is 
available on errors in scoring by this device. It 
runs the risk of making the test useless if pupils 
learn the scoring scheme. The score is the num- 
ber of questions answered correctly. 

There is no time limit, although the instruc- 
tions carry the comment that the time generally 
taken is 40 minutes and will vary according to 
age and grade. 

This test may have value when used as part 
of a battery of achievement tests or for instruc- 


- tional purposes to determine what further learn- 
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ing individual pupils need on specific items of 
content. 
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*Patient’s Self History Form. Physicians and hos- 
pital administrátors; 1948; І form ; $2 per 25; 15¢ per 
specimen set; cash orders postpaid; Oliver E. Byrd; 
Stanford University Press. * : 
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*Personal Health Inventory. Grades 1-16; 1947; 


$2 per 25; 1 5€ per specimen set; cash orders postpaid; 
Oliver E. Byrd; Stanford University Press. * 


WirLAnD W. Parry, Dean, School of Health, 
Physical Education, and Recreation, Indiana 
University, Bloomington, Indiana. 

This inventory is a device for securing health 
information about a student that supplements 
the facts obtained by doctors', dentists', and 
nurses' examinations. This information, based 
on remembered facts and symptoms rather than 
medical tests, is given by the high school or col- 
lege student himself or, in the case of elementary 
school students, by his parents. While directly 
concerned with the health of each individual, the 
questions asked are varied and quite compre- 
hensive and do not duplicate the findings of the 
usual examination. 

The author recommends that the inventory 
form be filled in at the time of school entry an 
that it become a part of the school records of 
the pupil. This health inventory should be ac- 
cessible to the school physician as an aid to diag- 
nosis and recommendations at the time of medi- 
cal examinations; to school nurses as а partial 
basis for health counseling; to amy teacher or 
other member of the school staff in order to 
understand health problems of students; and to 
health educators as an aid to individualizing 
health education. The inventory should be ап 
open, continuous record and should be kept uP 
to date by the student or parent concerned. ^ 

As a whole the inventory is well organized 
and clear in meaning. One may raise the question 
of ambiguity concerning an item such as “Drink 
alcohol?’ to which some students or parents 
might answer “no” although they habitually con- 
sume drinks with some alcoholic content. A1- 
other ambiguous question is “Use drugs of any 
kind?" No doubt the author refers to such se 
ously harmful substances as marijuana, opium, 
heroin and other similar narcotics. But many 
other substances are drugs and are used for 
legitimate purposes. js 

The subjectivity of responses to such questions 
as “Is your home a good one?" and “Do yo" 
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worry much?" may be considered a weakness. 
In the section Nutrition the respondent may be 
prejudiced in his responses by the preliminary 
statement : “Check yourself against the National 
Research Council’s Daily Food Guide.” Impli- 
cations as to proper answers are clear and place 
pressure upon the respondent’s honesty. 

Perhaps the most serious adverse criticism of 
the Personal Health Inventory is the incom- 
pleteness and lack of balance in the section 
Health Habits. The preliminary question con- 
cerning habits of sleeping is the only one where 
a beneficial habit is investigated. The remaining 
five questions inquire about the use of alcohol, 
tobacco, coffee, tea, and drugs. It would seem 
that the habits investigated should be more var- 
ied and extensive. Opportunities for the respond- 
ent to develop an accurate individual health habit 
pattern by indicating beneficial as well as detri- 
mental practices might well be given. 

No manual of directions for the use and in- 
terpretations of results of this inventory is avail- 
able, Such a publication would be helpful to 
those who wish to obtain the optimal value of 
the inventory as a health service and health edu- 
cation tool. 

The Inventory’s chief weaknesses seem to be: 
(a) absence of a manual, (b) underemphasis of 
good health habits in the inquiry list, (c) in- 
clusion of some questions which are so subjec- 
tive as to be low in validity and reliability, (d) 
a few ambiguous questions, and (e) a few ques- 
tions where directions may influence responses. 
Nevertheless, the Personal Health Inventory as 
a whole seems to be a very useful tool in a health 
program. 
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*Sex Knowledge Inventory: Experimental Edi- 
tion. Marriage counselors and teachers of sex educa- 
tion; 1950; 2 parts (labeled Forms X and Y); no 
norms; $2.50 per S ecimen set; postage extra; Gelolo 
McHugh; Family Life Publications. * 

а) FORM X: FOR MARRIAGE COUNSELING. Knowledge of 
13 areas: general, sex-act techniques, the hymen, pos- 
sible causes of poor sexual adjustment, sex dreams, 
birth control, sterilization and circumcision, menstrua- 
tion, conception-pre nancy-child-birth, superstitions- 
misconceptions-misin ormation, masturbation venereal 
diseases, effect of menopause on sex life; 75d per test; 
separate answer pads must be used ; $2.25 per 25 an- 
swer pads; $1.50 per manual; nontimed (45) minutes. 
b) ковм Y: VOCABULARY AND ANATOMY. 2 scores: vo- 
cabulary, anatomy ; $2.25 per 25; nontimed (25) min- 
utes, — 


Атвквт Ertis, Consulting Psychologist, 56 
Park Ave., New York 16, New York. 

This inventory is a new objective test which 
attempts to discover some of the basic factual 
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information and attitudes, which the subject has 
about human sex relations. Form Y of the in- 
ventory consists largely of an achievement test 
in sex physiology and hygiene. It is a fairly com- 
prehensive instrument which appears to serve 
the purpose for which it was intended. Only two 
minor objections could be found to it: (а) It 
defines frigidity as a “low degree of sex desire 
—especially in- women.” Actually, however, 
frigid women may have considerable sex desire 
but little or no capacity for fulfilling that desire 
in satisfactory orgasmic release. (b) Promis- 
cuity is defined as "sex activities with more 
than one person during the same period." This 
is quite inaccurate since promiscuity actually 
means indiscriminate sex activities. An indi- 
vidual may have sex relations with two or three 
persons during the same period while still re- 
maining quite discriminating in his or her sex 
choices. 

Form X has been developed specifically for 
use in conjunction with marriage counseling. It 
covers information on a wide variety of sexual 
topics and, considering that it is a pioneering 
test, it does so in a competent manner. If it were 
to be employed as a screening instrument to dis- 
cover which subjects out of an unselected group 
were in need of specific sex education and coun- 
seling, the inventory would probably serve a 
useful purpose. Its actual function, however, is 
stated to be its use with individuals who come 
for marriage counseling ; it is only supposed. to 
be used, according to the author’s accompanying 
manual, as an adjunct to such counseling, This 
reviewer is heartily opposed to using the Sex 
Knowledge Inventory in this manner, for the 
following reasons: 

a) The inventory, it is claimed, will save the 
counselor’s time “by providing quickly a com- 
prehensive picture of his counselees’ sex knowl- 
edge and attitudes, pointing out those areas 
where counseling is most needed." This time- 
saving advantage of the inventory appears to be 
quite chimerical : it will take the counselor up 
to a half-hour to check the subject's responses 
with the answers in the manual and, when he 
has done so, he will have only a most partial 
picture of his counselee’s sex knowledge and 
attitudes. 

b) The inventory and its manual seem to as- 
sume that all counselees will have fairly similar 
sex problems and that any subject who cor- 
rectly answers most of the questions will have 
virtually no serious sex or marriage problems. 
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Both these assumptions are not substantiated by 
the experience of the present writer or of any 
of the experienced marriage counselors of his 
acquaintance. 

c) While the inventory and the manual are, 
in general, good enough as far as they go, both 
fall short in the probing of sex fears, supersti- 
tions, prejudices, fixations, ignorances, disturb- 
ances, and emotional blockings and rigidities 
which frequently preclude sexual and marital 
adjustment. The manual gives mild lip service 
to the consideration of deep-seated psychologi- 
cal feelings and attitudes which normally under- 
lie sexual and marital difficulties ; but, when ex- 
amined more closely, it is seen to be a distinctly 
undynamic, superficial document. 

d) The philosophy behind the inventory and 
the manual seems to be that sex problems are 
almost entirely based on ignorance of sex facts 
and that virtually anyone who has read a few 
books on sex is qualified to do sex and marriage 
counseling. Nothing could be further from the 
truth. The practice of marriage counseling, to be 
done effectively, should be limited to those who 
have the deepest possible understanding of hu- 
man nature and who are specifically trained and 
supervised in the practice of psychotherapy. Mc- 
Hugh's inventory and manual, by encouraging 
marriage counseling by virtually any "teacher, 
doctor, minister, or other counselor" (and never 
even mentioning the role of the clinical psycholo- 
gist, psychiatrist, or psychiatric social worker !), 
is likely to add considerably to the existing chaos 
and lack of professional standards in this area. 

e) While McHugh, has managed to avoid 
most of the namby-pamby, puritanical sex atti- 
tudes which are still so common in our sex 
educational literature, he has also neglected most 
of the hard hitting information which would 
serve to allay the serious sex fears that com- 
monly overwhelm millions of our citizens. His 
treatment of masturbation, unconventional sex 
behavior, impotence, sexual deviations, and 
other troubling aspects of human sexuality is 
either nonexistent or carefully toned down. Any 
sexually disturbed individual who is counseled 
by one of McHugh’s disciples would apparently 
have little chance of getting solid answers to 
most of his questions or of resolving his more 
serious difficulties. 

In sum: while Gelolo McHugh is to be com- 
mended for his pioneer job of constructing a 
sex knowledge inventory which, when used as 
a screening device, might well serve the purpose 
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of discovering many individuals who are in need 
of sex and marriage counseling, his attempt to 
use this inventory as ап integral part of the 
counseling procedure itself is most inadvisable 
and is likely to lead to considerably more harm 
than good. Any marriage counselor who is so 
naive and so nonclinically oriented that he can 
make good counseling use of this inventory and 
manual most assuredly has no business being 
a marriage counselor—or any type of psycho- 
therapist—at all. 

Marriage & Family Living 13:45 w’51. Don- 
ald F. Schroeder. * a useful instrument for the 
trained marriage counselor. In the hands of the 
untrained it may be easily misused. Tt is assumed 
that the counselor has an adequate background 
in the biological, psychological and cultural sci- 
ences and the interrelations in the area of the 
psychosexual. The “Marriage Counselors Man- 
ual"....serves as a guide in the use of the Inven- 
tory but is not a textbook for the untrained. * 
In marital counseling I have found the Inven- 
tory of value as a means of discovering beliefs 
and attitudes which I had failed to catch through 
other methods. There has been as much value 
in noting the reasons for answering some ques- 
tions “correct” as well as those which were “in- 
correct." Some questions checked as "best" in 
the Inventory were marked although the reasons 
and attitudes for so marking were unsatisfac- 
tory. A discussion of these reasons often gave 
valuable leads for follow-up in later sessions. Of 
course, not all the questions in the Inventory 
need to be discussed. I feel the Sex Knowledge 
Inventory is a step in the right direction in pro- 
viding marriage counselors with instruments In 
the improving of their counseling. 
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zel; NEA Service, Inc. * 

а) THE BABY GRID: A GUIDE TO INDIVIDUA 
DURING INFANCY. Ages birth-3; 1946-48. MS 
b) GRID FOR EVALUATING PHYSICAL FITNESS IN TER a 
OF PHYSIQUE (BODY BUILD), DEVELOPMENT LEVEL A {5 
BASAL METABOLISM—A GUIDE TO INDIVIDUAL no 
FROM INFANCY TO MATURITY. Ages 2-18; 1940 te in 
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ferred to by publisher as the “Big” grid; ауа! 
white or buff. í 
REFERENCES S КАНЕ 

т. WETZ Norman C. “Physical Fitness 1 ‘de to 
Physique, Development and Basal Metabolism: With а Guide d 
Individual Progress From Infancy to Maturity: A No * 
for Evaluation. NT Am Med Assn 11611187295 аав and 

2. ETZEL, Norman C. “The Simu eou: fi 
Assessment of School Children." 7 Health & Phys Ed 13:57 
7 "42. j 
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з. Hatt, SUE Marion. A Comparative Study of the Carlson 
Fatigue Curve Test, the Brouha Step Test, the Wetzel Grid, and 
the Medical Examination, Doctor's thesis, Ohio State University 
(Columbus, Ohio) 1948. (Abstracts of Dissertations... Autumn 
Quarter-W inter Quarter 1947-48, 1949, pp. 263-8.) 

4. WETZEL, Norman C. The Treatment of CO M Failure in 
Children: An Application of the Grid Technique. Cleveland, 
Ohio: NEA Service Inc., 1945 Pp. xi, 102. * 

5. GRUENINGER, Rosert McKixLev. Physical Performance of 
High School Boys and College Men Classified by the Wetzel 
“Grid for Evaluating Physical Fitness." Doctor's thesis, Uni- 
versity of Michigan (Ann Arbor, Mich.), 1949. Abstract : 
илне Abstracts 9:139-40 no 2 '49. * (PA 24:3988, title 
only t 

6. KROGMAN, WILTON MARION, A Handbook of the Measure- 
ment and чег нон of Height and Weight in the Growing 
Child, pp. 53—03. Monographs of the Society for Research in 
Child Development, Vol. 13, No. 3, Serial No. 48. Evanston, Ill.: 
Child Development Publications, the Society, 1950. Pp. ix, 68 
Paper, lithotyped. * (PA 25:6061) 

7. WETZEL, Norman C. “Growth,” BR 513-69, In Medical 
Physics, Vol. I, Edited by Otto Glasser. Chicago, Ill.: Year Book 
Publishers, Inc., 1950. Pp. xxvi, 1227. * 

8. ENGLISH ‘Horace B. Child Psychology, 
York: Henry Holt & Co., 1951. Pp. xiv, 561. * 

9. MiLLER, KENNETH D. "The Wetzel Grid as a Performance 
CM With College Men." Res Q 22:63-70 Mr "sr. * (PA 
25:7221 


pp. 282-7. New 
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*Width-Weight Tables, Second Revised Edition. 
Ages 1 and over; 1936-40; 1 form, '40; 75¢ per copy; 
1-99 record charts, 6¢ each; 100 or more, 5€ each; cash 
emet postpaid ; Helen B. Pryor; Stanford University 
ress. 


REFERENCES 

т. Jenss, RAcHEL M., AND SOUTHER, Susan P. Methods of 
Assessing the Physical Fitness of Children: A Study оў Cer- 
tain Methods Based on Anthropometric, Clinical, and Socioeco- 
nomic Observations Made of 713 7-Year-Old White Boys and 
Girls in New Haven, Conn., Over a of 19 or 20 Months 
During 1934-36. United States Department of Labor, Children’s 
Bureau, Bureau Publication No. 263. Washington, D.C.: Gov- 
ernment Printing Office, 1940. Pp. vi, 121. Paper. * 


For excerpts jrom reviews of the original edi- 
tion, see 38:B455 and 36:B211. 
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*Cooperative Test in Foods and Nutrition. Col- 
lege; 1947-51; IBM; Forms X ('50—ѕате as test copy- 
righted in 1947) ; Y (48); no data on reliability and 
validity; directions for administering ['50]; general 
Cooperative manual ('51) ; norms (750) ; $2.50 per 25; 
separate answer sheets must be used; 8o¢ per 25 IBM 
answer sheets; 15¢ per scoring stencil; cash orders 
postpaid; 50€ per specimen set, postpaid; 80(90) min- 
utes; prepared by the Subcommittee on Foods an 
Nutrition (Clara Brown Arny, Ercel Eppright, Eliza- 
beth Hepworth Feniak, Henrietta Fleck, Doroty 
Schnell, Esther F. Segner, and Willa Vaughn Tinsley 
of the Evaluation Committee, American Home Eco- 
nomics Association; Cooperative Test Division, Edu- 
cational Testing Service. * 


REFERENCES 
1, Report on the National College Home Economics Testing 


Retort Seng азе Erasm ка: p d 17: Paper: н. 
Ковккт L, Eset, Director, University Exami- 
nations Service; and Associate Professor of Ed- 
ucation, State University о} Iowa, Iowa City, 
Тоша. | { , * 

The items in this test place emphasis on prac- 
tical information and practical situations, as the 
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following paraphrases of typical questions will 
indicate. 

What are the best conditions for storing winter 
onions? 

When meat supplies are limited, what group in the 
population should be given first consideration? 

What misleading techniques are illustrated by the 
following statements from advertisements ? 

What time schedule should be followed in preparing 
a dinner for guests? 

How should dishes be arranged on the serving table 

for a buffet luncheon? _ 
Special praise is due the test authors for their 
effective use of complex problem situations in- 
volved in such activities as marketing, planning 
a dinner, and arranging a table setting as bases 
for related groups of practical questions. 

A few flaws appear in the item writing. There 
are too many items similar to those below, 
stressing terminology and insignificant factual 
details. 

What is the vitamin which the body may absorb less 
of when mineral oil is used in salad dressings ? 

What is the name of the best method for preparing 
sirloin steak? 


How many calories are used when one gram of pure 
carbohydrate is burned in the body? 


Which disease is transmitted from cows to humans 

through unpasteurized milk? 
This difficulty results mainly from a preference 
for items which have very brief responses. A 
canon of good item writing is violated in the 
items dealing with definitions. The authors usu- 
ally place the definition rather than the term to 
be defined in the stem of the item ; such a reversal 
often makes the items somewhat artificial and 
ineffective. 

The test contains groups of related items hav- 
ing a common set of responses for different item 
stems. In most cases this approach is highly ef- 
fective, but in the series of items dealing with the 
correlation between variables it leads to some 
awkwardness. In this group of items the com- 
mon set of responses is: 

1. There will be an increase 

2, There will be a decrease 

3. There will be little or no change 
The implication of one item that under certain 
conditions “there will be an increase” in the 
color of carrots is a bit odd. In other instances 
the correlation could be expressed more effec- 
tively if the attempt to fit item stems to a set 
pattern of responses were abandoned and if 
each item were written as a separate unit. 

In general, the responses keyed as correct 
appear thoroughly defensible as best answers. 
The distractors appear likely to attract poorly 
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informed examinees. One may wonder, however, 
if “the market price per unit of weight" is ac- 
tually a better guide in buying an unfamiliar 
brand of food than “your grocer's recommenda- 
tion." 

Regional and national norms based on sub- 
stantial numbers of institutions and students are 
reported. The usefulness of the norms is some- 
what limited by the fact that “many institutions 
seemingly tested only selected groups of stu- 
dents.” Reliability coefficients, item analysis 
summaries, and other analytic data which would 
be useful in demonstrating the quality of the 
tests are not provided. 

Despite these minor shortcomings, the tests 
give evidence of competent construction, They 
should prove to be highly satisfactory instru- 
ments for measuring achievement in the field of 
nutrition. 
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*Cooperative Test in Home Management. Col- 
lege; 1950-51; IBM; Form X ('50) ; no data on relia- 
bility and validity; directions for administering ['50]; 
general Cooperative manual ('51) ; norms C$o ; $2.25 
per 25; separate answer sheets must be used; per 
25 IBM answer sheets; 15€ per scoring stencil; cash 
orders postpaid; 50€ per specimen set, postpaid; 40 
Go) minutes; ЕЕ by the Subcommittee оп Home 

anagement (Clara Brown Arny, Elizabeth Crandall, 
Jean Muir Dorsey, Irma H. Gross, Hazel M. Hatcher, 
and Florence McKinney) of the Evaluation Commit- 
tee, American Home Economics Association; Cooper- 
ative Test Division, Educational Testing Service. * 

REFERENCES 
1. Report on the National egi Home Economics Testing 


Program, Spring 1950. Princeton, J.: Cooperative Test Divi- 
sion, Educational Testing Service, 1950. Pp. i 17. Paper. * 
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*Cooperative Test in Household Equipment. 
College; 1950—51; IBM; Form X (750); no data on 
reliability and validity; directions for administering 
p ; general Cooperative manual ('51) ; norms ('50) ; 

.25 per 25; separate answer sheets must be used ; 
per 25 IBM answer sheets; 15¢ per scoring stencil; 
cash orders postpaid; 50¢ per specimen set, postpaid; 
40(50) minutes; prepared by the Subcommittee on 
Household Equipment (Clara Brown Arny, Daisy 
Davis, V. Enid Sater, Bernice Strawn, and Virginia 
Van Nostrand) of the Evaluation Committee, Ameri- 
can Home Economics Association; Cooperative Test 
Division, Educational Testing Service. * 

REFERENCES 
1. Report on the National College Home Economics Testing 


Шы ыш еее ОР Re. tt in Puer ана 
Farru Маррем, Associate Professor in House- 
hold Equipment, Iowa State College, Ames, 
Towa. 

The directions for administering and scoring 
are clear cut, easy to follow, and adequate. How- 
ever, the section marked “Directions for the 
Student” contains symbols and marks not ex- 
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plained and, so far as can be observed, having 
no particular function. The test can be easily 
scored by hand or machine. Percentile norms 
are presented. The directions do not indicate the 
objectives of the test or the subject matter areas 
which it covers. 

The format of the test is pleasing. Most items 
are keyed soundly and are free of ambiguity. The 
items, largely of the recognition type, do not 
require the student to indicate the principles in- 
volved. Many of the items are in story form; 
this results in items which are unnecessarily ver- 
bose. As an example, Item 34 states that “Mrs. 
Potter has a new gas range with a white porce- 
lain top. The porcelain between the burners has 
become crazed. This is probably the results of 
Mrs. Potter's.." Eleven words could be re- 
moved by stating the problem as follows: “The 
porcelain between the burners of a new gas 
range has become crazed. This is probably the 
result of..." 

The publishers describe this test in their cata- 
log as follows: “This test covers the material 
generally taught in college classes preceding 
professional work. Areas included are: knowl- 
edge of terms relating to household equipment 
and labels ; selection, use and care of equipment; 
planning and evaluating arrangements of equip- 
ment." An analysis of the test items shows that 
major emphasis is given to a knowledge of 
terms, whereas use and care receive second and 
third place respectively. Selection, planning, and 
evaluating arrangements of equipment are Ш- 
cluded in four to six test items each. Further 
analysis reveals that many important pieces of 
equipment commonly found in the home of to- 
day are not considered. There are no items 0" 
freezers, clothes dryers, automatic washers; 
none on many utensils of major importance 10 
the kitchen such as cutlery, surface cooking Ware 
and measuring devices. Only two items relate to 
storage and none to the overall problem of the 
fundamental principles of arranging equipment 
in the kitchen. Lighting, a major factor 11 M 
ducing fatigue and preventing accidents, 13 no 
included. 

"This test would be improved (а) by the au- 
thors' stating the objectives on which the pd 
based and the areas of subject matter covere | 
(b) by including items testing applications, sinc 
educational aims should include the use as W°" 
as the recognition of knowledge; (¢) by a 
creasing the number of items in order to aad 
the field more adequately ; and (@) by distribu 
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ing the items more equally among the various 
areas. 


Victor Н. Nott, Professor of E dücation, Mich- 
igan State College, East Lansing, Michigan. 

"This test consists of 70 items, about two thirds 
being of the multiple choice type and about one 
third of the classification or matching type. It 
purports to measure “knowledge of the selec- 
tion, use, and care of various kinds of household 
equipment and appliances." Although no data 
on reliability or validity are given, the test has 
face validity. Whether the knowledge covered 
can truly be said to belong in courses in home- 
making or might be better taught by the physi- 
cist ‘as а part of a course in household physics 
seems debatable. Many items seem designed to 
measure understanding of the principles under- 
lying the operation and maintenance of house- 
hold appliances rather than mere “gadgeteer- 
ing." 

In scoring the test, a correction for guessing 
is made by subtracting one third of the wrong 
answers from the right answers. This implies a 
four-response item ; but the majority of multiple 
choice items in this test have five responses re- 
quiring a different correction ; moreover, a third 
of the items are classification or matching items 
which do not lend themselves to this correction. 
It would seem better to score this test simply 
the number right. 

Percentile norms are provided for five regions 
of the United States. They reveal fairly substan- 
tial differences in achievement between some 
regions, both as to central tendency and variabil- 
ity. The range of scores is from 8 to 63 for all 
regions, with a mean of 36. The test may be a 
little too easy, though the range does not seem 
to indicate that it is. Many of the test items could 
probably be answered by the average college 
freshman, even though he or she had had no 
formal courses in physics or home economics ; it 
would be interesting to test this hypothesis. 

New and up-to-date tests of achievement in 
home economics have long been needed. It is 
hoped that the project of which this test is a part 
will provide such tests in multiple forms for col- 
lege students and, eventually for the secondary 
level. 


[494] 
XCooperative Test in Textiles and Clothing. 
College; 1948-51; IBM; Forms X (49), Y (48); 
no data on reliability and validity ; directions for ad- 
ministering [50]; general Cooperative manual ('51) ; 
norms ('50); $2.50 per 25; separate answer sheets 


must be used; 80€ рег 25 IBM answer sheets; 15¢ per 
scoring stencil; cash orders postpaid; 50¢ per specimen 
set, postpaid ; 87 (100) minutes ; prepared by the Sub- 
committee on Textiles and Clothing (Clara Brown 
Arny, Frances S. Goldsmith, Edna Meshke, Jane Sad- 
dler and Elizabeth Tarpley) of the Evaluation Com- 
mittee, American Home Economics Association; Co- 
operative Test Division, Educational Testing Service. * 
REFERENCES 


т. Report on the National College Home Economics Testing 
Program, Spring 1950. Princeton, .J.: Cooperative Test Divi- 
sion, Educational Testing Service, 1950. Рр. ili, 17. Paper. * 
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*Foods I, Food Selection and Preparation: State 
High School Tests for Indiana. High school; 
1934-51; Form C [531]; Forms A, B out of print; 
mimeographed ; no data on reliability ; no manual ; 4% 
per test; 15¢ per specimen set} postpaid; 55(60) min- 
utes; Mary T. Swickard and Muriel G. McFarland ; 
State High School Testing Service for Indiana, Pur- 
due University. * 


For a review by Hester Chadderdon of the 
1936 edition, see 38:1029. 


[496] 
*Foods II, Planning for Family Food Needs: 
State High School Tests for Indiana. High school ; 
1941-48; Form C [48]; Forms A, B out of print; 
mimeographed ; no data on reliability ; no manual ; 4 be 
per test; 156 per specimen set; post aid; $550) min- 
utes; Mary T. Swickard, Muriel . McFarland, and 


Louise Gentry; State High School Testing Service for _ 


Indiana, Purdue University. * 


[497] 
*Graduate Record Examinations Advanced 
Home Economics Test. Senior year college through 
graduate school ; 1946-51 ; available only in the Insti- 
tutional Testing Program of Graduate Record Ex- 
aminations (see 527); 105(125) minutes ; Educational 
Testing Service. * 

[498] 
«Нер; With Food in the Home: State High 
School Tests for Indiana. Grades 7-8; 109. 1-48; 
Form А [48]; mimeographed; no data on reliability ; 
no manual; ad per test; 15¢ per specimen set; post- 
paid; 28(35) minutes; Elizabeth Anderson and Muriel 
С. McFarland; State High School Testing Service for 
Indiana, Purdue University. * 


Jean D. AMBERSON, Professor of Home Eco- 
nomics Education and Head of the Department, 
The Pennsylvania State C ollege, State College, 
Pennsylvania. $ 

This test is one of a series of standardized sub- 
ject matter tests based upon Indiana courses of 
study. The тоб well worded items are related to 
specific home situations described in a phrase 
or in a sentence or two. Items of the true-false, 
matching, and completion types are all included. 

The test is intended for use with both boys 
and girls of the seventh and eighth grades but 
the language of many of the items indicates that 
the writers were thinking chiefly of girls. The 
scope of the test is fairly wide and includes those 
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phases of the food problem with which pupils 
at these grade levels might be expected to assist 
at home. The length of the test is also well 
adapted to these grade levels, but the desirability 
of timing the test is open to question. Directions 
are clear and concise. 

As a test of knowledge of facts and principles 
related to the problem of helping with food in 
the home, this test would seem to accomplish its 
purpose. The assumption is that accurate infor- 
mation is essential to the intelligent performance 
of activities of daily home living. Modern educa- 
tional thinking would remind us that evaluation 
of pupil growth should extend beyond this point. 
Other goals of achievement in this area of home 
living, such as ability to discover and solve a 
problem, apply principles, formulate sound 
judgments, make adaptations to existing cir- 
cumstances, develop acceptable standards of 
workmanship, and work cooperatively, should 
also be recognized by the users of the test as im- 
portant aspects of pupil development. 

The scoring of the test is quite simple and 
speedy. Norms, based upon performance of pu- 
pils in Indiana schools, are provided, making 
comparisons among groups in the state possible. 
Nevertheless, since within any state or region 
in the country, programs in education for family 
living, to be functional, require that purposes, 
experiences, and ideas be keyed to the local 
needs of homes and people, tests made by the 
teacher, if she understands evaluation and its 
function, will likely be more valid for her own 
pupil groups than any standardized test. 
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Helping With the Housekeeping: State High 
School Tests for Indiana. Grades 7-8; 1941-45; 
Form A ['45]; mimeographed; no data on reliability ; 
no manual; 36 per test; 15€ per specimen set; post- 
paid ; 28(35) minutes; Evelyn Swaim, Kathleen Mc- 
Gillicuddy, and Muriel G. McFarland; State High 
School 'Testing Service for Indiana, Purdue Univer- 
sity. 

HESTER CHADDERDON, Professor of Home Eco- 
nomics Education, Iowa State College, Ames, 
Iowa. 

This test, developed for use in Indiana, is 
based on the following objectives presented in 
the State Course of Study: knowledge of some 
of the housekeeping responsibilities ; knowledge 
of appropriate procedures to use in various 
housekeeping activities; understanding of how 


to increase safety in the home; and knowledge. 


of first aid treatment for common emergencies. 
Although these objectives are pertinent to any 
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junior high school program, the extent to which 
they are accepted as important in a given school 
usually depends on the amount of time devoted 
to home economics. 

Three sets of norms are furnished : one based 
on 206 pupils in short term school year, one on 
141 pupils in long term school year, and the 
third combining the two. The small number of 
cases limits the usefulness of the norms even in 
Indiana. 

Since families can vary in their solutions to 
such problems as dividing home responsibilities, 
it is difficult to develop a sound key. If the test 
required responses in terms of factors to be con- 
sidered rather than merely solutions to specific 
problems, this dilemma might be avoided. The 
items dealing with the housekeeping, safety, and 
first aid also fail to provide an opportunity to 
determine whether there is an understanding of 
the principles involved. 


[ 500] 

*Home Care of the Sick Test: State High School 
Tests for Indiana. High school; 1941-48; Form 
[48]; Form A out of print; mimeographed; no data 
on reliability; no manual; 41€ per test; 15€ per speci- 
men set; postpaid; 55(60) minutes; Louise Gentry, 
Gleela Ratcliffe, and Muriel G. McFarland; State 
High School Testing Service {ог Indiana, Purdue 
University. * 


[501] 
*Housing the Family: State High School Tests 
for Indiana. High school; 1941-48; Form B ['48]; 
Form A out of print; mimeographed; no data on гё 
liability ; no manual; 4€ per test; 156 per specimen set; 
postpaid; 55(60) minutes; Jeannette O. Parvis, De 
th S. Kurtz, and Muriel G. McFarland; State Hig! 


School Testing Service for Indiana, Purdue Univer- 


sity. * 

Jean D. Амвевѕом, Professor of Home Eco- 
nomics Education and Head of the Department, 
The Pennsylvania State College, State C ollege, 
Pennsylvania. 

This test is one of a series designed for 0p- 
tional use by the high schools of Indiana. Its 1 
well stated items sample widely the field of 
family housing and assume comprehensive study 
in this area by the group of students to be tested. 
The testee must pick from a group of stated facts 
or principles those which are correct or true as 
they relate to a specific situation. These situa- 
tions, though promoting objectivity in response 
are limited in the factors portrayed. The vae 
bles in a family situation may be many. It hok, 
be desirable, then, if students are to become real- 
istic about family problems, to help them develop 
abilities to form judgments about housing the 
family, knowing that “best family practice 
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varies in terms of many factors and that no an- 
swer is true for all situations. As seen most 
clearly in Part IV, the technique of the test does 
not give adequate recognition to this need. 

Though no purposes are stated (a serious 
omission from the standpoint of both the person 
selecting a test and the person being tested), a 
study of the test indicates that knowledge of 
facts and principles is one objective being tested 
and ability to judge good practice is another. 
The question to be asked is this: Does the test 
provide sufficient evidence of change in student 
behavior for those judging achievement to say 
with some degree of assurance that the student 
has or has not attained the objective? It is gen- 
erally assumed that a test may provide reliable 
evidence by giving the student opportunity to 
demonstrate the behavior many times in varied 
situations. If the number of aspects of the hous- 
ing field handled by the test were reduced, 
greater emphasis placed upon fewer facts and 
principles, and many situations offered the 
student to demonstrate recognition of these facts 
and principles or to judge when a principle is 
operating, more dependence could be placed on 
the evidence. This proposal is made in the belief 
that learning experiences of students in relation 
to family housing are less superficial than this 
test implies in its present form. 

Additional important objectives which might 
be considered in teaching high school students 
аге: (a) to understand and to explain the rea- 
sons for choices made in given situations; (b) 
to apply principles in varied situations; (c) to 
do critical thinking in a problematic situation ; 
and (d) to develop intelligent attitudes toward 
housing as it affects family welfare. It is to be 
hoped that this test is only the beginning. of a 
group of instruments that will assist the teacher 
in evaluating these objectives as well as those 
measured by this test. 

Timing the test, which for classroom use may 
seem feasible and practical, leads the reader to 
assume that the authors believe speed to be a 
significant goal; this belief might well be chal- 
lenged. 

The authors have helpfully provided norms 
for the test in terms of percentile and standard 
scores. Teachers of family life education may 
need considerable assistance in interpreting in- 
telligently the very probable differences between 
the distribution of scores in their classes and 
the norms caused by such factors as selection of 
students and reduced size of class. 


In summarizing, it may be said that the test, 
by dealing with many aspects of the field of fam- 
ily housing, makes the teacher and the student 
aware of the scope of this fairly new field, but 
that the question of whether depth or breadth 
of learning should be the major concern must 
be decided. The test presents opportunity to 
gather evidence about the student's knowledge 
of facts and principles and his ability to judge 
good housing practice but it requires a state- 
ment of the objectives to which it is related. Tt 
should be supplemented by additional tests re- 
lated to other objectives in the housing field. 


[502] 
Nutrition Information. Grades 3, 4-6, 7-8; 1947- 
50; 3 levels; 1 form, 147; no data on reliability; manual 
(47); norms (50); distribution gratis; postpaid; 
nontimed (40-50) minutes; prepared under the direc- 
tion of Clara Brown Arny; Willa Vaughn Tinsley ; 
Department of Public Services, General Mills, Inc. * 
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Mechanical Drawing Test: State High School 
Tests for Indiana. High school; 1944-45; Form А 
(44) ; mimeographed ; no data on reliability and valid- 
ity; no manual; norms [45] ; s¢ per test; 156 per spec- 
imen set; postpaid; 80(85) minutes; Justus Rising; 
State High School Testing Service for Indiana, Pur- 
due University. * 

Wis J. Micueets, Professor of Industrial 
Education, University of Minnesota, Minne- 
apolis, Minnesota. 

This is a good test but one that is in dire need 
of some mechanical improvement. Being a test 
in mechanical drawing, it should exemplify the 
best drawing practices. Its failure to do this is 
the result of the use of the mimeograph process 
for duplication rather than of any lack of draw- 
ing skill on the part of the test author. 

Although no information is given concerning 
test objectives, it appears that the test has been 
designed to evaluate the following abilities: (a) 
to understand working drawings and the rela- 
tionship of views, (5) to visualize through se- 
lection of correct end views, (с) to provide 
missing lines, (d) to recognize correct auxiliary 
and sectional views, and (е) to construct draw- 
ings concerning problems related to angles, cir- 
cles, arcs and tangents, isometric drawings, front 
and side views, and auxiliary views. While it 
is intended primarily as an end-of-course test, 
the reviewer can picture teaching situations in 
which it could serve as a very useful pretest as 
well. Its use in both pretesting and posttesting 
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is out of question at present since only one form 
is provided. 

The test is entirely one of application. There 
are no items of a strictly verbal nature. Rather, 
the first part of the test contains six sections in 
which the student is shown a variety of actual 
drawings, and asked to indicate his understand- 
ing by checking correct and incorrect views, 
noting missing lines, and so on. The second part 
contains construction problems in which the 
student is expected to use actual drawing equip- 
ment and complete the problem in accordance 
with the directions given. 

Some of the directions are difficult to compre- 
hend. The insertion of a sample item at the be- 
ginning of each section might help to clarify in- 
volved directions. The poor reproduction job 
makes some items unnecessarily difficult because 
the broken and solid lines are hard to follow. 

The test has a total possible score of 240 
points. One wonders why some of the items are 
weighted in the scoring since most studies on 
weighting items for tests of this kind show that 
little is gained by the weighting. For the greater 
part of the test the scoring is a somewhat cum- 
bersome process. In an effort to simplify scoring, 
the construction part of the test is devised in such 
a manner that it can be scored with the use of a 
stencil ; however, the manipulation of the stencil 
requires considerable figuring. The last page of 
the scoring key contains a set of norms based on 
three groups of T-scores. With the meager sug- 
gestions given, many teachers will be com- 
pletely puzzled when they attempt to interpret 
the scores of their students in terms of the 
T-score norms. 

There is no manual accompanying the test. 
This lack should certainly be remedied. There is 
need to apprise teachers of the competencies 
being sampled and to help them to interpret the 
. scores so that more effective use may be made 

of test results. There is also need to furnish in- 
formation concerning the validity and reliability 
of the test. 

"This test samples primarily those abilities re- 
lated to visualization of correct and incorrect 
views and certain abilities used in actual draw- 
ing construction. The testing situations are aptly 
selected and the items well constructed. In its 
present form the test should do a fairly good 
job of evaluating desirable outcomes of the me- 
chanical drawing course. It could do an even 
better job if it were printed on paper with a more 
practical method of duplication, if a second form 


were made available, and if a well-written man- 
ual were prepared to accompany it. 


[504] 

*Middleton Industrial Arts Test. High school; 
1951; Forms А, В; $1.05 per 25; 20€ per specimen set; 
postpaid; 60(65) minutes; Jean Ellis Middleton and 
H. E. Schrammel (manual); Bureau of Educational 
Measurements, Kansas State Teachers College of Em- 
poria. * 

WiLLIAM J. Micnzzrs, Professor of Industrial 
Education, University of Minnesota, Minne- 
apolis, Minnesota. + 

Each of the 190-item forms contains two parts, 
Part I being made up of true-false items and 
Part II, of multiple choice items. Each part in 
turn is divided into ten sections covering hand 
tools, lumbering, machine tools, abrasives, hard- 
ware, glues and fasteners, joints, wood finish- 
ing, drawing, and crafts. 

The manual states that the test was con- 
structed “to measure basic knowledge outcomes 
of high school students enrolled in industrial 
arts courses.” This statement and the title of the 
test may be misleading to potential users. Ac- 
tually, the test is primarily concerned with the 
area of woodworking. Roughly 75 per cent of 
the items relate to this field; about 15 per cent 
of the items concern mechanical drawing; and 
the remaining 10 per cent touch the areas of 
plastics, metal work, electricity and leatherwork. 
The sampling of these latter fields is very inade- 
quate. For example, in Form A there are 42 
items relating to hand tools in woodworking and 
only two items relating to the whole field of igi 
tricity; and there are 29 items in the area 9 
woodfinishing and only five which sample the 
broad field of metal work. There are no items ОП 
the graphic arts, although this is a part of b 
industrial arts curriculum in many schools. 
There is need for clarification in the title. 

The statement regarding test validity 1s very 
general in nature: r4 

i k content and budget- 
the of cai phe iden рок о in these ШЫ 
is basic to the test's validity. This was supplemen 
by suggestions from instructors....and test construc i i 
specialists. Studies of errors made by students en k 
liminary editions were also made and used in re ot 
the statements and in setting up the two forms 
equivalent difficulty and content. 

Two studies of the test's reliability are ad 
ported with coefficients of .86 and .87. No 02 
are included on the methods used in determining 
the reliability. : cile 

The manual contains a table of percen 


norms based on two sample groups: (4) 112 7 
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students whose scores were reported by 58 co- 
operating schools in the January 1951 Nation- 
Wide Every Pupil Scholarship Testing Pro- 
gram, and (b) 1,082 students whose scores were 
reported by 48 schools in the April 1951 Pro- 
gram. There is also a table of percentile norms 
for class median scores. This provides the user 
with а basis of comparison for individual stu- 
dents and total class results. j 

The test is typical of standardized tests avail- 
able in this field in that it concentrates exclu- 
sively on the evaluation of textbook achievement. 
None of the items endeavors to measure prob- 
lem solving abilities. For example, there are 29 
items relating to drawing, but no attempt is made 
to measure the student’s ability to visualize or 
interpret drawings. Such abilities are more de- 
sirable outcomes of drawing activities than mere 
knowledge of such facts as “Dividers are used 
to transfer distances” (Item 79, Form A). This 
same criticism is applicable to other sections of 
the test. 

Several other inadequacies should be men- 
tioned. The test samples minute details in some 
areas (“The jack plane has an adjustable sole” — 
Item 19, Form B), while glossing over or 
omitting equally important areas. For example, 
the machine tool section in Form B contains no 
mention of the drill press. There are some “so 
what?” items, as, for example, “A saw horse is 
a holding tool” (Item 2, Form А); and a few 
of the true-false items have specific determiners, 
as, for example, “АП saws should be started on 
the push stroke” (Item 12, Form B) and “Any 
cleansing solution used on glass can be used to 
clean plastics” (Item 99, Form B). 

In summary, this test provides a fairly com- 
prehensive sampling of factual type learnings in 
woodworking, with some attention given to the 
area of mechanical drawing. There is an in- 
sufficient number of items related to the other 
areas of industrial arts. Problem solving abili- 
ties are not sampled directly in the test. The 
teacher who wishes a check on the factual re- 
tention of his students will find this test useful 
in the areas listed, and he will have a basis of 
comparison through use of the norms included 
in the manual. 


PHILOSOPHY 


[505]. 
*Graduate Record Examinations Advanced Phi- 
losophy .Test. Senior year college through graduate 
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school and candidates for graduate schools; 1939-51; 
available only in Graduate Record Examinations pro- 
grams (see 527) ; 180(220) minutes; prepared by the 
Advanced Philosophy Test Committee appointed by 
Educational Testing Service: John Herman Randall, 
Jr. (Chairman), George D. W. Berry, J. Keith Mc- 
Gary, Herbert L. Searles, and Charles L. Stevenson; 
Educational Testing Service. * 

[506] 
xGraduate Record Examinations Scholastic Phi- 
losophy Test. Senior year college through graduate 
school; 1951; available only in the Institutional Test- 
ing Program of Graduate Record Examinations (see 
527); 105(125) minutes; prepared by the Scholastic 
Philosophy Test Committee appointed by the National 


Catholic Educational Association: Paul C. Reinert 


(Chairman), Sister Rita Marie, Leo R. Ward, and 
Robert J. Henle; Educational Testing Service. * 


PSYCHOLOGY 


[507] 
*Comprehensive Examination in Psychology. 
College; 1949; 1 form; manual ['49] ; $2.25 per 25 sets 
of test and answer sheet; separate answer s eets must 
be used; 2¢ per answer sheet; postage extra ; 35¢ per 
specimen set, postpaid; 50(60) minutes; Mae Pullins 
Claytor; C. A. Gregory Co. » 
Ештн M. HupprEsTON, Supervisor of Test 
Development in Social Studies, Educational 
Testing Service, Princeton, Меш Jersey. l 

This is a test which does much less than jus- 
tice to college curricula in psychology. Limited 
in approach, it picks up a mass of detail at the 
expense of basic concepts and interpretations. 
Many areas stressed in typical undergraduate 
programs are omitted altogether or are repre- 
sented by only a few questions testing isolated 
facts. 

The test contains 140 questions, of which 65 
are under the heading “Vocabulary” and 75 are 
under the heading “General Information.” Ac- 
cording to the description in the catalog of the 
С. A. Gregory Company, “Emotional reactions, 
the effects of heredity, reflex actions, laws of 
learning, gland secretions, tests used in the dis- 
covery of important traits, contributions of emi- 
nent psychologists, statistical data, manias and 
phobias are some of the subjects covered in the 
test.” This would appear to be a fair descrip- 
tion of the test content. However, it does not 
prepare the reader for some of the strange words 
he will encounter in the vocabulary section, or 
for the continued emphasis on detail in the in- 
formation section. ; 

The best coverage is given to physiological 
psychology and to psychological statistics. The 
statistics questions require thought and manipu- 
lation by the student, and for this reason are 
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among the best in the test. While there are 
thought questions on other topics, they are rela- 
tively infrequent. The reviewer would list the 
following fields as those in which coverage is 
seriously deficient : learning, personality, motiva- 
tion, psychology of childhood and adolescence, 
social psychology, applied psychology, compara- 
tive psychology, experimental psychology, the- 
ories and systems of psychology, and scientific 
methodology. In the field of abnormal psychol- 
ogy the questions are adequate in number but 
are unsatisfactory because they are all of the vo- 
cabulary type. 

The corrected odd-even reliability, based on 
a sample of 113 psychology students at Morgan 
State College, was .96 for the total test, 88 for 
the vocabulary section, and .92 for the general 
information section. In the reviewer’s judgment, 
the test is not excessively speeded; no data on 
speededness are provided in the manual. 

No data on validity are given. In a paragraph 
entitled “Validity” the manual states : “The con- 
tent is based on selected facts from various texts 
which are accepted as authentic. Indirect meth- 
ods of validation included the Index of Relia- 
bility which is sufficiently high and indicates that 
the test is measuring true ability in the function 
which it purports to measure.” 

A table is presented for converting raw scores 
to scaled scores, and percentile norms are given 
for a sample of 500 students from four institu- 
tions. The students are “Juniors or Seniors who 
are majors or minors in Psychology, or those 
who have had a number of courses in this area.” 
The norms indicate that the test is much too 
difficult for this particular population; a raw 
score of 72 (from a total of 140 items) is at 
the goth percentile, and the mean score is 51.3 
with a standard deviation of 9.05. While it is 
to be expected that psychology majors in some 
institutions would make higher scores than did 
the norms group, the reviewer would estimate 
that the test is more than optimally difficult even 
for graduate students. 

In summary, the reviewer would regard the 
test as incapable of measuring the desired out- 
comes of an undergraduate program in psychol- 
ogy. 


[508] 
*Graduate Record Examinations Advanced Psy- 
chology Test. Senior year college through graduate 
school and candidates for graduate school; 1939-51; 
available only in Graduate Record Examinations pro- 
grams (see 527); 180(220) minutes; prepared by the 
Advanced Psychology Test Committee appointed by 
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Educational Testing Service: Norman L. Munn 
(Chairman), Daniel Katz, Harold Schlosberg, Laur- 
ance F. Shaffer, and Delos D. Wickens; Educational 
Testing Service. * 


[509] 

*Hogan Psychology Test. High school and col- 
lege; 1951; IBM; Forms A, В; $1.05 per 25; 20€ per 
specimen set; separate answer sheets (IBM Form ITS 
1000 B 108) may be used; 856 per 25 IBM answer 
sheets; 25€ per set of stencils for machine scoring of 
answer sheets; postpaid; 50(55) minutes; Irene Ho- 
gan and H. E. Schrammel; Bureau of Educational 
Measurements, Kansas State Teachers College of Em- 
poria, * 


RECORD AND REPORT 
FORMS 


[510] 
*American Council on Education Cumulative 
Record Folders. Grades 1-3, 4-6, 7-12, 13-16; 1928- 
47 ; 4 levels; 6¢ per copy of any one level ; 5e e direc- 
tions for administering for any one level; 30 per bat- 
tery manual ('47); postage extra; I 5¢ per specimen 
set of any one level (does not include battery manual), 
postpaid; battery manual by the Committee on the Re- 
vision of Cumulative Records; prepared by the Com- 
mittee on Cumulative Records of the American Coun- 
cil on Education; American Council on Education. 
а) AMERICAN COUNCIL ON EDUCATION CUMULATIVE 
RECORD FOLDER FOR GRADES I, 2, AND 3. Grades 1-3; 
1944-47; directions for administering [44]. 
b) AMERICAN COUNCIL ON EDUCATION CUMULATIPE 
RECORD FOLDER FOR GRADES 4, 5, AND б. Grades 4-9; 
1944-47 ; directions for administering [44]. 
с) AMERICAN COUNCIL ON EDUCATION CUMULATIVE 
RECORD FOLDER FOR JUNIOR AND SENIOR HIGH seo 
1941 Ln pfit 7-12; 1928-47; directions or 
administerin 7]. 
а) vcn Фонда. ON EDUCATION CUMULATIVE 
RECORD FOLDER FOR COLLEGES, 1942 REVISION. Grades 13 
16; 1928-47; directions for administering [47]- 


Warren R. BALLER, Professor of Educational 
Psychology and Measurements, University 0 
Nebraska, Lincoln, Nebraska. [Review of rec- 
ord folders for grades 1-3 and 4-6.] 4 

RECORD FOLDER FOR GRADES 1-3. The 800 
features of this folder lose much of their effective" 
ness because of two detracting features. These 


are the drab appearance of the printing 21 ое 
unsatisfactory format. The specific EC * 


which the folder was designed to serve x 
harmony with approved guidance practices; y 
they deserve to be more effectively presente ч 

Users of record folders are helped ge 
headings which are prominent and which QU 
quickly the separate divisions. Headings on x 
folder are not consistently helpful; some 14 
poorly placed іп marginal ог other relatively ifs : 
functional positions ; and some are entirely !@ 
ing. 
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Clarity of directions is another desirable fea- 
ture of record folders. While some of the direc- 
tions are not necessarily misleading, they are 
distracting. For example, in the section “De- 
scription of General Behavior” one is instructed 
to write H (High), U (Usual), or L (Low) 
for such traits as Interest and Persistence, and 
this can readily be done. But then to apply these 
marks to Passive, Neutral, and Withdrawn, and 
other negative traits is, to say the least, bother- 
some, It also seems unnecessarily awkward to 
work with four different schemes of trait evalu- 
ation in proceeding through the six categories 
that comprise this section of the folder. 

Perhaps the most disconcerting arrangement 
of the parts of the folder is that pertaining to 
physical health data. Several sections related to 
other kinds of entries were allowed to intervene 
between “General Physical Condition” and 
“Health and Physical Characteristics.” The 
folder is not well organized for the effective use 
of space. There are sections where space allot- 
ments are too restricted; and, by contrast, there 
is a waste area of 23 square inches on the back 
surface of the folder. 

Among the better features of the folder is the 
encouragement which it gives to the assessment 
of development in relation to the handicaps and 
advantages of the individual child. Use of the 
expressions “Above Usual” and “Usual” are 
welcome substitutes for the more common refer- 
ence to “Normal.” Emphasis is also given to 
the importance of recording information about a 
pupil’s achievement other than that which is 
indicated by school marks. 

The section “Description of General Behav- 
ior” invites attention to important aspects of 
the child’s development which too frequently 
do not occupy a place in school records. The 
development of a record of the child’s work 
habits, general conduct, and personal and social 
adjustment is accorded approximately one third 
of the space which the folder provides. Persons 
who believe that a cumulative record should re- 
flect interest in the entire child will approve 
the scope of this section and will also be pleased 
to find a section for the record of the “Significant 
Interests and Experiences” of the child. 

Persons who use the folder will be helped 
considerably by a careful reading of the Manual 
for Cumulative Records and especially the sec- 
tion which pertains to the record of academic 
achievement. Also worthy of attention are ap- 
pendices which illustrate the use of the record 


folders and the comprehensive and well chosen 
bibliography. 

RECORD FOLDER FOR GRADES 4-6. Most of what 
has been said about the folder for grades 1—3 
would apply to the form for grades 4-6. This 
folder, too, clearly was planned to take into ac- 
count the many-sidedness of child behavior and 
development, with special attention to traits of 
personal and social development. It encourages 
the recording of illustrative statements to sup- 
plement the checking processes. It also gives 
encouragement to the viewing of development 
as a continuous process; in this latter connec- 
tion it provides for a summary of the child's 
previous school record and, at another point, 
for suggestions concerning his future. These 
highly desirable features are deserving of a 
more presentable appearance and a more effec- 
tive arrangement. 


AnrHUR H. BRAYFIELD, Professor of Psychol- 
оду and Head of the Department, Kansas State 
College, Manhattan, Kansas. 

According to the manual accompanying the 
record folders, the purpose of the American 
Council on Education in providing these forms 
is “to stimulate improvements in the counseling 
services." Again, it is stated in the manual that 
the committee which prepared the forms hopes 
that they “will stimulate interest in more com- 
plete study of pupils, and that adequate time and 
attention will be given to counseling and guid- 
ance.” Thus the reviewer is invited to evaluate 
the cumulative record folders as tactical tools 
of a major strategy aimed at developing and im- 
proving counseling services. 

Certainly personnel records may furnish a ve- 
hicle for promoting a modern student personnel 
program; as a tactical move, this may be a fruit- 
ful approach. The reviewer has successfully 
used this gambit along with others, such as edi- 
torship of catalog copy, administration of an 
outmoded scholastic delinquency program, and 
control of attendance reporting, to reach the same 
goal. 

More specifically, however, we should ask 
how effective these particular record forms are 
likely to be as operating personnel tools. This 
requires a statement of specific functions to be 
served by the forms. Unfortunately for this pur- 
pose, the committee which prepared the forms 
neglected to provide such specifications ; for ex- 
ample, it did not distinguish between adminis- 
trative and counseling use of records. That there 
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is some confusion on this point of function is 
demonstrated in the following excerpts from the 
manual: “The form is not a substitute for a 
counselor, nor can it be used by a novice. It re- 
quires careful study, understanding, background, 
and training on the part of a counselor" contrasts 
with "It is emphasized that they [the forms] 
should be placed where they can be consulted 
Íreely by faculty members." 

In the absence of a more clearly defined state- 
ment of specific functions, it is not possible to 
evaluate adequately the record form as a per- 
sonnel tool, These points may be suggested, how- 
ever: (a) it is an open question whether the 
items of religion and colot should be included 
as they are on these forms; (b) the problem of 
"privileged information" is not adequately dis- 
cussed in connection with the forms. The com- 
mittee recognizes the need for a "confidential 
letter file kept by the counselor" but does not 
fully consider the question of confidential in- 
formation. In fact, in one place it is suggested 
that some schools have found it possible to use 
student assistants to enter the material on the 
forms. The reviewer would insist that student 
personnel records should not be maintained if 
it is necessary to use student help in their prepa- 
ration; (c) a more practical method of cumu- 
lating information in most operating situations 
is to use blank file folders in which are placed 
from time to time original data, special records, 
or protocols. 

Any school interested in inaugurating or im- 
proving an existing personnel record system 
should start its work with a consideration of 
these materials. The bibliography in the manual 
will prove useful. The forms themselves will be 
highly suggestive of types of information to be 
included. The committee and the American 
Council on Education generously invite schools 
to make whatever use they may desire of the 
materials provided here. 


For a review by Herbert A. Toops of the sec- 
ondary and college forms, see 3:444-5. 


[511] 

California Cumulative Guidance Record for 
Elementary Schools. Grades 1-8; 1944; 4 parts; 
postage extra; developed by the California School 
Supervisors’ Association; A. Carlisle & Co. * 
D BASIC RECORD FOLDER. Form A ['44] ; $11.25 per 250. 
b) HEALTH AND PHYSICAL DEVELOPMENT, BASIC SUP- 
PLEMENT I. Form B ['44] ; $7.50 per 250. 
С) CURRICULAR EXPERIENCES, BASIC SUPPLEMENT II. 

orm C ['44]; $7.50 per 250. 
d) ADJUSTMENT FACTORS, BASIC SUPPLEMENT III. Form 
D [44]; $7.50 per 250. 
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WannEN R. Batter, Professor of Educational 
Psychology and Measurements, University of 
Nebraska, Lincoln, Nebraska. 

This guidance record consists of a basic record 
folder plus three removable supplementary cards 
which provide records of health and physical 
development, curricular experiences, and ad- 
justment factors, respectively. The basic folder 
is printed on good quality, buff manila and the 
supplements appear in distinctive, separate col- 
ors. The general effect of this arrangement is 
pleasing as well as contributory to the service- 
ableness of the record. 

The plan of the record reflects the desire of 
the committee which constructed it to provide 
fully for the dynamic and developmental aspects 
of the pupil's experience. Provision is made at 
numerous points for supplementing the record 
of present status with statements of apparent 
causes, related factors, and recommended follow- 
up action. Superficial or skimpy knowledge about 
a child will not result if this folder is conscien- 
tiously filled out and used. The only discordant 
note in this connection is a tendency, especially 
in the record of health and physical development, 
to emphasize almost exclusively the conditions 
which are unfavorable and need remedial atten- 
tion; the invitation to record evidences of de- 
sirable conditions is, for the most part, lacking. 

Effective use of the record is greatly enhanced 
by the presence of clearly printed, well located 
headings and well arranged divisions. Space 
allocations are consistently adequate and the 
directions which appear on the parts of the 
record are clear. Much attention was given 1n 
the construction of the record to details which 
have considerable importance in the day-to-day 
use of records: the verification of the birthdate 
is to be indicated by checking one or more of 
nine different kinds of evidence, space is pro- 
vided for noting the removal of any part of the 
record and the present location of it, and pro- 
vision is made for indicating the presence in the 
main folder of supplementary material and the 
existence of other related records to be found 
elsewhere. The words, “Attach Signals Here, 
for identification signals, is another of many 
features which attest to the committee's com 
cern for practical considerations. k 

The material of the record is accompanied by 
a 24-page manual which describes in soni 
able detail the processes to be followed in 15 
effective use. It includes many helpful sugges- 
tions about such matters as “Installation of 6 
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Record System,” “Collecting Data,” and “Маіп- 
taining the Record,” and there is a particularly 
valuable part which deals with the relation of 
the form ofa record to its use. 


[512] 
xCumulative Guidance Record. Grades 1-12; 1950; 
$6 per 100; lots of 1000, $3.05 per 100; sample copy 
free; postage extra; Н. & W. B. Drew Co. * 


[513] 
xCumulative Personnel Record. Grades 7-12; 
1946; 3 forms ; cash orders postpaid; prepared by Com- 
mittee on Cumulative Records (William. Н. Bristow, 
Chairman), National Association of Secondary-School 
Principals; the Association. * 
а) CARD FORM. I-99, 5¢ each; 100-499, $2 per 100; 
500-999, $1.75 per 100; 1000 or more, $1.50, Der 100. 
b) FOLDER FORM. I-99, 56 each; 100-499, $5 per 100; 
500-999, $4.50 per 100; 1000 or more, $4 per 100. 
С) ENVELOPE FORM. 1—00, 5¢ each; 100-499, $2.50 per 
100; 500-999, $2 per 100; 1000 or more, $1.75 per 100. 


; [ 514] 

*Ontario School Record System. Grades 1-13; 
1950-51; 6 parts; 8¢ per revised preliminary manual 
(31); 25¢ per specimen set; postage extra; Voca- 
tional Guidance Centre. * 

а) ONTARIO SCHOOL RECORD FOLDER, PART I. Grades 1-8; 
Form OSR-I (0); $1 per 25. 

b) ONTARIO SCHOOL RECORD FOLDER, part II. Grades 9- 
13; Form OSR-II (^50) ; $1.60 per 25. 

с) ONTARIO SCHOOL OFFICE RECORD, PART І. Grades 1- 
13; Form OSOR-I (0); 20€ per 25. 

d) ONTARIO SCHOOL OFFICE RECORD, PART п, Grades 1- 
13; Form OSOR-II (^50) ; 20% per 25. 

е) ONTARIO STUDENT INFORMATION FORM, 
Grades 7-13; I form, ’50; 30¢ per 50. 
f) ONTARIO SCHOOL SUBJECT RECORD. 
Form OSSR (50); 95% per 25. 


[515] 
Permanent Record Card. Gradés 9-12; 1041; 1- 
99, 56 each; 100-499, $2 per 100; 500-999, $1.50 per 
100; 1000 or more, $1 per 100; cash orders postpaid ; 
National Association of Secondary-School Princi- 
pals. * 


PART II. 


Grades 7-13; 


REFERENCES 
т. Exrcxer, PAvr E. "Record Forms for, Secondary Schools." 
B Nat Assn Sec Sch Prin 31 139748 N '47. " 


1-99, 5¢ each; 100-499, 
100; 1000 or more, $I per 100; cash orders postpaid; 
National Association of Secondary-School Princi- 
pals. * 


REFERENCES 


aut E. “Record Forms for Secondary Schools,” 


т, ELICKE 2 


R, Р. 
B Nat Assn Sec Sch Prin 31:39-48 N '47. 


[517] 

*V.G.C. Cumulative Record Folder. Grades kgn- 
8, 7-13; 1944747; 2 levels; $1 per 25 of any one level; 
postage extra; M. D. Parmenter ; Vocational Guidance 
Centre. * ; р 

D) ELEMENTARY FOLDER. 1044; 5f per specimen set | 
b) SECONDARY FOLDER, REVISED. 1944-47 ; 10€ per speci- 
men set. * 
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[518] , 
*Jewish Information Test. Jewish schools; 1944- 
49, C1945; 4 scores: Jewish history, present day Jew- 
ish life, holidays and customs, total; Forms A (47), 
В (45); manual ('49) ; tentative norms ; 10€ per test, 
postage extra; 156 per sample test, postpaid; nontimed 
(30-45) minutes; Noah Nardi; Jewish Education 
Committee of New York, Inc. * 


[519] 
XOpinion Survey. Jewish youths and adults; 1948 
(2) ; 1 form ['48] ; no data on reliability and validity ; 
no manual; no‘norms; 10¢ per booklet, postage extra; 
15¢ per sample booklet, postpaid; (60) minutes; Noah 
Nargis Jewish Education ommittee of New York, 
nc. 


. [520] 
*Peters Biblical Knowledge Test. Grades 9-16 
and adults; 1948; Form A; 2 parts; $1.15 per 2 of 
any one part; 20€ per specimen set; postpaid; 60(65) 
minutes per part; Frank С, Peters; Bureau of Edu- 
cational Measurements, Kansas State Teachers Col- 
lege of Emporia. * 
3 TEST I, OLD TESTAMENT. 
b) TEST II, NEW TESTAMENT. 


SAFETY EDUCATION . 


[521] 
*[American Automobile Association Driver 
Testing Apparatus.] Drivers; 1939-50; individual; . 
12 tests; no description of normative population; tenta- 
tive norms; manual ('50) for all tests except Driver 
Evaluator and Complex Reaction Time; approxi- 
mately 20% discount to schools; postage extra; Amer- 
ican Automobile Association. * 
a) COLOR VISION. 1947; $22 per instrument; (2) min- 
utes. . 
b) DETONATORS. 1939-50; а device which fires .22 cali- 
ber blank cartridges and aids in the measurement of 
reaction and braking distance; $1.35 per 100 blanks ; 
$1.25 per 100 yellow powder marking capsules, 
1) Model A. 1939; a mechanically operated, double- 
barreled instrument; $11 per detonator. 
2) Model B. 1945; an electrically operated, double- 
barreled instrument; $23.50 per detonator. 
3) Model C. 1946; an electrically operated, single- 
barreled instrument measuring braking distance 
only; $15 per detonator. 
4) Model D. 1950; an electrically operated, triple- 
barreled instrument; $31.25 per detonator. 
с) DISTANCE JUDGMENT. 1939; $50 per instrument; 
(10) minutes. 
d) DISTANCE JUDGMENT AND VISION. 19 ; a single in- 
strument incorporating the essential features of tests 
c and 1; $68.75 per instrument ; (15) minutes. 
e) DRIVER EVALUATOR. 1950; а single instrument incor- 
porating the essential features of tests a, c, f, i, and l; 
$195 per instrument ; (15) minutes, 
f) FIELD OF VISION. 1939; $25.75 per instrument; (5) 
minutes. 
д) GLARE ACUITY. 1940; $60 per instrument; (10) min- 
utes. 
h) FINGER REACTION TIME. 1939; discontinued. 
i) FOOT REACTION TIME. 1043; $67.25 per portable in- 
strument; $99.75 per upright instrument; (2) minutes. 
j) COMPLEX REACTION TIME. 1940-49; а single instru- 
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ment measuring the average reaction time for stop- 
ping and turning; manual ('49) ; $234 per instrument ; 
(5) minutes. à ; 
B STEADINESS. 1939; $15.75 per instrument; (3) min- 
utes. 5 
1) VISUAL ACUITY. 1943; $41 per instrument; (4) min- 
utes. 


[ 522] 
*Examination for Driving Instructors. High 
school driving instructors; 1951; Forms A, B; tenta- 
tive norms; 73¢ per 25; specimen set free to qualified 
test users; separate answer sheets may be used; 74€ 
per 250 answer sheets; postage extra; nontimed (45) 
minutes; American Automobile Association. * 


[523] 1 
*General Achievement Test in Fire Safety: Na- 
tional Fire Prevention Tests. Grades 4-8, 9-12 and 
adults; 1940-42; 2 levels (labeled Forms 1A and 2A) ; 
I form; Form 3А out of print; no norms; manual 
['42] ; 756 per 25; 15€ per specimen set; postpaid; 20 
(25) minutes; Charles C. Hawkins; Center for Safety 
lucation, New York University. * 
5 FORM IA. Grades 4-8 


; 1940. 
FORM 24. Grades 9-12 and adults; 1942. 

[524] 
*Rating Scale for Automobile-Driver Skills: 
The Abercrombie Driver Test. Student drivers; 
1940; individual; 1 form; no data on reliability and 
validity; no manual; no norms; 756 per 25; 5€ рег 
specimen set; postpaid; (30-45) minutes; Stanley A. 
Abercrombie ; Center for Safety Education, New York 
University. * 

[525] 

*Revere Safety Test, Second Edition. Industrial 
employees especially in the nonferrous metal indus- 
tries; 1947-50; 5 scores: general safety, piling-carry- 
ing-traffic, tools, machine operation, total; 1 form, '49; 
no norms for part scores; manual (’50) ; 33€ per test; 
separate answer sheets must be used; з? рег 25 ап- 
swer sheets; 10¢ per scoring stencil; per safety 
handbook; 756 per specimen set; cash orders postpaid ; 
20(25) minutes; Revere Copper and Brass, Inc. and 
the Psychological Evaluation and Services Center, 
IM University; Science Research Associates, 
nc. 


Wirrarp A. Kerr, Associate Professor of Psy- 
chology, Illinois Institute of Technology, Chi- 
cago, Illinois. 

The Revere Safety Test is definitely a train- 
ing and discussion aid rather than a selection 
test. Admitting this limitation, much must be 
said in its favor. It and its accompanying aids 
are ingeniously constructed and even glamor- 
ously printed. 

The authors offer no validity coefficients, but 
do provide some validity evidence which this re- 
viewer was able to convert into rough validity 
coefficients. The manual gives frequency dis- 
tributions of test scores for 239 accident free 
workers and 90 workers who had had accidents. 
Converting these data into a tetrachoric coeffi- 
cient of correlation resulted in a validity coeffi- 
cient of .22, which suggests only a slight tend- 


ency for the accident free to score higher on the 
test. The fact that the test correlates .30 with 
intelligence scores brings the validity into even 
sharper question. 

Also provided in the manual are frequency 
distributions on the test for three levels (95 ap- 
plicants, 103 operators, 40 supervisors) of per- 
sonnel by rank. Breaking these data into a 
tetrachoric scatterplot resulted in a validity co- 
efficient of .54 between scores and rank. But this 
coefficient obviously is contaminated by both ex- 
perience and intelligence. Were these two vari- 
ables held constant, the new coefficient probably 
would not exceed .25. One other bit of evidence 
is the declining trend of accident frequency and 
severity in Revere Copper and Brass, Incorpo- 
rated from 1945 through 1949. Since use of the 
test was not the only change during that period, 


` the trend cannot reasonably be attributed to it, 


particularly in view of the fact that those years _ 
witnessed the transformation of an emergency— 
and partly marginal—labor force into а more 4 
efficient labor force with possible accompanying - 
technological improvements. i 

The case for the reliability of this 20-minute — 
all-pictorial test seems as strong as the case for 
its validity is weak. Split half coefficients of .94 
and .98 are reported. 

On the positive side, also, it must be strongly 
emphasized that the authors apparently intend 
the test primarily as a safety training aid. Using 
it in the selection battery to help make the new 
worker safety conscious may be worthwhile. Its 
great value, in any case, appears to stem from 
the fact that the answer sheet is handed back 
to the worker along with a 57-page booklet 
which is keyed to the answer sheet so that the 
worker can look up the correct safety practices 
which he violated in making errors on the test. 
This is a valuable contribution to the training 
efforts of the industrial safety supervisor. 


Hanorp С. SeasHore, Director, Test Division, 
The Psychological Corporation, New York, 
New York. 

This test was designed to measure the 
worker's knowledge of good safety practices. It 
is advocated primarily as an effective device 11 
a safety education program, although its us¢ 
as a selection tool is suggested. k 

The items are closely related to a handboo 1 
on safety rules put out by Revere Copper 2" 
Brass, Incorporated. While the test may have 
general use, it is most appropriate for the n0? 
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ferrous metal industries and more particularly 
for those industries having the same safety rules 
as the Revere organization. 

There are 192 true-false items printed in 
groups of three. “Joe Brasco's" cartooned safety 
behavior is judged as being right or wrong. The 
three items in each group all cover the same 
topic, but each is to be marked separately. The 
items fall into four categories: general safety; 
piling, carrying, and traffic; tools; and machine 
operation. Two answer sheets are available, 
which are alike except that one has page refer- 
ences printed under each answer cell referring 
to the page in the handbook which states the 
rules, Presumably a company which had safety 
rules at variance with these could still use the 
test and discuss the differences in rules in a 
training session. A company could key the test 
differently if it wished. 

The authors encounter the usual difficulties 
inherent in trying to make a simple picture ade- 
quately tell a relevant story. While satisfactory 
as stimulus materials for safety discussions, 
some of the pictures appear inadequate as test 
items. For instance, the third picture in Item 18 
is said to show “safe behavior” while the second 
one in Item 20 is considered “unsafe behavior.” 
The subtle distinction seems to be in the detail 
of whether the simple tools in the rear pockets 
can be “safely carried” as shown. One wonders 
whether item analysis has been carried out on 
populations of factory workmen other than those 
who have been through the Revere training pro- 
gram. 

The standardization information in the manual 
is not sufficiently complete to permit the reviewer 
to evaluate the data properly. It is reported that 
37 per cent of Revere employees scored above 
140. One cannot tell whether the test was given 
before, during, or after learning the local rule 
book, If based on applicants, these data show 
that the test is too easy since 37 per cent have 
test scores within the small range from 140 to 
a perfect score of 162; if based on post-training 
cases, such a high percentage of high scores 
probably would be satisfactory. Good test man- 
uals describe the groups on which interpretive 
data are presented. ] 

Тһе reliability (of the experimental form) is 
given as .98 on a split half basis. Scoring each 
set of three cartoons as a single unit (that is, 
making this a 54-item test) yields a split half 
coefficient of .94. Both of these values seem 
much too high for this type of test material. The 
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publisher has suggested to the reviewer that the 
high coefficients are probably due to a wide 
range of scores and to the relatively large stand- 
ard deviations, which resulted in a more or less 
rectangular distribution. Before releasing the 
test, the publisher should have checked the re- 
liability of the current edition by applying it to 
a new and non-Revere sample. 

One validity study (again, with the experi- 
mental edition) shows a small difference in mean 
scores between 239 accident free employees and 
go having one or more accidents. Quite prop- 
erly the authors stress that because of the large 
overlap in these distributions, the test is only 
a rough indicator of accident proneness. In an- 
other study the results show that a management 
group scored higher than an operating group, 
and the latter scored higher than a hiring group. 
As additional validity information, the manual 
reports the reduction in accident frequency rates 
at the Revere plant over a period during which 
this test has been used. It is not clear how much 
of this reduction is due to the use of the test as 
a selection device, and how much is due to an 
improved safety program, including the use of 
the very clever safety handbook. 

The only norms are shown in a table of stand- 
ards which divides the range of scores into six 
levels. Two other tables of distributions are 
given in relation to the validation studies noted 
above. Neither of these is properly labeled as 
being derived from the experimental form. 

The test is shown to correlate .30 with Re- 
vere’s Otis-like test of intelligence for a sample 
of 78 persons. This is the only bit of reported 
standardization data based on the edition of the ` 
test being sold. 

Although the Revere Safety Test is an in- 
teresting and perhaps acceptable device to mo- 
tivate safety training, its standardization is in- 
adequate and its usefulness as a test is not as yet 
demonstrated. 


TESTING PROGRAMS 


[ 526] 
*College Entrance Examination Board Admis- 
sions Testing Program. Candidates for college en- 
trance; 1901-51; also called College Boards; IBM; 
2 parts; tests administered five times annually (De- 
cember, January, March, May, August) at centers 
established by Educational Testing Service; applica- 
tion form and bulletin of information may be obtained 
from Educational Testing Service; morning sessions 
only (Scholastic Aptitude Test), $6; afternoon session 
only, (achievement and special aptitude tests), $8; both 
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morning and afternoon sessions, $12; fees include 
scoring service and reporting of scores to any three 
designated colleges; scores not reported to examinees ; 
$т per 1-3 additional reports; tests administered in 2 


sessions; programs administered by Educational Test- 
ing Service for the College Entrance Examination 
Board. * See separate test entries referred to below 
{ог additional information, references, and reviews. 

а) SCHOLASTIC APTITUDE TEST. 1926—51; 2 scores: 
verbal, mathematical; 180(210) minutes ; ЕЯ һу 
the Staff of Educational Testing Service. See 285. 

b) ACHIEVEMENT AND SPECIAL APTITUDE TESTS. 1901- 
51; candidates elect I to 3 tests as specified by indi- 
vidual college requirements; 13 achievement tests pre- 
pared by College Entrance Examination Board Com- 
mittees of Examiners: English Composition (see 178), 
Social Studies (see 662), French Reading (see 237), 
German Reading (see 244), Greek Reading (see 248), 
Italian Reading (see 249), Latin Reading (see 250), 
Spanish Reading (see 259), Biology (see 600), Chem- 
istry (see 617), Physics (see 633), Intermediate Math- 
ematics (see 368), Advanced Mathematics (see 367) ; 
2 special aptitude tests prepared by the Staff of Educa- 
tional Testing Service: Pre-Engineering Science 
Comprehension (see 812), Spatial Relations (see 
808) ; 60(70) minutes per test. 

REFERENCES 


1. Weaver, LiLLAS JEAN, AND STOKE, Stuart M. “Тһе New 
C.E.E.B. Examinations." J Higher Ed 16:37-8 Ja '45. * 

2. TURNBULL, WiLLIAM W. “A Normalized Graphic Method 
T TAM Analysis." J Ed Psychol 37:129-41 Mr '46. * (PA 20: 
2 

е . TURNBULL, WiLLIAM W. “Tests of the College Board." 
, сд Ва К 8:99-102 Му E * (PA 2: 35951) 
. AsHBURN, Frank D. “How Do You Test a Student?” 
Atlantic Mo 1863537 Л ’so. * L 

s. Снлихскү, Henry. “From High School Into College.” 
J Nat Assn Deans Women 13:124-33 Mr '$о. * 4 

6. Furss, CLAupE M. The College Board: Its First Fifty 
Years. New York: Columbia University Press, 1950. Pp. vii, 
222. * (London: Oxford University Press.) (РА 25:3418) 

7. LANDIS, JOHN, AND SWINEFORD, FRANCES. “Questioning the 
Questions." Col Board R ib Je 'so. MEA 25:3419) 

8. TURNBULL, Mary E. “Building the Board's Tests." Col 
Board R 10:126-8 Mr *so. * (PA 25 33424 

о. Кил, FRANK R. “The Value of College-Entrance Exam- 
inations to the Student." Sch & Soc 74:247-50 О 20 '51. * 
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*The Graduate Record Examinations. Sophomore 
year college through graduate school and candidates 
for graduate school; 1937-51; tests administered in 2 
programs; Educational Testing Service. * See separate 
test entries referred to below for additional informa- 
tion, references, and reviews. 
а) NATIONAL PROGRAM FOR GRADUATE SCHOOL SELEC- 
TION. Candidates for graduate school; 1942-51; tests 
administered four times annually (October, February, 
May, and August) at centers established by Educa- 
tional Testing Service; application form and bulletin 
of information may be obtained from publisher ; 3 parts; 
$8 per any one part; $10 per any two parts; $12 per 
three parts; fee includes scoring service and reporting 
of scores to examinee and any three schools designated 
at time of application; $1 per each additional report re- 
quested at that time or any time thereafter; tests ad- 
ministered in 3 half-day sessions. ^ 

1) Aptitude Test. 1949-51; 2 scores: verbal, quanti- 

tative; 180(220) minutes; prepared by the Staff of 

Educational Testing Service. See 293. 

2) Profile Tests. 1937-51; 6 scores: physics, chem- 

istry, biological science, social studies, literature, fine 

arts; IBM; 240(260) minutes; prepared under the 

direction of Educational Testing Service. See 10. 

3) Advanced Tests. 1939-5? ; candidates may elect 1 

of 17 tests: Biology (see 604) ; Chemistry (see 620), 

Economics (see 672), Education (see 797), Engi- 

neering (see 811), French (see 241), Geology (see 

630), German (see 247), Government (see 703), 
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History (see 690), Literature (see 186), Mathe- 
matics (see 374), Philosophy (see 505), Physics 
(see 637), Psychology (see 5505), Ociology (see 
709), Spanish (see 263); Fren i 
Spanish 105(125) minutes per test; all others, 180 
_ (200) minutes per test; all tests (except French, 
"German, and Spanish) prepared by Advanced Test 
Committees appointed by Educational Testing Serv- 
ice. 
b) INSTITUTIONAL TESTING PROGRAM. Sophomore year 
college through graduate school; 1937-51; tests avail- 
able for institutional testing at any time except during 


weeks National Program for Graduate School Selec-. 


tion is scheduled; institution must agree to test all 
students in at least one administrative group; applica- 
tion form and handbook for deans and examiners may 
be obtained from publisher; 4 parts; Aptitude Test 
only, $2 per student; Profile Test only, $2 per student; 
advanced tests (each), $2 per student; any two of the 
receding, $3 per student; any three of the preceding, 
EA т student; Tests of General Education, alone or 
with one advanced test, $3 per student; fee includes 
scoring service, 3 individual score reports on each stu- 
dent, and 1 alphabetical report of entire group; admin- 
istration must be completed in one week. 

1) Aptitude Test. See a above. 

2) Profile Tests. Sce a above. 
3) Advanced Tesis. Same tests as for a above, plus 
Agriculture (see 442), Fine Arts (see 219), Home 
Economics (see 497), Music (see 227), Scholastic 
Philosophy (see 506); the 5 additional tests 105 
(125) minutes per test; Scholastic Philosophy and 
Music Tests prepared by Advanced Test Committees 
appointed by Educational Testing Service. 
4) Tests of General Education. 1946-51; 9 scores: 
general mathematics, physical science, biological sci- 
ence, social studies, literature, arts, effectiveness of 
expression, vocabulary, general education index; 4 
(540) minutes in 2 sessions. See 11. 


REFERENCES 
1. "The Graduate Record Examination." Sch & Soc 54:24172 


S 27 41. * 

$ Thé Graduate Record Examination: A Memorandum ой 
the General Character and Purpose of the Examination Includ- 
ing a Summary of Initial Studies of Its Validity. New York: 
Carnegie Foun dation for the, Advancement of Teaching. 1941. Pp. 
iii, 38. Paper. 15: Ч H 

3. CRAWFORD, ‘ALBERT BEECHER, Measuring Promise 1% 
Graduate Work: А Report оў Studies Undertaken at Yale 9; 
versity With Reference to Validity of the Graduate Record 
ente Мет Haven, Coana Departinent of Personn 

tui ale University, 1942. Pp. iv, бо. Paper. > 

ETERSON, STUART e The Measur t and Prediction of 

Schotaatic аи оп ae Стана el. Doctor's thesis, 

niversity of Iowa (Iowa City, Iowa), 1943. ‘ae 

5. LEARNED, WiLLIAM S. “Graduate Record Examination, an 
Returning Service Personnel," pp. 25-36. In the Thirty- 
Annual Report, 1943-44, ца the Carnegie Foundation 1 | 
Advancement of Teaching. New York: the Foundation, [1944:+ 
Pp. vii, 169. * ther 

6. Reen, Homer B. “Why Do Some Colleges Reach A Hig! Й 
Level of Achievement Than Others?” J Ed Res 38:161-72 Mad 
* (PA 19:1348) Same: Trans Kans Acad Sci 471-21 A п,” 

PACEPA; R. H. “The Graduate Record Examination: 
J Higher Ed 16:447-8 N '45. * ination 

8. LEARNED, WILLIAM & “Graduate Record Ехатіпа 197 
eit pp. 11-4. In the Forticth Annual Report, 1944-45. p 
the Carnegie Foundation for the Advancement of Teac 
New York: the Foundation, [1945]. Pp. vii, 140. Higher 

2 Улоснм, К. W. “Measuring the Outcomes of wi 
Education," pp: 14-40. In the Fortteth Annual Report, 194407 
0 the Carne i a oundation P. we won of Teac 

ew York: the Foundation, [1 , Pp. vii, 140. А 

10. SPEER, GEORGE S. “The Use of the Graduate à A 
Tanani in me зена fe Gredeate Engineer Stude: 

ng 37:313 6, 21:129 

11, The Ыыы - Жеме. 1298) te Students No JM 
P Wen Tests, 1945-1947. Graduate Record Examination, eee 
Pal чч No. i rineeton, N TS rawcations! Testing 9€ 

ecember 1947. Pp. ii, 27. Paper, lithotyped. ” 

12. VAUGHN, K. W. “The rosse Record Office, 194641], 
рр. 28-32. In the Forty-Second Annual Report, 1946-47, О, 
SOME юну " и аванса of Teaching. 

ork: the Foundation, [1947]. „ v, 136. + sation 

13. Heston, Joserx C. “The "Graduate Record Examinatio 
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ch, German, and. 
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Vs. Other Measures of Aptitude and Achievement.” Ed & Psy- 
chol Meas 7:618-30 au "f tee 2213210) 7 

14. VauGHN, К, W. The Graduate Record Examination: A 
Statement of Policy to Cooperatin Colleges and Universities. 
Graduate Record Office Bulletin, No. 1. rinceton, N.J.: Edu- 
cational Testing Service, February 1947. Pp. v, 17. Paper, * 

1s. Улисин, К. W. “The Graduate Record minations.”” 
Col & Univ 23:307490 547. * (PA 22:4124) j * 

16. VaucHN, К. W. “The Graduate Record Examinations.” 
Ed & Psychol Meas AES, w 47. * (PA 23:315) y 

17. VAuGHN, К. W. erformance on the 1947 Professional 
Aptitude Test: A Norms Bulletin for Committees on Admission 
in Colleges of Medicine, New York: Graduate Record Office, 
October 1947. Pp. i, 24. Paper, lithotyped. * 

18. VAUGHN, К. W. “The Projects of the Graduate Record 
Office,” pp. 41-50. (PA 22:3216) In National Projects in Edu- 
cational Measurement: A Report of the 1946 Invitational Con- 
ference on Testin Problems, New York City, November 2, 
1946, Herschel T. anuel, Chairman. Edited by K. W. Vaughn. 
Sponsored by the Committee on Measurement and Guidance. 
"American Council on Education Studies, Vol. 11; Series 1, Re- 
ports of Committees and Conferences, No. 28. Washington, D.C.: 
the Council, August 1947. Pp. vii, 8o. Paper. * 
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19. The Performance of Senior Students on the Advanced 
Tests, 1947. Graduate Record Examination Norms Bulletin, 
No. 3. Princeton, N.J.: Educational Testing Service, April 1948, 
Pp. 1, А Paper, lithotyped. * , 

20. The erformance of Senior Students on the Profile Tests, 
1946 1940 Graduate Record Examination Norms Bul etin, No, 
2. Princeton, N.J.: Educational Testing Service, April 1948. 
Pp. i, 9. Paper, lithotyped. * 

21. Heston, Josep C. “The Graduate Record Examination 
vs. Other Measures of Aptitude and Achievement." J Ed Res 
41 133847 I * (PA 22:3643) б 

22. The Performance of Sophomores and Seniors on the Tests 
of General Education, 1946-1948. Graduate Record Examination 

orms Bulletin, No. 4. Princeton, N.J.: Educational Testing 
Service, February 1949. Pp. ii, 29. Paper, lithotyped. * 

23, LANNHOLM, GERALD V. AND SCHRADER, WiLLIAM B. 
Predicting Graduate School Success: An Evaluation of the Ef- 
(eene of the Graduate Record Examinations. Foreword by 

епгу Chauncey. Princeton, N.J.: Educational Testing Service, 
1951. Pp. 50. Paper. * (PA 25 16470) 

_24. STUIT, Dewey B., AND ‘Peterson, Stuart С. “The Pre- 
diction of Scholastic Success in the Graduate College of the State 
University of Iowa.” Col & Univ 26:265-79 Jarit" 
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Reviews sy Warren R. Baller, Robert Murray Bear, Ivan A. Booker, Harold D. Carter, 
Frederick B. Davis, Robert L. Ebel, John C. Flanagan, C. M. Fleming, Gustav J. Froehlich, 
Eric F. Gardner, J. Raymond Gerberich, William S. Gray, J. Thomas Hastings, James R. Hob- 
son, Katherine G. Keneally, Martha E. Layman, Margaret G. McKim, Clarence H. Nelson, 
Victor H. Noll, Jacob S. Orleans, Worth J. Osburn, M. L. Kellmer Pringle, Henry D. Rins- 
land, Holland Roberts, Н elen M. Robinson, David H. Russell, F. J. Schonell, William D. Sheldon, 
Louis Shores, Henry P. Smith, Nila Banton Smith, William W. Turnbull, Henry Weitz, and 
Frederick L. Westover. 
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XA.C.E.R. Silent Reading Test, Form С: Grades 
3-6; 1946-50; 3 parts; Form C [46]; manual [ 50]; 
8s. 4d. per 10 sets of all parts; od. per key; 5s. per 
manual; 6s. 6d. per specimen set; cash orders postpaid 
within Australia; Australian Council for Educational 
Research. * 
а) PART I, WORD KNOWLEDGE. 2$. 3d. per 10; 10(20) 
minutes. 
b) PART 2, SPEED OF READING. 2$. 7d. per 10; 6(10) 
minutes, ч 
с) РААТ 3, READING FOR MEANING. 45. per 10; 20(30) 
minutes, E g 
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*Achievement Tests in Silent Reading: Domin- 
ion Tests. Grades I, 2, 2-3, 3-4. 4-6, 5-6; 1941-50; 
$1 per 2 of апу one test; 256 рег manual (separate 
manuals for a and b, c and d, e and f, g, h, and i; post- 
age extrà; prepared b the Department of Educational 
Research, Ontario Co lege of Education, University of 
Toronto; distributed by Vocational Guidance Centre. * 
а) PRIMARY: TYPE I, WORD RECOGNITION. Grade 1; 
1941-50; Forms A (C41), В (41) ; manual (45: re- 
vised norms (’50) ; 356 per specimen set; IO 20) min- 


utes. , 
b) PRIMARY: TYPE II, PHRASE AND SENTENCE READING. 


Grade т; 1941-50; Forms A 40; В (лї); manual 
(4D ; revised norms (^50) ; 35€ per specimen set; 20 
30) minutes. \ : 

с) PRIMARY: TYPE III, PARAGRAPH READING, Grade 1; 
1943-49; 1 form, '43; manual ('45);, revised norms 
(^50) ; 30€ per specimen set; 30(40) minutes. 

d) PRIMARY: TYPE IV, DIAGNOSTIC TEST IN WORD RECOG- 
nition. Grade 1; 1943-50; Form А (43); manual 


(245) ; revised norms ('50) ; 30¢ per specimen set; 30 
(40) minutes. 

e) ADVANCED PRIMARY : TYPE I, VOCABULARY. Grades 2- 
3; 1943-46; Forms А (743), В (43) ; manual (746); 
35€ per specimen set; EE minutes. 

f) ADVANCED PRIMARY: TYPE П, DIAGNOSTIC TEST IN 
PARAGRAPH READING. Grade 2; 1943-46; Forms 
С), В (743) ; manual (746) ; 35€ per specimen set; 
30(40) minutes. 

9) TYPE П, DIAGNOSTIC TEST IN PARAGRAPH READING. 
Grades 3-4; 1946-47; Forms А (46), B (^46) ; pre- 
liminary manual (47); 356 per specimen set; CST 
minutes. 

h) JUNIOR: TYPE I, VOCABULARY. Grades 4-6; 1943-46; 
Forms А (43), В (43) ; manual (746) ; 35€ per speci- 
men set; 20(30) minutes. 

i) TYPE II, DIAGNOSTIC TEST IN PARAGRAPH READING. 
Grades 5-6; 1948; Forms A, B; preliminary manual ; 
35€ per specimen set; 30(40) minutes. 


Henry P. Situ, Associate Professor of Edu- 
cation, and Director of the Reading Laboratory, 
The University of Kansas, Lawrence, Kansas. 

These tests, designed for use in Canadian 
schools, show strong relationship to the Gates 
reading tests in form, vocabulary, and general 
testing procedure; and the first-grade tests could 
well be additional forms of the Gates Primary 
Reading Tests. 

Norms based on the performance of rural and 
urban groups of Canadian children are pro- 
vided. To the extent that the authors have 
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achieved their goal of adapting the vocabulary 
and action of the reading material to the back- 
ground and interests of Canadian children, they 
have a strong case for using these tests in Ca- 
‘nadian schools but weaken the value of the tests 
for use elsewhere. However, the difference in 
the vocabulary and reading material employed 
in these tests and in the Gates and similar tests 
is not so great as to be noticed readily. In cer- 
tain of the tests, notably the tests of paragraph 
reading and of vocabulary, rather clever tech- 
niques have been employed for identifying the 
specific reading difficulties of the child. 

The quality of paper and printing, clearness 
of pictures, and general format is excellent. The 
statistical analysis accompanying the tests is 
complete and clear. 

The care which appears to have gone into the 
selection of items, the analysis of results, the 
preparation of norms, and the handling of other 
details of test construction and standardization 
is pleasing to the critical user of tests. The bat- 
tery appears to be a worthwhile contribution. 


For a review by Margaret G. McKim, see 
3:476. 


[530] 
*California Reading Test. Grades 1-4.5, 4-6, 7-9, 
9-14; 1933-50; a revision of Progressive Reading Tests 
(see 40:1563); a subtest of California Achievement 
Tests (see 2); 3 scores: reading vocabulary, reading 
comprehension, total; IBM for grades 4-14; 4 levels ; 
Forms AA (о), BB (зо), CC (so), DD (50); 
manuals ( 50); = сан; 356) per specimen - of 
any one level, postpaid; Ernest W. Tiegs and Willis 
W. Clark; Сайогша Test Bureau. * lí 
a) PRIMARY. Grades 1-4.5; $1.50 per 25; 30(35) min- 
utes, 
b) ELEMENTARY. Grades 4-6; $1.50 per 2 ; separate 
answer sheets may be used; 4¢ per IB a 
7¢ рег Scoreze answer sheet; боё per stencil for та- 
chine scoring of answer sheets; 20€ per stencil for hand 
scoring of answer sheets; 35(40) minutes. 
с) INTERMEDIATE. Grades 7-9; prices same as for Ele- 
mentary ; 50(55) minutes. 
d) ADVANCED. Grades 9-14; Forms AA, BB, CC only ; 
prices same as for Elementary; 50(55) minutes, 


Jonn C. FrANAGAN, President, American Insti- 
tute for Research; and Professor of Psychology, 
U niversity of Pittsburgh; Pittsburgh, Pennsyl- 
vania. 

As one step of preparation, this reviewer read 
previous reviews of earlier editions of this test. 
It was encouraging to note that a number of 
points raised by previous reviewers in The 
Mental Measurements Yearbook have now been 
corrected by the authors and publishers. For ex- 
ample, a review of earlier editions mentioned 
that standard errors of measurement were not 


mm. 
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reported. The manual for the 1950 edition re- 
ports reliability coefficients for a single grade 
range and also standard errors of measurement 
in terms of grade scores. These are given for 
the reading vocabulary, the reading comprehen- 
sion, and the total reading scores on the various 
levels of the test. 

Another criticism made by a reviewer of one 
of the early editions was that there was no warn- 
ing of the unreliability of the seven subscores 
used in the Diagnostic Profile. The manual for 
the 1950 edition states : "Because of the limited 
number of items (10-30), scores on the seven 
sections of this test should be used only as guides 
to indicate the presence of student difficulties in 
the seven major diagnostic areas." 

Although it is likely that the reviewer of the 
earlier edition would like to have a more specific 
warning than that quoted above, and would 
probably also like to see standard errors of meas- 
urement for the seven sections of the test which 
are so prominently featured in the Diagnostic 
Profile, these additions to the manual should be 
helpful to those trying to interpret the test re- 
sults, 

The manuals for all four levels contain the fol- 
lowing statement : “Careful study of tryouts in- 
dicates that most students (about 90%) can 
respond to the test situation and reveal their 
skill mastery within the time limits given in eac 
section, and the remaining 10% complete all the 
items they are capable of comprehending before 
the time limits are reached ; hence, the test 15 2 
power rather than a speed test.” Earlier editions 
of the test have been criticized because, although 
specific time limits were given, the examiner was 
told to stop the students when 90 per cent 0 
them had finished. Although it has certainly been 
established that speed and power, or level of test 
ability, are positively correlated to a substanti 
extent, the statement that all people not complet- 
ing the test within the time limits would have 
finished all the items they were capable of com- 
prehending seemed somewhat arbitrary. 

To test this statement іп a crude sort of W3), 
an eighth grade student was given Form AA H 
the Intermediate level. This student was НЙ 
amined under standard conditions, except ; ii 
at the end of each time limit he was given 2 sy 
tional time to attempt any items not prev 
tried. For the time limits given in the manua». 
reading vocabulary score was grade 7.2, 2^ 00. 
reading comprehension score was grade жо 
With additional time he got 11 more vocab 
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items and 9 more comprehension items correct. 
Thus, his scores under power-test conditions 
were grade 8.3 and grade 13.5 in vocabulary and 
comprehension, respectively, and total reading 
grade score was raised from 8.1 to 9.7 when he 
was given additional time. This represents a 
change in his percentile rank in the low eighth 
grade from below the soth percentile to above 
the 85th percentile. Form CC of the Advanced 
level was given to a high school senior. This 
student finished practically all of the parts within 
the time limits allowed. Although the above in- 
dividual cases cannot be accepted as having 
demonstrated that the Intermediate level is not 
a power test for most students, they do suggest 
that somewhat more objective and specific in- 
formation about the extent to which students 
might profit from additional time would be de- 
sirable in the manual. 

Ап examination of the test items suggests that 
they should do fairly well in discriminating be- 
tween individuals with respect to their knowl- 
edge of vocabulary and ability to comprehend 
what they read. The better students might, how- 
ever, find some items rather annoying with re- 
spect to the instructions given to them. For ex- 
ample, in selecting the second word in a group 
which means the “opposite or about the oppo- 
site” of the first word, the student is asked to 
pick “solid” as being the opposite of “plane,” 
“obtuse” as being the opposite of “acute,” “bot- 
any” as the opposite of “Zoology,” and “cylinder” 
as the opposite of “prism.” 

The content of the test appears to have been 
changed very little from that in the Progressive 
Reading Tests published in 1933, 1937; and 1943. 
For more detailed comment on the content and 
other aspects of the test, the reader is referred 
to the reviews of earlier editions. The norms for 
the 1950 edition are based on somewhat larger 
populations than were those for earlier editions. 
According to the manual, more than 100,000 
cases were used in standardizing each of the 
three higher levels; and the Primary level was 
standardized on more than 50,000 cases. The 
manual does not describe the manner in which 
the school districts were selected, but states that 
they were “of all sizes and types throughout the 
United States.” ЁЁ, 

Although the test user might wish for some- 
what more precise technical information regard- 
ing the test, it is this reviewer’s opinion that he 
will find the California Reading Test a valuable 
tool in appraising the progress of pupils with 


respect to these important skills of vocabulary 
and reading comprehension. 


James R. Hosson, Director of Child Placement, 
Brookline Public Schools, Brookline, Massa- 
chusetts. 

This successful and widely used series of read- 
ing tests, appearing under a new name, possesses 
some general and some specific virtues as well 


‚ аз some of the faults which seem common to 


most popular standardized tests. The most com- 
mon of these common faults is the almost total 
absence of proof of validity. Now, this reviewer 
has no doubt that these tests are valid within 
reasonable limits. However, to begin a discus- 
sion on validity with the statement, “All levels 
and forms of the California Achievement Test 
Series possess a high degree of validity,” and 
to end it with an invitation to write to the Cali- 
fornia Test Bureau for “further and more de- 
tailed data” with little specific information be- 
tween gives the impression that this subject is 
to be glossed over. It would be much more to 
the point to introduce briefly such specific evi- 
dence as correlation studies showing the rela- 
tionship with scores on other widely used read- 
ing tests, both group and individual, at specific 
grade levels; the power of prediction of later 
school success in such school subjects requiring 
well developed reading skills as literature or so- 
cial studies; or even the identification of the 
recognized experts in the field of reading whose 
pooled judgments determined what items should 
be included. 

The next important criticism is the grade 
range covered by each test. The greatest grade 
range is in the Primary test, where it does the 
most harm. This covers grades 1 through lower 
4. Such a wide range to be covered by the same 
test results in a spuriously high reliability co- 
efficient so far as any one grade is concerned 
and, as pointed out by Frederick B. Davis in 
The 1940 Mental Measurements Yearbook (see 
40: 1563), also results in “unreliability of sub- 
test scores based on only a small number of 
items." It is obvious that if you include enough 
items in one test to measure fourth grade ability, 
you include many which are too difficult and 
therefore useless to measure first grade ability ; 
conversely, the easy items necessary to measure 
first grade abilities are useless in the measure- 
ment of fourth grade abilities. As a result, the 
number of items which are valid measures at 
any one grade level is greatly reduced with a 
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resulting decrease in reliability. It appears that 
this defect is more serious in an instrument de- 
signed for analysis and diagnosis than in one 
intended only for survey purposes. 

Other criticisms and cautions occurring to this 
reviewer, which have been mentioned in previous 
reviews and so will not be dwelt upon here, are 
(a) the direction which instructs the examiner 
to stop work on a test when 9o per cent of the 


class have completed it and (b) the caution that . 


in the case of any individual child the analytical 
profile, improperly referred to as diagnostic, is 
useful only as a basis for genuine diagnostic 
testing or as a set of clues for the remedial 
worker. The former makes it difficult to under- 
stand how the test could be truly standardized in 
the first place, while the latter is a caution which 
should be emphasized in the manual. Inciden- 
tally, there is a distinct paucity of suggestion as 
to what to do about the class weaknesses dis- 
closed by the analysis. 

Fortunately, there are also many entries to 
be made on the credit side of the ledger regard- 
ing this series of reading tests. There is no ques- 
tion that the reading skills measured are among 
the most important ones we strive to develop. 
The test materials appear to this reviewer to be 
well chosen and representative of material en- 
countered by the child in his daily school work. 
The analytic feature should be a great aid to a 
teacher in assaying class and, to a lesser extent, 
individual strengths and weaknesses. The man- 


. . ual, despite the criticisms made above, has much 
'. to recommend it, particularly the sections on the 


diagnostic profile, the uses of test results by ad- 
ministrators, and the adjustment of norms with 
relation to intelligence quotient medians, al- 
though in the last case something should have 
been said about the desirability of developing 
one's own norms. Beginning with grade 4, the 
test user is offered his choice of hand scoring or 
of using the California Test Bureau's Scoreze 
answer sheet or an IBM answer sheet. This is 
commendable and should serve to widen the use- 
fulness of this test series. 

In general, this is a well thought-out series of 
tests which deserves its wide use. However, this 
reviewer would like to hold up to its publishers 
for emulation the statement on validity, relia- 
bility, and standardization found in the manual 
accompanying the Shank Tests of Reading Com- 
prehension, and the section on how to provide 
for and remedy the revealed differences and 
difficulties found in the manual accompanying 
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the reading test of the Jowa Every-Pupil Tests 
of Basic Skills. 


J Consult Psychol 14:333 Ag ’50. Laurance 
Е. Shaffer. [Review of the Intermediate level.] 
* The test and manual formats, redesigned by 
Raymond Loewy Associates, are attractive but 
violate at least a few experimentally determined 
findings about legibility, such as the use of white 
letters on a colored background. * 


For reviews by Warren G. Findley, Alvin W. 
Schindler, and J. Harlan Shores of the complete 
battery, see 2; for a review by Paul А. Witty 
of an earlier edition of the complete battery, see 
3:15; for a review by Frederick B. Davis of an 
earlier edition, see 40:1563; for reviews by C. 
W. Odell and Hugh B. Wood of an earlier edi- 
tion of the complete battery, see 40:1193; for 
reviews by Ivan A. Booker and Joseph C. Dewey 
of an earlier edition, see 38:1110; for a review 
by D. Welty Lefever of an earlier edition of the 
complete battery, see 38:876. 
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*Diagnostic Reading Tests. Grades 7-13; 1947-52; 
IBM except for Section IV, Part 1; $1.50 per construc- 
tion and validation booklet ('52—see I9 below); 15¢ 
per booklet of directions for administering (separate 
booklets required for each part of each section) ; $3.15 
per specimen set (does not include construction and 
validation booklet); separate answer sheets may be 
used except with Section IV, Part 1; postage extra; 
Committee on Diagnostic Reading Tests, Inc. * — 

а) SURVEY SECTION. 1047—52; 5 scores: rate of reading, 
comprehension check, general vocabulary, comprehen- 
sion, total; IBM; Forms А (’50), B ('47), C (зо), D 
(50), Е (’50), Е (50), G (50), Н (50) ; revised 
directions for administering ('50) ; 15€ per test; 3€ per 
IBM answer sheet; (Forms A and B also published by 
Science Research Associates : 256 per test; 156 per copy 


of directions for administering; 36 рег IBM answer = 


sheet; 60€ per specimen set; cash orders postpaid) ; 40 
(50) minutes. (Forms A and B were published in the 
high school and college educational edition of Reader’. 5 
Digest in November 1949 and May 1950 respectively.) 
b) SECTION I, VOCABULARY, 1947-52; 5 scores: English, 
mathematics, science, social studies, total ; IBM ; Forms 
A (зт), В (51); directions for administering (48); 
I5É per test; 26 per IBM answer sheet; 35(40) min- 
utes. 
с) SECTION П, COMPREHENSION. 1947-52; ІВМ; 2 
parts; Forms A (748), В (’48) ; revised directions for 
administering (50). Rs ЖЫЛУ, 
1) Part т, Silent. 20€ per test; 2€ рег IBM answer 
sheet; nontimed (40-60) minutes. b. 
2) Part 2, Auditory. Booklet of test paragraphs 
(47); 15€ per test; 20€ per copy of directions for 
administering; 50@ per booklet of test paragraphs 
(only опе copy needed for administrator); 3€ per 
IBM answer sheet; (40-60) minutes. 
d) SECTION III, RATES OF READING. 1947-52; IBM; 3 
parts; Forms А ('47), В (’47); revised directions for 
administering ('50). { 
I) Part т, General. 4 scores: normal rate of reading, 
comprehension at normal rate, maximum rate O 
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reading, comprehension at maximum rate; 16# per 
test; 26 per IBM answer sheet; 30(35) minutes. 
2) Part 2, Social Studies. 2 scores: rate of reading, 
comprehension check; 156 per test; 2% per IBM an- 
swer sheet; 15(20) minutes, 
3) Part 3, Science. 2 scores: same as Part 2; 15¢ per 
test; 26 per IBM answer sheet; 15(20) minutes. 

е) SECTION IV, WORD ATTACK. 1047—52; 2 parts. 
1) Part 1, Oral. 1948-52; individual; Forms A 
(448), В C48); directions for administering (^50) ; 
18¢ per test; (20) minutes. 
2) Part 2, Silent. 1947-52; 3 scores: identification 
of sounds, syllabication, total; IBM; Forms А (47), 
B (47) ; directions for administering ('48); 15¢ per 
test; 3¢ per IBM answer sheet; nontimed (30) min- 
utes, 
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Freperick B. Davis, Professor of Education, 
and Director, Educational Clinic, Hunter Col- 
lege, New York, New York; Director, Test Re- 
search Service, Bronxville, New York. 

This series of tests yields a number of scores 
for which norms (of one sort or another) are 
provided. The plan for use of the series is to ad- 
minister the survey section to ascertain the gen- 
eral level of reading competence. Pupils at a low 
level of proficiency can then be given the longer 
diagnostic section. 

The development of these tests was an ambi- 
tious undertaking, and it is obvious from an in- 
spection of the mass of testing materials that 
have been assembled that a great deal of energy 
and labor have been expended on this project. 
As a result, teachers and clinicians have avail- 
able a useful set of diagnostic tools. Their use-. 
fulness would be greatly increased if suitable 
norms were provided. For a battery of diag- 
nostic tests of this kind, norms should be es- 
tablished by having each child in the sample take 
all of the tests in the battery. Large representa- 
tive samples in each one of a wide range of 
grades should be used. Satisfactory norms are 
not available for the Diagnostic Reading Tests, 
perhaps because to obtain them is so expensive. 

Reliability coefficients have been reported for 


» 


many of the part scores ; some of them are based’ gy K 


on reasonably large samples at single grade lev- 
els while others are based on as few as 28 cases 
and on samples not clearly identified as con- 
sisting of pupils drawn from a single grade. 
Parallel forms coefficients are available for cer- 
tain of the rate tests, but Kuder-Richardson 
formula No. 21 was used for computing others. 
Since the latter are not supposed to be speeded, 
the estimates of reliability f them are prob- 
ably worth reporting. In genera 
that the individual part scores are not sufficiently 
reliable to identify, with confidence, weaknesses 
in an individual’s abili - to read. 

The validity of the various part scores can 


only be determined by subjective judgment. In . 


the reviewer's opinion, the comprehension, vo- 


‘cabulary, and oral reading scores seem accept- 


ably valid measures ; the tests of silent word at- 
tack do not seem to provide measures of skills 
that are crucial in this important area; the rate 
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tests are probably about as valid as tests of this 
kind usually are, but the time and space devoted 
to them seem to be entirely out of proportion to 
their very limited meaningfulness or usefulness. 
If all of Section III were omitted from the bat- 
tery, the reviewer believes that virtually noth- 
ing would be lost. The Committee’s intercorre- 
lation data support this belief. 

In passing, it should be noted that the reading 
passages for the rate tests seem interesting and 
that the comprehension items following them are 
not generally answerable without reading the 
passages. 

As in almost any test, faulty items can be 
found; the Diagnostic Reading Tests seem to 
have perhaps a few more than their share. An 
example of these is Item 32 of the Survey Sec- 
tion, Form A, where choice 5 (perimeter) in- 
cludes choice 4 (circumference), which is the 
keyed response. Webster defines circumference 
as “the perimeter of a circle.” Items of this kind 
tend to reduce test validity. 

The publications of the Committee on Diag- 
nostic Reading Tests are written in a labored 
and clumsy style (See Question 21 in “Ques- 
tions on Reading”) and contain an admixture 
of information helpful to teachers and test users 
and misinformation about reading. The answer 
to Question 11 in "Questions on Reading" will 
cause statisticians to shudder and will help make 
percentile norms seem incomprehensible to most 
teachers. 

The format and organization of the tests and 
manuals are satisfactory for materials in the 
developmental stage, as these are said to be. 


WinLiAM W. TURNBULL, Vice President, Edu- 
cational Testing Service, Princeton, N ew Jersey. 

The Diagnostic Reading Tests constitute a 
well integrated set of instruments for screen- 
ing students to discover those whose reading 
skills need special attention and for diagnosing 
the particular difficulties toward which that at- 
tention should be directed. The conception of a 
survey test supplemented by a set of diagnostic 
instruments is a sound one, and the tests them- 
selves give evidence of the conscientious work 
that has gone into bringing them to their present 
stage of development. The preparation of these 
tests is an ambitious project, and it has been 
carried forward energetically. 

The tests are designed for use over a wide 
range: grades 7 to 13 inclusive. The normative 
material suggests that the tests are reasonably 
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useful at the two extremes of this widespread 
group, but the scores are crowding the ends of 
the scale. In the Survey Section, Form B, for 
example, the median vocabulary score made by 
362 public school students in the seventh grade 
was 18 out of бо, when the chance score is 12. 
The 25th percentile for these students was 13— 
one point above the chance value. Among 1,447 
twelfth grade independent school students taking 
the same form, however, the median vocabulary 
score was 48 out of 60. The median comprehen- 
sion score was 33 out of 40 for these students, 
and the 25th and 75th percentiles were 30 and 
35 respectively, indicating a failure of the ques- 
tions to provide adequate discrimination at this 
level of ability. 

The Survey Section is designed to give a rela- 
tively brief (40-minute) overview of general 
reading ability, yielding separate scores for rate 
of reading, vocabulary, and comprehension, and 
a total score summing vocabulary and compre- 
hension. Rate is determined by number of words 
read in three minutes, using well chosen story 
type material in the area of natural science. The 
comprehension score is based on this same ma- 
terial (allowing the student 15 minutes to com- 
plete the reading and answer the questions) and 
on four shorter passages from the natural and 
social science fields. The rate and comprehension 
scores are indicated to have reliability coeffi- 
cients of about .80, while the vocabulary score 
has a reliability of about .85—fairly satisfactory 
figures. The reliability of the total comprehen- 
sion score is about .go. 

The skills tested by the comprehension ma- 
terials are broad and of central importance in 
reading. Emphasis is given to such questions as 
“The main idea of the paragraph 15...” and “A 
conclusion which can be drawn is..." rather 
than to factual memory. . 

Despite the fact that 30 of the 40 minutes are 
devoted to comprehension materials, 60 of the 
100 items are vocabulary rather than compre- 
hension. While standard deviations are not re- 
ported, the quartile figures given suggest that 
the vocabulary score contributes more heavily 
to the score called "total comprehension" than 
does the separate comprehension subscore. 

The intercorrelations of the separate scores 
indicate a median correlation of about .33 be- 
tween rate and comprehension, showing a de- 
gree of independence quite adequate to permit 
their use in diagnosing the particular area of a 
student's strength or weakness. The median cor- 
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relation of .62 between vocabulary and compre- 
hension indicates a less useful degree of inde- 
pendence for these measures, but is as low as 
could reasonably be asked in view of the close 
association of the skills involved. The fact that 
the correlation of the vocabulary score with total 
comprehension (a composite score summing vo- 
cabulary and comprehension) is substantially 
higher than that of the separate comprehension 
score with total comprehension indicates that the 
latter composite depends less heavily on the ma- 
terial termed “comprehension” than its name 
might imply, and suggests that the composite 
might benefit from renaming. 

A notable feature is the existence of eight 
parallel forms of the Survey Section, permitting 
reuse without repetition year after year. The 
fact that four of these forms (A, B, C, and D) 
have been published through the Reader’s Di- 
gest Educational Service detracts from their use- 
fulness in some situations, but should not be a 
serious disadvantage now that eight forms are 
available. 

The diagnostic reading sections, designed for 
use with students who do poorly on the Survey 
Section, are available in’ parallel editions. The 
four sections are: I, Vocabulary; II, Compre- 
hension; III, Rates of Reading; and IV, Word 
Attack (oral and silent). 

The Vocabulary Section is 35 minutes in 
length and is divided into four parts (English 
grammar and literature, mathematics, science, 
and social studies) on which separate scores are 
obtained. The reliabilities of these scores were 
found to be between .82 and .94, based on a 
random selection of 100 papers from students in 
various colleges., This indicates a quite satisfac- 
tory degree of stability and reflects the care that 
has been taken in the construction, analysis, and 
selection of items. While the reliability of the 
total score is not reported, it should constitute 
a useful and highly reliable vocabulary index. 

The intercorrelations of the part scores within 
the vocabulary test range between .43 and .67, 
and so are well below the reliabilities of the parts. 
(Since the reliabilities and intercorrelations 
were established on different samples, both small, 
one would not push the relations between the 
two sets of figures too far.) In any case, it seems 
clear that the part scores are stable and usefully 
independent, 

The Comprehension Section is presented in 
parallel parts, silent and auditory, in both of 
Which passages from high school texts in sci- 
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ence, social science, and literature are used. This 
test is untimed. The provision of an auditory test 
of comprehension is a distinct contribution in 
its own right, and teachers may well find scores 
on this part of the test of real interest. The ra- 
tionale given for use of the auditory section is 
that since it is free from the influence of the stu- 
dent's reading skill, it provides an indication of 
his capacity for understanding the materials of 
the test and hence of the improvability of his 
silent comprehension through remedial training. 
This is an interesting and reasonable hypothesis 
that remains to be checked. Meanwhile, it seems 
equally plausible that the silent comprehension 
section should be considered an indicator of ca- 
pacity for the improvement of auditory compre- 
hension skills. Neither possibility can be checked 
in any way from the data given. There are no 


^ reported intercorrelations between the silent and 


auditory parts: Kuder-Richardson reliabilities 
in the high .80’s are reported for both parts. 

А circumstance that detracts from the useful- 
ness of this test in the battery is the fact that 
some of the passages and questions are repeated 
verbatim from the Survey Section, This will 
surely result in differential advantage to stu- 
dents who have and those who have not studied 
the key for the Survey Section in relation to 
these materials. 

Section III, Rates of Reading, is designed to 
“make it possible to compare....reading abilities 
in two major areas of study and in situations 
demanding adjustment in rate of reading." The 
main purpose for including this section is to 
provide a measure of flexibility of reading—the 
degree to which a student adapts his reading rate 
to the type of material being read and to the 
amount of pressure under which he is asked to 
perform. 

Two types of material—social studies and 
science—are used to investigate the differentia- 
tion of rates according to content. As the au- 
thors point out, however, the intercorrelations 
of rate scores on the different types of materials 
"are high, in many cases as high as the relia- 
bilities themselves * To the extent that these 
tests are accurate, students vary their rate of 
reading very little according to the content, but 
instead tend to read all materials at a similar 
rate." Under the circumstances, it is hard to 
justify the continuance of separate scores for the 
two types of material. 

Flexibility of rate according to differential 
time pressure is measured by having the stu- 
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dents read first at a leisurely pace for the best 
possible comprehension and second as rapidly as 
possible with understanding. Data are presented 
to show that the second set of instructions does 
indeed result in increased speed, but correlations 
of scores under the two conditions are not pre- 
sented, leaving open the question of whether or 
not anything concerning an individual's “flexi- 
bility" of rate is being measured. It is difficult 
to obtain reliable scores for such a variable, 
measured by the difference between two rate 
scores which are themselves unstable. It re- 
mains for the authors to present evidence that 
their goal has been accomplished. 

Since it has not been demonstrated that the 
test does provide a measure of an individual's 
flexibility of rate according to time pressure, and 
since differentiation according to type of ma- 
terial has not been achieved, the authors' de- 
cision to retain this section of the test appears 
unwise to this reviewer. It would seem prefer- 
able to omit the section until its value can be 
demonstrated. 

Section IV, Word Attack, is also divided into 
two parts, oral and silent. The usefulness of the 
oral part, which constitutes one of the very few 
tests of its kind available, will depend very heav- 
ily on the skill of the teacher giving the test. 
Several good suggestions for interpretation are 
provided in the directions, and an alert and in- 
terested teacher should find this section reward- 
ing despite the difficulties introduced when a 
test must be given to pupils individually. The 
silent part is more readily standardized. The re- 
liabilities of its two parts (rhyming words and 
division into syllables) are in the .go’s, and their 
intercorrelation is below 60. This test may pro- 
vide valuable leads to the teacher in individual 
cases. The correlation between the scores pro- 
vided by the oral and silent parts is not indicated. 

The strength of the entire battery is clearly the 
fact that it was planned as a unit: one section 
supports and supplements another. The authors 
of the plan are to be congratulated upon it. The 
test materials themselves are reasonably good in 
the main, with inevitable lapses of detail A 
wealth of suggestions for making the test results 
useful in the teaching situation is provided. The 
committee's primary intent was to furnish tests 
that could be translated into guides for teaching, 
and the consistent orientation of the materials 
to this end is one of its greatest strengths. From 
this same aim, however, stems a real difficulty. 


. The material is so finely subdivided, in line with 


its diagnostic purpose, that the interpretation of 
the scores becomes dubious from the technical 
standpoint of their reliability, and, reliability 
aside, extremely difficult from the sheer complex- 
ity of the battery. 

The authors are to be congratulated also on 
the recently published compilation of extensive 
and helpful data on the construction and valida- 
tion of the tests, wherein the user can find in- 
formation on most aspects of their development 
and performance. As has been indicated in this 
review, however, there are still some significant 
omissions of data—a fact that is understandable 
and excusable in the early stages of an enterprise 
of this scope. 

In summary, the Survey Section stands al- 
ready as one of the better instruments for the 
evaluation of overall reading ability. The Vo- 
cabulary and Comprehension sections are likely 
to be widely useful, and the Word Attack Sec- 
tion should provide valuable information in in- 
dividual cases. One may hope that the committee 
responsible for these tests will streamline the 
battery wherever possible and will persevere in 
the collection and consolidation of information 
about them, 


Henry Werz, Director, Bureau of Testing and 
Guidance, Duke University, Durham, North 
Carolina. 

I. Purpose. These tests were designed to fur- 
nish teachers with information about the essen- 
tial reading skills of students from the seventh 
grade through the first year of college. The Com- 
mittee on Diagnostic Reading Tests, Inc. be- 
lieves that by providing teachers with adequate 
measures of reading skills, “a large part of the 
‘remedial’ work which has developed widely in 
our schools could thus be eliminated, because in- 
struction would be individualized to the extent 
necessary to meet the needs of most students.” 

II. DESCRIPTION or THE TEST BATTERY. A. 
GENERAL. The Diagnostic Reading Tests consist 
of a battery of nine tests grouped into five sec- 
tions: a Survey Section and four special diag- 
nostic sections. The Committee recommends 
that the Survey Section of the tests be adminis- 
tered to all students in a class “either as an inde- 
pendent test or as a screening test to segregate 
those students to whom the total battery or se- 
lected portions of [it] should be administered. 
It is recommended that students scoring below 
the 3oth percentile on the Survey Section be 
given other appropriate portions of the battery 
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(Vocabulary, Comprehension, Rates of Reading, 
and Word Attack) “unless other data are avail- 
able on factors such as low general ability, which 
may account for the low scores on a reading 
test." 

An attempt has been made throughout the test 
battery to relate the various measures of reading 
skill to subject matter areas: English, mathema- 
tics, and the like. Although most of the items on 
all tests are drawn from typical textbook ma- 
terials, only the Vocabulary Section yields scores 
in all the areas from which items are drawn, but 
these separate vocabulary subtests are generally 
too unreliable for use in individual measurement. 
The Rates of Reading Section yields separate 
scores for rate of general reading and for rate 
of reading science and social studies materials. 
The authors report that the correlation between 
rates of reading different kinds of material is 
high. If this is so, the separate subject matter 
tests would appear to be unnecessary. In view 
of the fact that only two sections yield subject 
matter scores and that these scores have only 
limited usefulness, it is unfortunate that the au- 
thors have emphasized this subject matter as- 
pect in their discussions of the battery. 

B. SURVEY SECTION. The Survey Section of 
the battery yields measures of reading rate, story 
comprehension, vocabulary, and general com- 
prehension, and a total reading score. The rate 
of reading score is not included in computing the 
total score. Sixty per cent of the total reading 
items are vocabulary. 

. While the average reliability for the total score 
is adequate to permit the use of the test for in- 
dividual measurement, the subtest scores do not 
yield such high reliability. Some of the reliabili- 
ties reported for the vocabulary subtest approach 
a satisfactorily high level, but others, for some 
grade groups, are still so low as to require great 
caution in the use of the scores. Reliabilities for 
the other subtests of this section are not suffi- 
ciently high to permit measurement of individu- 


als with a high degree of confidence. The Sur- : 


and science, as well as a total vocabulary score. 
Most of the items appear to be taken from one 
or another of the eight forms of the Survey Sec- 
tion. The degree of overlap is not reported al- 
though a sample tabulation suggests that it is 
fairly high. A revised edition of the Vocabulary 
Section has recently been made available. This 
revision drops the general vocabulary subtest 
and increases the length of the remaining sub- 
sections by about so per cent, thus increasing 
their reliability. 

According to data given in the 1948 edition of 
Directions for Administering, the unrevised edi- 
tion of this section seems to provide total scores 
of fairly high reliability. It should be noted, 
however, that for two grade groups the relia- 
bility of this r50-item test is less than the relia- 
bility of the 6o-item vocabulary subtest of the 
Survey Section. The reliabilities of the subtests 
in the unrevised edition are seriously low, clus- 
tering in the .70's, with some falling in the .50's. 
The diagnostic value of the section is, therefore, 
seriously impaired. While the revised edition 
may correct these faults, intercorrelations be- 
tween the subtests appear to be so high as to 
raise a question regarding the value of the sepa- 
rate scores anyway. 

D. SECTION 11, COMPREHENSION. The Com- 
prehension Section consists of two parts: Part 
I, Silent, providing a single silent comprehen- 
sion score, and Part 2, Auditory, providing a 
single auditory comprehension score. Each part 
contains reading selections of one or more para- 
graphs followed by several questions on each se- 
lection. These questions are principally of the 
“What is the main idea?" type. Part 1 is ad- 
ministered in the usual manner with the students 
reading each selection silently and answering the 
questions which follow. In Part 2, the selections 
and questions are read aloud to the students. 
Each student has a copy of the questions but not 
of the selections. The student answers the ques- 
tions either on the test booklet or on a separate 

E sheet. & 


vey Section, therefore, has very limited value as. . The assumption made by the Committee in 


an independent measure of the separate reading 
skills accounted for in the subtests. However, 
the use of the total score, as a general device for 
screening students with reading handicaps is. 
acceptable. 

C. SECTION I, VOCABULARY. The Vocabulary 
Section yields separate measures of general vo- 
. cabulary and of the vocabulary of English gram- 
mar and literature, mathematics, social studies, 


presenting -th e two parts together is that the 


auditory part gives a measure of the student's 
“potential” reading ability, his general back- 
ground of understanding, while the silent part 
measures his actual comprehension, Remedial 
instruction is likely to be profitable, the Commit- 
tee believes, when the auditory score exceeds the 
silent score. It considers this method of estimat- 
ing reading potential to be more effective, for 
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this purpose, than the use of intelligence tests. 
While these hypotheses are intriguing, the Com- 
mittee offers no evidence to support them, No 
information is given as to what range of differ- 
ences between these scores would be necessary 
to suggest that remedial instruction would be 
desirable or effective. 

The only reliability data supplied for this sec- 
tion indicate reliabilities of .87, .83, and .86 for 
grades то, 11, and 12, respectively, on the silent 
part and reliabilities of .84 and .89 for an eighth 
grade group on the auditory part. This level of 
reliability suggests the need for great caution in 
using these tests for individual diagnosis. 

The auditory part has recently been tempo- 
rarily withdrawn until such time as recorded ver- 
sions of the material to be read to the students 
are available. Since both the silent and auditory 
parts are used together in evaluating a student’s 
comprehension, this temporary withdrawal of 
the auditory part, in effect, withdraws the entire 
Section II. 

E. SECTION III, RATES OF READING. The Rates 
of Reading Section consists of three parts: Part 
I, General, provides measures of “normal” and 
“speeded” reading rates with comprehension 
check scores for each rate ; Part 2, Social Studies, 
and Part 3, Science, provide measures of normal 
reading rates with the specialized subject matter 
as well as comprehension check scores for each 
test. 

Although the science and social studies parts 
represent fairly conventional tests of this type, 
the general reading rate part introduces an addi- 
tional element. This part consists of two sub- 
parts, each containing a long selection followed 
by a series of questions. In administering this 
part, the teacher introduces the first selection to 
be read with the statement : “Read as rapidly as 
you can and still understand what you read” ; 
later this statement is added: “Remember that 
you are to read the article in this test in the same 
way you would ordinarily read such materials.” 
The second passage is introduced with this 
statement, “You were asked to read the first 
part at your regular everyday rate. Read the 
next article as rapidly as you can, and still un- 
derstand what you read. Keep pushing yourself 
constantly to speed your reading—to get the 
ideas as quickly as you possibly can.” Later this 
statement is added: “Remember that you are 
going to try constantly to read the article as rap- 
pae. as you can and still understand what you 
read,” 
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These two sets of directions are intended to 
provide motivation for different reading rates, 
In terms of the reported norms data, it is doubt- 
ful that they do. In the case of two of the seven 
median scores reported for Form A and three 
of the seven median scores reported for Form 
B, the “speeded” rate is slower than the “nor- 
mal” rate, and in only 4 of the 14 instances re- 
ported did the median number of words read 
per minute at the “speeded” rate exceed the 


normal rate by more than a line. No evidence 


is given to support the view that the differences 
in rate are significant. Therefore, although this 
approach to measuring reading rate may be 
promising, the reported evidence suggests that, 
as presently designed, this section contributes 
little to the understanding of a student's reading 
problem. 

The estimated reliability of .80 reported for 
the several parts of the Rates of Reading Sec- 
tion can be achieved only by using both forms 
of each part, thus doubling the time and cost 
to achieve a level of reliability still inadequate 
for individual measurement, This level of relia- 
bility is the same as that reported for the rate 
subtest of the Survey Section. 

F. SECTION IV, WORD ATTACK. The Word At- 
tack Section consists of two parts: Part I, Oral, 
provides a record of the kinds of oral word at- 
tack errors made and a count of the total errors; 
Part 2, Silent, provides separate measures of 
the student's skill in recognizing syllable sounds 
and his ability to recognize the number of sylla- 
bles in а word, as well as а total silent word 
attack score. The oral part of this test must be 
administered individually and is consequently 
very time consuming. It does, however, offer the 
teacher a good opportunity to gain some direct 
insight into the kinds of reading problems being 
encountered by the student. The silent part also 
seems promising as a diagnostic device but in a 
more limited area. Reliability coefficients re- 
ported for the oral and silent parts of this sec- 
tion appear to be sufficiently high for individual 
measurement both by subtests and total score. 

ПІ. Manua. One of the major handicaps 
to an adequate evaluation and effective use of 
this test battery is the absence of a manual. Each 
of the nine parts of the test has its separate pam- 
phlet on directions for administering, containing 
a statement of the purpose, a brief description of 
the construction and standardization procedures, 
instructions for administration, directions for 
scoring, a statement about the norms, and some 
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norms data. Much of the material in the instruc- 
tions for administering, the directions for scor- 
ing, and the statement about the norms is re- 
peated from pamphlet to pamphlet. Some of the 
norms tables present only the three quartiles. 
Reliability data is frequently either meager or 
contradictory. Median reliabilities reported in 
the body of the pamphlets frequently contradict 
the reliability data reported in the tables. Almost 
no information is given about the interrelation- 
ships of the parts of the battery. These pam- 
phlets are poorly edited as, for example, in the 
case of the March 1950 Directions for Adminis- 
tering Section III, Part 1, which states on page 
2, “Reliabilities of scores are given on each norm 
table for each grade level for which percentiles 
are furnished," but which fails to provide these 
data in the tables. 

Recently the Committee prepared a pamphlet 
which consolidates much of the statistical ma- 
terial published in the older directions for ad- 
ministering and provides a small amount of addi- 
tional material The new publication does not 
replace the directions, nor is it in any sense a 
manual. Its publication provides relatively little 
help in understanding the tests and does much 
to add to the confusion resulting from so much 
piecemeal material. 

The major difficulty of abbreviated directions 
for administering is that they give insufficient in- 
formation about the test results, thus making it 
extremely difficult, if not impossible, for the 
teacher to apply the results as they were intended 
as a basis for individualizing instruction. Inter- 
pretative data, when it does appear is sketchy, 
as, for example, in the Directions for Administer- 
ing Section II, which suggests ways in which the 
test scores may be interpreted but fails to indi- 
cate what levels of scores are to be interpreted 
in the ways suggested. Nor are the methods of 
implementing even these sketchy interpreta- 
tions recommended. 

The Committee plans to publish a complete 
manual in the near future. When it does, and if 
the manual is adequate, one of the major obsta- 
cles to the effective use of this battery will be 
removed, 

IV. Norms. Tentative percentile norms are 
available for most parts of the test battery. In 
some cases, however, only abbreviated norm 
tables (presenting only the three quartiles) are 
provided, and in other cases norms are provided 
on mimeographed inserts to the directions for 
administering. Not all available norms are pro- 
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vided. In order to secure certain norms, the test 
user has to make a special request for them. 

The method suggested by the authors for us- 
ing the norm tables presents an unnecessary 
problem. They suggest that the teacher compute 
the three quartiles for each population tested and 
then match these statistics with those in the norm 
tables to select the appropriate norms. Since the 
authors also suggest that the diagnostic portions 
of the battery be administered only to those stu- 
dents who fall below the зо percentile’on the 
Survey Section, this method of selecting an ap- 
propriate norms group for a diagnostic section 
administered only to the “screened” students 
becomes meaningless. More adequate norms are 
promised for the future. 

V. Summary. The Diagnostic Reading Tests 
represent an attempt to provide teachers with 
measures of essential reading skills in order that 
they may individualize subject matter instruc- 
tion to meet the reading needs of students and 
thereby reduce the need for remedial instruc- 
tion. Although such a purpose is an entirely de- 
sirable one, ‘this test battery does not achieve its 
purpose. The reliability information supplied by 
the authors suggests that many subtests are not 
sufficiently reliable for individual diagnosis; the 
norms and interpretative information supplied 
are not adequate to help the teacher translate 
the test results into meaningful individualized 
instruction ; the intercorrelations between parts 
of the battery suggest that much of it may be 
unnecessary overelaboration of a basically sim- 
ple measurement problem ; and some of the sec- 
tions appear to be based upon hypotheses which 
are not supported by the data thus far presented. 


[532] 
*Elementary Reading: Every Pupil Test. Grades 
4-6; 1936-51; new form usually published each April 
and December; 2 parts; form December 1 5r; no ta 
on reliability and validity į revised manual Ж ; norms 


('s1); 210€ per test; 1¢ per answer key; postpaid ; 
Ohio Scholarship Tests, Ohio State Department of 
Education. * 


a) GENERAL ABILITY. 17(30) minutes. | 
b) SPEED AND COMPREHENSION, 6(10) minutes, 


[ 533] 
*Garvey Primary Reading Test. Grades 1-3; 1931- 
36; 4 scores: form recognition, vocabulary, compre- 
nasion total; Forms 1 36), 2 (36); ЖАШ, 
1.50 per 25, postage extra; 35€ per specimen set, post- 
sd 4o ($0) minutes; Helen Sue Read and May V. 
Seagoe; California Test Bureau. * 


[534] 
*General Reading Test: Ohio Senior Survey 
"Tests. Grade 12; 1935-41; 4 scores: paragra h mean- 
ing, general vocabulary, outlining, total; Forms A 
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C35), B (37); no data on reliability and validity ; 3° 
per test; 25€ per manual (^41) ; postpaid ; 60(65) min- 
utes; S. L. Pressey and Maurice E. Troyer (B) ; Ohio 
Scholarship Tests, Ohio State Department of Educa- 
tion. 
[535] 

Haggerty Reading Examination. Grades 1-3, 6-12; 
1920-29; 2 levels; specimen set not available; postage 
extra; М. Е. Haggerty, Margaret E. Noonan (Sigma 
1), and Laura C. Haggerty (Sigma 3); World Book 
Со 


а) "SIGMA 1. КОЗ) 1-3; I form, '20; manual (’29) ; 
$1.55 per 25; 25(35) minutes. 
b) SIGMA 3. Grades 6-12; Forms A (720), В (22); 
revised manual ('29) ; $1.80 per 25; 20(30) minutes. 
REFERENCES 
т. Haccerty, M. E. Chap. 8, “Basis for Grouping Elementa 


School Pupils," pp. 114-57, and Chap, 9, “Ноу Virginia Chil- 
dren are Grou ed," pp. 15 93: In irginia Public Schools : 
Survey of a Southern State Public School System: Part Two, 


Educational Tests. By the Virginia Education Commission and 
the Virginia Survey Staff. Yonkers, N.Y.: World Book Co., 


1921. Pp. xii, 235. Й y 
2. Current, W. F., AND RucH, С. M. "Further Studies, on 
the ВШМ: of Reading Tests." J Ed Psychol 17:476-81 О '26. 
1: P 1 
а ‘SyMonDs Percivat M. Ability Standards for Standardized 
Wane ests in the High School, New York: Bureau of 
Publications, Teachers College, Columbia University, 1927. Pp. 
1. d 
"E BANI М. Eustace; DOUGLAS, 
D n 


OSEPHINE; AND RUDD, 
Marion the Validity of Silent Reading Tests.” J Appl 
Psychol 15:35-8 F ’31. * 


., JACOBSEN, CARLYLE Ё. A Endo and Attitude as Factors in 
Achievement in Medical School," J Assn Ат nog Col 21:152-9 
WirLiAM S. Gray, Director of Research in 
Reading, and Emeritus Professor of Education, 
The University of Chicago, Chicago, Illinois. 

SIGMA I. This test is designed to measure 
reading ability in grades 1—3. The aspects of 
reading selected as valid measures in the primary 
grades are not discussed in the manual accom- 
panying the test. An analysis of the test exer- 
cises shows that the designers of the test con- 
centrated on two basic aspects of silent reading ; 
namely, sentence meaning and paragraph mean- 
ing. The child's grasp of meaning in Part I is 
indicated by his ability to perform an act sug- 
gested by the sentence or paragraph which is 
read; for example, “Make two lines under the 
horse." Grasp of meaning is indicated in Part 
II of the test by underlining “Yes” or “No” in 
response to such a question as, “Are men larger 
than boys?” Such tests provide a reasonably 
valid measure of grasp of meaning of the kinds 
of sentences and paragraphs often read in the 
early grades. Since, however, the test is timed, 
the final scores represent a composite of capacity 
to interpret and speed of performance. 

The adequacy of the test is discussed in terms 
of two criteria. The first is its value in discrim- 
inating the individuals in a group in respect to 
reading ability, as measured by the test. In this 
connection, two requirements were set up. In 


the first place, the scores secured on the test 
must distribute themselves over a reasonably 
wide range; and in the second place, the test 
exercises must vary sufficiently widely in diffi- 
culty so that very few, if any, individuals will 
make zero scores, a large number will make 
average scores, and few or no individuals will 
make perfect scores. According to the data pre- 
sented, Sigma 1 meets these requirements very 
well. The reliability of the test was determined 
by correlating the results secured on repeated 
uses of the test. A reasonably high correlation 
(.84) was secured in this way. 

Only one form of the test is available. Direc- 
tions for giving and scoring the test have been 
worked out carefully. Grade norms (1—4) and 
age norms (6-11) accompany the test. As indi- 
cated above, the score is a composite measure of 
achievement and therefore has very little diag- 

nostic value. 

SIGMA 3. This test is designed to measure 
reading ability in grades 6-12. Three aspects of 
reading are involved; namely, vocabulary, sen- 
tence reading, and paragraph reading. The vo- 
cabulary test includes 50 words ranging from 
relatively simple words, such as сайт and coast 
to relatively difficult words, such as epaulets and 
chalice. No information is given concerning the 
source of the words or the basis of their selection. 
Ability to understand sentences is measured by 
checking “Yes” or “No” in response to ques- 
tions. Ability to understand paragraphs is meas- 
ured by the reader's responses to directions 
which arean integral part of the paragraphs read. 
The directions are of two kinds: one calls for 
underlining, and the other for checking. 

No specific discussion of the validity of the 
three tests as measures of reading ability is pre- 
sented. Ability to recognize the meanings of 
words, sentences, and paragraphs are indispu- 
table aspects of reading ability in junior and sen- 
ior high schools. Data are presented which show 
that the scores on the tests correlate fairly highly 
(.61) with a criterion composed of grade loca- 
tion, age, and teachers' estimates of scholarship. 
The correlation with intelligence scores is .64. 
On the basis of such data the designers con- 
clude that the “Sigma 3 examination is a really 
significant measure," The. vidence presented 
does not make clear, however, whether the test 
is a significant measure of general scholastic 
ability, reading competence, general intelligence, 
or a combination of the three. 
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In order to determine if the tests were satis- 
factory, the same criteria that were used with 
Sigma I were applied. The data presented indi- 
cate that the test has adequate discriminative 
capacity and is highly reliable. Correlations be- 
tween the scores on repeated use of the test are 
reported as .88 for the test as a whole, .86 for vo- 
cabulary, .77 for sentence meaning, and .81 for 
paragraph meaning. 

Two forms of the test are provided. Grade 
norms and age norms have been developed for 
each form, the former extending from grade 5 
to grade 12, and the latter, from age Io to age 
20. Carefully prepared directions for giving and 
scoring the tests are available. Because of the 
composition of the test, the scores may be used 
both as gross measures of reading achievement 
and, to a limited extent, for diagnostic purposes. 

The fact that both Sigma 1 and Sigma 3 are 
still in use and conform reasonably closely to 
current standards of test construction is a rare 
tribute to M. E. Haggerty whose pioneer work 
in the development of tests began almost four 
decades ago. 


[536] 

*High School Reading Test: National Achieve- 
ment Tests. Grades 7-12; 1939-52; 6 scores: vocabu- 
lary, word discrimination, sentence meaning, noting de- 
tails, interpreting paragraphs, total; Forms A (52), В 
(’51—same as test copyrighted in 1940) ; general series 
manual ['44]; Form A norms (’52), Form B norms 
('51) ; $2.50 per 25; 35€ per specimen set; postage ех- 
tra; nontimed (40) minutes; Robert K. Speer and 
Samuel Smith; Acorn Publishing Co. * 


Hotranp Roserts, Director, California Labor 
School, 321 Divisadero St., San Francisco, Cali- 
fornia. 

The simplicity and clarity of its construction, 
its reliability, and the ease with which it may 
be administered have helped this reading test 
maintain itself beyond the usual term of life. 
However, it no longer meets today’s standards 
and should be thoroughly revised or replaced. 
Its major deficiencies include: (а) no specific 
description of its construction ; (b) no norms for 
the subtests or for European or Asian foreign 
language groups, Negroes, Mexican Americans, 
different geographic areas, and varying socio- 
economic levels; (c). no specific criteria for the 
establishment of validity; (d) failure to es- 
tablish relationships between ability to pass the 
test and the specific reading abilities required in 
school and in daily life. In particular, those who 
use this test must assume—without supporting 


~ 


evidence—that it is a valid measure of the read- 
ing abilities which pupils need for success in 
their school work. 

In common with all standard measures of 
reading ability so far developed, the limitations 
of this test have become standardized. They are 
not only implicit but explicit in the routine pro- 
cedures of the schools, the mechanical mass pro- 
duction approach to American education, and 
the dominant contradictory philosophy of com- 
petitive individualism. From its very nature as 
an objective instrument such a standardized test 
becomes an important factor in crystallizing the 
educational process and so serving as a brake 
on change and development. Unfounded and 
dangerous conclusions are likely to be drawn 
from the results of this test by administrators, 
teachers, pupils, and testmakers. For example, 
in proposing it as a measure of “the student’s 

-mastery of factual material and, especially, his 
power to use information and skills," and in 
assuring teachers that in diagnosis it empha- 
sizes “the pupil’s ability to judge, appreciate and 
apply data or principles," the authors state 
claims which they make no effort to substantiate. 
What do “mastery of factual material” and 
“ability to judge, appreciate, and apply data or 
principles” mean? Certainly the former must 
include critical examination of the use of “facts” 
in propaganda to support special interests, while 
the latter opens up the whole field of moral 
values, Both these important areas in the devel- 
opment of reading ability are clearly outside 
the range of this simple reading test. 

Teachers who adopt this or any other current 
reading test for use in their school programs 
should be aware of these three basic dangers: 
using the test to measure reading abilities en- 
tirely outside its limits ; placing the emphasis in 
reading upon skills and mechanics, and ignoring 
more significant areas such as appreciation of 
values and judgment of principles; assuming 
that the important reading abilities needed for 
successful study in school and for work, recrea- 
tion, and enjoyment are measured by a reading 
test of this or any other character. Teachers and 
administrators can no longer meet their respon- 
sibilities by ordering tests by brand names or 
upon the unsupported assurances of the authors. 
Rather, test authors must prepare to meet exact- 
ing demands for proof that their tests are valid 
and that they make an essential contribution to 
the school program. 
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For areview by Robert L. McCaul, see 3:488. 


[537] 

*Holborn Reading Scale. Ages 55-11; 1948; 2 
scores: word recognition, comprehension; individual; 
I form; 3s. per 25; 3d. per single copy; 1s. 6d. per 
manual; postage extra; (20-30) minutes; A. F. Watts; 
George @ Harrap & Co. Ltd. * 

C. M. FrEMiNG, Reader in Education, Uni- 
versity of London, London, England. 

This scale consists of a set of 33 sentences ar- 
ranged in order of difficulty in terms of their 
mechanical elements and their comprehensibility. 
The sentences were so selected that in the stand- 
ardisation group of more than 2,000 pupils aged 
five and a half to ten and a half years the progress 
shown in mastering the difficulties of reading 
aloud was at a rate represented by an increase 
of one sentence every three months. The scale is 
offered as a means of making valid comparisons 
between the oral reading of pupils and the de- 
gree of their comprehension as assessed by a 
series of questions on the same sentences read 
silently. 

Apparently no attempt has been made to de- 
rive figures for validity or reliability by calcu- 
lating correlations with other measures of read- 
ing ability or by assessing the consistency of 
scores on split halves of the test or on its repeti- 
tion on a second occasion. The test, however, 
bears signs of expert construction. 

For the pupil's use the sentences are printed 
on both sides of a single sheet of paper. The com- 
plete test is printed in the manual, together with 
an account of the construction of the test, direc- 
tions for administering and scoring the test, and 
suggestions for interpreting test results. The 
manual also includes several pages of discussion 
of reading ability and its assessment, extracted 
from the author's book on the language and 
mental development of children. These pages, 
both interesting and suggestive, serve to present 
the test not only in its twofold function as a 
measure of oral and of silent reading but also in 
its setting of earlier reading researches in Amer- 
ica and England. 


[538] 

Ingraham-Clark Diagnostic Reading Tests. 
Grades 1-3, 4-8; 1929; 2 levels; 2 parts; Forms r, 2; 
$1.50 per 25 of any one level, postage extra; 356 per 
specimen set of any one level, postpaid; Jessie E. 
Ingraham and Willis W. Clark; California Test Bu- 
reau. 

а) PRIMARY: PART I, WORD FORM AND MEANINGS. 
Grades 1-3; 6 scores: word form, likenesses and dif- 
ferences, auditory stimuli, visual stimuli, opposites, 
total; 20(25) minutes. 


—— 


b) PRIMARY: PART П, SENTENCES AND PARAGRAPHS, 
Grades 1-3; 6 scores: following directions, answering 
questions, directly stated facts, qualified Statements, 
inferences and conclusions, total; 20(25) minutes. 

C) INTERMEDIATE: PART I, WORD FORM AND MEANINGS, 
Grades 4-6; 6 scores: likenesses and differences, audi- 
tory-visual, association, opposites, similarities, total ; 


30(35) minutes. 
d) INTERMEDIATE: PART II, SENTENCES AND PARA- 


GRAPHS. Grades 4-6; 5 scores: relevant and irrelevant 
statements, true and false deductions, selecting and 
classifying information, form and mechanics of or- 
ganization-sequence of events, total; 30(35) minutes, 
KATHERINE С. KENEALLY, Assistant Professor 
of Psychology, and Director, Remedial Reading 
Clinic, Catholic University of America, Wash- 
ington, D.C. 

These tests are intended to measure the in- 
structional needs of individual pupils and to fur- 
nish a sufficiently reliable instrument for classifi- 
cation and grouping within the classroom. A val- 
uable feature is a diagnostic outline and profile 
chart which enables the teacher to locate pupil 
needs quickly in order to plan corrective or 
remedial instruction. 

When published in 1929, this test was prob- 


ably a valuable diagnostic instrument for teach- 


ers who wished to locate pupil difficulties in 
order to plan remedial programs. Parts of the 
test now need revision, however. The test of 
word form would be strengthened by the in- 
clusion of some capitalized words, italics, and 
script instead of limiting this exercise to words 
of lower case letters, as some first grade pupils 
have difficulty in these areas. Some of the pic- 
tures should also be brought up to date. 

There is a need for a good diagnostic test of 
reading, particularly for remedial teachers work- 
ing at the elementary school level. It is hoped 
that the authors will consider revision and pre- 
sent new norms in order to compete with the 
more recent tests now available for the elemen- 
tary grades. 


[539] | 
Kansas Primary Reading Test. Grades 1-3; 1935; 
4 scores: sentence reading, word knowledge, para; 
graph comprehension, total; Forms A, В; $1.05 per 25; 
20€ per specimen set; postpaid; 12(20) minutes; Аша 
Hoag, Emma Humble, Bertha Robinson, Adeline үр 
and Н. E. Schrammel; Bureau of Educational Meas- 
азе Kansas State Teachers College of Em 
poria, 


Nira Banton Smiru, Professor of Education 
New York University, New York, New X ork. 

This test is divided into three distinct parts: 
Part т, a test of ability to read sentences, СО! 
sists of 22 yes-no questions; Part 2, a test 0 
word knowledge, consists of 20 multiple chok* 
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items ; and Part 3 consists of 8 paragraphs, each 
of which is followed by two multiple choice 
items. The exercises in all three parts begin 
with very simple sentences or paragraphs and 
gradually increase in difficulty. 

A helpful manual of directions accompanies 
the test. It contains explicit directions for ad- 
ministering the test, suggestions for interpreting 
and using the results, and tables of norms and 
percentile scores. According to the manual, va- 
lidity was established by checking each item 
"against reputable criteria, such as courses of 
study, text books, word lists, criticisms of 
teachers and supervisors, and analyses of pupils' 
test papers." This check, although a valuable 
safeguard, does not prove that the test measures 
what it purports to measure. Validity should 
also have been ascertained through intercorre- 
lations with other reading tests. Reliability co- 
efficients of .81, .93, and .84 are reported for 
grades т, 2, and 3, respectively. No reliability 
coefficients are reported for the part scores. No 
information is given as to how the reliability 
coefficients were computed or the numbers of 
pupils tested. 

Very complete information is given in regard 
to the norms. These norms were computed from 
scores made by 4,586 pupils at midyear, and 
3,960 pupils at end-of-year testing, reported 
from 159 schools in 19 different states. Percen- 
tile tables for interpreting both part and total 
scores are provided. 

'The strong points of the test are as follows: 
(a) The format and organization are excellent. 
(b) The increments of difficulty within each 
part appear to be gradual but decisive. (c) The 
test calls for one type of response only in all 
three parts—underlining. Because of this sim- 
plification pupils are freed from learning new 
techniques of working and can thus direct their 
major attention and efforts to the reading situa- 
tion itself. (d) The scoring is extremely easy. 

The disadvantages of the test are these: (a) 
Data are not presented for intercorrelations of 
the results of this test with those of other reading 
tests. (b) More information is needed in regard 
to the reliability of part scores and methods 
used in determining reliability coefficients. (c) 
The norms and percentiles indicate that the test 
is too difficult for first grade children. 

In summary, it might be said that this test 
has a sufficient number of good features to justify 
its revision, If the disadvantages mentioned 
above were removed through further experi- 


mentation and revision, this would be an ac- 
ceptable test for the early grades. 


For a review by Alice K. Liveright, see 40: 
1549. 


[540] 

*Lee-Clark Reading Test—First Reader, 1943 
Edition. Grades 1-2; 1931-43; 6 scores: auditory 
stimuli, visual stimuli, following directions, comple- 
tion, inference, total; Forms А (43), B (’43); no 
norms for part scores; manual (43); $1.25 per 25, 
postage extra; 356 per specimen set, postpaid; non- 
timed in part (25) minutes; J. Murray Lee and Willis 
W. Clark; California Test Bureau. * 


[54] 

Los Angeles Elementary Reading Test. Grades 
3-9; 1926-31; Forms 1 (726), 2 (26), 3 (26), 4 (26) ; 
manual ('31) ; $1 per 25, postage extra; 35€ per speci- 
men set, postpaid; 30(35) minutes; Jessie E. Ingra- 
ham; California Test Bureau. * 

Henry P. Ѕмітн, Associate Professor of Edu- 
cation, and Director of the Reading Laboratory, 
The University of Kansas, Lawrence, Kansas. 

This test is now over 25 years old. The age 
of the test makes the norms of questionable value 
because of the possible changes in the effective- 
ness of methods of teaching reading during the 
25-year period. 

For their time, the care used in selecting the 
items and in obtaining data concerning the effec- 
tiveness of the test is rather remarkable. How- 
ever, the usefulness of many of these data is 
lessened by the fact that some of the tests with 
which comparisons are made are no longer in 
use. In addition school curriculum and methods 
for teaching reading have been modified rather 
greatly in the past 25 years. 

While the selections chosen for the test situa- 
tions offer considerable variety in type of read- 
ing demanded, they do not appear typical of the 
current curriculum in reading. For example, an- 
cient kings and their problems, which are the 
basis for two selections in each form of the test, 
and a study of fables which is the basis for one 
selection in each form is not nearly as typical 
of reading instruction today as it was in 1926. 

It appears that while a reading test with a 
copyright of 1926, with norms revised in 1931, 
is not quite so out of date as a chemistry or 
physics test might be, changes in teaching meth- 
ods and materials, and general modifications of 
philosophy of education would serve greatly to 
reduce its worth. 


[542] 
Los Angeles Primary Reading Test. Grades 1-3; 
1925; Forms 1, 2, 3, 4; manual [25]; $1 per 25, post- 
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age extra; 35¢ per specimen set, postpaid ; 10(15) min- 
utes; Jessie E. Ingraham; California Test Bureau. * 


REFERENCES 
1. BAKER, FLoRENCE, AND Broom, M. E. “Concerning One 
Criterion for the Choice of Primary Reading Tests.” J Appl 
Psychol 16:419-20 Ag ’32. * (PA 7:4098) 


Niza Banton Ѕмітн, Professor of Education, 
New York University, New York, New York. 

This test purports to measure six reading 
abilities: (а) to follow a simple direction; (5) 
to follow a direction and to make a discrimina- 
Чоп ; (c) to interpret printed material in answer- 
ing a question and to draw an inference; (d) 
to discriminate between true and false conclu- 
sions as to implied facts and characterization ; 
(е) to perceive causes ; and (f) to perceive word 
meanings. 

Each of the four forms consists of a practice 
exercise and 24 test items. The test items are 
not grouped according to the abilities to be tested 
but are arranged miscellaneously. Several differ- 
ent types of responses are called for: marking 
symbols, underlining, crossing out, and writing 
answers to questions. 

The manual contains information about the 
test, clear-cut directions for giving and scoring 
the test, a table of age and grade norms, and a 
table of percentile ranks. 

Information given in the manual concerning 
the validity of the content is too general to be 
convincing. This statement is made: “The test 
materials are based on a careful study of pri- 
mary courses of study, research concerning word 
lists, and an examination of studies concerning 
reading materials and children's reading." It is 
questionable that a study of primary courses of 
study would contribute materially to the valida- 
tion of the content of the test. Research studies 
in regard to “word lists," “reading materials,” 
and “children’s reading" are legion. The content 
of this test would probably have had greater 
validity if the vocabulary had been drawn from 
one thoroughly scientific study or perhaps from 
a combination of two or three such studies. 

According to the manual, validity of function 
was determined through intercorrelations of the 
test with eight other reading tests. These inter- 
correlations, based on the scores of 120 second 
grade pupils on the several tests, yielded coeffi- 
cients ranging from .67 to .84, acceptable though 
not outstandingly high. No data are presented 
to show that the test actually measures each of 
the different types of reading ability which it 
is designed to measure. 

The test was originally standardized by giv- 


ing all four forms, together with the Haggerty 
Reading Examination, Sigma 1, and unnamed 
intelligence tests to 100 pupils in each of grades 
1—3. In this original standardization, the relia- 
bilities of the four test forms varied from .86 to 
-91 for a two-grade range as determined by in- 
tercorrelations. No data are given to support 
the claim that “extensive use indicates that the 
four forms are equivalent in difficulty and that 
the test provides a satisfactory measure of read- 
ing." The test is intended for use in grades 1-3. 
The grade placement norms, however, place a 
child in grade 1.5 if he makes a score of only 1 
on this test of 24 items. The child who gets all 
24 items correct is assigned a grade placement 
of 5.5. The norms confirm the impression that 
the test is too difficult for first grade children 
and better suits the second and third grade levels. 

In summary, it might be said that (a) the test 
is too ambitious in its attempt to measure so 
many different reading abilities with the use of 
only 24 test items; (b) it fails to make any dis- 
tinction among the items designed to measure 
different abilities either by grouping or label- 
ing, or in scoring procedures; (c) it is not care- 
fully graded in increments of difficulty; (d) it 
is much too difficult for first grade pupils in re- 
gard to vocabulary, the number of different types 
of responses which children are required to make, 
and inferential thinking which is beyond their 
level of maturity. 

On the other hand, the test has some good 
points. It is printed in large, clear, bold type. It 
can be given in the relatively short time of 10 
minutes. Its validity and reliability as deter- 
mined through intercorrelations are favorable. 
It taps some very important reading abilities 
which have largely been lost sight of in the more 
recent primary tests, such as drawing infer- 
ences, perceiving causes, and perceiving word 
meanings. As a rough check of general reading 
ability at second and third grade levels, it still 
might serve a useful purpose. 
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*Metropolitan Achievement Tests [Reading]. 
Grades 3-4, 5-7.5, 7-9.5; 1933-49; a subtest of Metro- 
politan Achievement Tests (see 18) ; 3 scores: reading, 
vocabulary, total; 3 levels; Forms R (дб), S (747), 
T ('49); directions for administering (’47) ; 80¢ рег 
manual ('48); postage extra; 356 per specimen set О 
any one level, postpaid; 35(45) minutes; Richard 
Allen, Harold H. Bixler, William L. Connor, ios 
ick D. Graham, and Gertrude Н. Hildreth; Worl 
Book Co. * 8: 
а) ELEMENTARY READING TEST. Grades 3-4; 1933-49; 
Forms R, S only ; $1.80 per 25. 


583 ] TESTS & REVIEWS: READING 


543 


b) INTERMEDIATE READING TEST. Grades 5-7.5; 1933- 


; $1.55 per 25. 
? ee READING TEST. Grades 7-9.5; 1933-49; 


$1.55 per 25. 
REFERENCES 

1-3. See 40:1189. 

"m [UT ЖЕКЕ “Deriving Contant Rate and 
Accuracy of Reading Norms for a Short Form of the Metropoli- 
tan Achievement Reading Test.” J Ed Psychol 32:359-64 My 
эт. * (PA 16:1207) ex : VE 

12. Stone, CLamENCE R. “Validity of Tests in Beginning 
Reading.” El Sch J 43:361-5 F '43. * (PA 18:2605) 

James К. Новзох, Director of Child Placement, 
Brookline Public Schools, Brookline, Massachu- 
setts. 

This series of reading tests is part of a de- 
servedly popular series of test batteries intro- 
duced in New York City about twenty years 
ago. They have undergone three revisions dur- 
ing this period. The 1947 revision and restand- 
ardization involved the largest and most com- 
plete standardizing population yet used in the 
development of a standardized measure of 
achievement. The normative material available 
in separate Booklets of Norms is the most com- 
plete this reviewer has seen. It includes not only 
age, grade, percentile, and modal age norms but 
also a wide variety of geographical norms as 
well as norms for Negro де eene schools. 

The reading sections of the batteries are pub- 
lished as separate tests beginning with the Ele- 
mentary Reading Test for grades 3-4. It is un- 
fortunate that the reading tests of the Primary I 
and Primary II Batteries have not been com- 
bined in a Primary Reading Test since their 
value has been demonstrated to this reviewer 
through 17 years of personal use. In fact, the 
three reading tests in the Primary I Battery ap- 
pear to be more outstanding for their grade level 
than are any of the other reading tests in the 
battery. 

In addition to the three features alluded to 
above, the chief virtues of the Metropolitan read- 
ing tests appear to be: (a) Each test is designed 
to cover a narrow grade range. Other things 
being equal, the smaller the grade range a test 
is designed to cover, the greater the proportion 
of the test which is a valid measure for the 
achievement of any given grade. (b) The au- 
thors have done an exceedingly good job of 
scaling as to difficulty of items and standardiza- 
tion to produce the optimum degree of com- 
parability from grade to grade and from form 
to form. This is exceedingly important if a 
cumulative test record from year to year is to be 
kept of individuals and classes. (c) Finally, 
there has been available soon after each revision 


an excellent Manual for Interpreting for the 
battery of which this reading test is a part. The 
present manual, a book of 122 pages, gives ex- 
tensive data regarding the development and 
standardization of the battery, has valuable sec- 
tions on various aspects of measurement and 
evaluation in general, and has a particularly 
helpful section on “Improving the Achievement 
of Individual Pupils through the Use of Test 
Results." Such a manual should be required 
reading in teacher training courses in measure- 
ment. 

While validity has been obtained by the usual 
method of adhering rather closely to the results 
of surveys of courses of study and the pooled 
judgments of "experts," the several revisions 
have made it possible to drop test items which 
did not distinguish between the capable and the 
less capable pupils and thus increase the validity 
of the instrument. This is at least a first step 
in the experimental determination of validity. 

Various minor criticisms of these tests have 
been made by previous reviewers, such as con- 
fusing directions to examiners in some instances, 
low diagnostic value, subjectivity of scoring, and 
the lack of fineness in scaling which allows the 
difference of one item to make a difference of 
from two to six months in grade level. This re- 
viewer and the 200 teachers who have adminis- 
tered these tests or their previous editions for 17 
years have not found the directions confusing 
although a set of supplementary directions for 
each test level is issued annually from the re- 
viewer's office. 

These tests could be arranged for analytical 
purposes along the lines of the California Read- 
ing Tests without detracting from their basic 
virtues. Many school systems cannot or do not 
use more than one standardized instrument in 
a grade in any one year. If an analytical feature 
based on skills measured were added to the 
present class analysis feature, it would be a 
boon to many teachers, particularly those with 
large classes. 

The so-called lack of fineness of scaling oc- 
curs only at the upper extremes of the range, 
two or more grade levels above the class median 
in practically every case. In the middle of the 
range it is more likely to take a difference of 
two items correct to make one month's differ- 
ence in grade level. The fact that at the extreme 
upper part of the range the norms are able to 
show even a rather coarse differentiation of two 
or three months in grade level for one item may 
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be considered indicative of high power of dis- 
crimination, since the values were obtained by 
actual administration of the test at the levels in- 
dicated and not by extrapolation. 

It is necessary to agree with the criticism of 
subjectivity in scoring the Paragraph Meaning 
sections of these tests since many acceptable an- 
swers are not included on the keys furnished. 
In fact, the reviewer has found it advisable, if 
not necessary, to have all Paragraph Meaning 
tests scored at a central office to assure some 
degree of uniformity. 

The strongest indictment the writer would 
make against these tests and the Metropolitan 
batteries in general is the failure to provide for 
machine scoring in the cases of those tests which 
are machine scorable and at the grade levels at 
which machine scoring is practical and desir- 
able. In school systems which have or which 
have access to an electric scoring machine, this 
failure to consider the wishes and convenience 
of test users will result eventually in deserved 
loss of business. 

To summarize, this reviewer would rate the 
Metropolitan reading tests as outstanding for 
general purpose every-pupil measurement and 
evaluation in reading, although they leave some- 
thing to be desired in the way of analysis and 
diagnosis if no other more specialized instru- 
ments are to be used. 


Marcaret G. McKim, Associate Professor of 
Education, University of Cincinnati, Cincinnati, 
Ohio. 

The value of these tests lies in their general 
estimate of paragraph reading ability and word 
knowledge. Although the questions in the para- 
graph reading sections call for varied reading 
skills, there is no way in which definite diagnos- 
tic evidence can be obtained. At the elementary 
level, particularly, the emphasis seems to be on 
noting details. The vocabulary test is general 
in content. 

At all three levels the tests are easy to admin- 
ister. The vocabulary test calls for the choice 
of a correct synonym from four words. The two 
sections in the paragraph reading part of the ele- 
mentary test call for the child, first, to select 
from four possibilities, the correct answer to a 
question and, second, to write in blanks at the 
side of the paragraph the words which are needed 
to complete numbered blanks in the paragraph. 

The paragraph reading sections of the inter- 
mediate and advanced tests call almost entirely 


for written responses. Since the test booklets are 
about the width of a typical textbook, the space 
allowed for writing is limited, even for an adult 
hand. This seems to place an unnecessary hurdle 
before the immature writer. Children who have 
spelling difficulties may also be handicapped, al- 
though spelling errors are not counted. Scoring 
the tests will be slow, as time must be taken to 
check on the correctness of alternate answers. 
Scoring standards are established so as to allow 
any reasonable response. 

Supplementary booklets of norms provide per- 
centiles, modal age-grade norms based on the 
proportion of pupils who are considered at grade 
for their age, and the norms for special popula- 
tion groups, including parochial schools, Negro 
schools, and selected states or groups of states. 
These supplementary norms should prove par- 
ticularly helpful to persons responsible for ad- 
ministering and interpreting city testing pro- 
grams. They will probably not be used exten- 
sively by the classroom teacher. 

The manuals accompanying the separate tests 
provide helpful suggestions to the teacher inex- 
perienced in giving standardized tests. A more 
extensive battery manual gives detailed sugges- 
tions on the planning of a school testing program 
and the interpreting and reporting of results. 
Of particular value to teachers is a section 
discussing general diagnostic and remedial 
procedures. The battery manual also gives a de- 
tailed discussion of the methods used in stand- 
ardizing all the tests in the battery and develop- 
ing norms. 

Inspection of the norms for the reading tests 
suggests that the elementary test will prove un- 
duly difficult for a typical class in the early part 
of the third grade; and that the advanced test 18 
not likely to provide enough challenge for the 
best readers in a typical class in the first half 
of the ninth grade. For the greater part of the 
grade range for which these tests are recom- 
mended, however, they should be appropriate. 
The tests will probably be of greatest value as 
general achievement tests in reading at the var- 
ious grade levels; for diagnostic purposes, they 
will need to be supplemented by reading tests 
which explore specific skills in more detail. 


For a review by Warren G. Findley of the 
complete battery, see 18; for a review by D. 4 
Worcester of an earlier edition, see 40:15515 for 
reviews by E. V. Pullias and Hugh B. W ood oj 
an earlier edition of the complete battery, $6 
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40:1189; for reviews by Ivan A. Booker and 
Joseph C. Dewey of an earlier edition, see 38: 
1105; for reviews by Jack W. Dunlap, Charles 
W. Odell, and Richard Ledgerwood of an earlier 
edition of the complete battery, see 38:874. 


[544] 
The Nelson-Denny Reading Test: Vocabulary 
and Paragraph: The Clapp-Young Self-Marking 
Tests. Grades 9-16; 1929-38 ; 3 scores: vocabulary, 
paragraph comprehension, total ; IBM; Forms A C203, 
В (30) ; manual ['38]; $1.35 per 25; $2.20 per 25 sets 
of test and answer booklet; separate answer booklets 
or sheets must be used; $1 per 25 answer booklets ; 50¢ 
per 25 IBM answer sheets ; 20¢ per stencil for machine 
scoring of answer sheets ; 27€ per specimen set; aya 
age extra; 30(35) minutes; M. J. Nelson and С; 
Denny; Houghton Mifflin Co. * 
REFERENCES 


1-6. See 40:1557. T 

7. Davis, NELSON W. A Study in Prediction Based on the 
Records of First-Year Students of the University of Arizona for 
1934-35. Master’s thesis, University of Arizona (Tucson, Ariz.), 
1937 and 1938. (Abstracts of Theses....1938, 1939, P- 19.) 

8. Larsen, ROBERT P. Common and Differential Factors in 
Reading Comprehension and Hearing Comprehension. Doctor's 
thesis, University of lowa (Lowa City, Lowa), 1938. 
. 9. LAWRENCE, WILLIAM А. da | ivalwation of Achievement 
in the Various Colleges of the Louisiana State University with 
Special Keference to Certain Aspects of the Junior Division. 
Doctor’s thesis, Louisiana State University (Baton Rouge, La.), 
1939. (Abstracts of Theses,...1939-1940, 1941, рр. 10-11.) = 

то. Lorr, Hiram V. A Comparative Study of Five Criteria 
for Predicting Achievement in Freshman History in the Junior 
Division at Louisiana State University. Master’s thesis, Louisi- 
ana State University (Baton Rouge, La.), 1939. (Abstracts of 
Theses....1939, 1940, р. 45.) 

11. Roy, Exic ArtHUR, Correcting High School Marks as a 
Means of Better Predicting College Success, Master's thesis, 
Clark University (Worcester, Mass.), 1939. (Abstracts of Dis- 
sertations....1939, pp. 170-2.) Pus 

12. VARNADO, GLADYS R. A Further Study of the Predictive 
Value of Various Criteria on Achievement in Freshman Mathe- 
matics at Louisiana State University for the Session 1938—1939. 
Master’s thesis, Louisiana State University (Baton Rouge, La.), 
1939. (Abstracts оў Theses....1939, 1940, P. 92 
‚ 13. ANDERSON, IRVING H., AND Dearsorn, WALTER Е. “Веай- 
ing Ability as Related to College Achievement." J Psychol 11; 
387-96 Ap '41. * (PA 15:3570) 

14. GREENE, PAUL C. "Some Relationships Between Place- 
ment Scores and Scholastic Rating." Proc lowa Acad Sci 48: 
361-6 '41. * (PA 16:2867) i { 

15. LANGSAM, ROSALIND STREEP, “A Factorial Analysis of 
Reading Ability.” J Exp Ed 10:57-63 541. * (PA 16:1187 

16. CRIDER, BLAKE. “А School of Nursing Selection. Pro- 
gram.” J Appl Psychol 27:452-7 O '43. * (P. 182283) 3 

17. GeNTRY, ровотну E. An абан to Predict Sci larship 
From a Variety of Objective Tests: College Ability Test, Nelson- 
Denny Reading Tesi, Kuhimann-Anderson Intelligence Test, 
and English Placement Test. Master's thesis, Ohio University 
(Athens, Ohio), 1943. Pp. 33. (Abstracts of ‘Masters’ Theses... 
1943, 1944, рр. 36-7.) c 

18. Portenier, LILLIAN С. “Predicting Success in Introduc- 
ry Paychologyi" Ed & Psychol Meas 8:117-26 sp '48. * C 

19. Моврнү, Hanorp D., AND Davis, FREDERICK B. "A Note 
on the Measurement of Progress in Remedial Reading." Pea- 
body J Ed 27:108-11 S '49. * 

728 Sirvey, HERBERT M. Change in Status of Iowa State 
vachers College Students as Revealed by хера Placement 
ей Iowa State Teachers College, Research Report No. 58. 

edar Falls, Iowa: Bureau of Research, the College, July 20, 
1949. Pp. 27. Paper, lithotyped. * 

Tar MONSON, LAWRENCE Davis, Comparative Analyses of a 
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e University of Missouri. Doctor's thesis, University of Mis- 
vu (Columbia, Mo.), 1949. Abstract: Microfilm Abstracts 9: 
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Ivan A. Booxer, Assistant Director, Division 
of Press and Labor Relations, National Educa- 
tion Association, Washington, D.C. 
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For more than 20 years this test has been a 
familiar tool in the kit of those concerned with 
the improvement of silent reading among college 
students. Although designed for use at all grade 
levels in high school and college, the test prob- 
ably performs its greatest service when used 
with high school seniors and college freshmen. 

The test is in two parts: a 100-item multiple 
choice vocabulary test and a 36-item compre- 
hension test consisting of 9 paragraphs, each of 
which contains about 200 words and is followed 
by 4 comprehension questions with multiple 
choice answers. Each vocabulary test item is al- 
lowed one point; each comprehension item, two 
points. The maximum score, therefore, is 172. 

This test was quite carefully constructed. 
Items were selected and scaled in terms of their 
difficulty, and the two forms of the test were 
carefully equated. This sound workmanship in 
constructing the test undoubtedly has been an 
important factor in its continued popularity and 
usefulness for more than two decades. 

The authors make no claim as to the validity 
of the test except to explain their purpose in con- 
structing it and their reasons for developing it 
as they did. Experience has shown, however, 
that its results correlate well with achievement 
in academic subjects. Its reliability is approxi- 
mately .90. 

The format of the test makes it economical to 
use. Students record their answers on separate 
answer sheets. For test users with machine scor- 
ing facilities, IBM answer sheets are provided. 
For other test users, Clapp- Young Self-Mark- 
ing answer booklets are provided. 

The time limits for the test are rather brief, 
especially for high school students, 10 minutes 
for the 100-item vocabulary test and 20 minutes 
for the comprehension test. This places a pre- 
mium upon rapid reading, immediate reactions, 
and lucky guesses. Careful reading, rereading, 
and deliberation are penalized, thus preventing 
a great many students from displaying their 
maximum performance. Yet, norms are invali- 
dated unless strict timing is observed. Since the 
test yields no rate of reading score, and since 
the measurement of speed is not one of the spe- 
cific purposes of the test, the indirect effect of 
rate of work introduced through the brief tim- 
ing may be regarded as an extraneous factor. If 
norms were available for the test administered 
as a generously timed or as an untimed test, its 
diagnostic value as a measure of power of vocabu- 
lary and comprehension would be increased. 
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The test manual emphasizes the total score, 
which is simply the sum of the vocabulary score 
and the comprehension scores. Grade equiva- 
lents are reported only in terms of the total score. 
Virtually nothing is said about the value of the 
test as a diagnostic instrument. Although this 
total score, obtained from the timed administra- 
tion of the test, provides a useful survey of the 
general level of reading performance for a group, 
this reviewer believes that the Nelson-Denny 
test is of greatest value when used in diagnosis. 
The manual does give the 1st and goth per- 
centiles, the deciles, and the quartiles for the 
vocabulary and comprehension scores. These, 
like the total score, are affected by the brief time 
limits but can be helpful if used with discrimina- 
tion. Moreover, the competent test user can re- 
move the time limits and soon establish local 
norms for his own diagnostic use. Proper evalu- 
ation of the separate vocabulary and comprehen- 
sion scores, together with analyses of the stu- 
dents’ responses to individual test items, will be 
far more significant, in many cases, than any 
comparisons of total scores that can be made. 
The latter, at best, are merely arbitrary combi- 
nations of separate elements which lose signifi- 
cance, rather than gain it, when their separate 
identities are obscured. 

Wherever a quick overview of the vocabulary 
knowledge and comprehension skill of high 
school seniors or college students is desired, this 
test is an efficient and dependable device. 


For a review by Hans C. Gordon, see 40: 
1557. 


[545] 
The Nelson Silent Reading Test: Vocabulary 
and Paragraph: The Clapp-Young Self-Marking 
Tests. Grades 3-9; 1931-39; 5 scores: vocabulary, 
general paragraph comprehension, ability to note de- 
tails, ability to predict probable outcome, total; IBM 
for Forms A, B; Forms A ('a1), B C32), С (39) ; 
manual ['39]; $1.35 per 25; $2.40 per 25 sets of test 
and answer booklet; separate answer booklets or sheets 
must be used; $1 per 25 answer booklets; 756 per 25 
IBM answer sheets; 40€ per set of stencils for ma- 
chine scoring of answer sheets; 306 per specimen set ; 
postage extra; 30(40) minutes; M. J. Nelson; Hough- 
ton Mifflin Co, * 3 

REFERENCES 


, 1. GRANT, ALBERT. “Results of Nelson Silent Reading Test 
in Grade IX.” Sch R 48:34-9 Ja '40. * (PA 14:2604) 


WirrrAM D. SHELDON, Associate Professor of 
Education, and Director of the Reading Labora- 
tory, Syracuse University, Syracuse, New York. 

This reviewer would make the same general 
comments concerning this test as did Constance 


M. McCullough (see 3:492). In addition, he 
would stress the unattractive format of the test 
as a feature in need of revision. 

Both the advertising literature and the man- 
ual for this test indicate the importance of its 
value as a diagnostic instrument. However, 
neither statistical validation nor the opinions of 
competent judges confirm the merits of the test 
as an aid in diagnosis. There seems to be no evi- 
dence that the paragraph reading subtests ac- 
tually measure the specific aspects of reading 
comprehension claimed by the author. 

The books suggested in the manual as being 
helpful to the teacher range in publication date 
from 1931 to 1939. While these older books 
might be valuable, there are available at least a 
dozen books and several hundred other refer- 
ences which are more recent. It would be help- 
ful if the publishers of the manual brought their 
references up to date. 

This reviewer suggests that even though the 
merits of this test as a measure of specific weak- 
nesses in comprehension are questionable, the 
total scores yielded by the test might be valuable 
as a general index of a pupil's grade placement 
in reading. 


For a review by Constance M. McCullough, 
see 3:492; for an excerpt from a review, see 
40:1558. 


[546] 

*Primary Reading: Every Pupil Test. Grades 2- 
3; 1936—51; new form usually published each April 
and December; form December 1951 ; no data оп relia- 
bility and validity; revised manual [’49] ; norms (’51) 3 
2156 per test; ІФ per answer key; postpaid; 19(30) 
minutes; Ohio Scholarship Tests, Ohio State Depart- 
ment of Education. * 


For reviews by William S. Gray and Virginia 


Seavey of forms for April and December 1046, 
see 3:493. 
[547] . 

*Reading Comprehension: Cooperative English 
Test: Lower and Higher Levels, Test C1 and C2. 
Grades 7-12, 11-16; 1940-51; also called Cooperative 
Reading Comprehension Test; 4 scores: vocabulary, 
speed of comprehension, level of comprehension, total; 
IBM; 2 levels; Forms R (’s0—same as test сору 
righted in 1941), S (Lower Level, '42; Higher Level, 
*5I—same as test copyrighted in 1942), T (Lower 
Level, ’43; Higher Level, 'so—same as test copy- 
righted in 1943), Y ('48); Form Q out of print; no 
data on validity; no specific manual; descriptive 
folder ['51] ; general Cooperative manual (’51) ; norms 
[40] ; $2.50 per 25 of any one level; 50¢ per specimen 
set of any one level, postpaid; separate answer sheets 
may be used; 80¢ per 25 ІВМ answer sheets; 15¢ рер 
stencil for scoring answer sheets; cash orders p. 
paid; 40(45) minutes; Frederick B. Davis, Harold V. 
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King (S), Mary Willis (T), Clarence Derrick (Ү), 
Harry R. Neville (Y), Jeanne M. Bradford (Y), and 
Geraldine Spaulding (Y) ; Cooperative Test Division, 
Educational Testing Service. * 
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For reviews by Robert Murray Bear and 
J. B. Stroud, see 3:407; for reviews by J. Paul 


Leonard, Edward S. Noyes, and Robert C. 
Pooley of the Forms R, S, and T of the complete 
battery, see 3:120. 


[548] 

*Reading: Seven Plus Assessment: Northumber- 
land Series. Ages 7-8; 1951; for complete battery, see 
24; 1 form ['51] ; no data on reliability ; manual ['51] ; 
75. 6d. per 25; Od. per single сору; 1s. per manual ; 25. 
6d. per specimen set (includes the other 2 tests in the 
series) ; postage extra; (75) minutes; C. M. Lambert; 
University of London Press Ltd. * 


[549] 

*Reading Test (Comprehension and Speed): 
Municipal Tests: National Achievement Tests. 
Grades 3-6, 6-8; 1938-50; same as reading tests in 
Municipal Battery (see 20); 5 scores: following di- 
rections, sentence meaning, paragraph meaning, read- 
ing speed, total; 2 levels; Forms A (50), В (39) ; 
no data on reliability and validity and no description 
of normative population in manuals; no norms for part 
scores; Form A manual ('50), Form B manual (39) ; 
$1.75 рег 25; 356 per specimen set of any one level; 
postage extra; Robert K. Speer and Samuel Smith; 
Acorn Publishing Co. * 

a) GRADES 3-6. 33(38) minutes. 

b) craves 6-8. 32(37) minutes. 


[ 550] 

SRA Reading Record. Grades 8-13; 1047 ; 11 scores: 
reading speed, comprehension, paragraph meaning, di- 
rectory reading, map-table-graph reading, advertise- 
ment reading, index usage, technical vocabulary, sen- 
tence meaning, general vocabulary, total; 1 form; 
49€ per set of test and answer pad; separate answer 
pads must be used; $1.80 per 25 answer pads; 556 per 
25 profile and norm sheets ; 756 per specimen set; cash 
orders postpaid; 28(40) minutes; Guy T. Buswell; 
Science Research Associates, Inc. * 


REFERENCES 
т. Buprow, Стлруѕ F. "Reading the SRA Reading Record,” 
рр. 141—3. In Claremont College Reading Conference, Four- 
teenth Yearbook, 1949: Conference Theme: The Problems and 


deck ud Involved in Reading Social Relationships. Clare- 
mont, Calif.: Claremont College Curriculum Laboratory, 1949. 
Pp. viii, 191. Paper. * 


2. SELDERS, GILBERT R. W, A Study of the Academic, Social 
and Persona! Needs of Students of the Pennsylvania State Col- 
lege for the Years 1948—1950 Through an Analysis of Instru- 
ments Administered in Education 105. Doctor’s thesis, Pennsyl- 
vania State College (State College, Pa.), 1950. (Abstracts of 
Doctoral Dissertations....1950, 1951, PP- 289-97.) (PA 26: 
2405, title only) 

WiLLiAM М. TURNBULL, Vice President, Edu- 
cational Testing Service, Princeton, New Jersey. 

This is a short, highly speeded test divided 
into 10 sections, each of which is designed to 
measure one of 10 “basic skills in reading.” Of 
these, 5 are rather general in nature (rate of 
reading, reading comprehension, paragraph 
meaning, sentence meaning, and general vocabu- 
lary), 4 are specific applications of reading skills 
to various kinds of material (directory reading, 
map-table-graph reading, advertisement read- 
ing, index reading), and 1 is technical vocabu- 
lary. 

' The rationale underlying the analysis of read- 
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ing into this mixture of general functional skills 
and specific areas of application is not presented. 
It seems unlikely, however, that a priori agree- 
ment on such a classification would be found 
among authorities in the field. This raises two 
questions : (a) does the test appear to cover most 
of the reading skills that are usually posited, and 
(b) does it yield reliable and independent meas- 
ures of the skills it is designed to test ? 

The test appears to do the most adequate job 
in the measurement of vocabulary. This seems 
almost inevitable in view of its brevity: there is 
time to measure little beyond word knowledge 
and memory for factual detail. Broader skills 
such as interpretation, inference, formulation of 
judgments and conclusions, and comparison of 
ideas are largely neglected. 

The manual presents exceedingly high relia- 
bility coefficients for the 11 scores. It fails to 
point out that such coefficients are spurious when 
based on sections as highly speeded as these. 
(Time limits are 3 minutes for each of eight sec- 
tions, 2 minutes for the other two. The direc- 
tions warn that a timing error “of even five 
seconds can increase a score ten points," a huge 
gain in view of the fact that the number of items 
in the sections varies from 10 to 25.) Assuming 
that the coefficients reported were obtained by 
an internal consistency method rather than by 
a test-retest method (not specified in the man- 
ual, but likely in view of the existence of only 
one form of the test), the reviewer must con- 
clude that no useful estimate of reliability has as 
yet been established, since the internal consist- 
ency methods give gross overestimates of the 
reliabilities of highly speeded tests. 

Since no intercorrelations of scores on the 
IO sections are given, it is impossible to judge 
the independence of these scores—a vital matter 
in view of the fact that they are to be plotted in 
profile form and interpreted differentially. The 
manual states : “While relationships between the 
tests of the Record are moderate to high, a pro- 
file of specific reading skills is essential for 
complete diagnosis and remedial treatment of 
individual students.” This statement suggests 
that the Record, in view of its sizeable intercor- 
relations of parts, will not be found useful as a 
diagnostic instrument. The prospective user 
would be able to make a much more intelligent 
judgment on this point if the intercorrelations, 
which have apparently been found, were pre- 
sented. Í { 

No statistical data pertaining to validity are 
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reported. The manual contains the unsupported 
assertion that "the tests have proven successful 
in differentiating good from poor readers in 
the skills needed for scholastic and occupational 
success." In the absence of the evidence on which 
this comment is based, the power of the test to 
discriminate between good and poor readers 
must be guessed at from inspection of the kinds 
of abilities apparently tested. As indicated ear- 
lier, the results of the inspection are not reassur- 
ing. It is fair to say, however, that skills in this 
area tend to be highly correlated, and a test 
which omits several of them, as this one appears 
to, may still provide a useful and discriminat- 
ing overall measure of reading ability. 

In format the test is handsome. It is attrac- 
tively and conveniently printed, as are the man- 
ual and the self-interpreting profile. The direc- 
tions and discussion are clearly and simply 
written. The use of progressively narrower ques- 
tion sheets to expose successive columns of an- 
swer spaces makes for convenience in taking the 
test. 

A stylus is used to mark the answers to the 
questions by punching holes through the re- 
sponse positions. The key is printed on the re- 
verse of the answer sheet, eliminating the need 
for a stencil. 

In summary, the SRA Reading Record is a 
short, easily administered, highly speeded test 
of reading ability. It covers a somewhat con- 
glomerate and incomplete group of skills and 
yields scores of indeterminate reliability and 
unspecified validity. It is probably of some value 
in obtaining a level of general reading ability for 
a group, and to a lesser degree, for an individual. 
In view of the paucity of data reported, its use- 
fulness in individual diagnosis is questionable. 


For a review by Frances Oralind Triggs and 
an excerpt from a review, see 3:502. 


[551] 
Sangren-Woody Reading Test. Grades 4-8; 1927- 
28; 8 scores: word meaning, rate, fact material, total 
meaning, central thought, following directions, Or- 
ganization, total; Forms A (’27), B (728); revise 


. manual ('28); $2 per 25, postage extra; 356 per speci- 


men set, postpaid; 27(35-40) minutes; Paul V. San- 
gren and Clifford Woody; World Book Co. * 


REFERENCES 
1-7. See 40:1565. 
Davi H. RusseLL, Professor of Education, 
University of California, Berkeley, С alifornia. 2 
А complete description of this test is given їп 
The 1940 Mental Measurements Yearbook 10 
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a review by Liveright (40: 1565) and will not 
be repeated here. The present review attempts 
to evaluate the test in light of recent develop- 
ments in testing and instruction in reading. 

A first survey of the test gives the impression 
that, despite lack of revision, the test follows 
rather closely modern trends in reading instruc- 
tion. Its seven subtests suggest that reading is 
not a unitary ability applicable to all situations 
but rather a complex process of several abilities 
or factors. Although these are interdependent 
variables, the test properly offers separate meas- 
ures of comprehension and speed in reading. It 
provides a profile sheet for individual pupils that 
indicates strengths and weaknesses in seven dif- 
ferent areas and which therefore may have more 
diagnostic value than some more recent reading 
tests, Furthermore, the accompanying manual 
gives a clear statement of the steps taken in 
standardizing the test and in determining its 
validity by correlations with external criteria. 
A coefficient of reliability, based on the alternate 
forms method, is given for each subtest and for 
the total score. 

Further study of the test raises two questions 
which should be investigated and answered by 
the authors or the users of the test. The first con- 
cerns the selection of factors presumed to make 
up reading ability. The factors measured, word 
meaning, rate, and five varieties of comprehen- 
sion, are often included, in whole or in part, in 
other reading tests. However, Part I, Word 
Meaning, is just one more test measuring the 
superficial recognition of a synonym and neg- 
lecting one important aim of vocabulary work 
and school lessons—the depth and breadth of 
concepts. In Part II, Rate, the score is based on 
the child's speed for the first minute only, al- 
though he has the opportunity to read for a 
total of three minutes. An average score on a 
longer passage might give a more valid result. 
The comprehension parts deal with getting facts, 
understanding the main idea, following direc- 
tions, and stating a sequence of events. These are 
all important phases of reading ability which 
should be tested, but other reading abilities which 
are stated aims in current programs are omitted. 
For example, the test does not include some of 
the more creative aspects of reading, such as 
critical reading of controversial material, read- 
ing to interpret motive or character, and reading 
to predict future or related outcomes. The neg- 
lect of some of the more thoughtful forms of 
reading comprehension is not an omission in the 


Sangren-Woody test alone, but is a factor to be 
considered in relating a testing program to an 
instructional program. 

A second question about the test is the matter 
of nomenclature. As Cronbach ! has pointed out, 
makers of reading tests often develop similar 
tests but call them by different names. There 
can probably be no complete agreement as to 
what a subtest is testing, but the type of exercise 
presented in Part III, Fact Material, is often 
called “Reading for Details," and Part IV, 
Total Meaning, and Part V, Central Thought, 
seem to overlap considerably in what they meas- 
ure. Some recent data on the intercorrelations 
of the various subtests, which are not presented 
by the authors in the manual, would answer this 
question of overlap positively or negatively. 
Finally, Part VII, Organization, seems to be 
named too generally since it deals only with find- 
ing a sequence of ideas or events. 

The two questions raised above should not be 
regarded as fundamental criticisms of what has 
been a very useful reading test. They point 
rather to further study and possible revision of 
a test which is now over twenty years old. In the 
absence of such study and revision, schools will 
find that the Sangren-Woody test offers a less 
intensive diagnosis of reading than individual 
diagnostic series such as the test batteries of 
Durrell and of Gates, but that it has survey value 
combined with more diagnostic value than many 
other current survey tests. 


For a review by Alice К. Liveright, see 40: 
1565. 


[552] 
*The Schonell Reading Tests. 1942-51; tests are 
reproduced in full in Diagnostic and Attainment Test- 
ing ('50—see 3 below) ; tests Rs, R6, and R7 not avail- 
able as separates; I form; 2s. 6d. per 12 of any one 
test; 25s. per copy of Backwardness in the Basic Sub- 
jects, Third Edition (746—ѕее 1 below) which serves 
as the manual for tests R2, R3, R4, Rs, R6, and R7; 
6s. per copy of The Psychology and Teaching of Read- 
ing, Second Edition ('46—see 2 below) which serves 
as the manual for the Graded Word Reading Test; 
postage extra; Fred J. Schonell ; Oliver & Boyd Ltd. * 
а) TEST R2, SIMPLE PROSE READING TEST. Ages 6-9; 
1942; also called My Dog Test; individual; (3-8) 
minutes. 
b) TEST R3, SILENT READING TEST A. Ages 7-11; 1042; 
9(15) minutes. 
с) TEST R4, SILENT READING TEST B. Ages 9-13; 1942; 
15(20) minutes. 
àj TEST RI, GRADED WORD READING TEST. Ages 5-15; 
1942; also called Graded Reading Vocabulary Test; 
individual; (5-15) minutes. 
е) TEST R5, TEST OF ANALYSIS AND SYNTHESIS OF 


1 Cronbach, Lee J. Essentials of Psychological Testing. New 
York: Harper and Brothers, 1949. Pp. xiii, 475. * 
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WORDS CONTAINING COMMON PHONIC UNITS. Individ- 
ual; (5-15) minutes. 

f) TEST RÓ, TEST OF DIRECTIONAL ATTACK ON WORDS. In- 
dividual; (5-10) minutes. j 
g) TEST R7, VISUAL WORD DISCRIMINATION TEST. Indi- 
vidual; (10-15) minutes. 


REFERENCES 
І. SCHONELL, Fren J. Backwardness in the Basic Subjects, 
Third Edition. Edinburgh, Scotland: Oliver & Boyd Ltd., 1946. 


Pp. xix, 560. * È 
E БЕКЕНЕ FreD J. The Psychology and Teaching of 


Reading, Second Edition. Edinburgh, Scotland: Oliver & Boyd 
Ltd., 1946. Pp. 128, * ) 

3. ScHoNELL, FRED J., AND ScHoxELL, Е. ELEANOR, Diag- 

mostic апа Attainment Testing: Including a Manual of Tests, 
Their Nature, Use, Recording and Interpretation. Edinburgh, 
Scotland: Oliver & Boyd Ltd., 1950. Pp. viii, 168, * 
M. L. KELLMER PRINGLE, Lecturer in Educa- 
tion, and Acting Deputy Head, Remedial Edu- 
cation Centre, University of Birmingham, Birm- 
ingham, England. 

Together these tests make a comprehensive 
battery for the measurement of reading attain- 
ment levels and for the detailed diagnosis of diffi- 
culties. They consist of a test of word recogni- 
tion skill. (R1), three comprehension tests at 
different levels of difficulty (R2, R3, and R4) 
and three unstandardised tests for diagnosing 
different aspects of mechanical failure in read- 
ing (R5, R6, and R7). With the exception of 
R3 and R4, all the tests are administered indi- 
vidually and depend to a considerable degree 
on the insight of the clinician, though diagnostic 
pointers are fully discussed in Schonell's ВасЁ- 
wardness in the Basic Subjects (1) in which they 
first appeared. 

The Graded Word Reading Test, К, consists 
of 100 words graded in difficulty to give me- 
chanical word recognition levels from 5 to 15 
years. It thus supplies a first rapid assessment 
of reading level. Though similar to Burt’s and 
Vernon’s reading vocabulary tests, it differs in 
one important respect when one is dealing with 
young or poor readers: Кт is tied to a “look- 
and-say" approach to reading, whereas the other 
two presume early phonic teaching. Thus young 
children will achieve results on Schonell's test 
which are higher or lower than those on the 
other two according to the method by which they 
have been taught. 

The Simple Prose Reading Test, R2, pri- 
marily measures comprehension. Norms for 
speed and accuracy of reading as well as for 
comprehension are provided, adding further di- 
agnostic and attainment pointers. The print, 
lay-out, and content of this test make it suitable 
for young children. However, the gradient of 
difficulty is too steep for weaker readers and is 
not a sensitive enough measuring device for the 


lower ranges of ability. Furthermore, the story 
would make a better test for measuring compre- 
hension if it contained a greater number of con- 
crete incidents instead of the rather diffuse de- 
scriptive sequences of the second half of the test, 

The Silent Reading Tests, R3 and R4, which 
may be given as group or individual tests, con- 
sist of short, disconnected paragraphs followed 
by questions to be answered or instructions to 
be followed in the case of R3, and by multiple 
choice vocabulary items, in the case of R4. So 
that the same test booklets can be used again, 
separate answer slips are provided. While this 
is a welcome economy to the hard pressed head- 
master, its suitability particularly for Juniors, 
is in some doubt. It tends to discourage slow 
and untidy writers and to lead to more con- 
fusions than underlining a word in the actual 
test booklet. The layout of the tests would be 
improved by more generous spacing between 
paragraphs and the use of a different type face 
for the questions or words. But the most serious 
criticism of R3 and R4 is that the prewar norms 
for these two tests given in Backwardness in the 
Basic Subjects differ materially from the post- 
war norms in Diagnostic and Attainment Test- 
ing, the difference increasing from 9 months at 
the beginning of the test to 2 years 3 months 
at the maximum age level. No explanation is 
offered for this; in fact, the difference in norms 
is not even mentioned. | 

Тезїз R5, R6, and R7 constitute the main di- 
agnostic battery. R5 measures the reader's 
knowledge of common phonic units and powers 
of analysis and synthesis of phonic constituents. 
The whole test consists of 9o words, but it 1s 
not necessary to give the test in its entirety 1n 
every case. RÓ indicates the consistency of the 
left-right approach, tendencies for reversals or 
partial reversals, and confusions of similar let- 
ters (such as b and d). R7 is concerned with 
visual perceptual recognition and supplies quali- 
tative information on short-term recall of wor 
patterns. á 

In addition to these three diagnostic tests, 
Schonell supplies tests—similar to those sug- 
gested by Burt in The Backward Child and by 
Whipple in the Manual of Mental and Physical 
Tests—for assessing auditory and visual recall 
of nonsense material of different types. 

It is very useful and convenient to have all 
Schonell’s reading tests available in a separate 
handbook ; regrettably, however, one is still re- 
ferred to Backwardness in the Basic Subjects 
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and The Psychology and Teaching of Reading 
for the detailed instructions for administering 
them. Furthermore, although it is over eight 
years since Backwardness in the Basic Subjects 
was published, data on the standardisation, re- 
liability, and validity of the tests remain inade- 
quate. For the Graded Word Reading Test, Кї, 
only norms are given—no particulars. The Sim- 
ple Prose Reading Test, R2, is said to be based 
on 512 cases, but no further details are given. 
For the Silent Reading Tests, R3 and Ry, it is 
stated that “1,865 cases were used to obtain 
approximate norms,” but no information about 
the subjects is offered. Estimates of the relation- 
ship between silent reading ability and general 
intelligence, based on the results of 210 children, 
age 8-11, on R3 and R4 and a verbal group 
intelligence test are reported ; correlation coeffi- 
cients range from .67 to .86, but the number of 
cases in each age group is small (from 40 to 63). 
The reliability coefficient for R4 is given*as .92. 
For the new norms in Diagnostic and Attain- 
ment Testing no particulars are given at all. 

Itis worth noting that, in common with almost 
every diagnostic battery, these tests are con- 
structed rather to indicate superficial defects in 
acquired skills than to push the diagnosis deeper 
into the underlying causes of reading difficulties. 
For this purpose, as Schonell himself indicates 
in his extensive research, a wide, detailed and 
subtle study of the child and his environment is 
necessary. Useful as they are, diagnostic and 
attainment tests are merely the beginning ; they 
are ad juncts and not substitutes for trained clin- 
ical insight and informed judgement. For the 
purpose for which they were designed, however, 
the Schonell Reading Tests—despite their short- 
comings—are the most comprehensive so far 
available in England.. 


For a review by Edith I. M. Thomson, see 
3:499. 
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Shank Tests of Reading Comprehension. Grades 
deir. I0-12; 1929; 8 scores: direct details, implied 
а Gun general sense of a paragraph, determination of 
ds nite ideas, recognition of references, truth of idea, 
SOCIUS, total; 3 levels; Forms A, B, C; original 
manual out of print; $1.60 per 25 of any one level ; 15 
eng sabbreviated manual ['29—a reprint of Chapter V 
m the original manual]; postage extra; 10€ per speci- 
п set of any one level (does not include manual) ; 
EM per specimen set of all forms of all levels; post- 
Es ; Spencer Shank; C. A. Gregory Co. * 
TEST I. Grades 3-6; 18(28) minutes. 
cj TEST П, Grades 7-9; 20(30) minutes. 
TEST пт, Grades 10—12; 20(30) minutes. 


REFERENCES 

1-3. See 40:1567. 

WirLrAM D. Ѕнегрох, Associate Professor of 
Education, and Director of the Reading Labor- 
atory, Syracuse University, Syracuse, New 
York. 

This test is designed to measure reading com- 
prehension. Each form of the test provides 10 
paragraphs of reading content. In Test I, for 
grades 3-6, each paragraph is followed by six 
questions, each of which purports to measure a 
specific aspect of comprehension, e.g., answering 
a direct question on content details and determin- 
ing whether the content stated a given idea. Test 
II, for grades 7-9, and Test III, for grades 10- 
12, have, in addition to the six questions meas- 
uring comprehension, a question requiring the 
student to supply a synonym from the paragraph 
for a word given in the question. The test is hand 
scored by means of a scoring key, and answers 
are written on the test booklet. There is no evi- 
dence that a machine scored answer sheet is 
available. A summary statement of data neces- 
sary for evaluating scores is provided for teacher 
use. Correlations with IQ’s, but no correlations 
with other reading tests, have been provided. 

Within-grade reliability coefficients between 
different forms of the test range from .89 to .92. 
The summary statement provides tables of age 
and grade norms for each test and subtest, based 
on a population of approximately 5,000 students. 

The difficulty level of the paragraphs has been 
determined by a system of weights derived from 
the success of pupils on the various paragraphs. 
The content of the paragraphs is largely drawn 
from textbook-type materials from literature, 
science, mathematics, and history backgrounds. 

The material is dated and deals largely with 
knowledge of specific facts rather than with gen- 
eral ideas or inferential material. There is no 
statistical evidence presented which would con- 
firm the author's contention that the individual 
questions do provide measures of specific aspects 
of comprehension. Hence, the diagnostic value 
of the test is questionable when success on groups 
of items is used to measure ability in a specific 
comprehension skill. 

The construction of the test seems to have 
been quite limited; there is no evidence that 
either the construction of the items or the sta- 
tistical evaluation of the test compares in refine- 
ment with tests of a later design. There seems 
to be no reason for a teacher to use this test when 
others of a superior design such as the 5 tanford 
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Achievement Test in Reading, the California 
Reading Test, and the Metropolitan Achieve- 
ment Test in Reading are available. 


For a review by James R. Hobson, see 40: 
1567. 
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*Silent Reading Comprehension: Iowa Every- 
Pupil Tests of Basic Skills, Test A. Grades 3-5, 
5-9; 1940-47; for complete battery, see 15; 3 scores: 
reading comprehension, vocabulary, total; IBM for 
grades 5-9; 2 levels; Forms L (’40), M (41), N 
(42), О (43); manual ('45) ; battery manual (47) ; 
336 per specimen set of any one level; postage extra; 
H. F. Spitzer in collaboration with Ernest Horn, 
Maude McBroom, H. A. Greene, and E. Е. Lindquist; 
Houghton Mifflin Co. * 

а) ELEMENTARY BATTERY. Grades 3-5; $1.60 per 25; 
46(60) minutes. 

b) ADVANCED BATTERY. Grades 5-9; IBM ; $1.75 per 
25; separate answer sheets may be used; 636 per 25 
IBM answer sheets; 20€ per stencil for machine scor- 


ing of answer sheets; 68(85) minutes. 


For reviews by Miriam M. Bryan and Anton 
Thompson of the complete battery, see 15; for 
reviews by James R. Hobson and Constance M. 
McCullough, see 3: 501; for reviews by Frederic 
L. Ayer, Gustav J. Froehlich, and Ralph C. 
Preston of the complete battery, see 3:10; for 
reviews by William A. Brownell, J. Murray Lee, 
and Charles W. Odell of the 1937 form of the 
complete battery, see 38:872. 


[555] 
Stanford Achievement Test [Reading]. Grades 
2-3, 4-6, 7-9; 1923-43; a subtest of Stanford Achieve- 
ment Test (see 25); 3 scores: paragraph meaning, 
word meaning, total; 3 levels; Forms D (до), E 
(40), F (41), С (42), Н 43) ; directions for ad- 
ministering (40); the Manual for Interpreting re- 
ferred to in the directions for administering has not 
been published ; $145 per 25 of any one level, postage 
extra; 35¢ per specimen set of any one level, postpaid ; 
Truman L. Kelley, Giles M. Ruch, and Lewis M. Ter- 
man; World Book Co, * 
а) PRIMARY READING TEST. Grades 2-3; 1023-41; 
Forms D, E, F only; 25(30) minutes. 
b) INTERMEDIATE READING TEST. Grades 4-6; 1923-43; 
30(35) minutes. 
C) ADVANCED READING TEST. Grades 7-9; 1923-43; 35 
(40) minutes. 


REFERENCES 

T: CURRENT, W. F., AND Rucu, С. M. “Further Studies on 

the Ea abiiy of Reading Tests.” J Ed Psychol 17:476-81 O '26. 
1:19 

2. BAKER, FromENCE, AND Broom, M. E. “Concerning One 
Criterion for the Choice of Primary Reading Tests." T Appi 
Psychol 16:419-20 Ag Кы * (PA 7:4098) 

3. Mirramp, Сес. V. “The Nature and Character of Pre- 
adolescent Growth in Reading Achievement." Child Develop 
11:71-114 Je '4o. * (PA 14:4797) 

4. PrLIEGER, Ermer Е. “A Study of Reading Grade Levels.” 
J Ed Res 42:541-6 Mr '49. * (РА 23:5752) 


James R. Hosson, Director of Child Placement, 
Brookline Public Schools, Brookline, Massa- 
chusetts. 

This is the oldest of the reading tests in a 


widely used achievement test battery and, de- 
spite certain basic limitations to be pointed out, 
remains one of the more satisfactory ones for 
accomplishing its purpose. Whether it deserves 
its wide use in the face of its faults and omissions 
is a moot question. 

The Stanford tests have three basic advan- 
tages over competing tests: authorship of as yet 
unparalleled distinction, the earliest start in the 
field, and the first really adequate job of stand- 
ardization, These assets have resulted in the gen- 
eral acceptance of the tests as a standard. Several 
revisions have resulted in such improvements 
and modernization as the machine scored edition 
of all tests for grades 4 and over and the publica- 
tion of an edition of one form of the Partial Bat- 
tery, Intermediate and Advanced, in extra large 
type for use in sightsaving classes. (This edition 
may be obtained from Stanwix House, 336 
Fourth Ave., Pittsburgh 22, Pa.) 

For obtaining valid and reliable measures of 
ability in paragraph meaning and word meaning 
in terms of age or grade norms, between the ages 
of 7 years 8 months and 15 years, or between the 
eighth month of the second grade and the begin- 
ning of the tenth grade, the Stanford reading 
tests are entirely adequate. The remainder of 
this review will be devoted to their inadequacies 
and unfulfilled functions. 

On the debit side of the ledger this reviewer 
would list the following: | 

a) There is no reading test covering the skills 
acquired in grade т. Yet, several skills whose de- 
velopment is begun in grade 1 can be adequately 
measured at that level. In most school systems 
the degree to which a child has acquired these 
skills is the most important criterion for his 
grade or class placement. In fact, there is no 
reliable measure below the eighth month of the 
second grade since all such values are obtaine 
by extrapolation. This series of tests neglects 
entirely the most crucial grade levels in begin- 
ning reading, consequently preventing the keep- 
ing of any cumulative standardized test recor! 
based on comparable tests for any child or class 
until near the end of the second grade. г 

b) Since there is no manual for interpreting 
the current edition and forms of these tests, their 
full and proper use is difficult except for users 0 
long standing who are familiar with the manuals 
of previous editions. The 2- to 4-page leaflet cov- 
ering directions for administering which accom- 
panies the various levels of these reading fests 
neglects to mention reliability or validity data, 
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although several hundred words are devoted to 
an explanation as to why the age and grade 
norms on the front of the test booklet are based 
on modal age groups. This concept is both inter- 
esting and worth while. Its practical usefulness 
in any given school situation is governed, how- 
ever, by the extent to which the given school’s 
entrance age, and hence its modal age in each 
grade, coincides with that of the standardization 
population. This of course points up the desira- 
bility of local standardization and norms. 

c) There are no suggestions of any kind for 
making use of individual or class test results in 
an analytical or remedial way. Perhaps these are 
to be included in the Manual for Interpreting 
which remains unpublished after 11 years. 

d) This reviewer doubts the validity of a read- 
ing test at the second grade level which requires 
a pupil to write his answers. Incidentally, as 
mentioned by Margaret McKim in a previous 
review of these tests (see 3:503), the spaces pro- 
vided for writing the omitted words in Test 1 
of the Primary Reading Test are too small for 
some words for some children. 

In conclusion and summary, it appears to this 
reviewer that this series of reading tests, despite 
distinguished authorship, adequate standardiza- 
tion, and a history of past usefulness is, because 
of its omissions, somewhat outmoded for present- 
day classroom uses and needs. For survey pur- 
poses and in the hands of experienced users it 
remains satisfactory within self-set limits. 


For reviews by Paul R. Hanna and Claude E. 
Norcross and Vergil E. Herrick of the complete 
battery, see 25; for review by Margaret G. Mc- 
Kim, see 3:503; for reviews by Walter W. Cook 
and Ralph C. Preston of the complete battery, 
see 3:16. 
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*Techniques in Reading Comprehension for 
Junior-Senior High School: Every Pupil Test. 
Grades 7-12; 1937-51; new form usually published 
each April and December; form December 1951; no 
data on reliability and validity; revised manual [49]; 
norms (’51) ; 210% per test; 16 per answer key; post- 
paid; 18(25) minutes; Ohio Scholarship Tests, Ohio 
State Department of Education. * 


For reviews by Ivan A. Booker and James M. 
McCallister of April and December 1946 forms, 
sce 3:505. 

[ 557] 
*Tests of Reading: Cooperative Inter-American 
Tests. Grades 1-3, 4-7, 8-13; 1950; 3 scores: vocabu- 


lary, comprehension, total; IBM for grades 4 and 
over ; 3 levels; 2 editions; English edition: Forms AE, 
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BE; Spanish edition: Forms AS, BS; no data on re- 
liability and validity; tentative norms; 50€ per series 
manual; cash orders postpaid; 50¢ per specimen set of 
any one level, postpaid; prepared under the auspices of 
the Committee on Modern guages of the American 
Council on Education, Herschel Т. Manuel, Director 


of Test Construction ; Cooperative Test Division, Edu- 
cational Testing Service. * 
а) PRIMARY LEVEL. Grades 1-3; $2.25 per 25; nontimed 
(25) minutes. 
b) INTERMEDIATE LEVEL. Grades 4-7; IBM; $2.50 per 
25; separate answer sheets must be used; per 25 
IBM answer sheets ; 15¢ per stencil for scoring answer 
sheets; 40(50) minutes. 
C) ADVANCED LEVEL. Grades 8-13; IBM; prices same 
as for Intermediate Level. 
REFERENCES 
1. Воо, Іѕмлк, Ropricuez. A Study of the Parallelism of 


, 
MN and Spanish Vocabwlaries. Doctor's thesis, University 
of Texas (Austin, Tex.), 1944. 

2. MCCRANIE, ЈоѕЕРНІМЕ, А Study of Fowr Inter-American 
Tests Applied, to High School Seniors. Master's thesis, Uni- 
X of Texas (Austin, Tex.), 1944. 

з. KELLEY, Frances. A Study of the Inter-American Tests 
at the Hi p "School Level. Master's thesis, University of Texas 
pig x), 1945. M = ТОТ 

4. Fire, ROBERT HERNDON, AND MANUEL, HERSCHEL Т, 
Teaching of English in Puerto Rico, рр. 171—313, 337—410. 
Prepared for the American Council on Education. San Juan, 
P.R.: Department of Education Press, 1951. Pp. xix, 410. * 


]лсов S. OrLEANS, Professor of Education, and 
Director of Research and Evaluation, Division 
of Teacher Education, The College of the City 
of New York, New York, New York. 

This is a series of reading tests with parallel 
editions in both English and Spanish. There are 
three levels. Each level consists of two parts, 
vocabulary and comprehension. The Primary 
Level contains 40 vocabulary items (each a pic- 
ture followed by 4 words) and 30 comprehen- 
sion items (each containing 4 pictures followed 
by verbal context). The Intermediate Level 
contains 65 vocabulary items each in the form of 
a completion item followed by 4 choices; and 57 
comprehension questions (4-response multiple 
choice) in groups of 3 to 5 questions referring to 
short paragraphs. The Advanced Level is simi- 
lar to the Intermediate Level with 70 vocabu- 
lary questions and 58 comprehension questions, 
the latter based on longer and more difficult par- 
agraphs. 

The Spanish edition is an exact translation of 
the English. No evidence is presented that the 
English context is suitable for testing reading 
achievement at the levels for which the test is 
intended, or that the Spanish context is suitable 
for testing reading achievement at the levels for 
which it is intended, or that context that is ap- 
propriate for measuring reading achievement in 
English of American children is also appropriate 
for measuring reading achievement in Spanish 
of Spanish speaking children. Norms are avail- 
able for both editions, but the norms represent 
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typical achievement and are not necessarily 
measures of the suitability of the context. In 
view of the differences in the cultures of the 
United States and the Spanish speaking coun- 
tries of the Americas one may well question the 
content validity of either or both of the tests for 
lack of supporting evidence. 

One effect of the exact translation from the 
English to the Spanish edition may be illustrated 
by Item 3 of Part I (vocabulary) of the Primary 
Level (Forms AE and AS). It consists of a pic- 
ture of a woman washing clothes followed in the 
English edition by the words wash, wake, walk, 
and call and in the Spanish edition by the words 
lavar, despertar, andar, and llamar. In the Eng- 
lish edition it is clear that three of the alternates 
ask the pupil to distinguish among words start- 


ing with the same letter. The same is not.true 


in the Spanish translation (“11” is not the same 
as "I" in Spanish). Theré are a number of in- 
stances where the same type of distinction (be- 
tween words starting with the same letter or 
containing the same group of letters) holds true 
in the one edition but not in the other. It is pos- 
sible that such differences in the context of the 
test, or differences in suitability of the same 
words, paragraphs, or comprehension questions, 
make no difference in the validity of the tests for 
measuring reading achievement in the respective 
languages ; but there is no evidence in the man- 
ual to support such an assumption. 
Both parts of the Primary Level employ pic- 
tures, the same pictures being used in both lan- 
. guage editions. Although many of the pictures 
are recognizable, on the whole they are of rather 
poor quality. Some would appear to be really 
difficult to recognize, and it is possible that on oc- 
casion the examinee identifies the picture only 
through a clue given by the verbal context. 
Each part of each test is preceded by at least 
five sample questions. The instructions seem to 
be clear. АП levels of both editions are prepared 
for machine scoring, although they can also be 
used for hand scoring. The manual states that 
at the Primary Level the pupils are to record 
their answers in the test booklets. It is difficult 
to see why the booklets at this level are prepared 
for machine scoring, except perhaps when used 
by older pupils who are retarded in their reading 
achievement. The Primary Level is intended for 
grades 1, 2, and the lower part of grade 3; the 
Intermediate Level for grades 4—7 ; and the Ad- 
vanced Level for grades 8-13. The working time 
is 16 minutes for the Primary Level, and до 


minutes for each of the other two levels. There 
is apparently no level appropriate for the upper 
half of grade 3. The norms indicate that there is 
no overlapping between the Primary and Inter- 
mediate, but a great deal between the Interme- 
diate and Advanced levels which essentially 
make them a continuous test. The first 33 vocab- 
ulary questions of the Advanced Level are the 
same as the last 33 of the Intermediate Level, 

The tests are accompanied by an Examiner's 
Manual which provides information concerning 
all five tests in the Cooperative Inter-American 
Test series, the purposes, levels, uses, adminis- 
tration, applications, validity, reliability, and 
preliminary interpretative data. There is a sep- 
arate set of directions for administering and 
scoring for the Primary Leyel, and one for both 
the Intermediate and Advanced Levels. These 
directions are very detailed. 

The manual contains an explanation of the use 
of the reliability coefficient and the probable and 
standard errors of a pupil’s score, illustrating 
their meaning with a correlation coefficient of 
:92, a probable error of 4, and a standard error 
of 6. A hasty reading may leave the impression 
that these are the reliability data for the tests. 
However, the last sentence in this section of the 
manual reads : “Because the Cooperative Inter- 
American Tests are being offered for general 
use for the first time, reliability statistics are not 
available at present.” The user has a right to as- 
sume that before the tests are offered for general 
use they have been adequately validated and 
standardized, and that therefore the reliability of 
the tests should have been determined. 

In place of the customary “norms” the manual 
presents “interpretative data.” This is a com- 
mendable practice since norms, which represent 
only typical achievement usually of unevaluated 
quality, are too often used as standards. The data 
are presented in two tables, one furnishing me- 
dian scores on the Spanish edition based on 
6,000 Mexican children in three cities and 20,- 
ооо Puerto Rican children in grades 1 through 
12, The data were obtained for experimental 
forms and “adjusted approximately for changes 
made in preparing final forms of the tests.” The 
second table furnishes medians based on 10,000 
pupils in about 30 cities of the United States, 
about half in Texas. These medians for the 
English edition were also “adjusted approxi- 
mately for changes made in preparing the final 
forms." The medians are given only for total 
scores, Ў 
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For the Primary Level of the English edition 
a spread of 30 points represents the range of 
achievement from grade 1 to grade 2 and only 9 
points from grade 2 to grade 3. The correspond- 
ing values for the Spanish edition are 15 points 
and 5 points with the median for the second half 
of each of the first three grades being lower than 
the median for the first half. Obviously the con- 
tent of the Primary Level is too easy. There was 
nothing available to the reviewer to explain why 
the second half of each grade did less well on 
the Spanish edition than did the lower half. In 
the case of the Intermediate Level there are large 
differences between medians for grades 2 to 5 on 
both editions. Beyond the fifth grade the spread 


is very much less, again indicating that the tests, _ 


are too easy for the upper grades. The same con- 
dition holds true for the medians for grades 4 
through 7 on the English edition of the Ad- 
vanced Level and grades 4 through 9 of the 
Spanish edition. Again for the Spanish edition 
the median for the second half of each grade is 
lower than the median for the first half of the 
grade. It is questionable that test scores can 
have much significance if a difference of 5 points 
represents two years of achievement—particu- 
larly when the probable error of a pupil’s score 
is likely to be at least a year of achievement. No 
information is furnished concerning the charac- 
teristics of the populations for whom the inter- 
pretative data were derived other than those in- 
dicated above. No evidence is furnished concern- 
ing the comparability of the populations. 

These tests represent a commendable effort to 
provide comparable measures of reading achieve- 
ment in Spanish and English. The manual offers 
suggestions and cautions concerning the uses of 
the test results for selection, placement, guidance 
and counseling, control of learning, motivation, 
and evaluation. Much work still needs to be done 
in validating the tests, determining their reliabil- 
ity, furnishing adequate interpretative data, and 
determining comparability of scores on the two 
forms. 


Freperick L. Westover, Associate Professor 
of Educational Psychology, University of Ala- 
bama, University, Alabama. [Review of the 
English edition. ] 

The unique feature of the series of Coopera- 
tive Inter-American Tests is the provision of 
parallel forms in English and Spanish in order to 
facilitate comparisons of pupil performance in 
each of the two languages. The present review 


considers only the English editions of the tests 
of reading. 

DESCRIPTION OF THE TESTS. The Primary 
Level test is designed for use in the first three 
grades. It covers vocabulary and comprehension, 
and is intended to give a measure of general 
reading ability. In the vocabulary section, a pic- 
ture is presented with four accompanying words, 
and the pupil indicates the word appropriate to 
the picture. In the comprehension section, the 
pupil indicates to which of four pictures a given 
phrase, sentence, or paragraph refers. 

The Intermediate Level is designed for use 
from the fourth through the seventh grade ; and 
the Advanced Level, for use from the eighth 


grade through the first year of college. These 
‚ tests, like the Primary Level test, cover vocabu- 
| lary and comprehension, and are intended to 


give a measure of general reading ability. The 


words in the vocabulary section and the para- 


graphs in the comprehension section are drawn 
from a wide range of materials. 

EVALUATION OF THE TESTS. Some of the draw- 
ings in the Primary Level test suffer from 
crowding and poor detail. This deficiency im- 
pairs the appearance of the test and the ease of 
perception of some of the figures. The useful- 
ness of this test would have been increased for 
many teachers if the vocabulary section had been 
modified to give more information about the pu- 
pil’s word-recognition skills, instead of being 
principally concerned with the amount of the 
pupil’s word knowledge. The tests at all three 


levels measure only two aspects of reading: vo- 


cabulary and comprehension. The value of the 


tests at the intermediate and advanced levels ' 


would have been enhanced for many users if a 
measure of reading speed had been included. 

All of the tests have been well constructed 
from a variety of intrinsically interesting mate- 
rials and appear to possess face validity. How- 
ever, no statistics on reliability are reported. 
Norms are based on administration of the tests 
to 10,000 pupils in cities of the United States, 
half of them in Texas. The norms provide in- 
formation only on the median total scores at each 
level for which each test is scored. The manual 
is well organized and easy to follow. 

suMMARY. These tests offer satisfactory brief 
surveys of vocabulary and comprehension in 
reading. They are of special value to those who 
wish to use them in connection with the Spanish 
editions in order to compare the performance of 
pupils in English and Spanish. Others may find 


Sa 
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that older established tests possess some advan- 
tages, especially in the provision of more ade- 
quate norms, diagnostic information, and data 
on reliability. 


[558] 
*Thorndike-Lorge Reading Test. Grades 7-9; 
1941-47; Forms A Rev. ('47), B Rev. (47) ; manual 
(47) ; $5.40 per 100; 35¢ per specimen set; postpaid ; 
40(45) minutes; E. L. Thorndike and Irving Lorge; 
Bureau of Publications, Teachers College, Columbia 
University. * 


REFERENCES 
т. Lorce, Irvine, “The Thorndike-Lorge Rearing 
Grades 7-9." Teach Col Rec 46:453-9 Ap '4s. * (Р. 


Test for 
19:2754) 
Ivan A. Воокев, Assistant Director, Division 
of Press and Radio Relations, National Educa- 
tion Association, Washington, D.C. 

The authors of this test boldly start out to 
measure “all the important factors in silent read- 
ing and to give reasonable weight to each of 
them." To say that the test falls far short of this 
ambitious goal is not to condemn it, but merely 
to look at it more realistically. 

The test emphasizes the ability to read and 
answer fact questions and the ability to read and 
understand sentences of the proverb or aphorism 
type. The latter, of course, involves considerable 
vocabulary skill and the ability to interpret 
idioms and figures of speech, but the emphasis is 
not on these subsidiary skills. The test is quite 
heavily weighted with items that call for the 
interpretation of proverbs or quotations. In fact, 
20 of the 53 to 55 items are of that type. One 
wonders whether, in a balanced test, two fifths 
of the items should be of this type. It should be 
pointed out, too, that for many of these items 
the difficulty which the reader encounters is in 
interpreting the supposedly synonymous sen- 
tences rather than in interpreting the test items. 
Perhaps this is not too grave a fault, but some 
testmakers insist that the test item, not the an- 
swer, should constitute the test of skill. Aside 
from these items, the remainder of the test is 
devoted quite largely to fact questions. Undoubt- 
edly, both types of items are useful measures of 
the general level of reading comprehension, but 
they scarcely give the test that comprehensive- 
ness and balance which the manual claims for it. 

The statistical evidence as to the validity and 
reliability of this test seems satisfactory as far as 
it goes. However, the vague way in which the 
procedure for checking validity is described and 
the limited sampling used to establish reliability 
leave some doubt as to the accuracy of the con- 
clusions drawn. Correlations of about .80 with 


scores from “four reputable reading tests” are 
cited as evidence of validity. The four tests are 
not named. Moreover, facts are entirely lacking 
to support the manual’s claim that correlations 
of about .96 would be obtained against “a more 
reliable criterion of reading ability.” If so, why 
was such a criterion not used ? Such probabilities 
are unconvincing, and many users of the test will 
prefer to think of it in terms of the demonstrated 
correlations of about .80 with other reading 
tests. The reliabilities of about .90 between 
scores on Forms A and B are based on only four 
samples of about 200 students each. This is sat- 
isfactory, but it would be reassuring to have sim- 
ilar results from a considerably larger number of 
samples. 

The manual suggests a few possible diagnostic 
uses of the test. In view of the nature of the test, 
however, it seems highly questionable that it 
yields any significant results other than those 
which a discriminating teacher can get from ana- 
lyzing students' responses to single test items. 
The test is most valuable as a survey instrument 
to measure the general level of reading compre- 
hension. It needs no other justification. Used in 
that way, it should provide a useful measure “of 
the level of reading comprehension in junior 
high classes" and should help to identify those 
in need of careful diagnosis and special instruc- 
tion in reading. 

Each form of the test opens with a short pas- 
sage followed by 7 fact questions. Items 8 to 27 
call for the interpretation of sentences. Finally, 
there are 3 prose passages, each passage followed 
by 5 to 12 questions on the content, that are 
largely fact questions, which call for very little 
use of reasoning or of the ability to draw conclu- 
sions or inferences from what is read. 

The time allowance is generous enough to 
make the test a power test for most junior high 
school students. The authors estimate that the 
relative contribution of quality and speed to the 
students' total scores are in the ratio of 12 to I. 
The test gives no measure of reading rate, but 
students who fail to finish it in the 40 minutes 
allowed should be suspected of unduly slow read- 
ing and should be tested with a good rate test. 
The directions for administering the test are 
brief, simple, and clear. Students record their 
answers to the test questions directly on the test 
booklets, not on separate answer sheets. 

The format of the test is somewhat cumber- 
some for both student and scorer. For example, 
the student must read the first test selection at 
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the top of a page, then find and study a series of 
questions near the middle of the page, select the 
right answers to those questions from a list at 
the bottom of the page, and then go back and 
record the numbers of these answers alongside 
the questions. The amount of such shifting about 
from place to place seems unnecessarily great. 

For the scorer, the answer keys in the test 
manual are poorly arranged. They are not set 
in the form of matching stencils, which can be 
laid alongside each page of the test for quick 
comparison. Instead, there is simply a list of the 
numbered items and the correct responses. The 
scorer must either move the answer sheet along 
from one item to the next or construct his own 
scoring stencil. The manual itself is quite simple 
and usable, if one disregards its “overselling” 
claims. Grade and reading age equivalents are 
given for each score on each form of the test. 
These are based on approximately 2,000 pupils. 
No medians, quartiles, percentiles by grades, 
standard scores, or other interpretative data are 
included. 


For a review by Robert L. McCaul and an 
excerpt from a review, see 3:506. 


[559] 
*Traxler High School Reading Test. Grades 10- 
12; 1938-42; 5 scores: reading rate, story comprehen- 
sion, main ideas, total comprehension, total; IBM; 
Forms A (’38), В (’39) ; $1.80 per 25, postage extra; 
12@ per revised manual (’42); 30% per specimen set; 
postpaid; separate answer sheets (IBM Form I.T.S. 
1000 B 108) may be used; IBM answer sheets and 
scoring stencils must be purchased directly from Inter- 
national Business Machines Corporation; 50(55) min- 
M } Arthur E. Traxler; Public School Publishing 
о. 


REFERENCES 


I. TRAXLER, ARTHUR E. “Relationship 


Between the Length 
and the Reliability of a Test of Rate o! Reading." J Ed Kes 
32:1-2 S '38 12:6660) 


2. TRAXrER, AgTHUR E. “One Reading Test Serves the Pur- 
PON Clearing House 14:419-21 Mr '40. * (PA 14:5233, title 
3. Bromaters, PAUL, AND LINDQUIST, E. F. “Rate of Com- 
prehension of Reading: Its Measurement and Its Relation to 
Е romprehension2? J Ed Psychol 35:449-73 N '44. * (РА 19: 

4. Preston, RALPH C., AND TUFT, EDWIN N. “The Reading 
Habits of Superior College Students.” J Exp Ed 16:196-202 
Mr '48. * (PA 20:5126) 

Hanorp D. Carrer, Professor of Education, 
University of California, Berkeley, California. 

This test, designed in two equivalent forms for 
use in grades ro through 12, yields a rate score 
and three comprehension scores. The test re- 
quires 50 minutes of working time, a require- 
ment that may be awkward in many school situ- 
ations, 

The manual for the test, written in clear and 


meaningful style, indicates a fine background of 
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scholarship in the fields of test construction and 
reading instruction. It contains a description of 
the test and information concerning its validity 
and reliability, percentile norms for each score 
and for each grade level, and practical sugges- 
tions for the interpretation of test results. 

Face validity of the test is assured by the care- 
ful item analysis which preceded selection of the 
items. The reported reliabilities of .90 for the 
rate score, .72 for the story comprehension score, 
and .80 for the paragraph comprehension score 
are highly satisfactory. The norms are based on 
the results of over 7,000 pupils. 

Among the commendable features of the test 
are the emphasis upon comprehension of mate- 
rial read at a natural rate chosen by the pupil, 
and the provision of two nicely equated forms, a 
convenient device for obtaining the rate score, 
and the manual packed with essential informa- 
tion. While one might hope for further reports 
of experimental studies dealing with the use of 
test results, the test as published appears to fur- 
nish the basis for an effective program of reading 
instruction through providing a high quality in- 
strument for the appraisal of reading ability and 
achievement. 


For reviews by Alvin C. Eurich, Constance 
М. McCullough, and С. Gilbert Wrenn and ex- 
cerpts from reviews, see 40:1578. 
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*Traxler Silent Reading Test. Grades 7-10; 1934- 
42; 6 scores: reading rate, story comprehension, word 
meaning, paragraph meaning, total comprehension, 
total; IBM; Forms т (734), 2 (34), 3 (41), 4 (42) ; 
$1.80 per 25; 18¢ per revised manual ('42), postpaid ; 
36€ per specimen set, postpaid; separate answer sheets 
may be used; 4¢ per 1BM answer sheet; 18¢ per sten- 
cil for scoring answer sheets; postage extra; 46(55) 
minutes; Arthur Е. Traxler; Public School Publish- 
ing Co. * 
REFERENCES 

4 SA aaa OW ARO І. А бие, of Teachers’ Marks 

With an ‘Actual Battery of Aptitude Test Percentile Scores. 


Doctor's “Field Study No. 1,” Colorado State College of Edu- 
cation (Greeley, Colo.), 1949. (Abstracts of Field Studies.... 


. “Correlations Between Scores on 


pp. 78-82. (PA 24:748 i 
gram in Independent ols and Supplementary Studies. Edu- 
cational Records Bulletin, No. 52, New York: Educational 


Records Bureau, July 1949. 


J. Tuomas Hastines, Associate Professor of 
Education, Director, Unit on Evaluation, Bu- 
reau of Research and Service, College of Educa- 
tion; University Examiner; University of Ili- 
nois, Urbana, Illinois. 

In both the 1938 and 1940 Mental Measure- 
ments Yearbooks, Forms 1 and 2 of this test 
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were thoroughly and critically evaluated. Form 
3 (1941) and Form 4 (1942) have since been 
added. The manual, which was originally pub- 
lished in 1934, was revised in 1942. In addition 
to these materials a single loose sheet for con- 
verting vocabulary scores to mental ages has 
been added. This sheet, with no explanatory in- 
formation on validity, reliability, or construction 
and no suggestions to the user about the dangers 
involved in use, is likely to be misleading and 
harmful. 

The previous reviews were explicit in both 
adverse and complimentary criticism. It is quite 
unfortunate that virtually all the adverse criti- 
cisms made at least 11 years ago are still valid. 
Even though two new forms have been published 
and the manual has been revised, very little has 
been done actually to improve the statistics and 
interpretation. 

The “tentative norms" have been replaced in 
the 1942 revision of the manual by new norms. 
These consist of (a) some kind of average score 
given for each grade, 7 through то, on each test 
part and total (in the descriptive material re- 
lating to this table these averages are called 
"median scores," but in the table heading they 
are called “mean scores") ; (b) a table of grade 
norms by months for the total score; and (c) 
percentile scores for each part score and the 
totals by grade level, 7 through 1o. The manual 
says that these new grade norms are based upon 
scores of about 25,000 pupils. There is no indica- 
tion of how these cases are distributed according 
to grade level and no description of the samples 
in terms of ranges, standard deviations, or 
chronological ages. These omissions are espe- 
cially interesting in view of the fact that under 
the heading “Use of Test Results" the test user 
is instructed to answer in regard to his pupils' 
test scores such questions as "What is the range 
of the scores in each grade?" and “What is the 
standard deviation of the scores?" He is then 
told that with the answers to these questions he 
will know whether the pupils within a grade (in 
his school) are relatively homogeneous in read- 
ing ability. The word "relative" obviously refers 
to his other classes and not the norm group on 
which the tests were standardized, since it did 
not seem important to present such figures on 
the standardization group. 

The data presented in the manual concerning 
validity, reliability, and equivalence of forms are 
still insufficient. Surely there has been time to 
collect new data ; yet it appears that old, unsatis- 


factory data have been used. For example, the 
manual presents the following data on the valid- 
ity of the comprehension scores: a combination 
score of story comprehension and power com- 
prehension correlated with a composite criterion 
of other reading tests for 54 sixth grade pupils, 
This is done in spite of the fact that sixth grad- 
ers were apparently not used in establishing the 
norms nor for obtaining any of the other data 
concerning the test. The grade norms by months 
explicitly state that the grade equivalents below 
the seventh grade are extrapolated ! 

Fortunately, the business of not doing much 
about the material has allowed the test to retain 
the good points mentioned in the previous re- 
views: (a) comprehension is properly empha- 
sized in the rate test; (b) the rate test is inter- 
esting reading and should appeal to pupils of the 
age for which it is intended; (c) use of sen- 
tences or phrases in the word meaning test is a 
more meaningful device than the use of words by 
themselves; (d) the time limits on the various 
parts are such that it is not likely to be a speeded 
test at any of the intended grade levels. 

In summary, we have here a reading test for 
the junior high school level—a level at which 
much attention should be paid to reading—which 
appears to be relatively good for survey purposes 
but on which the data presented are extremely 
scarce even though the test was first copyrighted 
some 17 years ago. It is discouraging to note that 
even in the face of sound critical reviews which 
pointed out serious shortcomings, a decade has 
been allowed to elapse with little attempt at im- 
provement. 


For reviews by Robert L. McCaul and Miles 
А. Tinker and an excerpt from a review of the 
1939 edition, see 40:1579; for reviews by Fred- 
erick B. Davis and Spencer Shank of the 1934 
edition, see 38: 1114. 
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Durrell Analysis of Reading Difficulty. Grades 
1-6; 1933-37; 7 scores: oral reading-comprehension, 
oral reading-recall, silent reading, word recognition, 
word pronunciation, spelling, handwriting ; individual ; 
1 form, ’37; no data on reliability; manual (^37) ; $2.30 
per 25 individual record blanks; stopwatch essentia 
for administration; $1.30 per booklet of reading para" 
graphs (37); 90€ рег tachistoscope and cards; $1.20 
рег 30 blank tachistoscope cards; 30€ per manua; 
postage extra; $3 per examiner's kit including 5 11 ve 
vidual record blanks, postpaid; (40-45) minutes; Don- 


ald D. Durrell; World Book Co. * 
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Heren M. Rosinson, Associate Professor of 
Education, and Director of the Reading Clinic, 
The University of Chicago, Chicago, Illinois. 

This instrument was designed to be used by a 
skilled observer to detect basic weaknesses in 
reading. Durrell has chosen to test six phases of 
both oral and silent reading. Supplementary tests 
of writing and spelling are provided. The man- 
ual states that “а teacher unacquainted with the 
methods and objectives of teaching reading will 
find difficulty in using the Analysis intelli- 
gently" ; it should be used by experienced read- 
ing teachers only. 

'The first subtest implies that comprehension 
of oral reading is to be tested, yet the norms for 
this subtest consider only the time required to 
read each paragraph. Although space is pro- 
vided for recording the number of errors and 
the answers to comprehension questions, the 
former appears to be used only to determine 
the appropriate paragraphs to be read, while 
the latter is interpreted qualitatively by such à 
term as "low." Therefore, the scoring of this 
subtest appears to be inconsistent with the pur- 
pose stated. Furthermore, this test requires only 
recall of the story read. Many pupils can repeat 
material without comprehension. This test is of 
little value in analysis of comprehension of 
different types, when word recognition is ade- 
quate. 

Norms for the second subtest, Oral Reading 
—Unaided Oral Recall, are based on a combi- 
nation of the number of thought units remem- 
bered and the time required for oral reading. The 
manual suggests that the examiner record both 
the number of errors made in oral reading and 
the number of ideas recalled when direct ques- 
tions are asked, although only a qualitative analy- 
sis is made of these data. No standard questions 
are provided, so that the extent and accuracy 
of the measure of comprehension depends on the 
skill of the examiner. 

The third subtest, Silent Reading—Unaided 
Oral Recall, is similar to the second, except that 
reading errors cannot be noted. These two sub- 
tests enable the examiner to compare silent with 


oral reading since paragraphs are matched in 
difficulty. " 


y 


The fourth subtest combines a measure of 
rapid word recognition with ability to analyze 
words which are not a part of the pupil's sight 
vocabulary. The cardboard tachistoscope and 
shutter are manually operated at a suggested 
speed of exposure of one half second. The man- 
ual states that the purpose for quick exposure 
is so that the child may have “just one glance 
without time for an extra eye movement." Con- 
siderable practice is required to operate a man- 
ual tachistoscope consistently at this speed. Fur- 
thermore, studies have shown that several eye 
movements can be made in a half second. 

The Phonetic Inventory is especially useful 
in determining the nature of the difficulty in one 
phase of word analysis. The reviewer would 
question the recommended practice of excusing 
those who score above third grade on the flash 
and pronunciation tests, since experience shows 
that pupils who reach this level nevertheless 
often exhibit weaknesses which can readily be 
identified by such an inventory. 

The manual is written so that the directions 
for administration are clear. However, consid- 
erable difficulty is experienced in using and in- 
terpreting the norms. The latter were estab- 
lished on 1,000 children, but there is no descrip- 
tion of the population sampled, or of the meth- 
ods used in teaching the pupils to read. Further- 
more, neither reliability nor validity is mentioned 
in the manual, and there is no indication of the 
consistency of the tests. 

The author states that “since this test is essen- 
tially a method of standard observation of errors 
and faulty habits in reading, the checklist of er- 
rors is more important than the norms." To this 
end, the checklists for observing errors in oral 
reading and faulty reading habits are quite com- 
plete and detailed. In short, it appears that 
herein lie the major values of the test. 

The Durrell Analysis is not as complete as 
the Gates Reading Diagnostic Tests, but has the 
advantage of requiring less time to administer. 
It includes no measures of aptitudes for learning 
to read by different methods as does the Gates 
test. к» 

The Durrell Analysis of Reading Difficulty 
provides an excellent opportunity for an expe- 
rienced reading teacher to observe difficulties in 
word recognition and oral reading, as the check- 
list of errors is the best available. However, the 
silent reading section is of limited value, and 
the norms must be used with full realization of 
the limitations pointed out previously. 
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*Durrell-Sullivan Reading Capacity and Achieve- 
ment Tests. Grades 2.5-4.5, 3-0; 1937-45; 5 scores: 
word meaning, paragraph meaning, spelling (optional), 
written recall (optional), total; 2 levels; per man- 
ual (45) ; postage extra; 35¢ per specimen set of any 
one level, postpaid; 30(45) minutes; Donald D. Dur- 
rell and Helen Blair Sullivan; World Book Co. * _ 
а) PRIMARY TEST. Grades 2.5-4.5; 2 tests published in 
1 booklet; Form A (°37) ; directions for administering 
(39) ; $2.65 per 25. | 
1) Reading Capacity Test. 
Ч Reading Achievement Test. 
b) INTERMEDIATE TEST. Grades 3-6; 2 tests. 7 

1) Reading Capacity Test. Form А (’37); direc- 

tions for administering (37) ; $1.70 per 25. 

2) Reading Achievement Test. Forms А (37), B 

(744) ; directions for administering ('45) ; $2 per 25. 
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Influence on the Validity of а Vocabulary Test of the Method 
of HERR Responses.” J Ed Psychol 31:153-4 F '40. * (PA 
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Ed 62:32-6 S '41. * x Ў 

4. Росн, Grapvs S. “Summaries From ‘Appraisal of the 

Abilities of Acoustically Handicapped Chil- 
nn Deaf 91:331-49 S '46. * (PA 21:290) 


Silent Readin 
dren.'" Am 
Herren M. Ковімѕом, Associate Professor of 
Education, and Director of the Reading Clinic, 
The University of Chicago, Chicago, Illinois. 
Although these tests were first published 14 
years ago, they remain unique in their purpose 
of providing a simple means for identifying poor 
readers among pupils in grades 2-6. Pupils who 
have specific reading disabilities often appear 
to be dull or mentally retarded if they are meas- 
ured by a group intelligence test requiring them 
to read. Hence, a reading achievement test shows 
that such pupils are reading in harmony with 
their capacities and, as a result, no attempts are 
made to correct their reading problems. While 
individual intelligence tests may be more dis- 
criminating and more useful, few schools have 
the number of adequately trained staff members 
required to administer and interpret them. Au- 
thorities estimate that Io to 20 per cent of pupils 
in grades 2-6 fall in the category of poor readers; 
consequently, the need for tests to distinguish 
pupils with reading deficiencies from those who 
are intellectually slow in development is urgent. 
The Durrell-Sullivan Reading Capacity and 
Achievement Tests meet this need in two ways. 
First, the tests may be given by a teacher without 
special psychological training. Second, the tests 
may be given to groups of children, hence they 
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are an economical means for identifying poor 
readers without major language deficiencies, 
who are capable of reading considerably beyond 
their present achievement level. 

The tests were carefully constructed so that 
the capacity and achievement sections are com- 
parable. It is essential that both sections be ad- 
ministered if a comparison is to be made between 
the levels of achievement and capacity of a given 
pupil. 

Since these tests include measures of word 
meaning and paragraph meaning, it is possible 
to determine whether a pupil's difficulty lies in 
word recognition or in understanding the mean- 
ing of larger speech units. Furthermore, if the 
latter is the case, clues as to the specific types of 
comprehension problems are provided by an 
analysis of the types of questions most frequently 
answered incorrectly. This is possible because 
the paragraph meaning section includes ques- 
tions requiring the reader to understand sen- 
tences and words, to note details, to’ get the 
central thought, and to locate information. Such 
hints enable the teacher or reading clinician to 
select the proper diagnostic tests to investigate 
these problems more analytically. Supplemen- 
tary tests of spelling and written recall are avail- 
able if further information about these areas is 
needed. 

Тһе manual of directions is written clearly 
and simply. Both age and grade norms are pre- 
sented for word meaning, paragraph meaning, 
and the total score. The norms for the Inter- 
mediate Test are based upon 6,000 cases in 19 
communities. These communities were largely 
in the East, but this selection shows no limita- 
tion so long as the tests are used for comparison 
purposes. No statement is made concerning the 
standardization of the Primary Test, but, since it 
includes the easier items of the Intermediate 
Test, it is assumed that the same data were used. 

The selection of the test items was carefully 
planned to produce a valid test. Clinical expe- 
rience has shown a close relationship between the 
capacity test and scores on individually adminis- 
tered intelligence tests which are heavily 
weighted with verbal items. The reliability co- 
efficients provided by the authors are highly sat- 
isfactory. 

In the light of recent studies of the language 
limitations of underprivileged socio-economic 
groups, the reviewer questions the assumption 
that the Capacity Tests are as accurate with these 
groups as with pupils of the more privileg 
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groups. More probably, they determine whether 
a pupil has acquired the necessary language fa- 
cility to make progress in reading. However, 
they fail to distinguish between pupils who have 
general language handicaps, with potentialities 
for rapid increase in language facility and read- 
ing improvement, and those who are intellectu- 
ally dull without these potentialities. The Ca- 
pacity Tests are really tests of hearing compre- 
hension or language facility and should not be 
confused with intelligence tests. Further study 
in this area holds promise for a better under- 
standing of the value of these tests. They can be 
used with confidence, provided pupils with for- 
eign language backgrounds and those suspected 
of having language handicaps are given more 
careful appraisal. 


For reviews by William S. Gray and Marion 
Monroe and an excerpt from a review, see 38: 
1099. 
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Gates Reading Diagnostic Tests, Revised Edi- 
tion. Grades 1-8; 1926-45; 2I scores: categories of 
mispronunciations, speed of reading paragraphs, oral 
reading total, oral vocabulary, reversals, phrase per- 
ception, word perception-flash presentation, word per- 
ception and analysis-untimed presentation, spelling, 
syllabication, recognition of syllables, recognition of 
phonograms, blending letter sounds, giving letter 
sounds, speed of reading capital letters, errors in read- 
ing capital letters, speed of reading small letters, errors 
in reading small letters, giving letters for sounds, giv- 
ing words with stated initial sounds, giving words with 
stated final sound; individual; Forms I (42), П 
(42); manual (45); 20€ per pupil record booklet; 
55€ per oral response test material for any one form; 
Тоё per set of tachistoscopic cards; 456 per manual; 
$1.50 per specimen set; postpaid; (60-90) minutes; 
Arthur I. Gates; Bureau of Publications, Teachers 
College, Columbia University. * 


REFERENCES 

1-3. See 3:510. 
vé CoLLINS, ELIZABETH Mary. The Measurement of Visual- 

isual and Visual-Auditory Association, Master’s thesis, Ford- 
ham University (New_York, ШУ, 1939. Рр. 39. 

"m GATES, ARTHUR I. “A Corre tional Study of a Battery 
of Reading Diagnostic Tests." J Ed Res 40:436-47 F '47. * 
(PA 21:3743) 

Wonrz J. Ossur, Professor of Remedial and 
Experimental Education, University oj Wash- 
ington, Seattle, Washington. 

These are individual tests designed for use 
with pupils who have failed or made low scores 
оп group tests. The purpose of the tests is to 
disclose why the pupil has failed to learn to read. 


They include material on oral reading, vocabu- 


lary, reversals, phrase perception, word percep- 


tion and analysis, spelling, visual perception, 
and auditory techniques. As a reminder, the 
booklet in which the tester records the pupil’s 


responses includes blanks for recording results 
of tests of vision, hearing, and speech, and in- 
formation about eye movements, emotional ten- 
sion, special interests and distastes, home and 
other out-of-school influences, and school his- 
tory. Since there are two forms of the series, it 
is possible to measure improvement. 

The manual contains explicit aids for those 
who are relatively inexperienced in diagnosis. 
Twenty-one tables of norms enable the examiner 
to ascertain the pupil’s grade level in the sub- 
elements of the tests. Other tables show degree 
of retardation, age-grade scores, and the like. 
Suggestions for appropriate remedial treatment 
are to be found in the author’s Improvement of 
Reading (3). 

No estimate is given in the manual of the 
time required to administer the entire series of 
tests individually ; however, the author has men- 
tioned elsewhere that 60 to 9o minutes should 
be a reasonable allowance. Even if the author's 
estimate is not too conservative, the testing time 
required is still long enough so that it seems 
doubtful that the ordinary school system will 
have anyone around who has sufficient time to 
use the series to any great extent. The tests were 
apparently designed to diagnose everything that 
ever gets wrong with any child anywhere. АП 
such tests need, some device by which the tester 
can find quickly just which part of the series is 
needed for a given child and thus effect the saving 
of badly needed time. 

In the estimation of the reviewer, more atten- 
tion should have been given to reading rate and 
to emotional blocks. In spite of these adverse ele- 
ments the tests are well worth using. 


For related reviews, see 564; for a review by 
T. L. Torgerson, see 3:510. 


[564] 
[Re Gates Reading Diagnostic Tests] Gates, AR- 
тнок L The Improvement of Reading: A Pro- 
gram of Diagnostic and Remedial Methods, 
Third Edition. New York: Macmillan Co., 1947. Pp. 
xxi, 657. $4.75. * (РА 22: 3195) 

J Consult Psychol 12:124 Mr-Ap '48. Lau- 
rance F. Shaffer. * rewritten entirely * An ap- 
pendix gives full directions and norms for the 
Gates Diagnostic Reading Tests. These tests 
have progressed markedly, since their first pub- 
lication in 1927, toward a wholistic approach to 
reading functions in place of the original com- 
partmented approach. ^ 

J Ed Res 42:76-7 S ’48. Miles A. Tinker. * 
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primarily a manual of directions for diagnosing 
and remedying reading defects. It describes the 
author’s series of reading achievement and diag- 
nostic tests. More space has been devoted to 
discussions of reading activities, especially com- 
prehension, than in earlier editions. * Only a 
few of the many excellent features of this text 
can be mentioned here: (1) Heavy emphasis is 
properly placed upon motivation as a factor in 
causing and in correcting difficulties in reading. 
(2) Adequate weight is given to training in 
phonetics. (3) Physical disabilities as causes of 
reading disability are satisfactorily evaluated. 
(4) Non-verbal intelligence tests are unsatis- 
factory to use with reading cases. (5) Caution 
should be exercised in assigning reading diffi- 
culty to nervous instability since emotional 
symptoms are frequently the result of the diffi- 
culty and will disappear when the difficulty is 
remedied. (6) Remedial insttuction is the same 
as regular developmental classroom teaching but 
involves greater individual prescription for in- 
dividual needs. (7) Strong emphasis is placed 
upon training left-to-right sequences of percep- 
tion. (8) Proper weight is assigned to various 
clues to word recognition. (9) Mechanical de- 
vices are properly de-emphasized. (10) It is 
unnecessary to photograph eye movements in 
diagnosing a reading case. (11) Good readers 
are flexible in their reading attack, modifying it 
in accordance with the difficulty and complexity 
of the context. There are a few questions of em- 
phasis and a few errors that may be noted: (т) 
In several places low mentality is given as a 
cause of reading deficiency. This will confuse 
the less careful reader for at no place is it clearly 
explained that low mentality, although asso- 
ciated with poor reading ability, is not the cause 
of reading disability. (2) The “eye recognition 
span” is mistakenly called the “eye voice span.” 
(3) Eight per cent rather than 4 per cent of 
boys are color-blind, and, contrary to the au- 
thor’s statement, the Jensen Test for Color- 
Blindness does not yield “reliable” evidence of 
color deficiencies. A recent study shows poor 
validity for this test. (4) In several instances, 
emphasis is placed upon identifying little words 
in big words. This procedure is apt to create 
many difficulties unless handled with extreme 
caution. To avoid trouble, it is best to confine 
the technique to distinguishing parts of com- 
pound words and root words in variants and 
derivatives. (5) Greater emphasis might well 
be placed upon teaching syllabification. (6) “It 


is very useful to learn to count the number of 
stops per line by observing the pupil’s eyes as 
he is... reading . . ." (p. 344). This tech- 
nique is both inaccurate and useless. Further- 
more it is contrary to the view expressed on. 
page 314. (7) There should be more emphasis 
upon the thinking aspects of reading, especially 
in discussion of comprehension. (8) On page 
543, the term presbyopia is erroneously applied 
to a child. It is a defect of vision associated with 
advancing age, i.e., middle-aged people. (9) A 
boy (p. 566) 6 years, 11 months old and with 
an I.Q. of 110 is considered deficient because 
he did not use syllabic divisions in spelling, and 
because he spelled letter by letter. How skillful 
a speller is a normal child of less than seven 
years of age supposed to be? (10) Traxler's two 
important summaries are unfortunately absent 
from Gate's bibliography. This book is un- 
doubtedly the best text we have in systematic 
diagnostic and remedial reading. It is thorough, 
well written and practical. This revision is a 
decided improvement over previous editions 
which in themselves were excellent books. 


ORAL 
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XOral Diagnostic Test of Word-Analysis Skills, 
Primary: Dominion Tests. Grades 1—2; 1947; indi- 
vidual; no data on reliability; 1 form; $1 per 25; 15 
per set of word slide cards and sleeve; 256 per manual; 
45€ per specimen set; postage extra; (20-40) min- 
utes; prepared by the Department of Educational Re- 
search, Ontario College of Education, University of 
Toronto; distributed by Vocational Guidance Centre. 
Nira Banton Situ, Professor of Education, 
New York University, New York, New York. 

This test is designed primarily to assist the 
teacher in making a diagnosis of the word recog- 
nition and word ‘analysis skills of individual 
children. It is supposed to supplement the word- 
recognition and vocabulary test in the Achieve- 
ment Tests in Silent Reading: Dominion Tests, 
a group test. While these two tests are intended 
largely for survey purposes, the Oral Diagnostic 
Test is designed to reveal more of specific indi- 
vidual weaknesses. It is intended to answer such 
questions as : Is the child careless ? Are his meth- 
ods of word-analysis immature? Has he any 
method? Does he have difficulty with specific 
sounds and blends? : 

The test consists of 100 words, printed on 
two oblong strips, with 25 words on each side. 
A tagboard sleeve or envelope with an opening 
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cut in the upper part of one side is provided. As 
the word strips are pulled up through the sleeve, 
one word at a time is revealed in the opening. A 
Teacher’s Record Sheet and Check List is pro- 
vided for use in recording errors and methods 
of attack. Information about the test, directions 
for administering the test and recording re- 
sponses, and suggestions for using the check list 
are included in the manual. 

The selection of test items is open to question. 
The manual states that the 100 words were so 
chosen that all of the common letter sounds and 
blends are represented, the more important ones 
several times. No information is given in regard 
to the procedure used in determining the com- 
monness or the relative importance of the letter 
sounds and blends. The manual states further 
that the words are roughly graded for difficulty, 
with more than half of them well within the 
scope of the average child at the end of Grade 1, 
but that some words are included with which all 
but exceptionally good primary readers are un- 
familiar. No explanation is given of the pro- 
cedure used in determining the relative diffi- 
culty of these words. 

No reference is made to any attempts to as- 
certain the validity or reliability of the test. 
Norms are not provided. 

This test cannot be considered a scientific 
instrument for use in diagnosing word analysis 
skills. However, as an informal check device, it 
would undoubtedly prove valuable to those seek- 
ing additional information about an individual's 
word analysis habits and skills. It is to be com- 
mended as a pioneer effort in an aspect of read- 
ing which has been little explored in the prepa- 
ration of standardized tests. 


READINESS 
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Gates Reading Readiness Tests. Grade 1; 1939; 5 
Scores: picture directions, word matching, word-car 
matching, rhyming, letters and numbers; 1 form; $4.85 
рег 100; 30% per specimen set; postpaid; nontimed 
(so) minutes; Arthur I. Gates; Bureau of Publica- 
tions, Teachers College, Columbia University. * 
7 REFERENCES 

1-5. 3 

6:8. Seo. sister N 
F. J. Ѕсномкі,, Professor of Education and 
Head of the Department, University of Queens- 
land, St. Lucia, Brisbane, Australia. [Review of 
the Australian adaptation.] 

The manual and tests have been reprinted in 


their entirety by the Australian Council for Edu- 
cational Research for distribution throughout 
Australia. In addition, tentative Australian 
norms are presented based upon 500 children 
selected at random from 50 different schools in 
Western Australia. 

Evidence gathered in Australia indicates that 
a weighted combination of the scores in the Gates 
Subtests 1, 2, 3, and 4 and the IQ obtained from 
an individual intelligence test gives the best pre- 
diction of reading success as measured by the 
A.C.E.R. Individual Reading Test. In the re- 
port of the Australian testing it is indicated that 
“busy teachers with large classes and little time 
at their disposal might well use only Gates Sub- 
tests 2 and 3. The scores in Test 3 should be 
doubled and added to those of Test 2. A table 
for determining the percentile rank scores ob- 
tained in this manner has been constructed. 

The material of the foregoing paragraph in- 
dicates in some measure what the reviewer has 
already urged in his Psychology and Teaching 
of Reading that teachers can obtain an effective 
indication of the readiness of children for formal 
reading by use of an intelligence test result to- 
gether with an estimate of the extent of the 
meaning vocabulary. But in respect to reading 
readiness, what is perhaps much more important 
for Australian schools is the introduction of 
“look and say” and sentence methods based on 
vocabulary-controlled readers which are almost 
universal in American schools. The word whole- ` 
sentence method and  vocabulary-controlled 
readers presuppose a preparatory or readiness 
period in reading instruction, and it is during 
this time that teachers can so readily determine 
which group of children is ready to go forward 
with reading. The lock step use of phonic meth- 
ods in many Australian schools gives no such 
chance to determine which children are ready 
for reading, and which require time to mature 
intellectually and in vocabulary growth. In other 
words, readiness tests do not replace the in- 
formed estimate of discerning teachers using 
modern methods of teaching reading which in 
themselves provide a period to discover indi- 
vidual differences in equipment and to allow 
for the necessary maturation of some pupils. 
While state education systems use reading meth- 
ods that push pupils willy nilly into reading in- 
‘struction as soon as they enter school, a read- 
ing readiness test may be a very useful way 
of revealing individual differences in equip- 
ment. à 
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For reviews by Marion Monroe Cox and Paul 
A. Witty, see 3:516; for excerpts from reviews, 
see 40:1537. 

[567] 
*Group Test of Reading Readiness: Dominion 
Tests. Grades kgn-1; 1949-51; 6 scores: discrimina- 
tion of objects-symbols-words, listening-remembering- 
observing, familiarity with word forms, memory for 
word forms, motor coordination, total; Forms A (’49), 
B (’49); mimeographed manual (’51) ; profile chart 
(^51) ; $1.50 per 25; 15¢ per set of flash cards for any 
one form; 25¢ per manual ; 65¢ per specimen set; post- 
age extra; 31(45) minutes; prepared by the Depart- 
ment of Educational Research, Ontario College of 
Education, University of Toronto; distributed by Vo- 
cational Guidance Centre. * 
[ 568] 
*The Harrison-Stroud Reading Readiness Tests. 
Grades kgn-1; 1949-50; 6 scores: making visual dis- 
criminations (attention span controlled), making vis- 
ual discriminations (attention span uncontrolled), us- 
ing the context, making auditory discriminations, using 
context and auditory clues, using symbols; 1 form, '50; 
manual ('50); $2.10 per 25, postage extra; 44€ per 
specimen set, postpaid; (90) minutes in 3 sessions; M. 
MS Harrison and James B. Stroud; Houghton Mif- 
in Co. 


МҮпллАМ S. Gray, Director of Research -in 
Reading, and Emeritus Professor of Education, 
The University of Chicago, Chicago, Illinois. 

The scope and content of this test are based 
on the assumption that reasonable readiness in 
learning to read includes the ability to make 
visual and auditory discriminations, the ability 
to use context clues in identifying items that are 
inherent in that context, the ability to use audi- 
tory and context clues separately or in combi- 
nation in identifying ideas represented by words, 
and the understanding that printed words stand 
for ideas. No evidence is presented in support 
of the validity of these assumptions. The test 
designers draw, rather, on their wide experi- 
ence and on the extensive body of available sci- 
entific literature relating to requisites for prog- 
ress in learning to read. There is little question 
about the validity of including tests of these 
abilities and understandings in a reading readi- 
ness test. However, it could be justly argued 
that the selection of test items should have been 
preceded by considerable specific research to 
justify their selection, as contrasted with others 
which might have been included. 

The tests used in this battery have been care- 
fully designed as measures of the abilities and 
understandings selected as basic factors in read- 
ing readiness. For example, visual discrimina- 
tion of the pattern used in reading is tested by 
asking the pupil to identify a given word (do) 


in a series of four words. Ability to use con- 
tent is measured by asking the pupil to choose 
and mark one of three pictures which best sup- 
plies an element missing in oral text. Ability to 
use auditory clues with context clues is meas- 
ured by asking the pupil to listen to oral state- 
ments which suggest two possible responses illus- 
trated in a group of three pictures. As implied 
by these examples the designers of the test have 
been quite successful in selecting test situations 
which measure capacity to use specific abilities 
in situations which approach closely the act of 
reading. 

The form in which the test has been prepared 
is attractive and convenient. The directions for 
giving and scoring the test have been very care- 
fully prepared. Six different scores are pro- 
vided, on the basis of which recommendations 
are proposed as to the placement of a pupil in 
a reading group which moves rapidly, normally, 
or slowly, or his retention in a nonreading group 
in grade one, the kindergarten, or a transition 
group. 

Unfortunately, only one form of the test has 
been prepared, the norms provided are based on 
results of experimental tryouts only, and no 
data relating to the reliability of the test have 
been published. Furthermore, no data are avail- 
able concerning the prognostic value of the 
scores when used alone or in combination with 
intelligence scores and other available types of 
information. 


[ 569 ] К 
*Lee-Clark Reading Readiness Test, 1951 Revi- 
sion. Grades kgn-1; 1931-51; 4 scores: letter symbols, 
concepts, word symbols, total; 1 form, ’51—same as 
test copyrighted in 1943; manual (’51) ; $1.25 per 25, 
postage extra; 356 per specimen set, postpaid; non- 
timed in part (15) minutes; J. Murray Lee and Willis 
W. Clark; California Test Bureau. * 


REFERENCES 
1. Lez, J. Murray; Crank, WinLis W.; AND LEE, Donja 
May, Меша Seong Readiness." El Sch J 34:656- 
y "34. 14741 
2. Wirwons, Warbo W. Relative Validity of Three Grip 
Readiness Tests in Predicting Reading Achievement. Master 
thesis, University of Kansas (Lawrence, Kan.), 1939- 


For reviews by Marion Monroe Сох and 
David Н. Russell of the 1943 edition, see 3:517- 


[570] 

*Metropolitan Readiness Tests. End of Кап and 

rst grade entrants; 1933-50; 4 scores: reading reg 
ness, Дре. readiness, drawing-a-man (optional), 
total; Forms R (740), S (о); Form R manual (49) 
Form S manual (50); $2.35 per 25, postage еч 
35¢ per specimen set, postpaid; nontimed in part ( PA 
75) minutes; Gertrude H. Hildreth and Nellie L. Gri 
fiths; World Book Co. * 
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1-10. See 40:1552. 
11-15. See 3:518. 3 , 

16. BAGLEY, NET W. Reading Readiness as Related to 
Southern Methodist Universit; 


(Dallas, Tex.), 1941. (Abstracts of Theses...., 1941, ЇР 5-7. 
h le Train- 


Eric F. GARDNER, Associate Professor of Edu- 
cation, Syracuse University, Syracuse, New 
York. 

Two alternate forms, R and S, of the widely 
used Metropolitan Readiness Tests have re- 
placed the single older form. The new forms con- 
tain the same subtests as those in the original 
edition, except that the Similarities Test, which 
called for identifying dissimilar pairs of pictures, 
has been replaced by the Matching Test in which 
one of four objects similar to a given object is 
selected. 

Several improvements in test format, related 
materials, and procedures have been introduced. 
Size of the test booklet has been increased, mak- 
ing picture material clearer and working space 
greater. Recommended time for administration 
has been reduced by ten minutes, and new forms 
now require three sittings instead of four. The 
carefully prepared scoring directions now include 
such features as special directions for adminis- 
tering the copying test to left-handed children 
and explicit instructions, with illustrations, for 
scoring subjective material. 

, The new forms include new and revised old 
items, the final items having been selected after 
item analysis from three experimental forms ad- 
ministered to 2,600 children in the beginning 
first grade from localities specified in the manual. 
The comparability of Forms R and S was inde- 
pendently investigated by administering each 
to randomly selected halves of an independent 
sample of 1,500 first grade children. This very 
praiseworthy and conscientious technical refine- 
ment is slightly marred by the sole unelaborated 
conclusion that “Analysis of the data from this 
experiment showed the two forms to be com- 
pletely comparable throughout the range of 
Scores in each subtest." The reader is uninformed 
as to how comparability was demonstrated (165 
mean item difficulty, mean internal consistency, 
Similar shape of frequency distributions of scores, 
difficulty indices or internal consistency indices, 
similar standard errors of measurement at cor- 
responding portions of the distributions of 
Scores, content, etc.) and what evidence was 
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considered adequate to warrant the term “com- 
pletely comparable." 

Norms are based on 15,081 white children 
from 56 communities in 26 states. The publisher 
offers to furnish upon request details regarding 
the normative sample and recently developed 
kindergarten norms. As an aid to teachers, tables 
are provided which permit the translation of 
the total readiness score (total raw score) 
into a percentile rank, and the reading readi- 
ness, number readiness, and the total readiness 
scores into letter ratings with verbal descrip- 
tions. 

Median reliability coefficients (Pearson r’s 
between two forms administered a few days 
apart) of six determinations, based on groups 
of from 9o to 273 of beginning first grade pupils, 
are provided for each subtest, for the sum of 
scores on Tests 1—4 (reading readiness), and for 
total score. Unfortunately, no evidence is pro- 
vided as to the variability of these reliability co- 
efficients. Means, standard deviations, and stand- 
ard errors of measurement are also presented. 
The reported reliability of Tests 1-4 is .83; of 
the Numbers Test, .84;:and of the total score, 
.89. The corresponding standard errors of meas- 
urement are 3.7, 1.9, and 4.6. 

The manual appropriately advises against the 
interpretation of each subtest for an individual, 
pointing out that the reliability warrants the use 
of only reading readiness, number readiness, and 
total readiness scores. Would that other test 
makers would exercise the same technical com- 
petence and restraint. Too often, even in very 
popular achievement tests, do the authors pro- 
pose individual diagnosis by means of subdivi- 
sions containing ten or even fewer items. Compu- 
tation of the standard error of measurement | 
would clearly show the absurdity of such at- 
tempts. 

Face validity is claimed for the test, and some 
statistical data are presented as evidence of va- 
lidity. Additional validity studies are reported 
in progress. In a study involving 487 cases, all 
of 44 pupils rated as “Superior” on the Metro- 
politan Readiness Tests administered in Sep- 
tember 1948 were found to be above the na- 
tional norm in average “reading achievement,” 
and all but one above this norm in Numbers 
when tested in February 1949 on the appropriate 
tests of the Primary I Battery of the Metropoli- 
tan Achievement Tests. Of 46 pupils rated “Poor 
Risk” (lowest category) in 1948, 22 were found 
to be below the national norm in average read- 
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ing achievement and 39 below in Numbers when 
tested in 1949. 

Even if the norm is accepted as a minimal 
level of competence (which is certainly a de- 
batable issue), it is to be noted that about half 
of those who were considered “unready” for 
reading actually reached the norm of achieve- 
ment. Even so, the reported results are likely 
to be somewhat optimistic since the group re- 


ported was the same group from which the pre- ' 


diction formula was derived. The present inter- 
pretation of these data emphasizes even more 
than does the manual the extremely tentative 
meaning of the Metropolitan Readiness Test 
scores for individual diagnosis, especially in the 
more crucial lower range. The importance of 
additional factors in considering a child's readi- 
ness for first grade work cannot be emphasized 
too strongly. ~ 

These and other deficiencies (such as oc- 
casional out-dated pictures and poor phrasing of 
items) do not obscure a general high level of 
excellence and careful workmanship both with 
respect to item construction and statistical analy- 
sis. From the technical point of view, the Metro- 
politan Readiness Tests are among the superior 
readiness tests now available. 


Teach Col J 21:15-6 O "49. Fay Griffith. * 
The correlations between the Metropolitan 
Readiness Tests given in September, 1948, with 
the reading score and the number score of the 
Primary I Battery of the Metropolitan Achieve- 
ment Tests, given in February, 1949, are so 
startling as to indicate the value of the Metro- 
politan Readiness Tests in predicting school suc- 
cess of first grade children. The wise teacher will 
avail herself of their use. 


For a review by Irving H. Anderson of the 
original edition, see 3:518; for a review by W.J. 
Osburn of the original edition, see 40:15 52. 


[571] 

*Murphy-Durrell Diagnostic Reading Readiness 
Test. First grade entrants; 1949, CI947—49; 3 scores: 
auditory, visual, learning rate; Part 3 individual in 
part; I form; $1.70 per 25; $1.10 per set of flash cards ; 
postage extra; 35€ per specimen set, postpaid; Parts 
I-2: nontimed (60) minutes; group administration of 
Part 3: (30-40) minutes; individual administration of 
Part 3: (5-10) minutes; Helen A. Murphy and Don- 
ald D. Durrell; World Book Co. * 


REFERENCES 
т. MunPHY, HELEN A. An Evaluation of Exercises for De- 
veloping Auditory Discrimination in Bee Ruaan Mas- 
ter’s thesis, Boston Universit (Boston, Mass.), 1940. 
2. BiGcv, M. VIRGINIA. The Establishment of a Relative 
Order of Difficulty of Word Elements in Audit Discrimina- 
tion, Master’s thesis, Boston University (Boston, Mass), 1946. 
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[572] 
*School Readiness Inventory. Grades kgn-1; 
1950; Forms A, B; both forms on 1 sheet; no data on 
reliability and validity; $1.25 per 25, postage extra; 
50€ per specimen set, postpaid; Katharine М, Ban. 
ham; Educational Test Bureau, Educational Publish- 
ers, Inc. * 


Dav H. RussELL, Professor of Education, 
University of California, Berkeley, California. 

This inventory is not a standardized test but 
a checklist of 25 items of “ordinary personal and 
social accomplishments of six-year-old children, 
regardless of whether they have attended nursery 
school or kindergarten." The items have been 
selected from developmental scales such as those 
of Gesell, Doll, and Goodenough, with certain 
items added by the author. They involve “foot- 
work and balance," “handwork and vision," 
"speech and language," "personal independ- 
ence," and “social co-operation.” Their inclu- 
sion is a result of “observations in examining 
several hundred six-year-old children in school 
and child guidance clinics.” 

The author is on firm ground in emphasizing 
the motor, social, and emotional factors in readi- 
ness for the usual school work. The scale is also 
useful in presenting specific behavior items 
rather than generalities about six-year-olds’ de- 
velopment. However, the author offers no proof 
that the items listed in the two forms of the scale 
are any more crucial for school adjustment and 
success than dozens of other items that could be 
listed. Even more serious, no evidence is given 
for the recommendation that children scoring 20 
points or over are ready or almost ready to enter 
first grade, whereas children scoring below 15 
points should attend kindergarten or nursery 
school or have a longer period at home. In а 
checklist of 25 items, the setting of specific criti- 
cal points for school placement, particularly 
when unsupported by statistical evidence, vio 
lates both principles of test interpretation an 
knowledge of the continuity of human abilities. 
It is true that the author recommends a psycho- 
logical examination for a more reliable estimate 
of the mental level of children scoring below 15 
points. Throughout the accompanying direc 
tions, however, the only other remedy for i 
low scoring child seems to be postponement 0 
entrance into first grade. The advantages of t й 
child's staying with his age group, with modifica- 
tion of the first grade program, are never jud 
tioned. Most school people would agree thal 
postponement is not the only cure. 


607 | 


TESTS & REVIEWS: READING—SPECIAL FIELDS 


No evidence is offered that the inventory has 
the reliability or validity to predict early school 
success by itself. However, since it stresses so- 
cial and emotional factors, it may be used as a 
supplement to some of the better known readi- 
ness tests, such as those of Hildreth, Gates, and 
others. (This supplemental use is not mentioned 
in the manual.) Used as a checklist it may help 
some teachers in understanding their individual 
pupils better. 


SPECIAL FIELDS 
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*Ability to Interpret Reading Materials in the 
Natural Sciences: Iowa Tests of Educational 
Development, Test 6. Grades 9-13; 1942-51 (first 
published as a separate in 1951) ; for complete battery, 
see 17; Form Y-2 (’51—same as 1949 edition) ; manual 
(51); general manual ('51); $3.75 per 25; separate 
answer pads or answer sheets must be used; $1.95 per 
25 answer pads; $3 per тоо IBM answer sheets; 50¢ 
per scoring stencil; $2.50 per 25 first semester ('46) 
or second semester (’48) profiles for any one of grades 
9-12; $1 per 25 self-interpreting profiles for students 
(151); a per school summary report ('44) ; 4€ per 
pupil score sheet (48) ; 5o€ per specimen set; cash or- 
ders postpaid; 60(65) minutes; edited by E. F. Lind- 
gm i K. W. Vaughn; Science Research Associates, 
nc. 


For a review by Eric F. Gardner of the total 
battery, see 17; for reviews by Henry Chauncey, 
Gustav J. Froehlich, and Lavone A. Hanna of 
Forms X-1 and У-т of the complete battery, see 
3:12. 


[574] 
*Ability to Interpret Reading Materials in the 
Social Studies: Iowa Tests of Educational De- 
velopment, Test 5. Grades 9-13; 1942-51 (first pub- 
lished as a separate in 1951) ; for complete battery, see 
17; Form Y-2 (’51—same as 1949 edition) ; manual 
(51); general manual (’51) ; $3.75 per 25; separate 
answer pads or answer sheets must be used; $1.95 per 
25 answer pads; $3 per 100 IBM answer sheets; 50¢ 
per scoring stencil; $2.50 per 25 first semester (’46) 
or second semester (’48) profiles for any one of grades 
9-12; $1 per 25 self-interpreting profiles for students 
C51); 256 per school summary report ('44) ; 4€ per 
pupil score sheet (748); 50% per specimen set; cash 
orders postpaid ; 60(65) minutes; edited by E. F. Lind- 
guise i K. W. Vaughn; Science Research Associates, 
c. 


For a review by Eric F. Gardner of the total 
battery, see 17; for reviews by Henry Chauncey, 
Gustav J. Froehlich, and Lavone A. Hanna of 
Forms X-1 and Y-1 of the complete battery, see 
32. 


[575] 
*[Robinson-Hall Reading Tests.] College; 1940- 
49; 2 scores: rate, comprehension; 4 tests; no descrip- 
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tion of normative population; manual ('49); 4€ per 

reading selection for any one test; 2€ per any one test ; 

15¢ per manual; postage extra; 520 per specimen set, 

postpaid; (15) minutes per test; Francis P. Robinson 

and Prudence Hall; Ohio State University Press. * 

а) A TEST OF READING ABILITY FOR ART. I form ['41]. 

р) i TEST OF READING ABILITY FOR GEOLOGY. I form 
41]. 

C) А TEST OF READING ABILITY FOR HISTORY. Forms 

Canada ['41], Russia ['41]. 

р А TEST OF READING ABILITY FOR FICTION. I Íorm 
41]. 


REFERENCES 

178 Seo i bur ior A., AND THomson, КеМмЕТН F. 
“Test Scores Examined With the Lexis Ratio." Psychometrika 
7:281-8 D ’42. * (PA 17:1032) 

s. Hart, WiLLIAM E., AND Ковіхѕом, Francis Р. “An Ana- 
lyfical Approach to the Study of Reading Skills." J Ed Psychol 
36:429-42 О '4s. * (PA 20:1680) 

Ковевт Murray Bram, Professor of Psychol- 
ogy, and Director of the Reading Clinic, Dart- 
mouth College, Hanover, New Hampshire. 

These tests, devised in experimental studies 
of reading performance in four subject fields, 
provide scores for rate in words read per min- 
ute and for percentage of comprehension ac- 
curacy. Effort was made to achieve high face 
validity through use of assignment type material 
and control of testing conditions. Each test con- 
sists of one continuous passage of from 3,000 
to 4,500 words in length. With the exception of 
the fiction passage—a story from Collier's—the 
material was taken from Compton’s Pictured 
Encyclopedia. The student is instructed to read 
in his usual manner of reading assignments and 
told that after ten minutes he will be asked to 
answer questions of the type generally asked in 
class on such material. Test questions, chiefly 
multiple choice in form and dealing with the 
more important points, are printed on separate 
answer sheets. These features make the tests 
superior to those like the Michigan Speed of 
Reading Test which consist of unrelated short 
paragraphs in which the testee is asked to mark 
out an incongruous word. The separation be- 
tween reading and answering questions makes 
the reading and the testing resemble the school 
situation and facilitates the ready identification 
of the shallow fast reader and the accurate plod- 
ding one. 

One of the arguments for the short unrelated 
paragraph type of rate test has been the diffi- 
culty of securing high reliability for the contin- 
uous passage type. The authors state in the 
manual that for “our test” the reliability of rate 
over the 10-minute reading period is .o1. A 
potential user should also be told the reliabilities 
for rate and comprehension accuracy of the in- 
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dividual tests. They are not shown, but the au- 
thors warn that the comprehension scores, es- 
pecially for fiction, are less reliable than for rate 
and recommend in individual diagnosis the use 
of several tests in determining comprehension. 

While as is often the case, the manual does 
not describe the standardization, it cites a re- 
port (2) by the authors which states that the 
subjects of the original experiment were 205 
students, mostly freshmen and sophomores, in 
the College of Education in a required educa- 
tional psychology course. For interpretative 
purposes, the manual provides tables of per- 
centile ranks for rate and comprehension ac- 
curacy based upon the results of Ohio State 
University freshmen, but no indication is given 
of the number of cases involved or of how repre- 
sentative the sampling was. 

There is only one form each of the tests for 
art, geology, and fiction. For history, there are 
two forms, one on Canada and the other on Rus- 
sia. The history tests are very highly inter- 
correlated as to rate and as comparable as two 
forms usually are. While somewhat lower inter- 
correlations in rate were found between history, 
geology, and art, indicating some difference in 
the ways students read the material or in skills 
and abilities required in the different subject 
fields, they are high enough to indicate some 
overlapping. The still lower correlations be- 
tween fiction rate and rates in art, geology, and 
Canadian history (.74, .59, and .62) suggest 
that differences in reading performance between 
fiction and nonfiction are more substantial. The 
intercorrelations between comprehension ac- 
curacy scores are generally even lower, but lack 
of knowledge of reliabilities of scores makes un- 
certain any generalizations. The authors report 
a factor analysis (5) which showed heavy load- 
ings for what they term "attitude of comprehen- 
sion accuracy" by all the comprehension tests, 
for "rate of inductive reading" by the art rate 
test, and for "rate for unrelated facts" by the 
geology rate test and to a lesser extent by the 
art and history rate tests. 

The tests will be most useful to those wishing 
rate of reading tests based upon materials and 
conditions similar to those faced by college fresh- 
men. They constitute a helpful approach toward 
what has proved a difficult task for testmakers, 
ie, obtaining in a reasonable length of time 
measures of differences in reading performance 
in different subject fields. That the difficulty has 
not been wholly surmounted is apparent from 


the magnitude of rate intercorrelations, How. 
ever, because of the length of the passages a more 
dependable judgment of rate can be obtained 
than is possible for most other tests of this type 
and as satisfactory as for any type. Though the 
comprehension accuracy score is somewhat less 
reliable than the rate score, comparison may be 
made between these scores which afford in- 
sight into some of the student's reading habits 
and levels of proficiency. A different type of 
reading test should be chosen, however, if an 
analysis of comprehension skills is desired. 


[576] 
XTests of Natural Sciences: Vocabulary and In- 
terpretation of Reading Materials: Cooperative 
Inter-American Tests. Grades 8-13; 1950; 3 scores: 
vocabulary, interpretation of reading materials, to- 
tal; IBM; 2 editions; English edition: Forms AE, 
BE; Spanish edition: Forms AS, BS; no data on relia- 
bility and validity; tentative norms; $2.50 per 25; sep- 
arate answer sheets must be used; 80€ per 25 IBM an- 
swer sheets; 15¢ per scoring stencil; 50¢ per series 
manual; cash orders postpaid; 5of per specimen set, 
postpaid ; 35(45) minutes; prepared under the auspices 
of the Committee on Modern Languages of the Ameri- 
can Council on Education, Herschel T. Manuel, Direc- 
tor of Test Construction ; Cooperative Test Division, 
Educational Testing Service. * 
REFERENCES 

т. Bou, IsxaeL Ropricuez. A Study of the Parallelism of 
English and Spanish Vocabularies. Doctor's thesis, University 
of Texas (Austin, Tex. 1944. » 

2. McCrante, Joserutxe, A Study of Four Inter-American 
Tests Applied to High School Seniors. Master's thesis, Uni- 
versity of Texas (Austin, Tex.), 1944. А 

з. KeLLeEY, Frances, А Study oF the Inter-American Tests 
at the Hi igh School Level. Master’s thesis, University of Texas 
(Austin, Tex.), 1945. т. 

4. Fire, Ronert Hernpon, AND Manver, Herscuet T. 
The Teaching of English in Puerto Rico, pp. 171—313, 337—410. 
Prepared for the American Council on Education. San Tuan, 
P.R.: Department of Education Press, 1951. Pp. xix, 410. 


Crarence Н. Netson, Board of Examiners, 
Michigan State College, East Lansing, Michi- 
gan. [Review of the English edition.] 

These tests are designed to measure the pu- 
pil's ability to select appropriate words in vo- 
cabulary exercises and to read and interpret 
materials in the natural sciences. A feature 
which is quite unique is that two parallel forms 
of these tests are also available in the Spanish 
language. 

Each form of the English edition consists of 
a vocabulary section of 75 four-choice items 
which progress from very easy to moderately 
difficult words commonly used in the natural 
science area. The exercises are presented in the 
form of sentences in which the task is to select 
one of the four words listed to fill а blank. 
The items in this section reflect, on the whole, 
careful workmanship on the part of the test 
builder. 

The second section of each form employs à 
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series of well written one-paragraph reading 
passages, of increasing difficulty, drawn from 
different sciences. The three to five multiple 
choice items following each reading passage are 
intended to measure the accuracy of reading and 
the degree to which the passage is understood. 
While the preponderance of the items are de- 
signed to elicit responses which will indicate 
primarily the degree of understanding of the 
language of the paragraph, in some of the more 
difficult exercises the answers to the questions 
are not to be found in the reading material itself 
but must be based upon inferences from the facts 
presented. Items of the latter type measure an 
ability which is especially desirable in the sci- 
ences. 

According to the manual, the English ver- 
sion of this test is designed for grades 8-13 
when given to English-speaking pupils who are 
doing their school work in English, and the 
Spanish edition is designed for Spanish-speaking 
students in the same grades who are doing their 
school work in Spanish. The tests appear to be 
very useful in appraising the status of Spanish- 
speaking students who have enrolled for regular 
courses given in English in American high 
schools, colleges, and universities. By giving 
such students one form of the test in English 
and the alternate form in Spanish, a standard 
of comparison is provided whereby it would be 
possible to ascertain (a) the quality of their 
science background, (b) their relative profi- 
ciency in Spanish and English, and (c) whether 
any deficiency in performance is due to inade- 
quate science background or aptitude, to lack 
of facility with the English language, or to a 
combination of these factors. The parallelism of 
the English and Spanish editions thus becomes 
a feature of special value in the orientation of 
students who wish to transfer from the one 
language area to the other. 

According to the test builder, these tests have 
validity in the sense that the materials which 
provide the basis for the items were drawn from 
different fields within the natural science area. 
Furthermore, every exercise was selected on the 
basis of its power to discriminate between those 
who scored high and those who scored low on 
the particular section of the test under considera- 
tion. Comparable validity of the English and 
Spanish editions was sought by employing bi- 
lingual staff specialists, by using the same exer- 
cises in the two languages, by presentation of 
the exercises in equivalent language, and by 
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checking against standard word lists, such as 
the Thorndike list. 

No index of reliability appears in the data 
accompanying this test, presumably because of 
the recency of the edition. Since it is of crucial 
importance for the prospective test user to have 
this information, it is hoped that reliability data 
will be forthcoming soon. 

Quoting from the manual, “The Cooperative 
Inter-American tests of Natural Sciences repre- 
sent a first attempt to provide measures which 
can be used to compare the performance of stu- 
dents from different linguistic and cultural back- 
grounds. They have been carefully constructed 


-by bilingual specialists to eliminate, insofar as 


possible, idiomatic and cultural bias from the 
test materials themselves.” These tests conform 
to the standards of good test construction. The 
format is attractive and the legibility is good. 
The tests should function well as measures of 
vocabulary and interpretation of reading pro- 
ficiency in the natural science area. 


[577] 
*Tests of Social Studies: Vocabulary and Inter- 
pretation of Reading Materials: Cooperative In- 
ter-American Tests. Grades 8-13; 1950; 3 scores: 
vocabulary, interpretation of reading materials, total ; 
IBM; 2 editions; English edition: Forms AE, BE; 
Spanish edition: Forms AS, BS; no data on reliability 
and validity; tentative norms; $2.50 per 25; separate 
answer sheets must be used; 80¢ per 25 IBM answer 
sheets; 15¢ per scoring stencil ; sol per series manual ; 
cash orders postpaid; 50¢ per specimen set, postpaid ; 
35(45) minutes; prepared under the auspices of the 
Committee on Modern Languages of the American 
Council on Education, Herschel T. Manuel, Director 
of Test Construction; Cooperative Test Division, Edu- 
cational Testing Service. * 
REFERENCES 


ега. of Texas (Austin, Tex.), 194. 

3. Kerrey, Frances. A ‹ 

at the High School Level. Master's thesis, University of Texas 
(Austin, Tex.), 1945 


4. Firg, ROBERT ‘Hernpon, AND MANUEL, HERSCHEL Ti 
The Teaching of English in Puerto Rico, pp. 171—313, 337-410. 
Prepared for the American Council on lucation, San Juan, 
P.R.: Department of Education Press, 1951. Pp. xix, 410. * 
Gustav J. FROEHLICH, Assistant Director, 
Bureau of Institutional Research, University of 
Illinois, Urbana, Illinois. [A review of the Eng- 
lish edition.] 

This test is part of an inter-American series 
designed to provide measures of ability and 
achievement in parallel editions of tests pub- 
lished in English and in Spanish. The Test of 
Social Studies is essentially a test of reading 
skills based upon social studies reading materials. 

The idea of having comparable bilingual tests 
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designed to measure progress in a foreign lan- 
guage in relation to achievement in the native 
tongue is one worthy of development ; however, 
the question as to whether or not the English 
edition of the Test of Social Studies is a valuable 
link in this development cannot be answered by 
this reviewer at this time. There is a dearth of 
objective evidence. The test manual is well 
written ; it contains a certain amount of pertinent 
theoretical discussion; but it presents no con- 
crete supporting evidence. No experimental data 
on validity and reliability have been found any- 
where by this reviewer. The degree of compara- 
bility between the English and Spanish editions 


can only be inferred from the fact that an impos- - 


ing array of experts spent many hours in the 
preparation of both editions. 

To date the only available norms for the Eng- 
lish edition are a single set of median scores for 
each grade from the sixth through the twelfth. 
These norms, furthermore, are highly tentative. 
They are based upon the data obtained from an 
administration of a preliminary form, the data 
being arbitrarily adjusted to obtain a single set 
of medians to be applied to both final forms in 
the English edition. This assumes a high degree 
of correlation between the two English forms; 
although no evidence is presented to substantiate 
such an assumption. 

The reviewer is dubious of the value of the 
English edition when used “to discover more ac- 
curately the level at which pupils can read"— 
especially when used to discriminate between 
adjacent grade levels. The tentative norms show 
total score differences of only six, seven, and 
eight points respectively between adjacent grade 
medians for grades 11 to 12, 10 to 11, and 9 to 
10; and differences of ten to twelve points for 
grades 8 to 9, 7 to 8, 6 to 7. 

The mechanics of the test—its format, word- 
ing, type, and keys—are well executed, except 
that some testees might be bothered by finding 
a question mark splitting each blank in the vo- 
cabulary subtest, especially when a single word 
is the correct answer. A continuous blank with- 
out the question mark might be an improve- 
ment. 

When considered as English tests of reading 
skills using social studies reading materials, the 
Tests of Social Studies comprise just another 
reading test, similar in form and content to a 
number of such tests currently available. When 
they are considered as a link in a series of bi- 


lingual tests of ability and achievement, a judg- 
ment of their worth must be held in abey- 
ance until experimental evidence is made avail- 
able. 


Martua E. Layman, Associate Professor, 
Board of Examiners, M ichigan State C ollege, 
East Lansing, Michigan. 

These tests, utilizing materials from the social 
studies, have been prepared in parallel English 
and Spanish editions. Their purpose is to meas- 
ure achievement in either English or Spanish 
and to compare achievement in the two lan- 
guages on the part of bilingual students, 
In other words, for either Spanish or English 
speaking students one form would serve as a 
test of achievement in the social studies. A stu- 
dent with one of these languages as his native 
language and studying the other one as a for- 
eign language would take Form A in his native 
language and Form B in the foreign language. 
By a study of his two scores authors of these 
tests claim that the teacher would be able to 
form some estimate of his ability in the foreign 
language compared to his ability in his native 
tongue. Thus a teacher of Spanish in the United 
States might discover that a student reads Eng- 
lish at the twelfth grade level but is able to read 
similar materials in Spanish at only the eighth 
grade level. Yet the norms provided for this 
purpose are simply medians by grade levels based 
on the testing of 26,000 students on the Spanish 
forms in Mexico and Puerto Rico in 1943 and 
on the testing of 10,000 students on the English 
forms in the United States in 1943 and 1945. 
One is left wondering just how a teacher could 
determine very accurately how much better a 
student was reading in one language than in the 
other one. 

The manual states, moreover, that “Because 
the Cooperative Inter-American Tests are being 
offered for general use for the first time, reli- 
ability statistics are not available at present. 
This is a most surprising statement in view of 
the claim on page 3 that the authors have had | 
“several years of experience with the tests” and 
have made “many studies.” Most test users eX- - 
pect authors and publishers to undertake the — 
responsibility of providing such data when tests; ! 
are offered for use in the schools. 1 i 

Part I of these tests consists of 75 vocabulary i5 
items of the multiple choice variety. They are 
in the form of sentences, each containing a blank 
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which the student is to fill from four possible op- 
tions. The manual states that the method of 
selection depended primarily on subjective judg- 
ment as to concepts that were common to the 
two languages and which could be expressed in 
English and Spanish of approximately the same 
difficulty. In addition, use was made of Thorn- 
dike's, Buchanan's, and Eaton's word lists, and 
the test constructors were further aided in their 
selection by additional evidence as to rela- 
tive difficulty of the tests within the same lan- 
guage obtained through administration of the 
items. 

The terms are generally well-chosen from the 
various components of social studies courses. In 
the easier items there are wide differences among 
the meanings and sounds of the response words, 
but in the difficult items the response words are 
closer to one another in either classification, 
meaning, or sound. In a few cases the meanings 
given in the correct response are doubtful. For 
example Item 72, Part I, of Form AE reads, 
“The slaves of ancient Greece were known as 
.’ The correct response is given as 

“helots” but helots were serfs rather than slaves, 
and the term is accurately applicable only to 
Sparta, not to Greece as a whole. In the same 
part, Item 58 calls for “sedition” as the correct 
response to define “ап action which causes a 
revolt against the government." Social studies 
teachers can rightly object to the sentence “АП 
citizens have the right to vote because of uni- 
versal suffrage.” Though suffrage is obviously 
the intended response the sentence is mislead- 
ing, for of course not all citizens have the right 
to vote even under so-called "universal" suf- 
frage. 

Part II consists of several paragraphs drawn 
from various social studies materials each fol- 
lowed by a series of questions calling for an in- 
terpretation or evaluation of the reading passage. 
The test builders have avoided one of the cus- 
tomary pitfalls of reading tests by placing in 
almost all of the items one option reading “The 
question cannot be answered from the facts of 
the paragraph." Thus they have included very 
few items which could be answered without 
reference to the reading material. The passages 
are generally well selected and the items com- 

Ж constructed. 

The usefulness of these tests could be con- 
siderably increased by a simpler set of instruc- 
tions and by more complete interpretative data. 


STUDY SKILLS 
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*Bennett Use of Library Test. High school and 
college; 1947; Forms A, В; $1.15 рег 25; 20€ per 
specimen set; postpaid; 50(55) minutes; Alma Ben- 
nett and H. E. Schrammel; Bureau of Educational 
Measurements, Kansas State Teachers College of Em- 
poria. * 

Lours SHores, Dean, School of Library Train- 
ing and Service, Florida State University, Talla- 
hassee, Florida. 

The 130 questions related to “basic accepted 
rules and practices" in libraries aim to provide 
an "achievement test for evaluating the knowl- 
edge concerning library organization and prac- 
tice by high school and college students." Con- 
tent is covered in nine sections: the book (15 
questions), the card catalog (14 or 15), decimal 
classification (10), subject headings (10), li- 
brary terms (10 or 12), filing and alphabetizing 
(10), reference books (25), reference questions 
(24 or 25), and Readers’ Guide (10). More than 
one third of the questions are true-false (54 or 
55). Multiple choice and matching types are 
represented by 44 or 45 and 30 or 32 questions 
respectively. There are clear directions for ad- 
ministering the test in the manual, and there is 
a printed scoring key. The publishers report a 
reliability coefficient, under the corrected split 
half method, of .86. 

Although the manual does not indicate how 
the content was selected, the questions cover 
most of the units usually included in high school 
and college freshman courses and orientation 
periods on “how to use the library.” The sec- 
tion sequence, too, is approximately that of the 
dozen or fewer lessons prescribed for secondary 
schools by various state and regional accrediting 
agencies as well as by standard texts, and to 
that extent the test should fit in with present 
library-use teaching programs. 

Many of the individual questions are well 
conceived. For example, Part IT, Section B pro- 
vides a good matching exercise for locating book 
subjects in the decimal classification. Similarly 
Sections C and E provide excellent practice in 
consulting a card catalog for specific titles or 
subjects. 

The weakest sections, in this reviewer's opin- 
ion, are those which employ the true-false type 
of question. Perhaps it is only the inherent 50-50 
nature of this type of question which appears 
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to reduce its effectiveness. But another objection 
is suggested by Item 4 in Part I (Form B): 
“The publisher of a book is the person who has 
written it." This should, no doubt, be marked 
false, but many a youngster will recall experi- 
ences with books in which that statement was 
true. 

Of the three sections in Part III dealing with 
reference books, the multiple choice questions 
of Section B, requiring the student to select the 
best answer to a specific question from five 
choices, offer the greatest learning opportunity. 
Section A recalls to librarians the old fashioned 
questions given to librarians-in-training in li- 
brary school reference courses. Section C could 
be simplified. For example, in Form A, Item 
121, "Find an article continued in succeeding 
issues of a magazine," it is not clear whether the 
answer should be the title of the article, the 
reference to the first issue, or the reference to 
the second issue. à 

The word pseudonym has been misspelled in 
both forms. 

Despite these suggestions on detail, here is 
the germ for a useful test in a much neglected 
area. 


[579] 

*Cooperative Dictionary Test. Grades 7-12; 1951- 
52; 5 scores: alphabetizing, spelling, pronunciation, 
meaning, total; IBM; Form A ('s1); manual ('52) 
$1.75 per 25; separate answer sheets must be used; 
$1.25 per 25 IBM answer sheets; 30€ per scoring sten- 
Cil; cash orders postpaid; 50€ per specimen set, post- 
paid; 30(40) minutes; S. D. Melville with the editorial 
assistance of Clarence Derrick and Anne W. Henry; 
Cooperative Test Division, Educational Testing Serv- 
ice. 


[580] 
X*Edmiston How to Study Test. Grades 7-13; 
1947-49; Forms A (47), B (47); manual ['48]; 
norms ['49]; $2 per 25; 356 per specimen set; post- 
paid; nontimed in part (45) minutes; R. W. Edmis- 
ton; the Author, Miami University, Oxford, Ohio, * 
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*Interpretation of Data de: General Educa- 
tion Series. Grades 7—12, 12-14; 1939-50; except for 
slight changes in wording, wer Level: Forms A 
(750), B (50) combined are identical to Interpretation 
of Data: Test 2.71 (40), Upper Level: Forms A 
(50), В ("50) combined are identical to Interpretation 
of Data: Test 2.51 (^39) ; original tests published by 
Evaluation in the Eight Year Study, Progressive 
Education Association; 2 levels; data on reliability 
and validity based on the 1939-40 tests; no norms; 
manual ('50); separate answer sheets must be used : 
¢ per 25 answer sheets for any one level; cash or- 
ders postpaid; 50€ per specimen set of any one level, 
postpaid; (40) minutes; Evaluation Staff (Ralph W. 
Tyler, Director) of the Eight Year Study of the Pro- 
gressive Education Association; published in 1950 by 
Cooperative Test Division, Educational Testing Serv- 
ice, 


а) LOWER LEVEL. Grades 7-12; 1940-50; 6 scores: gen- 


eral accuracy, accuracy in recognition of true or false 
statements, accuracy with insufficient data, Overcau- 
tion, going beyond data, crude errors; $2.25 per 25; 
30€ per set of scoring stencils, : 
b) UPPER LEVEL. Grades 12-14; 1939-50; 7 scores: 
same as Lower Level, plus accuracy in recognition of 
probably true and probably false statements ; $2.50 per 
25; 60€ per set of scoring stencils. 


REFERENCES 
1-9. See 40:1544. 
10-13. See 3:535. 
14. Sum UGENE R.; Tyrer, RALPH. W.; AND THE EVALU- 
ATION Starr. Appraising and Recording Student Progress, 
Adventure in American Education, Vol. III. New York: 
Harper & Brothers, 1942. Tp xxiii, 550. * (РА 16:5033) 

15, Weisman, Lean L, Some Factors Related to the Ability 
to Interpret Data in Biological Science. Doctor’s thesis, Uni- 
versity of Chicago (Chicago, Ill.), 1946. Pp. vii, 176. 

16, BROTHERS, iLBUR Leo., The Relationship of Certain 
Factors to Effectiveness in Student Teaching in the Secondary 
Schools. Doctor's thesis, Indiana, University (Bloomington, 
Ind.), 1950. (Thesis Abstract Series....1950, 1951, pp. 12-8.) 

17. LUpLow, HERBERT GLENN, An Analysis of the Ability to 
Interpret Data and Its Relationship to Certain Other Aspects 
of Pupil Status. Doctor's thesis, Indiana University (Blooming- 
ton, Ind.), 1950. (Thesis Abstract Series....1950, 1951, pp. 
75-80.) (PA 25:7082, title only) 

18. PLummer, Rosert Howamp. Characteristics and Needs 
of Selected Ninth Grade Pupils as a Basis for Curricular 

hanges to Meet Life Adjustment Needs. Doctor’s thesis, In- 

diana University (Bloomington, Ind.) 1951. (Thesis Abstract 
Ѕеғіез....1951, 1952, pp. 113-9.) 
J. Raymonp Gerzericu, Director, Bureau of 
Educational Research and Service, and Profes- 
sor of Education, University of Connecticut, 
Storrs, Connecticut. 

These pilot editions of the Interpretation of 
Data Tests for the junior high school through 
junior college grades are evaluation instruments 
of the Eight Year Study issued as part of the 
General Education Series of Cooperative Tests. 
Forms A and B of the Lower Level test employ 
the ten sets of data originally issued as Form 
2.71, while Forms A and B of the Upper Level 
instrument make use of the ten sets of data 
originally found in Form 2.51. Each of the new 
forms is therefore half as long as the “parent 
Forms 2.71 and 2.51 and recommended timing 
has been reduced to 40 minutes. 

Three criteria employed in the selection of 
the data to be interpreted—that they be pre- 
sented in a variety of forms, that they relate to 
major problems in various subject areas, and 
that they deal with various types of relation- 
ships—appear to be well met. At the lower level 
pupils are asked to discriminate on a 3-point 
scale whether each statement based on the data 
presented is true, false, or uncertain as to truth 
or falsity because of insufficient information 11 
the data. At the upper level a 5-point scale pro- 
vides for two additional types of responses— 4 
probably true and probably false, intermediate 
in position between true and indeterminate ап 
between false and indeterminate. 

Unlike the original 2.51-2.52 and 2.71-2:72 
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forms issued by the Progressive Education As- 
sociation in 1939 and 1940 and the 2.51 form 
reproduced by the Cooperative Test Service in 
1947, the new instruments are printed from type 
and they make no provision for machine scor- 
ing. Although a separate answer sheet is used 
with the new editions, it is not machine scorable. 
Instead it is scored by the use of a set of trans- 
parent stencils to be superimposed over the an- 
swer sheet. Six of the seven separate scores of 
the Lower Level test are obtained by the use 
of three such stencils, whereas six stencils are 
required in obtaining seven of the eight scores 
for the Upper Level test. The remaining score, 
for omissions, is obtained for each test by scan- 
ning the answer sheets. The scoring of the an- 
swer sheets hence becomes a rather involved and 
time-consuming. process. 

The publisher recommends that comparability 
of results from score to score for purposes of 
analysis, summarization, and interpretation be 
obtained by converting each score to a percent- 
age of its possible value. Although the reviewer 
recognizes the difficulties involved in establish- 
ing comparability otherwise, he believes that 
these percentage scores are somewhat cumber- 
some, that they may well give those school offi- 
cials indoctrinated with the percentage basis for 
marking pupil achievement an opportunity to 
misinterpret the results, and that the develop- 
ment of some other basis for comparability or 
the facilitation of the present one may be desir- 
able. If the percentage base is considered prefer- 
able to any alternative scale, likelihood of fre- 
quent errors in computing percentages could be 
reduced and much processing time could be 
saved the test user by printing tables on the 
answer sheets for converting from raw scores 
to percentages. 

A chart appearing in each manual shows how 
the results can be interpreted as to their general 
accuracy, the tendency to be overcautious, the 
tendency to go beyond the data, and in terms of 
crude errors, The general accuracy score is also 
broken down into three parts: accuracy with 
true-false items, accuracy with probably true or 
probably false items, and accuracy with items 
for which the data are insufficient. All are ob- 
tainable for the Upper Level test but only the 
first and last of the specialized accuracy scores 
are possible for the Lower Level test. Recom- 
mendations concerning the setting up of a Sam- 
ple Data Sheet for summarizing test results also 
appear in each manual, together with illustra- 


tive data and examples of interpretations of 
sample scores reported in the illustrations. The 
publisher could do much to insure utility of re- 
sults for purposes of class analyses if Sample 
Data Sheet forms were printed and distributed 
with copies of the test booklets and answer 
sheets. 

Norms are not provided by the publisher. It 


' is true that the tests are definitely not keyed to 


particular courses or school curricula and that 
results from their use do not serve traditional 
purposes. However, these characteristics do not 
seem sufficient to preclude the use of norms in 
the interpretation of results. Most test users at 
the junior and senior high school levels doubt- 
less prefer tests with norms to tests for which 
they must establish their own bases for compari- 
sons of results. Norms by grades, and perhaps 
also by types of high school curricula, seem not 
only practicable and feasible but also likely to 
be highly useful. It is even possible that separate 
tests for such subject areas as the biological sci- 
ences, physical sciences, and social studies with 
differentiated norms would supplement these 
tests by providing somewhat more diagnostic 
evaluative instruments. 

The validity of these instruments was con- 
sidered by the publisher from two standpoints : 
(a) asa measure of the student’s ability to judge 
interpretations formulated by others, and (b) 
as an index of the student’s ability to write origi- 
nal interpretations. One validation technique 
was that of logical analysis, through which the 
test situations were so chosen as to allow the 
pupils to demonstrate the behaviors defined by 
the test objectives. The other, abstracted in the 
manuals from a more extensive treatment in 
Smith and Tyler’s Appraising and Recording 
Student Progress (14), showed correlations 
ranging from .44 to .69 for Form 2.71 at the 
junior high school level and of .72 and .74 for 
Form 2.52 at the higher grade level between gen- 
eral accuracy scores and scores on an objectively 
scored essay test designed to measure the same 
outcomes. It is pointed out in the Upper Level 
test manual that these validity coefficients are 
for a test twice as long as each form of the cur- 
rent edition and that comparable coefficients for 
Forms A and B would necessarily be lower. 

The test manuals treat reliability of scores 
only in two brief paragraphs referring the reader 
to Smith and Tyler and noting that the test 
user may obtain reliability data for himself by 
correlating scores on Form A and Form B. 
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Smith and Tyler report reliability data of several 
types for the original 2.71-2.72 and 2.51-2.52 
forms. These data are often based on pupils in 
single schools and in several grades, so they leave 
much to be desired. They are also scattered 
through a number of textual pages and pre- 
sented in tabular form by grades and separate 
schools in the appendix. It is improbable that 
the typical test user could adequately sum- 
marize or synthesize these reliability data if, 
which is highly doubtful, the book were available 
to him and he took the trouble to seek out and 
consult the reference. If and when he did, he 
would obtain evidence on tests twice as long as 
the current editions and data mainly based on 
pupils attending the progressive schools of the 
Eight Year Study. 

‘As a means for obtaining further evidence 
concerning the reliability of scores and other 
characteristics of these tests, Form A both of 
the Lower Level and Upper Level test was ad- 
ministered to 99 twelfth grade chemistry pupils 
in the Crosby High School of Waterbury, Con- 
necticut, in April 1951. Reliability coefficients 
were estimated for the various scores of both 
tests by the use of the Kuder-Richardson “Еоої- 
rule,” Formula 21, with raw score data. Relia- 
bility coefficients for the Upper Level test can 
be compared with a composite of the Kuder- 
Richardson coefficients reported in Smith and 
Tyler, page 511, for twelfth grade pupils on the 
original 2.52 form. The Smith-Tyler results for 
234 pupils, presented separately for five schools, 
were weighted by use of Fisher's z-transforma- 
tion and then transmuted back to Pearson prod- 
uct-moment 7 values. These in turn were stepped 
down to obtain reliability coefficient estimates 
for a test equivalent in length to the current 
Form A or Form B. 

Estimates of reliability coefficients for the 
Crosby High School pupils and from the ad- 
justed Smith-Tyler data respectively were 
found to be, for the various scores: .79 and 79 
for general accuracy, .57 and .56 for accuracy 
with true-false items, .8т and .81 for accuracy 
with probably true or probably false items, .67 
and .77 for accuracy with indeterminate items, 
.85 and .85 for overcaution, .68 and .87 for be- 
yond data, and .62 and .41 for crude errors. In 
four of the seven comparisons the two estimates 
are identical or closely similar; only for scores 
on accuracy with indeterminate items, beyond 
data, and crude errors are the two estimates 
markedly different. For two of these—accuracy 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK [ 614 


with indeterminate items and beyond data—the 
Crosby coefficients are markedly lower than the 
adjusted Smith-Tyler estimates. The degree of 
conformance in results from the two estimates 
seems sufficient to warrant the belief that the 
reliabilities of the scores are relatively stable for 
the pupil groups represented in the two sets of 
data. 

Doubtless more important than the compara- 
bility of results, however, are the magnitudes of 
the estimated reliability coefficients. They range 
from .85 to .41 for the Smith-Tyler adjusted 
data, with a mean coefficient of .72, and from 
.85 to .57 for the Crosby data, with a mean co- 
efficient of .71. That these estimated coefficients 
are too low to justify the use of results from one 
form of the current test for individual pupil di- 
agnosis seems inescapable, even for the more re- 
liable overcaution, accuracy with probably true 
or probably false items, and general accuracy 
scores. Some of the other scores seem to be too 
unreliable even to warrant group comparisons 
when only one form of the test is administered, 
Adequate reliability for individual pupil diag- 
nosis seems likely on scores for general accu- 
racy, accuracy with probably true or probably 
false items, and overcaution scores if scores are 
based on the combined A and B forms, but the 
crude error score appears to be too unreliable 
for group comparisons even if results from Forms 
A and B are combined. 

One other type of evidence on reliability of 
measurement was obtained from the Crosby data 
—correlations between scores on Form A of the 
Upper Level and Lower Level tests for the 99 
twelfth grade pupils. These were .65 for general 
accuracy, .45 for overcaution, .44 for beyond 
data, and .32 for crude errors. When corrected 
for attenuation the intercorrelations for the same 
scores were found to be respectively .89, .74, 64, 
and .64. These results are in essential harmony 
with those presented above. ; 

The fact should doubtless be borne in mind 
when interpreting these results that а diminu- 
tion in reliability may be accompanied by an m- 
crease in validity, especially when a test purports 
to measure relatively intangible as contrasted 
with more traditional skill and knowledge out- 
comes, Insurance against overinterpretation 0 
results seems possible through the exercise of 
proper caution and through the use of Forms 
and B in combination when greater reliability 
than that obtainable from only one form 1s de- 
sired. So employed, these instruments for meas" 
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uring abilities to interpret data should make pos- 
sible for many test users a new and refreshing 
experience in pupil evaluation and the use of 
results distinctly different from, and more sig- 
nificant than, many of those obtained from tra- 
ditional tests. These pilot editions can well be 
used to supplement tests designed primarily for 
use in those content and skill areas closely tied 
to school curricula and courses. The availability 
of such instruments commercially and as part 
of a significant series may well accomplish much 
both directly and indirectly in the improvement 
of school practices. 


Vicror Н. Nott, Professor of Education, Mich- 
igan State College, East Lansing, Michigan. 

The original forms of these tests are now so 
well known that little description is needed. The 
1950 edition of each test consists of Forms A and 
B made by splitting each original test in half. 
By so doing, the time required is said to be re- 
duced to 40 minutes per form. In the report by 
Smith and Tyler (тд) the Kuder-Richardson 
reliability of the Lower Level test (2.71) ranges 
from .80 to .9r. For the Upper Level test the 
reviewer was unable to find comparable reliabil- 
ity coefficients in either the earlier report or the 
newer manual, Correlations between the original 
Tests 2.51 and 2.52 range from .65 to 85, for 
the different part scores obtainable, and the 
Kuder-Richardson reliability of the part scores 
on Test 2.52 ranged from .75 to .95. Splitting 
the original test into the two forms now pub- 
lished would almost certainly reduce the relia- 
bilities. Unfortunately, the manuals give no in- 
formation on this point. All statistical data on 
validity of these forms are also based on the 
earlier tests, Correlations between the general 
accuracy score of the Lower Level test (2.71) 
and an essay form of the test ranged from .44 to 
69. For the Upper Level test (2.52) correla- 
tions between part scores and an essay form 
ranged from .12 to .80, when corrected for un- 
reliability of the criterion. 

Scoring is a rather cumbersome and time- 
consuming process. Both tests are hand scored 
by use of transparent stencils, the Lower Level 
test requiring three stencils and the Upper Level 
test, six. Each stencil yields one or more part 
scores, Seven part scores are obtainable on the 
Upper Level test and six on the Lower Level 
test. Each raw score is then converted to a per- 
centage score. 

One can hardly question the contribution of 


these tests as new departures in evaluating 
rarely measured outcomes of instruction and 
learning. As pioneer attempts in a comparatively 
uncultivated field they were, and to a lesser ex- 
tent still are, unique. There can be little doubt 
also that they have had a good influence on 
measurement practices. It is a matter of regret, 
therefore, that after more than a decade statis- 
tical data on reliability, validity, norms, and 
equivalence of forms are still meager or entirely 
lacking. It is doubtful, moreover, that the new 
editions will find wide use in schools because of 
the complications involved in scoring and inter- 
pretation. Finally, it has never to the reviewer's 
knowledge been shown that these tests actually 
measure unique qualities or abilities. They ap- 
pear to be different, but it would be most inter- 
esting to see such evidence as correlations be- 
tween scores on these tests and on good stand- 
ardized tests of achievement in subject matter 
such as science or mathematics. Perhaps the pub- 
lication of these tests in these so-called pilot 
editions will lead to more evaluation of their 
qualities as measuring instruments. 


For a review by J. Wayne Wrightstone, see 

3:535. 
[582] 

*Logical Reasoning Test: General Education 
Series. Grades 10-12; 1939-50; except for slight 
changes in wording, Forms A (50), B. (50), combined 
are identical to Application of Certain Principles of 
Logical Reasoning: Test 5.12 (40), which was a re- 
vision of Tests 5.1 (730) and 5.11 (’39) ; original tests 
published by Evaluation in the Eight Year Study, Pro- 
gressive Education Association; 10 scores: right con- 
clusions, wrong conclusions, relevant judged relevant, 
irrelevant judged relevant, irrelevant judged irrelevant, 
relevant judged irrelevant, accuracy with definitions, 
accuracy with indirect arguments, accuracy with ad 
hominem arguments, accuracy with if-then arguments ; 
reliability data based on the use of both forms ; manual 
(зо); $2.50 per 25; separate answer sheets must be 
used; боё per 25 answer sheets; 20€ per set of scoring 
stencils; cash orders postpaid; so¢ per specimen set, 
postpaid ; nontimed (40-50) minutes; Evaluation Staff 
(Ralph W. Tyler, Director) of the Eight Year Study 
of the Progressive Education Association; published 
in 1950 by Cooperative Test Division, Educational 
Testing Service. * 


REFERENCES 


1-4. See 40:1528. 


Smitn, EUGENE R.; ТҮ ВлРи W.; AND THE EVALU- 


. LER, [ 
azion Stay. АНЫН кйш, Vd. Te New Cork: 
Harper & Brothers, 1942. Pp. xxiii, 550, * (PA 16:5033) 
RozrnT L. Eset, Director, University Exami- 
nations Service, and Associate Professor of Ed- 
ucation, State University of Iowa, Iowa City, 
Iowa. 

The title of this test carries considerable pres- 
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tige, and the test itself is certain to interest many 
teachers. Unfortunately it possesses shortcom- 
ings which seriously limit its usefulness as a 
measuring instrument. 

In the first place, the test emphasizes the ap- 
plication of several selected principles of rigorous 
logic, which is not generally useful in everyday 
thinking. Most practical problems can not be 
solved by syllogistic reasoning unless one intro- 
duces simplifying assumptions which are quite 
unacceptable on a common sense basis. This is 
recognized by the authors, who point out in the 
manual that the test “is recommended only for 
classes where conscious attention has been di- 
rected toward logical reasoning. Students are 
apt to wonder why they should attempt to reach 
logical conclusions, which are sometimes con- 
trary to their better judgment, unless they have 
previously played the logical game.” Logical 
reasoning as commonly understood involves 
much more than the exercises of this test would 
seem to indicate. 

Secondly, the test places less emphasis on the 
process of arriving at a logical conclusion than 
on the logical defense of the conclusion chosen. 
There are 12 “supporting” statements to be 
judged relevant or irrelevant for each conclusion 
chosen. The authors suggest that one type of be- 
havior which characterizes progress toward logi- 
cal reasoning is “ability to isolate the significant 
elements in the logical structure of an argument 
as shown by distinguishing between statements 
of ideas which are relevant and statements which 
are irrelevant for explaining why a conclusion 
follows logically from given assumptions.” That 
this ability is an essential part of logical reason- 
ing, and that it is revealed in the manner sug- 
gested above, seem to be highly questionable 
assumptions. 

It is worth mention at this point that the 
choice of a particular conclusion makes some of 
the supporting statements obviously relevant 
and others obviously irrelevant, so that very lit- 
tle analysis of the structure of the argument is 
involved. In Problem I of Form A, for example, 
conclusion X and statement 11 clearly favor one 
party to the dispute, conclusion Y and state- 
ments I, 3, 5, and 9, clearly favor the other, 
while conclusion C, and statements 2, 7, and 12, 
are indecisive. This leaves only four of the state- 
ments which are not obviously tied to one or the 
other of the conclusions. If the response to one 
item in a test determines absolutely (except for 
chance errors) the responses to any others, no 


useful purpose is served by including the others 
in the test. It is also worth mention that some of 
the "correct" conclusions can be arrived at by 
very superficial “logic.” In Problem IV of Form 
B, for example, the examinee is asked to accept 
certain statements as true. If he then indicates 
as his conclusion that one of those statements is 
true, he is credited with logically correct reason- 
ing. 

In the third place, the test does not provide a 
single score which can be interpreted as a meas- 
ure of ability to reason logically. While the data 
sheets show columns for 11 separate scores, only 
6 or 7 are obtained if only one form is used. 
Moreover, it seems obvious that some of these 
6 or 7 scores (e.g., number of right conclusions 
and number of wrong conclusions) must be 
highly correlated. Apparently the authors did 
not intend the test to provide any overall meas- 
ure of logical reasoning ability. 

The complexity of the task of interpreting 
scores is indicated by this statement from the 
manual. “The number of patterns of behavior is 
almost as great as the number of students who 
take the test. Each pattern should be considered 
as presenting a unique situation to be inter- 
preted.” Apparently the test is to be regarded 
more as a diagnostic instrument than as a tool 
for measurement, and the usefulness of the 
scores is assumed to lie in their stimulation of 
speculation rather than in the information they 
provide. It is questionable that the contents of 
the test merit the elaborate interpretation of 
scores which has been suggested. 

Finally, while norms and analytic data are 
provided, they are not adequate. The compost 
tion of the norm group appears to have been 
fortuitous rather than deliberately planned. The 
manual recommends the test only for groups 
which have “previously played the logical game, 
but no mention is made of the extent of the game 
experience of the norm group. The reliabilities 
reported for the “conclusions scores” are low, 2$ 
would be expected from the small number of 
conclusions’ involved. Reliabilities of the “rele- 
vance scores” are surprisingly high. Perhaps this 
is because the statements to be judged are very 
much alike, from exercise to exercise. Consistent 
interpretations of these statements are likely to 
lead to consistent responses regardless of the 
reasoning situation. 

As a conscientious effort to open up 4 
field of appraisal of educational outcomes, 
test deserves serious consideration. It is not 


new 
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however, recommended by this reviewer as a 
valid test of logical reasoning. 


[ 583 ] 
Survey of Study Habits, Experimental Edition. 
Grades 8-14; 1944; 1 form; mimeographed manual; 
sé per сору; 25¢ per specimen set; postage extra; non- 
timed (30) minutes; Arthur E. Traxler; Educational 
Records Bureau. * 
REFERENCES 

т. TRAxLER, ARTHUR E. “Some Results of an Experimental 
Survey of the Study Habits of Indes лы ЕЗ Pupils," 
pp. 40-9. In 1944 Fall Testing Program in Independent Schools 
and Жаран Studies. Educational Records Bulletin, No. 


2. New York: Educational Records Bureau, January 1945. 
эр. x, 49. Paper, lithotyped. * (PA 19:1812) 


Warren R. BALLER, Professor of Educational 
Psychology and Measurements, University of 
Nebraska, Lincoln, Nebraska. 

This blank consists of 85 items grouped under 
17 heads : keeping in physical condition for study, 
understanding the assignment, planning a study 
schedule, efficient finding of the necessary study 
materials, applying one’s self consistently, fixing 
material in mind, reflecting, working independ- 
ently, the prompt completion of work, persist- 
ence in overcoming difficulties, paying attention 
in class, participation in class activities, review- 
ing, memorizing, increasing vocabulary, im- 
provement of reading rate, and maintaining an 
attitude of study. The survey items are state- 
ments which do or do not characterize the pupil 
and to which he responds by checking “seldom 
or never,” “sometimes,” or “usually or always.” 

The main purpose of the device is to afford 
the pupil a means of self-analysis of his study 
habits and a basis for related counseling. This 
definition of purpose should serve as a caution 
against the perfunctory use of the blank as a 
method of rating pupils’ proficiency in study. 
Clearly the “score” which a pupil might obtain 
from this process of self-estimation would, at 
best, be a rough index of the satisfactoriness of 
his habits of study. Norms for grades 7 through 
12 are available, however, and are based upon 
the use of the survey on a population of inde- 
pendent school pupils. 

The author reports a split half reliability co- 
efficient of approximately .91 for pupil responses 
at two different grade levels. But, quite appro- 
priately, the point is made in the manual that 
this correlation reflects the internal consistency 
of the survey blank rather than the consistency 
of response on the part of pupils who filled out 
the blank. No repetition of the blank with the 
same group of pupils has yet been reported. 

Correlations between pupil response to the 
blank and such criteria as achievement test per- 


TESTS & REVIEWS: READING—STUDY SKILLS 


585 


formance, school marks, and teacher ratings of 
study habits make the validity of the instrument 
seem low. Median coefficients for these several 
correlations ranged from .14 to .24. (In some 
instances the correlations were as high as .42 
and statistically significant.) It must be ob- 
served, however, that evidence of validity thus 
obtained is inherently capricious. For one thing, 
one would be over credulous to accept self-esti- 
mates as consistently accurate. For example, not 
every pupil with poor study habits would be 
likely to appraise his faults well, and not every 
pupil with good habits would know the extent 
and degree of his good behavior. Difficulties of 
this sort with the determination of validity are 
not unique to the Survey of Study Habits. Tt 
would be interesting to learn what would result 
—in line with this method of determining valid- 
ity—from a carefully planned briefing of pupils 
about how most effectively to evaluate their 
study habits. 

It is probably unfair to the Survey to place 
emphasis upon such tests of validity. The im- 
portant question is whether the blank gets at 
the essentials of effective study and thus guides 
the counseling process along profitable lines. 
The practices which generally are given major 
attention in organized plans for the improvement 
of study habits seem to be well represented in 
the content of the blank. There are reasons for 
believing that the blank might prove even more 
effective for follow-up teaching and counseling 
were the headings, above the classes of items, 
fewer in number and better adapted to repetitive 
emphasis which teachers give to main ideas in 
this area of instruction. 


[ 584] 
*Test of Critical Thinking. Grades 7-9; 1951; 7 
scores: inquiry, interest, relationships, openminded- 
ness, generalizations, accuracy, total; 1 form ['51]; 
mimeographed; no data on reliability and validity; no 
norms for part scores; tentative norms; $8 per 100, 
postage extra; test may be reproduced by users; 256 
per specimen set (specimen set must be purchased to 
obtain manual), postpaid; nontimed (45-50) minutes ; 


M. T. Macy and Hugh B. Wood; University of Ore- 
gon Press, * 


[585] 
*Test on the Use of Books and Libraries: Gen- 
eral Education Series. Grades 7-12; 1939-50; a re- 
vision of Test on the Use of Books and Libraries: Test 
7.3 (39) ; original test published by Evaluation in the 
Eight Year Study, Progressive Education Association 
8 scores: the parts of a book, use of encyclopedias, use 
of the dictionary, sources of information, use of an in- 
dex, use of the library card catalog, use of the Readers" 
Guide to Periodical Literature, total; Forms A (50), 
B ('50) ; no data on reliability and validity ; no norms ; 
manual ('50); $2.50 per 25; separate answer sheets 
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must be used; 75€ per 25 answer sheets; 20€ per set 
of scoring stencils ; cash orders postpaid; 50€ per speci- 
men set, postpaid; nontimed (60) minutes; Evaluation 
Staff (Ralph W. Tyler, Director) of the Eight Year 
Study of the Progressive Education Association; pub- 
lished in 1950 by Cooperative Test Division, Educa- 
tional Testing Service. * 


REFERENCES 

т. Ѕмітн, Evcene R.; Tyrer, RALPH W.; AND THE EVALUA- 
TION STAFF, Apraptsing onadva coraig Student TELA Ad- 
venture in American Éducation, Vol. III. New York: Harper 
& Brothers, 1942. Pp. xxiii, 550. * (PA 16:5033) 

Henry D. RiNsLAND, Professor of Education, 
The University of Oklahoma, Norman, Okla- 
homa. 

This test is designed to measure a very im- 
portant objective in education for the fulfillment 
of which provision is not made in the usual high 
school subject-matter course—knowledge as to 
the proper use of books and libraries. Since the 
test is being issued in a pilot edition and statisti- 
cal data are not available to show how well it 
actually achieves its purpose, it would seem to 
be in order to discuss here certain aspects of its 
face validity as they impress the reviewer. 

The items based on information about the parts 
of a book, Part I, are well chosen, except that 
sometimes the author’s full name is not given on 
the title page (Item 1, Form B) and the table 
of contents does not always contain an outline 
of the book (Item 2, Form B). 

In Part II, Use of Encyclopedias, the student 
is asked to give the number of the volume in 
which he would look for information about a 
particular topic. For Item 20, Form B, the topic 
“The cause of thunder” is undoubtedly to be 
found under “Thunder”; but Volume 4 ends 
with the word “Thrush,” and Volume 5 begins 
with the word “Thyrm,” so there is no 
"Thunder" in the model encyclopedia. In this 
part an opportunity has been missed to discover 
whether students are aware that the alphabetical 
arrangement of topics differs with different en- 
cyclopedias—the Encyclopedia Americana, for 
example, using “word by word” order and the 
Encyclopaedia Britannica and the New Inter- 
national Encyclopedia listing words in "letter by 
letter” order. It is easy for a student to overlook 
a topic in an encyclopedia if he does not know 
the type of alphabetizing used. 

In Part III, Use of the Dictionary, the direc- 
tions should state that the five choices are not 
appropriate to Funk and Wagnalls New Stand- 
ard Dictionary or Webster's Collegiate Diction- 
ary, so that there will be no misunderstanding 
that these items primarily measure ability to 
use Websters New International Dictionary. 
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In Part IV, Sources of Information, the style 
of underlining the names of books, as, for ex. 
ample, Who’s Who in American Education, is 
not acceptable. These titles should be printed in j 
italics or enclosed in quotation marks. In this | 
part also, the reviewer does not favor the use of 
such expressions as "Ask an automobile sales- - 
man” or “Ask your druggist” as possible choices 
when what is needed is some real authority, 

In Part VI, Use of the Library Card Catalog, 
the “word by word” method of alphabetizing 
should be emphasized since card indices in li- 
braries, including the Congressional Library, are 
listed in "word by word" order—íor example, 
East Moline, Illinois; East Orange; Easter; 
Eaton. 

In spite of the specific shortcomings men- 
tioned, this test has much to offer. In itself, the 
test represents an excellent outline for a unit on 
the use of books and libraries. As a test, it can 
be used equally well as a measure of achievement 
in this area of knowledge and for diagnostic pur- 
poses; in the latter connection, tabulations of 
items missed might serve as the basis for effec- 
tive follow-up lessons. In reissuing this test, the 
Cooperative Test Division is, in the opinion of 
the reviewer, making a valuable contribution to 
measurement in a field where little that is really 
worthwhile has been done to date. 


Lours Ѕнокеѕ, Dean, School of Library Traim- 
ing and Service, Florida State University, Talla- 
hassee, Florida. 1 
The 107 questions in this test are distributed 
as follows: 10 on the parts of a book, 10 on the 
use of encyclopedias, 20 on the use of diction- 
aries, 25 on other reference books, 14 on the use 
of indexes, 14 on the use of card catalogs, and 
14 on the use of the Readers’ Guide. Items on the 
use of encyclopedias are matching items; all 
others are multiple choice items. Since this test 
has long been out of print and since, despite 
some changes, the revision is “substantially the 
same” as the original form, the publishers Ш- 
vite suggestions. : 
Part I deals quite effectively with the five 
basic parts of a book, but it neglects preface, ui 
troduction, title-page verso (important 1n Ks 
nection with Item 4 (both forms) dealing W! 
the date), and even the use of running heats: 
Furthermore, there is nothing about the car 
of the physical book, basic in high school library 
instruction. f 10 
Part II, on the encyclopedia, provides 
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questions on one technique of encyclopedia use 
— guide words on the backs of the volumes. One 
or two questions would probably suffice for that 
point. Several questions are needed to test long 
article indexes versus short article cross refer- 
ences, “dummy” and “see also" references, kinds 
of illustrations, alphabetizing, and guide words. 

Part III is confined to Webster’s New Inter- 
national Dictionary and misses an opportunity 
to test dictionary contrasts, especially since most 
school libraries have Funk and Wagnalls New 
Standard Dictionary and several good abridged 
dictionaries, some especially designed for high 
school. Items like 34, 35, and 37, which in both 
forms are concerned with proper nouns, tend to 
confuse the pupils, since not all dictionaries pro- 
vide separate alphabets for persons and places. 
The very important comparison of definition 
sequence is omitted entirely, yet high school stu- 
dents are frequently misled to believe that the 
first definition given is always the most im- 
portant. 

Part IV, on sources of information, needs 
typographical attention. Underlining of titles 
instead of italicizing or enclosing in quotation 
marks gives improper visual emphasis to some 
possible answers. The reference sources selected 
need to be reviewed in the light of newer school 
reference books. 

In Part V, there is an error of detail: for ex- 
ample, ine Form A the index entry for “oil 
gushers” is given as “510-12,” but the best pos- 
sible answer to Item 66 is given as “510 to 512." 
This type of error is repeated in several items in 
both forms. 

The above examples indicate that review of 
details is needed. But even more important is 
a reconsideration of what constitutes library 
aptitude. Unquestionably, the skills necessary to 
use print and audiovisual materials are becom- 
ing an increasingly important aspect of student 
survival in an educational world of growing 
emphasis on many and varied instructional 
sources. The old-fashioned library lessons are 
probably as good a place as any to start, but cer- 
tainly no point to stop. In terms of the conven- 
tional unit on how to use a library, this test is 
fairly adequate, but from the standpoint of criti- 
cal literacy something more is needed. 


[586] 
*The Use of Library and Study Materials. Grades 
9-16; 1930-41; 3 scores: findin information, inter- 
preting information, total; IBM; Forms А ('40— 
puis as test copyrighted in 1939), В (40); norms only 
or entering college freshmen on Form A; manua 
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(41); $2.50 per 30; 256 per specimen set; separate 
answer sheets may be used; 144¢ per IBM answer 
sheet; 25¢ per stencil for machine scoring of answer 
sheets; postpaid; 40(50) minutes; Mary S. Kirkpat- 


rick, Lola Rivers Thompson, and Helen Tomlinson; 
Steck Co. * 


REFERENCES 
т. Уотлу, Davin Е. “А Comparison of Test Scores of En- 


tering College Freshmen_as Instruments for Predicting Subse- 
quent Scholarship.” J Ed Res 40:215-8 N '46. * i: 
1304) 

Rogert Murray Bear, Professor of Psychol- 
оду, and. Director of the Reading Clinic, Dart- 
mouth College, Hanover, New Hampshire. 

This test was designed to help teachers ap- 
praise student skill in the use of library and 
study materials. Since in the present state of 
test development in this field, it is easier to 
build items that check knowledge rather than 
the use of that knowledge, it is not surprising to 
find heavy emphasis upon the former in Part I, 
which concerns location of information. To a 
greater extent, use of knowledge is required in 
Part II, which is intended to test "ability to un- 
derstand the material once it has been found." 
The 88 items of the test are all of the multiple 
choice type. The wide grade range has resulted 
in the inclusion of some tasks too difficult for 
many high school pupils and some references too 
little used or known by them, while for college 
students intensiveness of questioning is wanting 
at points and desirable items are omitted. 

The items in Part I place more emphasis 
upon location of information in dictionaries, 
magazines, and reference books; classification 
and the card catalog ; and using books. Too few 
of the 45 items in this part are devoted to each 
of these groupings to enable the teacher to 
analyze reliably the student's weaknesses, and 
there seems to be a disproportionate number of 
quite similar items requiring alphabetizing. 

Almost half of the items of Part II belong in 
the general area of work-study skills. While li- 
brarians fortunately have a lively interest in the 
improvement of these skills, the emphasis given 
them reduces the amount of time available for 
the more specialized library usage items. Under 
the study skills category are some useful items 
dealing with reading a population growth table, 
interpreting a pie graph, locating items in the 
index of a book, reading a map, understanding a 
bar graph, and interpreting а time zone map of 
the United States. Of the more specialized sec- 
tions, three comprise one third of the items in 
this part: the card catalog, the periodical index, 
and abbreviations found in books. Included with 
the latter are two general vocabulary items in 
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science test. Among these may be mentioned 
Items 2, 4, 7, 9, 24, and 46 in Form DM and 
Items 1, 3, 6, 34, and 38 in Form EM. Item 25 
in Form DM and Item 23 in Form EM are 
identical except for the order of listing of the 
three choices. Item 42 in Form DM seems to 
have escaped needed editorial attention. The 
"male tanager" and the "steam engine" items 
which were singled out for particular criticism 
in Preston's review of the battery editions (see 
3:18) are, happily, absent from the forms un- 
der consideration. | 

Finally, the great majority of items involve 
factual information and vocabulary; few seem 
to involve judgment, reasoning, or problem solv- 
ing. Of course, it is difficult to see how a 70-item 
test given in a r5-minute working time limit 
could be otherwise. 


Paur E. КАмВгу, Professor of Education, Uni- 
versity of Oregon, Eugene, Oregon. 

Each of the forms of this elementary science 
test contains 70 items selected from items orig- 
inally included in Forms D and E of the battery 
edition of the Stanford Achievement Test. The 
science sections of the regular edition contain 
100 items, each with 20 items common to both 
forms. These separate tests, therefore, actually 
contain seven ninths of the items in the original 
tests. The norms were established on the basis 
of determining the equivalence of Forms D and 
E and Forms DM and EM. Comparability of 
Forms DM and EM has also been established 
on the basis of experimentation. 

According to the authors, these forms were 
prepared to meet the needs of those who wish 
to use a separate answer sheet. Any choice be- 
tween Forms D and E and Forms DM and EM 
should be made largely on this basis. If the va- 
lidity and reliability of the original forms were 
determined by adequate research, as they seem 
to have been, then Forms DM and EM may be 
regarded as equally valid and reliable. Raw 
scores on Forms DM and EM can be converted 
to equated scores by means of tables appearing 
in the margin of each answer sheet. These 
equated scores are also comparable with equated 
scores on the regular edition. 

The 70 items in each form are all of the multi- 
ple choice variety. Most of the items are incom- 
plete statements, but there are a few direct ques- 
tions. Each item has three possible answers. It 
might have been desirable to use four or five 
answers and thereby make it unnecessary to cor- 
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rect for guessing by subtracting one half the 
number of wrong answers from the number of 
right answers. 

Usually, the keyed answer to an item is ob- 
viously the best. Of the 140 items included їп 
these forms, this reviewer questions only four 
on the basis of factual accuracy. One question 
concerning the cause of tides includes pull of 
the moon and sun and rotation of the earth as 
two possible answers. Another question when 
completed “correctly” reads, “Humus is soil 
found chiefly under trees," One cannot help won- 
dering how a boy from the corn belt would react 
to such a statement. 

The items are all highly factual and do not 
test for understanding or for problem solving 
abilities. However, if a test of factual knowledge 
of elementary science is desired, these tests are 
good. 


[594] 

*Test of Application of Principles in Physical 
Science: General Education Series. High school; 
1940-50; a revision of Application of Principles tn 
Physical Sciences: Test 1.34 (’40) which was a revi- 
sion of Tests 1.31 and 1.32 ('38); original tests pub- 
lished by Evaluation in the Eight Year Study, Pro- 
gressive Education Association; 1 form, '50; no data 
on reliability and validity; no norms; manual ('50); 
$2.50 per 25; separate answer sheets must be used; € f 
per 25 answer sheets; 40¢ per set of scoring stencils; 
cash orders postpaid; 50¢ per specimen set, postpaid; 
nontimed (40) minutes; Evaluation Staff (Ralph W. 
Tyler, Director) of the Eight Year Study of the A 
gressive Education Association; published in 1950 by 
Cooperative Test Division, Educational Testing Serv- 
ice. 


REFERENCES 


B 5 ive 

1. Клтиз, Louis Е, Application of Principles. Progressivi 
Education Association, Evaluation in the Eight Year Eur 
Bulletin No. s. Chicago, Ill.: the Study, December 1936. Pp. 

aper, mimeographed. 3; 

E Evaluation. Materials Developed for Various des 0 
Thinking. Chicago, Ill.: Evaluation in the Eight "ne taal, 
Brogressiee Education. Association, March 1939. РР. 

aper, mimeographed, Я 

ti Symm, ITE R.; DIM GS Nam mar 
TION STAFF, Appraising and Recording Stude: P Harper 
venture in American Education, Vol. III. New York: Harpe 
& Brothers, 1942. Pp. xxiii, sso. * (РА 16:5033) 


Pater О. Jounson, Professor of Education, 
University of Minnesota, Minneapolis, Min- 
nesota. , Wi 
Although this attempt to measure the pic 
to apply principles is laudable, much refinemen 
would be required before the test could product 
generally useful results. In the first placa 
adequate consideration has been given to 
problem of sampling physical principles: p 
five commonly used principles constitute t i 
test, and the assumption that the test situation 
are new to students in high school physics 18 T 
batable. No measures of absolute or relative 1 
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liability are presented. No normative data are 
available for this form of the test. No considera- 
tion is given to the validity of the test unless one 
accepts an operational definition of application. 
Furthermore, the method of scoring and inter- 
pretation seems unnecessarily complicated, not 
sufficiently objective, and, in some instances, 
quite arbitrary. 


[595] 
*A Test of General Proficiency in the Field of 
Natural Sciences: Cooperative General Achieve- 
ment Tests, Revised Series, Test II. Grades 10-12 
and college entrants; 1940-51; 3 scores: terms and 
concepts, comprehension and interpretation, total; 
IBM; Forms X (47), Y C49) Z (51); Forms OR, 
S, T out of print; no specific manual; no norms for 
part scores; general battery folder ('51); general 
Cooperative manual (’51) ; norms ['41] ; $2.50 per 25; 
50 per specimen set, postpaid; separate answer sheets 
may be used; 80¢ per 25 IBM answer sheets; 15 per 
stencil for scoring answer sheets; cash orders post- 
paid; 40 (43) minutes; Paul J. Burke (X), Carl A. 
Pearson (Y), and John G. Zimmerman (Z) ; Cooper- 
ative Test Division, Educational Testing Service. * 
REFERENCES 


1, Herret, J, P., лхо DiVesta, FRANCIS J. “Ап Evaluation 
of Five Factors for Predicting the Success of Students Enter- 
ing the New York State College of Agriculture." Ed & Psychol 
Meas 8:389-95 au '48. * (PA 23:4412) 1 


For а review by Paul L. Dressel of Forms Т, 
X, Y, and QR of the complete battery, see 5; 
for a review by Palmer O. Johnson of Forms 5, 
T, and X, see 3:548; for a review by John V. 
McQuitty of Forms S and T of the complete 
battery, see 3:3. 
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[596] 

900: Achievement Examinations for Sec- 
ondary schools. High school; 1951; І form; no data 
on reliability and validity; no manual; Minnesota 
norms (median and quartile deviation) available ; sim- 
ilar norms for other regions by special arrangement 
with publisher; 7€ per test, postage extra; 60(65) min- 
utes; edited by Walter W. Cook; prepared by a cur- 
ticulum committee of high school teachers for use in 
the Minnesota State Board Achievement Examina- 
tions Program; Educational Test Bureau, Educational 
Publishers, Inc. * 


[597] 
"Biology: Every Pupil Test. High school; 1935- 
51; new form usually published each April and Decem- 
г; form December 1951; no data on reliability and 
validity; no manual; norms (51); 2%# per test; 1¢ 
Š answer key; postpaid; 40(45) minutes; Ohio 
iM larship Tests, Ohio State Department of Educa- 


ү [598] 
aBlolo y Test: State High School Tests for In- 
lana. First, second semesters high school; 1934-50; 
2 levels; mimeographed; no data on reliability and 
Validity; no manual; 4366 per test; 15¢ per specimen 
set; postpaid; 40(45) minutes; Avis Risk and D. № 
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Elliott; State High School Testing Service for In- 
diana, Purdue University. * 
а) FIRST SEMESTER. 1934-48; Form B ['48]; Form A 
out of print; no norms. 
b) SECOND SEMESTER. 1934-50; Form О [48]; Form N 
out of print; norms ['50]. 

[599] 
*Biology: 20th Century Test. 1, 2 semesters high 
school; 1948; 2 levels; Form A; no data on reliability 
and validity; no norms—author recommends the use 
of local norms; 10 or more copies, 5¢ each, postage 
extra; 35€ per specimen set, postpaid; 40(45) minutes ; 
Benton Review Publishing Co., Inc. * 

[600] 
*College Entrance Examination Board Achieve- 
ment Test in Biology. Candidates for college en- 
trance; 1915-51; available only in College Entrance 
Examination Board Admissions Testing Program (see 
526); 60(70) minutes; prepared by College Entrance 
Examination Board Committee of Examiners in Biol- 
ogy in cooperation with the Staff of Educational Test- 
ing Service: 1951 membership: Charles Tanzer 
(Chairman), Elso S. Barghoorn, Rhoda Garrison, 
Herbert C. Knutson, and Frank Trevor; program ad- 
ministered by Educational Testing Service for the Col- 
lege Entrance Examination Board. * 


Crank W. Horton, Consultant in Educational 
Research, Dartmouth. College, Hanover, New 
Hampshire. [Review of Forms VAC2, WAC2, 
ХАС, Y AC, and ZAC.] 

Since this series of tests is available only in 
the CEEB program, the purpose of a public re- 
view must be to inform teachers whose students 
take the tests, and college officials who use the 
scores, whether the tests are in fact good tests 
of achievement in biology. This assurance can 
be given. The tests are excellent measures of a 
candidate's mastery of the facts and principles 
of biology as it is traditionally taught in the 
high school and also at the beginning college 
level. 

Five forms of the test were made available for 
review. The form designations and copyright 
dates of these were: VAC2, 1947 ; WAC2, 1948; 
XAC, 1949; YAC, 1950; and ZAC, 1951. A 
reviewer's manual covers in a comprehensive 
manner the procedures followed in making the 
tests, and gives detailed test analysis data on 
four of the above forms. This reviewer "took" 
all of the above tests as a candidate would take 
them, compared his answers with the keys, and 
studied the manual. 

The tests range in length from 175 numbered 
items (190 scorable responses) in Form VAC2 
to 145 items (160 scorable responses) in Form 
ZAC, to be done in 60 minutes of working time. 
Even so, the tests are not highly speeded; the 
tendency is to reduce speededness in later forms. 
The test analysis data show that, depending on 
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the form, about 75 to 85 per cent of the College 
Board candidates work through to the last item, 
and practically all work through three-quarters 
of the test. The average candidate answers about 
half of the items correctly. 

The distribution of question difficulty and the 
wide spread of content are such that practically 
no candidate achieves within 15 to 30 items— 
again depending on the form—of a perfect score, 
and few do so poorly that they fall into the 
chance score range at the bottom. In short, there 
is plenty of room to demonstrate superior knowl- 
edge, and there need be no suspicion that a sub- 
stantial score was obtained by chance alone. 
Candidates who fall in the chance score range 
at the bottom will in general have standard rat- 
ings below 300. Those who achieve 700 will have 
given correct answers to roughly 75 to 80 per 
cent of the items. 

The knowledge sampled by the tests is broad 
and comprehensive, ranging over the entire 
scope of plant and animal materials commonly 
included in high school courses. Such aspects as 
knowledge of structures and their functions, 
meaning of technical terms, knowledge of im- 
portant facts and principles, and understanding 
of a host of interrelations and dependencies ap- 
pear to be fairly and thoroughly sampled through 
the use of a variety of item types which have 
changed some, but not very much, over the years. 
Can a candidate be “prepared” for the test? 
Definitely yes—by teaching him an enormous 
amount of biology. 

The test items themselves are very good. They 
are short, clear, penetrating, defensible. In the 
five test forms studied, involving roughly 9oo 
scorable responses, this reviewer would protest 
perhaps less than a dozen as involving an am- 
biguity which made the keyed response not fully 
defensible. 

Reliabilities reported for the several forms are 
consistently .93 or .94. Their mean item validi- 
ties (biserial r) lie in the range .35 to .40. Great 
care is taken to control difficulty from form to 
form and to stabilize the meaning of the stand- 
ard ratings into which raw scores are converted 
for reporting. Persons who understand in gen- 
eral the nature of the College Board standard 
rating scale sometimes ask, “Since the score of 
an individual is relative to the performance of 
the group taking the test, does its meaning vary 
from time to time as the quality of candidates 
varies?" Some extracts from the manual may be 
of general interest in this connection. 
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The standard rating scale....is defined in terms of an 
idealized typical or standard group of CEEB candi- 
dates....so defined that if it were to take any one of the 
tests the standard ratings would have a mean of 500 
and a standard deviation of тоо. * It is necessary to 
define a time of year as standard * March or April, 
{һе {те of the major test series has been so defined * 
several different samples of candidates have been used 
to standardize the scales on several key tests * [In its 
standardization,] any particular achievement test may 
be taken by a sample known to be biased with respect 
to the typical group of candidates. In such a case cor- 
rections for observed atypicalities in aptitude are made 
„using the two parts of the aptitude test as the equat- 
ing variables * Major emphasis is placed on achieving 
continuity of existing scales by equating scores on suc- 
cessive forms of the test * Two methods are in use * 
The part-score method utilizes data on a sub-test which 
is common to both the new form of the test and an old 
form. The item difficulty method utilizes data (delta 
values) on selected items which are common to both 
the new form of the test and an old form. 


The most serious criticism that can be leveled 
at these tests at this time is the charge that they 
are too heavy with detailed factual material, Ac- 
cording to the ETS, the Biology Committee is 
aware of this, and is seeking “ways to cut across 
subject matter divisions and to test the candi- 
date’s understanding of basic principles and his 
ability to apply his biological knowledge to new 
situations.” Certainly the techniques used in the 
construction of the tests, and the statistical con- 
trols exercised to make them sound measuring 
instruments and to insure stability of the stand- 
ard ratings, appear to offer little room for star- 
tling improvement. Possible advance appears to 
lie in the direction of (a) realistic assessment of 
the value that measures of this type now have 
for college officers who presumably use them in 
selection, placement, and guidance ; and (5) de- 
termination of whether a measure of somewhat 
different aspects of achievement might not prove 
more useful. This problem also is recognized. 
Again, according to the manual, “it is reasonable 
to expect that in the next few years the develop- 
ment of the Board biology test will be accel- 
erated in the direction of an achievement sat- 
urated aptitude test.” 

[ 601] 
*Cooperative Biology Test. High School; 1933" 
51; IBM; Forms X (47), Y (748); Forms 1033-97 
N-Q, S, out of print; по data on validity; no spec! 
manual; general Cooperative manual ('51); norms 


; d; 
['37]; $2.50 per 25; 50¢ per specimen set, postpal 
separate answer sheets may be used; 80€ per 25 TBM 
answer sheets; 15@ per stencil for scoring ans É 
sheets; cash orders postpaid; 40(45) minutes; Peru 
Kambly ; Cooperative Test Division, Educational 


ing Service. ‹ 
Letanp P. Јонмѕом, Professor of Biology: 
Drake University, Des Moines, Iowa. [Review 
of Forms Q, S, X, and Y.] 
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Test Forms Q and S are good examples of 
recent subject matter examinations. Form Q is 
composed of 120-items. In each item the exami- 
nee selects the best answer from five choices. Ap- 
proximately one third of the items test applica- 
tion of biological information. Of the remaining 
two thirds, basically factual, about 30 per cent 
deal with the identification of representative 
drawings and 10 per cent test with terminology 
per se. Form S is composed of ттт items similar 
to Form Q. Twenty to 25 per cent of the items 
test application of biological information and the 
interpretation of materials. The remaining items 
test primarily factual material of which approxi- 
mately 15 per cent test terminology. 

Parts I of Forms X and Y are structured 
similarly to Forms Q and S with the traditional 
five-choice type of item. Parts II deal with a 
number of reading passages and their interpre- 
tation. Forms X and Y possess 85 and 89 items 
respectively. The forms appear to be equivalent. 
Approximately 50 per cent of the items demand 
application of biological information or the in- 
terpretation of materials. Of the remaining items, 
approximately five per cent test terminology 
alone. 

The factual material tested in all forms is per- 
tinent to a basic understanding of biology. The 
tests have been well edited and the student should 
have no difficulty in following directions easily. 
While most high school texts discuss osmosis as 
used in Item 40 of Form X, one must seriously 
question choices 3 and 5 as illustrating the prin- 
ciple of osmosis. 

Data concerning the test objectives, construc- 
tion, and validity studies are inadequate. The 
general manual does not contain enough infor- 
mation for one planning to use the examination 
conscientiously, and a specific manual is not 
available for the test. Additional information 
in such a manual should include: upon what 
bases were the questions originally prepared and 
later retained, what are the item difficulty values 
and validity indices, what was the population on 
which the original testing data were obtained, 
how was the test validated, what are the relia- 
bility coefficients and how were they determined. 
'The information which is included in the gen- 
eral Cooperative manual is explicit and well 
presented. 

Raw scores can be converted to scaled scores 
and percentile ranks. The scoring formula is 
R — W/a4. The average raw scores of an indi- 
vidual in tenth grade at the end of one year of 
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biology are 50-51, 29-30, 14, and 25-27 re- 
spectively for test Forms Q, S, X, and Y. This 
causes one to question the difficulty or length 
of the examination for evaluation of students at 
this level. Norms for roth, 11th, and 12th grade 
individuals could well be included in the test 
data. It is also questionable if the present forms 
are equivalent. Forms Q and S are structured 
similarly as are Forms X and Y. Emphases vary 
from test to test. This also appears in the raw 
scores of students. 

All forms of the examination test basic bio- 
logical information and possess adequate cov- 
erage of the field. Forms Q and S are predomi- 
nantly subject matter tests with emphasis upon 
factual material. Forms X and Y appear to be 
comparable and emphasize equally factual mat- 
ter and the application of information and the 
interpretation of data. These forms of the exami- 
nation could well perform a valuable function in 
focusing attention to aspects of teaching so often 
overlooked by the initiate or subject minded 
teacher. 

While the examination and the general manual 
are well prepared and give excellent assistance 
to the user, additional data is essential concern- 
ing the manner by which the examination is pre- 
pared and validated. 


For a review by C. W. Horton of Forms P, 
О, and S, see 3:550; for a review by Ralph W. 
Tyler of Form P, see 40:1585; for reviews by 
Francis D. Curtis of Form 1937 and George W. 
Hunter of Form N, see 38:907. 
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*Cooperative Biology Test: Educational Rec- 
ords Bureau Edition. High school; 1941-51; Forms 
ERB-RX V ERB-SX (40), ERB-IX (о); 
Forms ERB-R, ERB-S, ERB-T out of print; no data 
on reliability and validity with test (for data presented 
elsewhere by the publisher, see 1-2 below) ; no specific 
manual; general Cooperative manual ('51); Form 
ERB-RX norms (51); Form ERB-SX norms (749), 
Form ERB-TX norms (’50) ; $2.50 per 25; 256 per 
specimen set; postpaid ; 4o (45) minutes; the Commit- 
tee on Biology Tests of the ducational Records Bu- 
reau (Thomas F. Morrison, Chairman) ; Educational 
Records Bureau. * 


REFERENCES 


т. TraxLer, AxrHUR E. “Some Data on the Difficulty and 
Validity of (һе Cooperative Tests in Biology, Chemistry, and 
Physics, Form ERB-R,” pp. 64-8. In 1941 ‘Achievement Test- 
ing Program in Independent Schools and 
Studies, Educational Records Bulletin, No, 33. 
Fdncations! Records Bureau, June 1941. Pp. x, 76. Paper, 
ithot А 2 
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[603] 
*General Biology Test: National Achievement 
Test. High school; 1951; 4 scores: uses-processes- 
results, biologists, miscellaneous facts, total; Form А; 
no data on reliability and validity; no norms ; $2.50 per 
25; 50€ per specimen set; postage extra; 35(40) min- 
utes; Lester D. Crow and James G. Murray; Acorn 
Publishing Co. * 

[604] 
*Graduate Record Examinations Advanced Biol- 
ogy Test. Senior year college through graduate school 
and candidates for graduate school; 1939-51; available 
only in Graduate Record Examinations programs (see 
527); 180(220) minutes; prepared by the Advanced 
Biology Committee appointed by Educational Testing 
Service: Clarence H. Nelson (Chairman), Eric G. 
Ball, J. H. Bodine, F. L. Fitzpatrick, and Conway 
Zirkle; Educational Testing Service. * 

[605] 
*Nelson Biology Test: Evaluation and Adjust- 
ment Series. Grades 9-13; 1951-52, c1950-51; IBM; 
Forms AM (51), BM ('52) ; manual (’51) ; $2.50 per 
25; separate answer sheets must be used; 80¢ per 25 
IBM answer sheets; postage extra; 356 per specimen 
set, postpaid; 40(50) minutes; Clarence H. Nelson; 
World Book Co. * 


CLARK W. Horton, Consultant in Educational 
Research, Dartmouth College, Hanover, New 
Hampshire. 

Designed for use at the end of a course in 
high school biology, this test appears to be a 
very good one for the purpose. One of its virtues 
is described in the manual as follows : “This test 
has been developed primarily to measure under- 
standing and the ability to apply knowledge in 
the interpretation of situations and the solution 
of problems. Testing of ability to recall minute, 
isolated facts has been minimized. Rather, the 
student is given an opportunity to demonstrate 
how well he can discern relationships between 
what he has learned and the world of living 
things which he encounters every day.” 

Each of the two forms consists of 75 multiple 
choice items in an 8-page booklet, with the 
first page devoted to directions and sample ques- 
tions. Good type size, good paper, open spacing 
of the items, and good line drawings make the 
material easy to read and generally attractive. 

The questions are, for the most part, unusu- 
ally good with respect to clarity, directness, 
freedom from ambiguity, and the extent to which 
they require application of knowledge to com- 
mon situations. The coverage is good both in 
content and objectives sampled. A few items are 
open to criticism if one assumes that the stem 
of a completion item plus the option keyed as 
correct ought to constitute a true statement. In 
Form BM we find, “Wheat rust can be elimi- 
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nated best by getting rid of all common barberry 
bushes in the vicinity,” which certainly implies 
that wheat rust can be eliminated in an area by 
this means. Use of “controlled” or “reduced” 
would have been closer to the truth. An item in 
Form AM reads, “Plants which are saprophytes 
are distinguished from all other kinds of plants 
by their lack of chlorophyll.” This simply is not 
true; one cannot tell saprophytes from parasites 
in that way. Although they are regrettable, such 
slips will not affect the validity of the test at 
the level for which it is intended ; the possessor 
of knowledge has no difficulty in selecting the 
best option in such items. 

Тһе manual deserves high praise. The author 
has shown commendable concern for the needs 
of the test user. He gives the facts necessary for 
a critical evaluation of his methods, does not 
overstate his case or make false claims, and in 
general proceeds in striking contrast to some 
test authors of an earlier day who were at times 
careless about such matters. Moreover, he takes 
care to inform the reader about such points as 
"measurement error," the difficulty of demon- 
strating a normative group to be representative 
of a "hypothetical national school population," 
the value of standard scores, the desirability of 
supplementary testing in relation to objectives 
emphasized locally, etc. ; and he tries throughout 
to foster understanding leading to sensible test 
use and score interpretation. Teachers who 
know little about the technical aspects of test 
construction can read the manual with profit. 

The methodology and statistical treatment ap- 
pear to be sound. Mean item validities for the 
two forms are given as .43 and .44 (Flanagan 
method); mean item difficulties as 5o and 49 
per cent success. A detailed list of item difficulty 
values is given for those who wish to compare 
them with local results. Corrected split half re- 
liabilities of .87 and .88 and an alternate form 
reliability of .77 are reported. According to the 
manual, the test is essentially a power test. The 
standard error of measurement is given as 43 
standard score points and its interpretation 15 
explained. Care was taken to make the forms 
strictly equivalent from the standpoint of con- 


‘tent and objectives as well as from the stand- 


point of the statistical characteristics of the 
items. The normative group consisted of "4993 
students in 63 schools representing 27 states 
throughout the country." ` 
As in other tests of this series, raw scores are 
converted to a system of “normalized standard 
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scores” which “for all tests in the series are com- 
parable in the sense of being derived according 
to a uniform method.” Raw scores were equated 
to the median and standard deviation of the 
Terman-McNemar IQ's of the standardizing 
group to produce a system of scores that look 
like IQ's. "The standard scores....permit ready 
comparison of achievement with mental ability 
level. * Thus, a direct comparison of the student's 
standard score on the test with his Terman-Mc- 
Nemar IQ will indicate the direction and extent 
of the difference between his ability and achieve- 
ment in this field.” This feature adds greatly to 
the usefulness of the tests in this series. 


Letanp P. Јонмѕом, Professor of Biology, 
Drake University, Des Moines, Iowa. 

Two fundamental questions should be asked 
of an evaluation instrument : What does the test 
attempt to measure? Does it measure what is 
designated as the objectives of the examination? 
The objectives for the Nelson Biology Test as 
stated in the manual are: 

This test has been developed primarily to measure 
understanding and the ability to apply knowledge in 
the interpretation of situations and the solution of 
problema, Testing of ability to recall minute, isolated 
acts has been minimized. Rather, the student is given 
an opportunity to demonstrate how well he can discern 
relationships between what he has learned and the 
world of living things which he encounters every day. 
In addition specific aims of the test are listed 
and example test items from both Forms А and 
B which demonstrate these objectives are in- 
cluded. These are most helpful for anyone seek- 
ing more than superficial information concern- 
ing the examination. 

The examination is composed of 75 questions, 
of which approximately two thirds involve the 
application of information or interpretation of 
materials. Both forms have broad coverage of 
subject matter which is not necessarily essen- 
tial but which undoubtedly will meet the re- 
quirements of many subject centered teachers. 
The examiner is avowedly aiming at a relatively 
high level of achievement often lacking in many 
courses in biology. Items measuring only termi- 
nology are practically nonexistent in the exami- 
nation, yet the student must have a thorough 
understanding of a basic biological vocabulary 
to do well. There appears to have been a con- 
scientious effort to minimize terms wherever 
possible. Directions to students are concise and 
for the most part easily followed. A more opaque 
paper would make reading easier, since printing 
on the opposite side of the page shows through. 
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Very few questions fail to withstand the sever- 
est criticism. Item 18, Form A has two correct 
choices, 1 and 3; this is likely to make the selec- 
tion of the best answer difficult for top students. 
Also, some biologists may object to the use of 
the word sex in connection with style and stigma 
in Item 28 of the same form. 

Both forms are well adjusted for coverage 
and appear to be highly comparable without aid 
of a statistical analysis. The manual includes a 
traditional statistical analysis and contains, in 
addition, item difficulty values for each item in 
both forms of the examination. It would be of 
interest to most teachers to have included item- 
by-item validity indices. Also, it would be help- 
ful to have more detailed data concerning the 
bases on which items were selected, retained, 
and validated. While this would be most useful 
in the hands of a testing expert, more and more 
teachers are now able to use this data. 

Raw scores, the number correct, may be ob- 
tained by machine or hand scoring. The mean 
item difficulty is 50 and 49, respectively, for 
Forms A and B. Standard scores and percen- 
tile rankings are available. Standardized scores 
are related to the Terman-McNemar Test of 
Mental Ability. Aids are available in the manual 
to show the average teacher and school adminis- 
trator how to make use of the tests. The manual 
contains more than the usual information con- 
cerning the test, its structure, and development ; 
yet additional data would be useful and necessary 
if one were to conduct a comprehensive evalua- 
tion program. 

Since professional tests are so widely used, it 
becomes important that examiners construct 
tests which are current in their objectives. The 
present examination tests objectives often dis- 
cussed by biologists, but seldom. incorporated 
in their testing programs. As а student's study 
is influenced by the kinds of examinations he 
takes in a course, so is a teacher's teaching in- 
fluenced by the type of examination on which 
his students may be ranked. The present exami- 
nation should be an asset through its influence 
upon the teaching of biology and as a measuring 
instrument. 


[606] 


*Test of Application of Principles in Biology: 
General Education Series. High school; 1940-50; 
Forms А (0), and В (50) com ined are essentially 
the same as Application 0, „Principles in Biological 
Science: Test 1.33a (40) which was a revision of Test 
1.33 ('39); original tests published by Evaluation in 
the Eight Year Study, Progressive Education Associ- 
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ation; no data on reliability and validity; no norms; 
administration of both forms recommended in manual 
but not in publisher's catalog; manual ('50) ; $2.50 per 
25; separate answer sheets must be used; боё рег 25 
answer sheets; 40€ per set of scoring stencils; cash 
orders postpaid; 256 per specimen set, postpaid; non- 
timed (40) minutes; Evaluation Staff (Ralph W. 
Tyler, Director) of the Eight Year Study of the Pro- 
gressive Education Association; published in 1950 by 
Cooperative Test Division, Educational Testing Serv- 
ice, * 
REFERENCES 

D , . icatic inciples. Progressive 
Edicion Оа, Е. Meister ой. dr Ei Yen DR" 
Bulletin No. s. Chicago, I: the Study, December 1936. Pp. 


25. Paper, mimeograp ed. А 

2. Boaluation ‘aterials Developed for Various Aspects of 
Thinking. Chicago, Il: Evaluation in the Eight Year Study, 
Progressive Education Association, March 1939. Pp. [22]. 


Paper, mimeographed, * 

D МАРТИ Маса WEISMAN, LEAH; McMutten, 
Harotp G.; AND TRIMBLE, HAROLD С, "Applications of Princi- 
ples of Science," pp. 77-111. In Aiit jen] and Recording Stu- 
dent Progress. By Eugene R. Smith, Ralph W. Tyler, and the 
Evaluation Staff. Adventure in American Education, Vol. III. 


New Berk: Harper & Brothers, 1942. Pp. xxiii, 550. (PA 
16:50, 
4 dU TRI EvucENE R.; Tyrer, RALPH W.; AND THE EVALUA- 


TION Starr. Appraising and Recording Student Progress. Ad- 
venture in American Education, Vol. тї. New York: Harper 
& Brothers, 1942. Pp. xxiii, sso. * (РА 16:5033) 

Crark W. Horton, Consultant in Educational 
Research, Dartmouth College, Hanover, New 
Hampshire. 

The rationale of this test type and the story of 
its development are given in Smith and Tyler 
(4). Any attempt to use the test without a 
thorough understanding of its origin, purpose, 
and structure is likely to be unsatisfactory. The 
test is designed neither to yield a single, easily 
obtained score, nor to be administered to large 
populations. Rather, its primary purpose is the 
diagnosis of the mental processes the individual 
student employs in applying principles. Although 
ability to make the correct decision in problem 
situations plays a major role in the resulting 
scores, the primary emphasis appears to be on 
analysis of the kinds of reasons the student will 
choose to justify his decisions. It is doubtful 
that the test will yield a reliable index of either 
the number of biological principles the student 
knows and can use or the range of biological 
problem situations in which he can apply such 
principles. Each form of the test contains only 
5 biological problem situations, and while the 
total of то in the 2 forms provides a better sam- 
ple, the actual number of biological principles 
or "understandings" needed to make correct de- 
cisions in these situations is, after all, relatively 
small. The question is not so much “Ноу 
many?” as “How well?” 

The testing device used is, briefly, as follows: 
A problem situation is described and a conclu- 
sion (prediction or interpretation) is given. The 
student indicates whether he agrees with, dis- 
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agrees with, or is uncertain about the conclusion, 
He then selects from listed statements those he 
believes to be in support of his decision. If he is 
uncertain, he may select items from a separate 
list to indicate that he lacks knowledge or wishes 
additional information'about the situation before 
he can decide. If he either agrees or disagrees, 
the student selects his reasons from a list in- 
cluding true statements of principles which 
justify the correct conclusion, false statements 
which justify the incorrect conclusion, good and 
poor analogies, statements indicating reliance on 
authority, mere restatement of the conclusion, 
statements involving ridicule, common miscon- 
ceptions, and teleological explanations. The stu- 
dent’s performance is judged by the number of 
his correct conclusions and the extent to which 
he justifies them by selecting acceptable reasons, 
Diagnosis of his deficiencies involves analysis 
of the extent to which he selects the several kinds 
of unacceptable reasons. 

The manual, which also covers corresponding 
tests in general science and physical science, 
gives clear directions for administering and 
scoring the test. The manual, transparent keys, 
and test booklets are attractively printed. One 
must have true zeal, however, to struggle through 
to understand this device, to use the hand keys 
to derive 14 separate scores from each answer 
sheet, to record these scores on a data sheet, to 
derive others from them, and to proceed with 
an interpretation of the performance of indi- 
vidual students. However, the teacher who is 
interested enough to do so will gain a valuable 
understanding of student strengths and weak- 
nesses. 

It is very difficult to make the items of a test 
of this kind completely defensible. In this review- 
er's opinion there are a number of items, Dm 
cluding one of the major conclusions, in which 
the keying is controversial. Also, there are cases 
of phraseology which may look odd to biologists, 
for example, “Inoculation of antitoxin of disease 
germs during an attack.” But, in spite of defects, 
this test is a very useful device. Perhaps it should 
be regarded less as a test in the usual sense and 
more as a teaching device for expanding the 
horizons of both teachers and pupils. Administra- 
tors who hope to get teachers to emphasize think- 
ing skills in courses now overloaded with factual 
learning may find experimentation with it, if 
accompanied by a thorough discussion of both 
the test itself and the results, a truly profitable 
experience. 
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This is a valid test of ability to apply biological 
principles only at certain levels and within 
limits of meaning of the objective. This reviewer, 
a former teacher of botany who likes to think 
himself adept at applying biological princi- 
ples and who has spent many hours discussing 
both the thinking skills involved and the ways of 
testing them, fared rather badly when he took 
both forms of the test and scored the result. True, 
his performance was much above the expected 
median of a high school group, but it was far 
from perfect. Some of the errors were clerical, 
the result of confusing the "agree" and “dis- 
agree" columns, with the consequence that he 
was charged not only with a wrong conclusion, 
but also with a host of inconsistent reasons. (A 
student may do that too. Either the columns 
should be clearly labeled on the answer sheet 
instead of being merely tagged A, B, and C, or 
a brief of the directions should be reprinted on 
subsequent pages.) In one problem he was “too 
certain" in his conclusion, which considerably 
reduced his score. This and other "errors" ap- 
peared to arise from alternate interpretations of 
statements, and from failure to sense the particu- 
lar rules by which the game was being played. 

That the correctness or incorrectness of a 
response (according to the key) may depend 
entirely on guessing the author's interpretation 
of words used in an item, is well illustrated by 
Item 20, Problem I, Form B. Problem 1 involves 
the dependence of the yield of apples on the pol- 
lination of apple flowers by bees. Item 20 is an 
analogy stated: "Just as bumblebees in a field 
of red clover increase the yield of red clover, so 


honeybees in an apple orchard will increase the. 


yield of apples." This reviewer accepted the 
item as a good analogy in support of the correct 
conclusion to Problem I, but the key says he is 
wrong because the item is a poor analogy. Now 
“yield of red clover” can mean yield of hay, in 
which case the analogy is a poor analogy, or 
it can mean yield of seed in which case it is a 
good one. Was the author of the key thinking 
only of the yield of hay? It is comforting to 
infer from Darwin’s classic linking of the old 
maids of England, through cats, field mice, 
bumblebees, clover, beef, to the strength of the 
British Navy, that he also would have missed 
this item. Perhaps, as this test defines the ability, 
Darwin couldn’t apply biological principles 
either. 

Limitations of space prevent a detailed critical 
review of all points of disagreement. Neverthe- 
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less, the test is definitely recommended for use 
by people who thoroughly know what it is and 
what it is not. 


CrAnENCE Н. Netson, Board of Examiners, 
Michigan State College, East Lansing, Michi- 
gan. 

This test is not just another subject matter 
test. In fact, there is little resemblance between 
the test exercises and the activities which take 
place in many high school biology classes. This 
is not meant to be an indictment of the test. On 
the contrary, it would be highly desirable if more 
high school biology classes were conducted so 
as to foster and encourage rather than to stifle 
the kinds of activities embodied in the test. 

The stated purpose of the test is to measure 
some of the less tangible results of education. It 
attempts to obtain information about the extent 
to which high school students understand cer- 
tain commonly taught principles of biology as 
evidenced by the degree to which they are able 
to apply these principles in new situations. For 
the purpose of testing application, the meaning 
of the term “principle” has been broadened to 
include any scientific information, such as fact, 
generalization, understanding, concept, or law, 
which proves to be useful for predictive or ex- 
ploratory purposes. Students are confronted 
with problems, which presumably have not been 
discussed in class, relating to some of the funda- 
mental concepts and principles of biological 
science. A preliminary paragraph describes a 
situation which would be likely to fall within the ` 
everyday experience of practically all high school 
students. The paragraph closes with a conclusion 
suggesting а certain course of action to be taken 
in regard to the situation. The students are asked, 
first, to decide whether they agree with, are un- 
certain about, or disagree with the stated con- 
clusion; then, to select, from a list provided, 
reasons to explain or justify their decision about 
the conclusion. 

The following account of the derivation of the 
reasons is given by Smith and Tyler: 

The reasons used in the test were arrived at by first 
supporting the correct conclusion by formulating three 
statements of facts or principles which support it and 
by implication eliminate the other two conclusions. 
Four wrong reasons which, if accepted as true, would 
support the other conclusions were next formulated. 
Two of these would tend to support one of the wrong 
conclusions and two the other. They would all tend by 
implication to eliminate the right conclusion. One 
statement was formulated so as to be true but irrele- 


vant to the explanation of the problem. One each of 
the following kinds of reasons completed the pattern— 
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a teleological statement, ridicule statement, assuming 
the conclusion, unacceptable analogy, unacceptable au- 
thority and unacceptable common practice. Each of 
these was worded to appear to be consistent with the 
conclusion keyed as right. (4, pp. 90-1) 

The student’s responses are recorded on a 
special answer sheet which is scored by hand, 
with the use of four keys ; machine scoring is not 
feasible. The student’s performance is appraised 
in terms of 14 scores which are recorded on a 

. 30-column data sheet from which interpreta- 
tions are made. For each student, group pat- 
terns of responses can be worked out from the 
data sheet to provide answers to these five ques- 
tions: 

т. To what extent can the student reach valid con- 
clusions involving the application of selected principles 
of science, which he presumably knows, to new situa- 
tions ? 

2. How does the student explain his uncertainty 
when he marks the stated conclusion “uncertain” ? 

3. To what extent can the student justify logically 
his agreement with, his uncertainty about, or his dis- 
agreement with, the stated conclusion? 

4. What kinds of reasons does the student select to 
explain his decisions about the stated conclusions? 

5. To what extent does the student discriminate be- 
tween acceptable and unacceptable reasons in the vari- 
ous categories? 

If the prospective user of this test is unfa- 
miliar with the Eight Year Study and its ob- 
jectives, he is strongly urged to read the section 
entitled "Application of Principles of Science— 
Analysis of the Objective," pp. 77-111 in Ap- 
praising and. Recording Student Progress (3). 
The following paragraph may be helpful to the 
individual who is trying to decide whether this 
test is likely to function well in his particular 
situation : 

If students have been placed in situations in the 
classroom and laboratory where resourcefulness, adapt- 
ability, and selective thinking have been essential for 
the solution of problems, and if the emphasis given to 
teaching science principles has been upon their appli- 
cations to the solution of problems involving commonly 
occurring natural phenomena rather than on the mas- 
tery of science information as an end in itself, then 
students should have little difficulty in behaving in the 
manner anticipated by this test. Such students would 
have had many opportunities to apply the principles of 
science as they learned them to a number of situations 
in the laboratory and classroom, and would have been 
encouraged to be alert for similar opportunities for ap- 
plication as they occur outside the classroom. (4, p. 111) 

The Test of Application of Principles in Bi- 
ology is much more complex from the standpoint 
of administration, scoring, and interpretation of 
results than most classroom tests. It exempli- 
fies, however, a vital type of educational activity 
which is highly stimulating and very much 


- Chairman), Examinations Committee, 


worthwhile in the training of the citizenry of 
an enlightened democracy. 


CHEMISTRY 


[607] 
*A.C.S. Cooperative Biochemistry Test, 1-2 
semesters college; 1947-51; Form X entitled Cooper- 
ative Biochemistry Test; 3 scores: biological chemis- 
try, physiological chemistry, total; IBM; Forms X 
(47), 2 (50); no data on reliability and validity; di- 
rections for administering (’51) ; Form X norms (49), 
Form Z norms ('50); $3 per 25; separate answer 
sheets must be used; 80€ per 25 IBM answer sheets; 
15¢ per scoring stencil; 50¢ per specimen set; postage 
extra; 100(110) minutes; prepared jointly by the 
Cooperative Test Division and the American Chemi- 
cal Society through its Division of Chemical Educa- 
tion, Examinations Committee, Biochemistry Subcom- 
mittee (Gordon H. Pritham, Chairman); Examina- 
tions Committee, Division of Chemical Education, 
American Chemical Society, St. Louis University. * 
REFERENCES 

„1. РаІтнАм, Gorpon Н. “Survey of Courses and Construc- 
tion of Tests in Biological Chemistry.” J Chem Ed 22:84+ F 
ЖА Asurorp, T. A. “The Testing Program of the Division of 
Chemical Education of the American Chemical Society: A Com- 
mittee Report.” J Chem Ed 25:280-2-+ My '48. * (РА 22: 


4115) 
[608] 
*A.C.S. Cooperative Chemistry Test in Qualita- 
tive Analysis. College; 1940-51; 4 scores: descriptive 
information, ки and equations, chemical equilib- 
rium, total; IBM; Forms Y (’48), Z (50); Forms 
Q-T out of print; no data on reliability and validity; 
directions for administering [’51]; Form Y norms 
(749), Form Z norms ('50); $3 per 25; separate an- 
swer sheets must be used; 80¢ per 25 IBM answer 
sheets; 15¢ per scoring stencil; 50€ per specimen set; 
postage extra; 100(110) minutes; prepared jointly by 
the Cooperative Test Division and the American Chem- 
ical Society through its Division of Chemical Educa- 
tion, Examinations Committee, Qualitative Analysis 
Subcommittee (Y—Rufus D. Reed, Chairman) and 
Analytical Subcommittee (Z—R. L. Van Peursem, 
Division 9 
Chemical Éducation, American Chemical Society, St 
Louis University. * 
REFERENCES 
; AND CALANDRA, 


1. R: W. P. 
ALEXANDER, "Aims or Objectives of Qualitative Analysis. 
e.t 


Ww. RrEMAN III, Professor of Analytical 
Chemistry, Rutgers University, New Bruns- 
wick, New Jersey. [Review of Forms Y and 
7.] 
This test covers well the student's factual 
knowledge, his ability to use this knowledge to 
solve analytical. (nonmathematical) proven 
his understanding of the principles, and his abil- 
ity to balance chemical equations. The time 27 
lowance is rather short. The section that deals 
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with numerical calculations is the least satis- 
factory, chiefly because the multiple choice type 
of question is inherently unsuited for this kind 
of work. 

No wrong answers or ambiguous questions 
were found in the test, but the choice between 
the best and second best answer is too close in 
a few cases. For example, Item 44 in Form Z 
asks the student to select from five reagents the 
best one for separating manganous ion from 
aluminum and chromic ions. Sodium peroxide 
is the best answer, but ammonia runs a close 
second. In a case like this, it is unfortunate that 
the system of scoring does not allow more credit 
for the second-best answer than for an answer 
that is definitely wrong. The reviewer believes 
that the Bronsted theory has won sufficient ac- 
ceptance to justify the use of NH, instead of 
NH,OH, especially since both Н+ and H,Ot 
appear in the test. 

In general, this is a good test and can be used 
with confidence to evaluate the performance of 
a student in the average course in qualitative 
analysis. 


For reviews by William B. Meldrum and Wil- 
liam Rieman III of Form T, see 3:562. 


[609] 

*A.C.S. Cooperative Chemistry Test in Quanti- 
tative Analysis. College; 194- 1; 3 scores: theory, 
problems, total; IBM ; Forms Po». Y (48); Form 
U out of print; no data on reliability and validity ; di- 
rections for administering [’51] ; Form X norms (47), 
Form Y norms (’50); $3 per 25; separate answer 
sheets must be used; 80¢ per 25 IBM answer sheets ; 
15¢ per scoring stencil; 50¢ per specimen set; postage 
extra; 110(120) minutes; prepared jointly by the 
Cooperative Test Division and the American Chemi- 
cal Society through its Division of Chemical Educa- 
tion, Examinations Committee, Quantitative Analysis 
Subcommittee (R. L. Van Peursem, Chairman) ; Ex- 
aminations Committee, Division of Chemical Educa- 
ves American Chemical Society, St. Louis Univer- 
sity. 


‚ For reviews by William B. Meldrum and Wil- 
liam Rieman III of Form U, see 3:563. 


[610] 
*A.C.S. Cooperative General Chemistry Test for 
College Students. College; 1934-51; 7 scores: gen- 
eral knowledge and information, application of prin- 
ciples, quantitative application of principles, scientific 
method, total of preceding 4 scores, laboratory tech- 
nique, total; IBM; Forms 1948, Z; Forms 1934-42, 
1944, 1946 out of print; no data on reliability and va- 
Пау; directions for айаш [51]; Form 1948 
norms ('49), Form Z norms (50); $3 per 25; sepa- 
rate answer sheets must be used; 80¢ per 25 ІВМ an- 
swer sheets; 15@ per scoring stencil; 50¢ per specimen 
Set; postage extra; 110(120) minutes; prepared 
jointly by. the Cooperative Test Division and the 
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American Chemical Society through its Division of 
Chemical Education, Examinations Committee, Gen- 
eral Chemistry Subcommittee (Conrad E. Ronneberg, 
Chairman); Examinations Committee, Division of 
Chemical Education, American Chemical Society, St. 
Louis University. * 


REFERENCES 

CE бел, 

о. Asuromp, Т. А, “The Testing Program of the Division 
of Chemical Education of the rican Chemical Society: 
A Committee Report.” J Chem Ed 25:280-2+ My '48. * (PA 
22:4115) T 
Kennetu E. ANDERSON, Professor of Educa- 
tion, and Director, Bureau of Educational Re- 
search and Service, University of Kansas, Law- 
rence, Kansas. 

This is an achievement test designed for use 
at the end of a two-semester course in general 
college chemistry.. Both forms cover general 
knowledge and information, application of prin- 
ciples, quantitative application of principles, 
scientific method, and laboratory technique and 
procedure. In Form 1948 the part concerned 
with laboratory technique and procedure is op- 
tional ; in Form Z there is an optional part de- 
voted to organic chemistry. Form 1948 is the 
longer of the two forms, with a total of 135 items 
as opposed to 115 items in Form Z. The more 
recent form gives somewhat more emphasis to 
quantitative application of principles than does 
the older form; otherwise, the proportion of 
items in areas covered by both forms is approxi- 
mately the same. 

No information is given concerning either the 
validity or the reliability of parts of the test or 
of the test as a whole. However, a thorough 
reading of the test, item by item, reveals con- 
siderable curricular validity, and the test as a 
whole probably possesses high reliability. 
Throughout both forms the test items are 
thought-provoking, going well beyond the usual 
factual type of item. It is refreshing to see a 
college examination in chemistry measuring ap- 
plication of principles as well as quantitative 
application of principles. These sections, as well 
as the section on scientific method, are especially 
well done. 

Percentile norms for part and total scores are 
provided. For Form Z there is an unusually ex- 
tensive norms leaflet which contains five sepa- 
rate tables of percentile ranks based on the 
scores of groups of students enrolled in two- 
semester courses varying in number of hours de- 
voted to lecture and to laboratory work. A de- 
scription of each group and its main professional 
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goals accompanies each table. The institutions 
whose test results are included in each table are 
listed at the end of the leaflet. While the per- 
centile ranks for the part scores seem to be some- 
what repetitive from table to table, it is con- 
ceivable that these may serve a purpose in situ- 
ations where some diagnosis of strengths and 
weaknesses in specific areas is desired. The per- 
centile ranks for combined and total scores 
should be generally useful. 

In summary, one may reasonably say that this 
test seems capable of measuring quite satisfac- 
torily the objectives of general chemistry as 
reported in the literature. Colleges should find 
the test useful as an achievement test and as a 
test of aptitude for further work in chemistry. 


For reviews by Sidney J. French and Florence 
E. Hooper of Form 1946, see 3:557. 


[611] 

*A.C.S. Cooperative Organic Chemistry Test. 
College; 1942-51; 9 scores: total, first-semester total, 
second-semester total, and the following subtests for 
each semester—general information, application of 
principles, problems; IBM; Form W Rev. (^48); 
Form Y ('49); Forms S-W out of print; no data on 
reliability and validity; directions for administering 
(’51]; Form W norms (’48), Form Y norms ('50); 
$3 per 25; separate answer sheets must be used; 80€ 
per 25 IBM answer sheets; 15¢ per scoring stencil; 
50¢ per specimen set; postage extra; 100(110) min- 
utes; prepared jointly by the Cooperative Test Division 
and the American Chemical Society through its Divi- 
sion of Chemical Education, Examinations Committee, 
Organic Subcommittee (O. C. Dermer, Chairman); 
Examinations Committee, Division of Chemical Edu- 
cation, American Chemical Society, St. Louis Univer- 
sity. 


REFERENCES 
1. ASHFORD, T. A. “The Testing Program of the Divisi f 
Chemical Education of the pape Chemical бону ^ 
сне Report.” J Chem Ed 25:280-2-- My 48. * (PA 22: 
4115 


For a review by Shailer Peterson of Form 
W, see 3:558. 


[612] 

*A.C.S. Cooperative Physical Chemistry Test. 
College; 1946-51; 6 scores: gases-liquids-solids-atomic 
structure, solutions and colloids, homogeneous equilib- 
rium-heterogeneous —equilibrium-kinetics, thermody- 
namics and thermochemistry, electro-chemistry, total ; 
IBM; Form W (46); no data on reliability and va- 
lidity ; directions for administering ['51] ; norms (’50) ; 
$2.50 per 25; separate answer sheets must be used: 
80¢ per 25 IBM answer sheets ; 15¢ per scoring stencil ; 
cash orders postpaid; 50¢ per specimen set; postage 
extra; 110(120) minutes; prepared jointly by the 
Cooperative Test Division and the American Chemi- 
cal Society through its Division of Chemical Educa- 
tion, Committee on Examinations and Tests, Physical 
Chemistry Subcommittee (D. A. Dreisbach, Chair- 
man); Examinations Committee, Division of Chemical 
Education, American Chemical Society, St. Louis Uni- 
versity. 


REFERENCES 


1, AsHFORD, T. A. “The Testing Program of the Division 
of Chemical Éducation of the American Chemical Society: А 
CIR Report.” J Chem Ed 25:280-2+ My '48. * (рд 
22:4115 


For a review by Alfred S. Brown, see 3:559. 


[613] 

*Anderson Chemistry Test: Evaluation and Ad. 
justment Series. Grades 11-13; 1051-52, с1950-51; 
IBM; Forms AM (51), BM ('52) ; manual ('s1); 
$2.50 per 25; separate answer sheets must be used; 
80¢ per 25 IBM answer sheets; postage extra; 35¢ 
per specimen set, postpaid; 40(50) minutes; Kenneth 
E. Anderson; World Book Co. * 

Wm. Riman III, Professor of Analytical 
Chemistry, Rutgers University, New Bruns- 
wick, New Jersey. 

This test is designed for use in secondary 
schools. The two forms are intended to be 
equivalent with regard to the range of subjects, 
the relative emphasis on each subject, and the 
difficulty. This objective has been accomplished 
well. Each form consists of 80 questions to be 
answered by selecting the correct (or best) an- 
swer from the five suggested. Wrong answers 
are scored the same as unanswered questions. 

In general, the reviewer commends the wide 
range of subjects and the relative emphasis on 
each, although he would like to see more than 
three questions requiring computations. This is 
a test not only of factual knowledge but also of 
the student’s understanding of principles, fa- 
miliarity with laboratory work, and ability to 
reason. 

Part B of each form contains seven pairs of 
questions. The first question of each pair con- 
cerns factual knowledge. The second of each 
pair “is a question on the chemical principle 
that best explains the answer to the preceding 
question.” For example, Item 47 of Form A 
reads: 

Hydrogen gas was passed into a test tube con- 
taining copper oxide, Heat was then applied to 
the tube. What probably happened? 

The well informed student would have no diffi- 
culty in selecting the answer “Water dripped 
from the tube.” Item 48 reads: 

Which one of the following statements gives the 
principle that best explains the answer to question 
47! 

6. Copper is a comparatively inactive metal. D 

7. Metal oxides often combine with an aci 
form a carbonate. ith 

8. Many oxygen compounds when heated Wi 
hydrogen will release oxygen. 

9. Noticeable heat and light occur in many 
chemical reactions. 


* 
to. Some chemical compounds are able to ab- 
sorb water from air. 
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Answer 8 is considered correct. In the reviewer's 
opinion, answer 6 is equally good. 

The test contains a number of other inaccura- 
cies and poorly worded questions. In other ques- 
tions of Part B, the choice is almost as close as 
in Item 48. In these cases, it seems desirable to 
devise a method of scoring that would give part 
credit for selecting the second-best answer. In 
each form, a table of solubilities is followed by 
four questions that refer to the table. In the 
table of Form A, the solubility of sulfur at 
100? C is given as 15 g. per 100 cc. of water! In 
Item 80 of Form B, the formula of calcium 
chlorate is incorrectly given. Item 54 of Form 
А asks the student to select the term in the fol- 
lowing five that does not belong with the others : 
boiling point, inert, solubility, specific gravity, 
density. A student with no knowledge of science 
should be able to select the correct answer be- 
cause "inert" is the only adjective among the five 
choices, the other four of which are nouns. Both 
forms contain a question like this: "What will 
be one of the ions in the ionic equation for 
Ca(OH),?” A better wording would be: 
"Which one of the following ions is furnished 
by Ca(OH),?” 

In spite of these shortcomings, the test is 
considerably better than most of the multiple 
choice chemistry tests that the reviewer has seen. 


[614] 

X*Chemistry: Achievement Examinations for 
Secondary Schools. High school; 1951; 1 form; no 
data on reliability and validity ; no manual; Minnesota 
norms (median and quartile deviation) available; simi- 
lar norms for other regions by special arrangement 
with publisher ; 7€ per test, postage extra; 60(65) min- 
utes; edited by Walter W. Cook; prepared by a cur- 
riculum committee of high school teachers for use in 
the Minnesota State Board Achievement Examina- 
tions Program; Educational Test Bureau, Educational 
Publishers, Inc. * 


[615] 

*Chemistry: Every Pupil Test. High school; 1920- 
51; new form usually published each April and De- 
cember ; form December 1951; no data on reliability 
and validity; no manual; norms ('51); 216 per test; 
16 per answer key; postpaid; 40(45) mind: Ohio 
Scholarship Tests, Ohio State Department o! Educa- 
ion. 


[616] 

"Chemistry Test: State High School Tests for 
ndiana. First, second semesters high school; 1934- 
50; 2 levels; mimeographed ; no data on reliability. and 
validity; no manual; 4!4 per test; 156 per specimen 
set; postpaid; 40(45) minutes; Ezra А. Miller; State 
High School Testing Service for Indiana, Purdue Uni- 
versity, * 

а) FIRST SEMESTER, Form B [48] ; 
print; norms [49]. 


Form A out of 


b) SECOND SEMESTER. Form О [48]; Form N out of 
print; norms ['50]. 

For areview by Fred P. Frutchey of an earlier 
form, see 38:931. 


[617] 

XCollege Entrance Examination Board Achieve- 
ment Test in Chemistry. Candidates for college en- 
trancé; 1001—51; available only in College Entrance 
Examination Board Admissions Testing Program (see 
526); 60(70) minutes; prepared by College Entrance 
Examination Board Committee of Examiners in 
Chemistry in cooperation with the Staff of Educa- 
tional Testing Service: 1051 membership: Stuart К. 
Brinkley (Chairman), Otis E. Alley, Nelson F. 
Beeler, Ina M. Granara, and Warren T. Warnecke; 
program administered by Educational Testing Service 
for the College Entrance Examination Board. * 


REFERENCES 
т. Foster, LAUuRENCE F., CHAIRMAN. "Report of the New 
England Association of Chemistry Teachers’ Committee on 


College Entrance Examinations.” J Chem Ed 16:46-9 Ja '39. * 


x "The Chemistry Examination of the College ntrance 
V PR REST TORNA Ed 17:443-5 S '40. * Sch Sci 

3. "The Chemistry Examination of the College Entrance 
Examination Board." J Chem Ed 18:441-3 S '41. * 

4. Weser CHmisriAN О. “Old and New College. Board 
Scores and Grades of College Freshmen." J Am Assn Col Reg 
20:70-5 О '44. * (PA 19:570) 

Evetyn Raskin, Assistant Professor of Psy- 
chology, Brooklyn College, Brooklyn, New 
York. (Review of Form ZAC.) 

Form ZAC, the most recent, and best, edition 
may be considered the prototype of this series of 
achievement tests in chemistry. It consists of 
4 parts: Part I contains 76 items requiring the 
student to classify groups of items according to 
varjous categories such as laws, theories, va- 
lence numbers, etc. ; Part II consists of 40 multi- 
ple choice items, each alternative of which must 
be judged as true or false; Part III consists of 
44 conventional multiple choice items with 5 
options apiece, only one of which is correct ; and 
Part IV consists of 10 equations which the stu- 
dent must complete and balance. 

The earlier forms of the test are essentially 
similar to this one in composition, except that 
the proportion of "true-false" items has been 
reduced in the more recent forms and items of 
the classification type have been substituted. 
This seems to be a desirable revision, since the 
“true-false” items occasionally had stems which 
gave away the fact that only one of the possible 
options could be true. Moreover, the stems to 
these items were so general that no unity could 
be discerned among the various alternatives. 
While the classification items are an improve- 
ment, it seems to the reviewer that they consti- 
tute too large a proportion of the total test. For 
example, there are 15 items alone calling for the 
classification of various substances as natural 
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elements, compounds, mixtures, isotopes, or 
solutions. Some of the information called for in 
this part of the test is, as a matter of fact, dupli- 
cated in other parts of the examination (e.g., 
Items 15 and 113). Although extensive cover- 
age with minimum reading time is gained by 
this testing technique, it does tend to stress iso- 
lated and relatively factual information. 

In this respect, the multiple choice items in 

the examination are superior. They give evi- 
dence of thoughtful and painstaking preparation, 
and most of them demand that the student show 
understanding of basic concepts and principles. 
Occasionally a term is not precise (Avogadro's 
"theory"), or the distractors appear to offer little 
"bait" (“Air is a mixture because....the wind 
blows it around"), but such flaws are few. The 
equations in Part IV are well selected, although 
it may be desirable to test the student’s ability 
to balance separately from his ability to complete 
the equation. Moreover, requiring the student 
to give only one coefficient as evidence that he 
has completed and balanced the equation cor- 
rectly would appear to leave too much to a lucky 
error. : 
In general, the test content stresses basic top- 
ics, with adequate coverage of quantitative 
formulations and problems and with gratifyingly 
few questions calling for highly specialized in- 
formation, such as details of industrial processes. 
There are, however, no questions on laboratory 
techniques or procedures. In this connection, 
the reviewer would be interested in knowing how 
the committee of chemists responsible for the 
preparation of the examination handles the dif- 
ficult problem of drawing up the curricular basis 
for its content. 

One serious objection the reviewer does have 
to the test is its length. A one-hour time limit 
for a test containing 170 items certainly places 
too heavy a premium on speed for a test designed 
to measure achievement. Statistical evidence 
on earlier forms of the test which showed that 
more than 30 per cent of the students failed to 
complete it bears out this contention. Although 
the inclusion of more classification items in the 
recent forms may result in lowering this figure, 
the student must still have the uncomfortable 
feeling that he is racing against time. A test of 
this length can be completed within an hour only 
if it sacrifices the important objectives of testing 
for comprehension, interpretation, and applica- 
tion. It is hoped that future editions of the test 
will move in the direction of a substantial reduc- 
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tion in length which will thereby permit testing 
for these important, although time-consuming, 
objectives. Greater validity should be gained 
with no appreciable loss in reliability. 

The test, as already indicated, is entirely ob- 
jective and may be machine scored. Norms are 
based on. College Entrance Examination Board 
candidates and are expressed in the form of 
standard scores with the mean and standard de- 
viation set at 500 and 100 respectively. Percen- 
tile rank equivalents for these scores are also 
available. Evidence obtained on an earlier form 
of the test indicates that it has a reliability of 
over .9 (by both the odd-even method and the 
Kuder-Richardson formula No. 20). A recent 
study carried out by the Board on a group of 
139 California Institute of Technology fresh- 
men reported a validity coefficient of .48 be- 
tween scores on the test and grades at the end 
of the first year course in college chemistry. 


[618] 

*Cooperative Chemistry Test. High school; 1933- 
51; IBM; Forms X (^47), Y (48), Z (’50) ; Forms 
1933-37, N-Q, S out of print; no data on validity ; no 
specific manual; general Cooperative manual (51); 
norms ['37] ; $2.50 per 25; 2 per specimen set, post- 
paid; separate answer sheets may be used; 80¢ per 25 
IBM answer sheets ; 15€ per stencil for scoring answer 
sheets; cash orders postpaid; 40(45) minutes; Paul J. 
Burke (X) and Joseph F. Castka (Y, Z) ; Cooperative 
Test Division, Educational Testing Service. * 


REFERENCES Аа 

1, Rice, Сесп, L., AND Vinton, Kennetu W, “Ап Experi- 
л їп Teaching CHIEDI for Testing." Sch & Soc 52:4375 

20 '40. 14:5717 FE 3 

2. BiNINOEA, Mant Lax, A Study on the Prediction of o 
cess im am Engineering College. Master's thesis, Clark 
versity Сасе, Mass.), 1045, (Abstracts оў Dissertations. 
194, „ 20—21. s "Ne 

fo КліРН F., AND Sutter, Nancy А. Fredik 
ae $f  Pneineering Students.” J Ed Psychol 41:184 

г ’50, 24:60 

4. Бы Жм RYAN, МівтАм М.; AND BURKE, Ma 
t "Correction for Guessing in the Scoring of Pretests: S 

pon Item Difficulty and tend Validity Indices, P Ee 
In The Seventh Yearbook of the National Council on, Meas the 
ments Used in Education, 1949-1950. Fairmont, W. "Paper, 
Council, Fairmont State College, 1950. Pp. v, 55, XL 
mimeographed. * 


For a review by John Н. Daugherty of Form 
S, see 3:561; for reviews by Charles L. Bickel 
and Louis M. Heil of Form P, see 40:1592; for 
reviews by Edward E. Cureton of Form 1937 
and W. B. Meldrum of Form N, see 38:932. 


Rec- 
Forms 


Forms ERB-R, ERB-S, ERB-T out of print; n den 
on reliability and validity with test (for data ртт 
elsewhere by the publisher, see т—2 seque for a 


, 9); 
ERB-RX norms (’51), Form ERB-SX norms (4 H 
$2.50 per 25; Eun 
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tee on Chemistry Tests of the Educational Records 
Bureau (Charles L. Bickel, Chairman) ; Educational 
Records Bureau. * К 


REFERENCES 


т, Traxter, ARTHUR E. “Some Data on the Difficulty and 
Validity of the Cooperative Tests in Biology, Chemistry, and 
Physics, Form ERB-R,” pp. 64-8. In 1941 ‘Achievement Test- 
ing Program in Independent Schools and Supplementary 
Studies. Educational Records Bulletin, No. 33. New York: 
Educational Records Bureau, June 1941. Pp. x, 76. Paper, 
lithotyped. * (PA 15:4388) 

2. TRAXLER, ARTHUR E. "Some Data on the Difficulty, Re- 
liability, and Validity on the Cooperative Biology, Chemistry, 
and рн Tests, Forms ERB-RX," pp. 74-8. (РА 23:314) 
In 1948 Achievement Testing Program in Tode, endent PEIRA 
and Supplementary Studies. Educational Records Bulletin, No. 
so. New York: Educational Records Bureau, July 1948. Pp. 
xiii, 86. Paper, lithotyped. * 


[620] 
XGraduate Record Examinations Advanced 
Chemistry Test. Senior year college through gradu- 
ate school and candidates for graduate school; 1939- 
51; available only in Graduate Record Examinations 
programs (see 527); 180(220) minutes; prepared by 
the Advanced Chemistry Test Committee appointed by 
Educational Testing Service: Sidney J. French (Chair- 
man), John C. Bailar, Jr., Florence Hooper Forziati, 
and Charles C. Price; Educational Testing Service. * 


[621] 
Iowa Placement Examinations: Chemistry Apti- 
tude, Grades 12-13; 1925-44; IBM; 2 editions; no 
specific manual; G. D. Stoddard, Jacob Cornog, and 
L. W. Miller (New Series) ; Bureau of Educational 
Research and Service, State University of Iowa. * 
a) SERIES CAI, REVISED. 192 26; Forms А (725), B 
(25) ; series manual ['26]; $4.20 per 100, postage ex- 
tra; 50¢ per specimen set (includes the other 10 tests 
in the series), postpaid ; 44(50) minutes; prepared un- 
der the direction of C. E. Seashore and G. M. Ruch. 
b) NEW SERIES CA2, REVISED. 1925-44; also called 
Quick-Scoring Edition; IBM; Form M (42); $1.35 
per 25; 20€ per series manual (744) ; 40€ per specimen 
set (includes the other 4 tests in the series), postpaid ; 
separate answer sheets may be used; $2.50 per 100 
IBM answer sheets; 256 per stencil for scoring answer 
sheets; postage extra; 47(55) minutes; revised by 
Jacob Cornog and D. B. Stuit. 


REFERENCES 

1-15. See 3:566. 

16, STALNAKER, JoHN M. A Statistical Study of Some As- 
кз of the Purdue Orientation Testing Program. Bulletin of 
Purdue University, Vol. 28, No. 6; Studies in Hie Educa- 
tion VIII, Lafayette, Ind.: Division of Educational Reference, 
the University, February 1928. Pp. 68. Paper. * (PA 5:831 

17. BARNETTE, LEsLIE, JR. Occupational Aptitude Pat- 
terns of Counseled Veterans. Doctor's thesis, New York Uni- 
tte, er York, N.Y.), 1949. Pp. viii, 385. * (PA 24:362, 

ту 
8. Оннркоск, Rrowanp STEPHEN, “Construction of а бе- 
ection Test for College Graduates." J General Psychol 41: 
$910 '49. * (PA 24:4874) f 

19. BARNETTE, W. LesLie, JR. Occupational Aptitude Pat- 
rmt of Selected Groups of Counseled Veterans. American 
sychological Association, Psychological Monographs; General 
E Applied, Vol. 6s, No, s, Whole No. 322. Washington, D.C.: 

е Association, Inc., 1951. Pp. у, 49. Paper. * (PA 26: 794) 

1 20. KnarHwoHL, WILLIAM C. “En ingering Norms for the 
owa Chemistry Aptitude Test, Form бг” J Eng Ed 41:607-9 
Је 's1. * (РА 26:3626) 


KennetH E. AwDrERSON, Professor of Educa- 
tion, and Director, Bureau of Educational Re- 
search and Service, University of Kansas, Law- 
rence, Kansas. 

SERIES CAI, REVISED. Each form consists of 
125 items distributed as follows: 15.8 per cent 
sample the simple arithmetic of chemistry, 23-7 


t 
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per cent sample the ability to secure precise data 
from chemical paragraphs, 13.3 per cent sample 
chemistry reading comprehension, and 47.2 per 
cent sample interest in chemistry as measured by 
accuracy of common information. In terms of 
maximum scores these proportions become 33.6 
per cent, 25.2 per cent, 28.6 per cent, and 12.6 
per cent respectively. These proportions are not 
in harmony with time allotments in the usual 
first year courses in chemistry. Too much of the 
examination is devoted to Part 1, and too little 
to Part 2. It is doubtful that the reading test of 
17 items in Part 3 possesses high reliability. In- 
terest in chemistry and the ability to answer fac- 
tual questions are probably not highly related. 
The manual should discuss the validity and 
reliability of the parts as well as the rea- 
sons for weighting the questions. It does not 
do so. 

The two forms of the examination are not 


strictly comparable. Form A is two points easier 


at the roth percentile and five points easier at the 
goth percentile. Form A is reported to have a 
coefficient of reliability of .88 with a probable 
error of a score of 4.0 points. Although validity 
coefficients are not reported separately for the 
various tests, the average correlation with 
grades for the entire series is .50. Thus, the 
standard error of estimate for this test is about 
14 per cent smaller than the standard deviation 
of the criterion scores. 

Hand scoring and the use of correction formu- 
las make the scoring tedious. 

Used alone, the examination does not seem 
designed to perform the job of predicting success 
in chemistry courses. Its usefulness lies in its 
being supplementary to other measures of aca- 
demic success, such as general intelligence and 
grades received in previous courses in science 
and mathematics. 

NEW SERIES CA2, REVISED. Form M consists 
of four parts. Of the 110 items, about 22.7 per 
cent sample the arithmetic and algebra skills com- 
monly used in problems in first year chemistry, 
9.1 per cent sample the ability to change a verbal 
statement into an algebraic expression, 27.3 per 
cent sample the ability to read and comprehend 
facts in the field of chemistry, and 40.9 per cent 
sample range of information in the field of chem- 
istry. In terms of maximum scores these propor- 
tions become 19.2 per cent, 23.1 per cent, 23.1 
per cent, and 34.6 per cent respectively. These 
proportions are not in harmony with time allot- 
ments in first year courses in chemistry. Too 
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much of the examination has been devoted to the 
mathematical aspects of chemistry. The part on 
reading is an improvement over the CAT series 
in that the student must do more thinking about 
the basic principles of chemistry and not just 
answer the questions by a process of spotting 
words or passages. It is doubtful if Part 4, con- 
sisting of 60 true-false questions, adequately 
samples a student's range of information in chem- 
istry. 

The validity and reliability coefficients for the 
new series are higher than for the САт series. 
The examination has a reported reliability coeffi- 
cient of .96 and a probable error of a score of 
3.0 points. It may be inferred from the table of 
percentile equivalents in the manual that the 
probable error of a score is much greater than 
3.0 points near the extremes of the distribution. 
The validity and reliability of the separate parts 
should be reported. They are not. For the most 
part a change of one percentile point is accom- 
panied by a change of one raw score point except 
below the sth percentile and above the goth 
percentile. The progression from the 95th to the 
tooth percentile is accompanied by a change of 
I5 raw score points. 

The use of this examination alone as a predic- 
tor of success in chemistry does not seem war- 
ranted. It must be supplemented by measures of 
general ability and success in previous science 
courses. The examination should be useful in 
dealing more effectively with individual stu- 
dents who are not well oriented to college, and in 
furnishing a basis for the diagnosis of class 
weaknesses. ‘ 


For a review by Theodore A. Ashford, see 
3:566. 


[ 622] 

Iowa Placement Examinations: Chemistry 
Training: Series CT1, Revised. Grades 12-13; 
1925-30; Forms A (725), B (25); Forms X (^30— 
New Series C.T.), Y (3o0—New Series C.T.) out of 
print; no specific manual; series manual [26] ; $5 per 
100, postage extra; per specimen set (includes the 
other то tests in the series), postpaid; 43(50) minutes ; 
prepared under the direction of C. E. Seashore and 
G. M. Ruch; С. D. Stoddard and J. Cornog; Bureau 
gi шан Research and Service, State University 
of Iowa. 


REFERENCES 
ONES 3:567. 
15. Unrsrock, RicHaRD STEPHEN. "Construction of a Se- 


lection Test for College Graduates," J General Psychol 41 1153- 
93 O '49. * (PA 24:4874) 


KeNNETH E. Амревѕом, Professor of Educa- 
tion, and Director, Bureau of Educational Re- 


search and Service, University of Kansas, Law- 
rence, Kansas. 

The purpose of this test is to measure previous 
training which is considered essential prepara- 
tion to the work at hand. It is assumed that the 
student has had a year of high school or a semes- 
ter of college chemistry. 

Each form of the series consists of four parts, 
Of the 152 items, 29.6 per cent sample knowl- 
edge of the fundamentals of chemical processes; 
29.6 per cent sample ability to handle valence, 
formulas, and equations; 32.9 per cent sample 
knowledge of manufacturing processes—applied 
chemistry; and 7.9 per cent sample ability to 
work fundamental chemical problems. In terms 
of maximum scores these proportions become 
23.9 per cent, 23.9 per cent, 26.6 per cent, and 
25.5 per cent respectively. These proportions со- 
incide fairly well with the time allotments in 
first year courses in chemistry. However, too 
few items devoted to fundamentals of chemical 
processes seem designed to measure the basic 
principles and generalizations of chemistry. Тоо 
many items are devoted to applied chemistry. 

The hand scoring and weighting procedures 
make the correction of papers a tedious pro- 
cess. 

The reliability of Form A is reported to be 
.90; the probable error of a score, 6.4 points. 
The validity and the reliability of each of the four 
parts are not discussed in the manual. The ex- 
amination may be used as a supplementary prê- 
dictive device even though the average validity 
coefficient of the entire series is reported to be 
.60. Thus, the standard error of estimate for 
this test is about 20 per cent smaller than the 
standard deviation of the criterion scores. 

Although separate percentile norms are pro 
vided for the two forms, Form A is about 12 to 
14 raw score points easier throughout the scale 
than Form B. 

Although the examination has somewhat bek 
ter predictive power than the Jowa Риси 
Examination: Chemistry Aptitude, it should no 
be used alone for this purpose. The examina in 
should be useful for measuring the achievemen 
of a student after a semester or year's work n 
chemistry, for furnishing a basis for the diagnos! 
of class weaknesses, and for dealing more ette 
tively with students who are not well orient 
in their college work. 


For a review by Theodore A. Ashford of 
Form A, see 3:567. 
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*Cooperative General Science Test. High school ; 
1932-51; IBM; Forms X (747), Y (48), Z (50); 
Forms 1932-37, N-Q out of print; no data on valid- 
ity; no specific manual; general Cooperative manual 
sr) ; norms [37] ; $2.50 per 25; 50€ per specimen 
set, postpaid; separate answer sheets may be used ; 

per 25 IBM answer sheets; 15@ per stencil for scoring 
answer sheets; cash orders postpaid; 40(45) minutes; 
Paul E. Kambly (X, Y) and Carl A. Pearson (Z); 
Cooperative Test Division, Educational Testing Serv- 
ice, 


REFERENCES 
1. MERRILL, RUTH ATHERTON. An Evaluation of Criteria for 


the Selection of Students in the School of, Nursing of the Uni- 
versity of Minnesota. Doctor's thesis, University of Minnesota 
(Minneapolis, Minn.), 1937. (Summaries of Ph.D. Theses... 
1939, pp. 126-31. 

Joun S. Кіснлкрѕом, Associate Professor of 
Education, The Ohio State University, Colum- 
bus, Ohio. 

This test is intended primarily for use as a 
measure of end of course achievement and also 
for use in placement, administrative survey, and 
curriculum study. The number of items in each 
form ranges from 85 in Form Z to тот in Form 
Q. With the exception of a few matching items 
in Form Q, all items are of the multiple choice 
type. Each form is divided into two parts, with 
25 minutes allowed for the first part and 15 min- 
utes for the second. There is no apparent order 
in the listing of the items. 

It is assumed that the general procedures de- 
scribed in the manual for all Cooperative tests 
were carefully followed in the preparation of 
the general science test. These procedures, as 
described, include curriculum and textbook 
analysis, research studies, editorial attention to 
test items, experimental use, revision, and sur- 
veys to obtain norms. 

The teaching objectives which these instru- 
ments actually test merit attention. Study of 
the items reveals considerable dependence upon 
recall of factual information. To a lesser degree, 
principles of science are involved in determining 
the correct response, Tn a few items in each form 
the student has some occasion to do reflective 
thinking. In only one item in all four forms is 
he expected to respond directly in one of the 
aspects of thinking. In Form Q, Item 24 of Part 
I deals directly with the interpretation of data. 
It should be noted that Form Q is the oldest of 
the four forms here examined and that no item 
comparable with that cited is included in the later 
forms, This lack of attention to the method of 
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science per se constitutes a weakness of the se- 
ries, 

An error of fact appears in Item 34 of Form 
X. The key indicates that the correct answer is, 
“Water is a good conductor of electricity." Ac- 
tually, water is a poor conductor of electricity. 
It is the solution of certain compounds in water 
that constitutes a good conductor of electricity. 

Analysis of the items by fields of content 
(chemistry, physics, botany, astronomy, etc.) 
reveals a rather uniform pattern of distribution 
in each of the forms of the test. While such dis- 
tribution among the fields cannot be entirely 
clear-cut in certain instances, there is in each 
of the forms a preponderance of items from the 
field of physics—approximately 30 to 40 per 
cent. The items concerned with health and physi- 
ology constitute from 20 to 35 per cent of all 
items. In contrast, there is but scant attention 
given to the fields of conservation and meteorol- 
ogy. Assuming that the analysis of curricula, 
textbooks, and the like reveals that such is the 
distribution of the content in actual practice, 
some users of the tests may find the distribution 
quite satisfactory. Other users may not find such 
relative emphases upon the various fields of con- 
tent to their liking. 

There is no manual for this particular test. 
The general manual prepared for use with all 
Cooperative tests gives no information on the 
validity and use of the Cooperative General Sci- 
ence Test. Some information on reliability is re- 
ported on the hand scoring keys. The usual 
teacher would, however, find such reliability 
data difficult to interpret. 

Tn spite of the shortcomings noted above, this 
test should serve quite satisfactorily the teacher 
who understands that it measures only a lim- 
ited portion of the range of objectives for the 
teaching of science. The sampling provided by 
the test items is consistently distributed over cer- 
tain academic areas and thus will serve some 
testing situations reasonably well. 


For a review by G. W. Hunter of Form P, 
see 40:1601; for reviews by W. B. Meldrum of 
Form N and Alvin W. Schindler of Form 1937, 
see 39: 1125. 


[624] 
*Cooperative Science Test for Grades 7, 8, and 9. 
Grades 7-9; 1941-51; 4 scores: informational back- 
ground, terms and concepts, comprehension and inter- 
pretation, total; IBM; Forms X C4, Y (48; 
Forms R, S out of print; no data on validity; no spe- 
cific manual; general Cooperative manual (51); 
norms ['48]; $2.50 per 25; 506 per specimen set, post- 
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paid; separate answer sheets may be used; 80¢ per 25 
IBM answer sheets; 156 per stencil for scoring answer 
sheets; cash orders postpaid; 80(85) minutes; Paul E. 
Kambly; Cooperative Test Division, Educational Test- 
ing Service. * 

R. Witt Burnett, Professor of Science Educa- 
tion, University of Illinois, Urbana, Illinois. 
[Review of Forms S, X, and Y.] 

Part I of each form contains 75 multiple 
choice items on informational background in 
general science. The distribution of items appears 
sensible and tends to emphasize areas of func- 
tional value such as health, conservation, biology, 
and so forth along with areas of general interest. 
There was fortunately no attempt to use an 
additive process in providing equal balance to 
items from all relevant fields. There appears to 
be some emphasis on certain types of questions 
that could hardly be supported as significant 
Írom a modern curricular viewpoint. Detailed 
objections can be raised concerning sampling 
and word usage. Of the six questions in meteor- 
ology most are very similar and deal with con- 
densation phenomena. Two of the four ques- 
tions on sound are essentially alike and deal 
with pitch. Item 26 in Form Y misuses the term 
"motor." Item 34 has spiracles “around” rather 
than "along" the sides of the abdomen. Aside 
from such minor points as these, the informa- 
tional items are probably as wisely drawn and 
intelligently constructed as can be expected in 
a standardized instrument to be used generally 
over the country regardless of local objectives 
and emphases. Certainly, Part I avoids the all 
too common tendency in achievement tests of 
finding the apparent most common denominator 
of textbooks and courses of study over the coun- 
try. This latter procedure generally produces an 
instrument testing memoriter learnings of dis- 
crete factual sorts basically unrelated to func- 
tional or situational contexts. The Cooperative 
test is still distinctly superior in the field today 
and is matched or surpassed, in the reviewer's 

: opinion, only by the USAFI test which requires 
almost twice the time for administering. 

Part II of each form purportedly tests knowl- 
edge of both terms and concepts. Clearly, many 
items in Part I also test, directly or indirectly, 
for these knowledges. The 45 multiple choice 
items in Part II are very largely questions of 
terminology. “During the daytime an oak tree 
manufactures starch by the process known as:” 
is a fair example. Very little if any real conceptu- 
alization is required in Part II. “Ап electric cur- 
rent always produces: light, sound, radio waves, 


magnetism, motion" is as near as any item 
comes to it. The title of Part II is therefore 
misnomer and future editions would profi 
greatly from a deemphasis on terminological 
questions and an increase in the number that 
test understanding of principles and concepts, 

Part III, Comprehension and Interpretation, 
is the most valid part of the test from a curricu- 
lar standpoint in conception and probably in 
execution. This part consists of various pas- 
sages, tables, and diagrams that present experi- 
mental results and scientific problems. Follow- 
ing each given situation are a number of 
multiple choice propositions. In each, the student 
is to choose the response supported by the given 
situation or by scientific principles and generali- 
zations. It is quite possible that some items in 
Part III test nothing more than reading compre- 
hension. However most items present predictive 
questions and other sorts that require conceptual 
knowledge for correct answering. 

Norms are given in percentile rank tables. 
The general Cooperative manual fortunately 
presents an excellent cautionary statement 
on the use and interpretation of norms that 
may prevent the abuses of normative data so 
common in the past. No data on statistical va- 
lidity are given. No data are given on what 
numbers and distribution of students were In- 
volved in the normative tables. “Вазей on large 
numbers of students,” a statement appearing 10 
the general manual and referring to all of the 
Cooperative tests, is hardly sufficient. These data 
should be available to all potential users in the 
manual. Reliability coefficients are not given, 
however the scaled score tables provide diagram- 
matic indices of the standard error of measure- 
ment at certain points. : 

From a curriculum standpoint this reviewer 
feels that the entire test is basically sound in bal- 
ance and in the implied objectives for which test 
items have been developed. However he woul 
wish for a greater emphasis in number of items 
and time allocated to Part III which now allows 
but 25 minutes for response and six or seven situ- 
ations for analysis depending on the form. The 
entire test will take 8o to 85 minutes to administer 
but could easily be administered in two РАН, 
useful feature of all Cooperative tests is that the 
raw scores are easily converted to scaled Scores 
so that normative data can be directly compare 
test by test. In addition it is an easy matter (0 
determine a student's growth, as measured by 
reapplication of the test using a different form 
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For reviews by Hans C. Gordon of Form R 
and Herbert A. Thelen of Forms S and R, see 
3:571. 

[625] 

*General Science: Every Pupil Test. High School ; 
1929-51; new form usually published each April and 
December ; form April 1951; no data on reliability and 
validity; no manual; norms (51); 2156 per test; ІФ 
per answer key; postpaid; 40(45) minutes; Ohio 
Scholarship Tests, Ohio State Department of Educa- 
tion. * 


[626] 

*General Science Test: National Achievement 
Tests. Grades 7-9; 1936-50; 7 scores: general con- 
cepts, identifications, men of science, definitions, uses 
of objects, miscellaneous facts, total; Forms А ('50— 
same as test copyrighted in 1939), B ('50—same as 
test copyrighted in 1939) ; no data on reliability and 
validity and no description of normative population in 
manuals; no norms for part scores; manuals ('39) ; $2 
per 25; 35€ per specimen set; postage extra; nontimed 
(30-45) minutes; Robert K. Speer, Lester D. Crow, 
and Samuel Smith; Acorn Publishing Co. * 


For reviews by Francis D. Curtis and G. W. 
Hunter, see 40:1602. 


[ 6271 
*General Science III: Achievement Examina- 
tions for Secondary Schools. High school; 1951; 
1 form; no data on reliability and validity; no manual ; 
Minnesota norms (median and quartile deviation) 
available; similar norms for other regions by special 
arrangement with publisher; 7¢ per test, postage ex- 
tra; 60(65) minutes; edited by Walter W. Cook ; pre- 
pared by a curriculum committee of high school teach- 
ers for use in the Minnesota State Board Achievement 
Examinations Program; Educational Test Bureau, 
Educational Publishers, Inc. * 

[628] 

*Read General Science Test: Evaluation and 
Adjustment Series. Grades 9-13; 1951-52, C1950751 ; 
IBM; Forms AM (з), BM (52); manual ('51); 
$2.50 per 25; separate answer sheets must be used; 
per 25 IBM answer sheets; postage extra; 356 per 
specimen set, postpaid; 40(50) minutes; John G. 
Read; World Book Co. * 
BENJAMIN S. Broow, College Examiner, The 
University of Chicago, Chicago, Illinois. 

Froma statistician’s viewpoint, this is a model 
test. Preliminary forms were administered to 
1,600 students in 12 high schools in 8 states 
near the close of the year. The mean IQ of the 
group was 102, The mean validity index (item- 
total correlations) was computed and found to 
be .42 and .43 for the two forms. Items were 
selected so as to yield two forms of the test, pre- 
cisely balanced in difficulty (both forms had a 
mean difficulty of 53 per cent) and composed of 
items known to be of significant discriminating 
power, Reliability coefficients are .85 and higher. 
The test user is also aided by information about 
the standard error of measurement, which is 4.6 


points for a 75-item test. Both forms of the test 
were administered in a rotation-type experi- 
ment, and it was found that the two forms are 
almost directly comparable at all points along 
the scale, even in terms of raw score. 

From the viewpoint of the present general 
science curriculum, the test is quite good. Ob- 
jectives and content to be tested were drawn 
from an examination of widely used textbooks, 
state curricula and samples of courses of study, 
and the relevant publications of the National So- 
ciety for the Study of Education. As a result, 
an attempt was made to measure a knowledge 
of the basic facts and principles of the sciences 
as well as the ability to apply this knowledge in 
problem solving situations. Approximately one 
half of the test items are of each type. Teachers 
who administered the preliminary forms of the 
test were invited to criticize them, and these criti- 
cisms were taken into account in developing the 
final forms of the test. The items are well con- 
structed with ample diagrams, tables, and illus- 
trations. 

As a test for the present general science pro- 
gram in the high schools, this test can not be 
criticized. It is true that general science courses 
do help the student become acquainted with 
many tidbits of information and do develop some 
slight ability to apply little understood princi- 
ples in general life situations. This test ade- 
quately reflects this emphasis. However, the edi- 
tor claims that the Evaluation and Adjustment 
series of tests attempts “to give recognition to 
the newer objectives and to anticipate curricular 
trends of the next few years.” The yearbook to 
which the test writer referred stresses the im- 
portance of developing competence in the use 
of the scientific method of problem solving and 
of inculcating scientific attitudes as transcend- 
ing in importance other objectives in science in- 
struction. Here it is that this test is lacking. Of 
course, it is here also that the curriculum is lack- 
ing. Such objectives are difficult to teach as well 
as to test. But it is here that our new tests must 
pioneer. М 

Other than improved statistical work and 
more careful construction, this test shows little 
improvement over tests constructed 20 years 
ago in the same field. Ifa test made in 1951 shows 
little or no advance over tests made in 1931, 
there would appear to be equally little reason for 
publishing it. Tests workers appear to be loath 
to assume the opportunities as well as responsi- 
bilities for leadership. Our field must keep pace 
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and perhaps even anticipate thinking about the 
curriculum. The writer would advocate that 
every new standardized test contain at least one 
experimental section in which “newer” objec- 
tives are tested—perhaps with the opportunity 
given the test user to use or reject this section as 
he desires. 


Joun S. RrcnanpsoN, Associate Professor of 
Education, Ohio State University, Columbus, 
Ohio. 

According to the manual, the distribution of 
items by content areas is: physics, 42 per cent; 
biology, 28 per cent ; chemistry, 4 per cent; and 
general (weather and climate, communication, 
transportation, geology, astronomy, space, sci- 
entific method), 26 per cent. The tests are not 
diagnostic in nature; no one area of content is 
sampled to a degree that makes possible diag- 
nosis of academic achievement. 

The choice of the items seems to have been 
carefully made, and the wording is clear and 
exact. There is considerable reliance upon the 
recall of factual information; there is also a 
commendable utilization of principles of sci- 
ence. Several items in each form require the 
student to recall or select data relevant to a given 
problem, to test his hypothesis, and to draw a 
conclusion, There is a question as to whether 
the sampling is adequate, even for the purpose 
of gaining an overall estimate of achievement. 
Admittedly, it is difficult to satisfactorily cover 
with 75 items the broad range of content in gen- 
eral science. Perhaps more time should be 
planned for the test with a corresponding in- 
crease in the sampling. 

In view of the evident care in choice and 
preparation of the items and the efforts toward 
standardization, this test should prove one of 
the better instruments for the measurement of 
achievement in general science, 

[ 629] 
*Test of Application of Principles in General 
Science: General Education Series, High school; 
1940-50; Forms А (’50), B ('50) combined are essen- 
tially the same as Application of Principles in Science: 
Test 1.3b (40) which was a revision of Test 1.3 (' 7); 
original tests published by Evaluation in the Eohi 
Year Study, Progressive Education Association; no 
data on reliability and validity; no norms; administra- 
tion of both forms recommended in manual but not in 
publisher's catalog; manual ('50); $2.50 per 25; sep- 
arate answer sheets must be used; 60f per 25 answer 
sheets ; 40€ per set of scoring stencils ; cash orders post- 
paid; 50¢ per specimen set, postpaid ; nontimed (40) 
minutes; Evaluation Staff (Ralph W. Tyler, Direc- 
tor) of the Eight Year Study of the Progressive Edu- 


cation Association; published in 1950 by Cooperative 
Test Division, Educational Testing Service. * = 


REFERENCES 


1, Ratus, Louis E. Application of Principles, Progressive 
Education Association, Evaluation in the Eight Year Study, 
Bulletin No. s. Chicago, Ill.: the Study, December 1936, Py. 
25. Paper, mimeographed. * F 

2. Evaluation Materials Developed for Various Aspects of 
Thinking. Chicago, Ill.: Evaluation in the Eight Year Study, 
Progressive Education Association, March 1939. Pp, [25] 
Paper, mimeographed. * 

3. SmitH, Evcene R.; Tyrer, RALPH W.; AND THE Були 
TION Starr. Appraising and Recording Student Progress, M. 
venture in Arherican Education, Vol. III. New York: Harper 
& Brothers, 1942. Pp. xxiii, 550. * (РА 16:5033) 


К. Witt Burnett, Professor of Science Edu- 
cation, University of Illinois, Urbana, Illinois. 

Perhaps the most commonly professed ob- 
jective of science teaching is the development of 
critical thinking. The teacher, however, does not 
have adequate and valid instruments with which 
to determine his success in this objective or to 
diagnose his students' weaknesses and his own 
teaching techniques. It is therefore somewhat 
surprising that the essentially pioneering work 
of the Committee on Evaluation and Recording 
of the Progressive Education Association's 
Commission on the Relation of School and Col- 
lege has not been built upon by others or that the 
promising instruments developed in connection 
with the Eight Year Study have not been re- 
fined and made available before this time. 

One of these instruments was the Test oj 
Application of Principles in General Science. 
Two forms of this test are now available in sub- 
stantially the same form as that finally developed 
in 1940 by the aforementioned committee. De- 
tailed information on the procedures used in 
constructing and validating this instrument 15 
available in Appraising and Recording Student 
Progress (3). As the present publishers have no 
additional data and as they are interested in se 
curing from users data that can be used іп es- 
tablishing norms (and, we hope, in improving 
the reliability and usefulness of the test), those 
interested in utilizing the instruments are 
encouraged to study with care the reference 
cited, 

Cooperating teachers in the Eight Year Study 
agreed upon certain principles of natural science 
as being applicable to such areas as person? 
health, public health, conservation, consumer 
ship, and so forth. Problem situations were 06 
cided upon that met the following criteria: (a) 
they were likely to be new to students ; ( b) they 
were likely to occur rather commonly in ac 1 
life; (c) they could be interpreted, explained, d 
resolved through the application of one of t ч 
selected principles; and (d) they were of ЭШ 
nificance to the students in that the solution д 
explanation might be helpful in solving ОТ © 


n d 


plaining similar problems that occur in every- 
day living. 

Students were asked to respond to the prob- 
lem situations in essay fashion and to list their 
reasons for their responses. These essay re- 
sponses were used in framing an objective form 
of the instrument, Experimental groups then 
responded to both forms (essay and objec- 
tive) and a correlation coefficient of ,80 was 
found. The objective form was, therefore, 
adopted. 

In the final form, situations or descriptions 
that provided stated predictions, courses of ac- 
tion, or stated causes were employed. Form A 
and Form B each state four of these problem 
situations, The student is asked to decide 
whether he agrees, disagrees, or is uncertain of 
his agreement with each statement. Only one 
conclusion for each statement is defensible 
through the use of scientific principles. The other 
two, however, are never nonsensical, absurd, or 
preposterous, 

The student is then asked to check from a list 
of reasons why he agreed, disagreed, or was un- 
certain, Both valid and invalid reasons are listed 
for the student's inspection and response. The 
valid reasons can be categorized as: (а) citing 
established scientific facts, principles, generali- 
zations; (b) citing of personal experience that 
is valid and relevant ; (c) citing of authoritative 
references ; and (d) proper use of analogies per- 
tinent to the situation, The invalid reasons can 
be categorized as: (a) teleological; (b) ridi- 
cule; (c) assuming the conclusion; (d) unac- 
ceptable authority; (e) unacceptable analogy ; 
and (f) unacceptable common practice, Each of 
the invalid reasons is stated so as to appear con- 
sistent with correct conclusions. As both the 
valid and invalid reasons were taken from essay 
responses of students to earlier forms of the in- 
strument, one can assume a fair curricular va- 
lidity in the type and form of reasons listed in 
the present instrument. 

Page 515 of the Smith and Tyler reference 
cited above provides information concerning the 
reliability of the instrument in its earlier form. 
The original Forms 1.3b and r.3c (each of which 
was twice the length of a form of the present 
edition) were utilized to obtain the following 
product-moment correlation coefficients for cer- 
tain subparts: accurate conclusions, .48; right 
of reasons, .67; percentage of principles right, 
36; percentage of reasons invalid, .33 ; and per- 
centage of inconsistent responses, .15. The cor- 
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responding reliability coefficients for tests of the 
length of the tests being reviewed would be ap- 
proximately .32, .50, .22, .20, and .08, Obviously, 
the reliability of the instrument is too low to 
allow intelligent use of normative data even if 
they were available. The present value of the 
instrument is as a teaching device through which 
teachers and students can consider certain as- 
pects of critical thinking, The earlier studies of 
the Committee on Evaluation and Recording 
disclosed typical categories of invalid responses. 
These categories as well as valid categories are 
built into the present instrument. Where teachers 
find that students tend to respond by assuming 
the conclusion or by any of the other invalid 
categories, a useful basis for corrective teaching 
is provided. The present test is rather cumber- 
some. It requires considerable time on the stu- 
dent’s part and is somewhat complicated to score 
and analyze. The reliability of the instrument 
can hardly be materially improved until tech- 
niques that will provide for less time per prob- 
lem and for more problems are developed. In the 
reviewer's opinion, the instrument is reasonably 
valid for its purposes and is the best that is now 
available, If the test is widely and experimen- 
tally used, the publishers should soon have 
data upon which to construct sounder instru- 
ments more predictive of real behavior and of 
higher reliability. Norms may be useful at that 
stage. Me 
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*Measurement of Observation and Understand- 
ing of Physical Phenomena Life Processes. 
Grades 9-10; 1952; 1 form ['$2]; mi $ 
data on reliability and validity; no norms ; рег т 
postage extra; test be reproduced by users; 25 
Postage imen set (specimen set must be purchased to 

), postpaid ; gontimed (4 ) minutes ; 
Annette M. Wright and Hugh B. ; University 
of Oregon Press. * 
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[632] 
*The New Air World. High school and adults; 
1944; 3 scores: vocabulary, information, total; 1 form; 
по сва ы теш validity; no norms; ra 
44]; per 30; 25% per specimen set; postpaid; 30 
(35) minutes; Elizabeth Sterry, D. F. Votaw, H. T. 
Manuel; Steck Co. * 


PHYSICS 


[633] 

*College Entrance Examination Board Achieve- 
ment Test in Physics. Candidates for college en- 
trance; 1901-51; available only in College Entrance 
Examination Board Admissions Testing Program (see 
526); 60(70) minutes; prepared by College Entrance 
Examination Board Committee of Examiners in Phys- 
ics in cooperation with the Staff of Educational 
Testing Service: 1951 membership: Paul A. Nor- 
throp (Chairman), Alexander Efron, C. L. Henshaw, 
W. C. H. McQuarrie, and Howard A. Taber; pro- 
gram administered by Educational Testing Service 
for the College Entrance Examination Board. * 


REFERENCES 
1. "Proposed Revision of the Requirements for the College 
IDE Бхзшкана in Physics." Sch Sci & Math 40:686-90 
40. 


2. Weer, СнвіѕттАМ О. “Old and New College Board 
Scores and Grades of College Freshmen." J Ат Assn Col Reg 
20:70-5 o "44. Ae 19:570) 2 p 

3. TURNBULL, WiLLIAM W. “Effect of Sequence of Topics 

in Physics Courses on Scores in the April Physics Test." Col 
Ва R 1:10-1 sp '47. * (PA 22:838) 
Parwzn О. Јонмѕом, Professor of Education, 
University of Minnesota, Minneapolis, Minne- 
sota. [Review of Forms VAC2, WAC2, XAC, 
Y AC, and ZAC.] 

These tests are components of a battery of 
tests designed to estimate the achievement of 
secondary school seniors who are applicants for 
admission to certain American colleges. Scores 
on these tests are reported to colleges at the 
request of the applicant and may be used as one 
of a number of means of estimating a candidate's 
preparation for college and predicting his future 
performance with similar material. The scores 
are also sent on request to the secondary schools 
for what value they may have for guidance pur- 
poses. The tests are thus primarily designed to 
estimate the achievement in physics of those 
students planning to enter college. 

The preparation of the test forms is the re- 
sponsibility of the Physics Committee appointed 
by the College Entrance Examination Board, the 
membership consisting usually of three college 
instructors, one public high school and one pri- 
vate secondary school instructor. This committee 
is served both practically and technically by 
regular members of the Educational Testing 
Service staff. 

The tests in their present form are the result 


of a long period of development during which 
systematic experimentation and study of empiri- 
cal results have been used to appraise the ef. 
fectiveness and to improve the quality of the 
tests. The procedures preliminary to the prepa- 
ration of the final form of any test include: (a) 
the construction of test items by the committee 
members with the technical assistance of ETS 
representatives in accordance with the previ- 
ously agreed to specifications, (5) the review of 
all items by the same individuals, (c) the ad- 
ministration of the pretesting items to a repre- 
sentative sample of the regular Board popula- 
tion, and (d) detailed statistical analysis of the 
pretest items to determine optimum properties. 
On the basis of this preliminary trial, the ex- 
aminers committee with the assistance of ETS 
representatives recommend changes, make sub- 
stitutions and revisions. The favored types of 
items in use are a balanced combination of com- 
pletion items, true-false, and 5-choice (classifica- 
tion) items. A highly critical job of editing for 
both technical and grammatical errors is a final 
step. 

An exhaustive study is made of the results 
from the administration of the final form of the 
tests. A test analysis report is prepared for each 
new form of every test. This includes the total 
score distributions indicating the range and 
skewness. An analysis of greater detail is car- 
ried out on a random sample of papers to yield 
information on such properties as the speeded- 
ness of the test and the overlap between earned 
scores and chance scores for the purpose of 
gauging more accurately the difficulty spread 
for the next form of the test and determining the 
optimum number of items and time allotment for 
the test. A detailed item analysis is made on the 
same sample. As an example, the following data 
among others were reported for one form of the 
test: reliability, .928; standard error of meas 
urement, 5.9; speededness of sections, percent 
age completing test, 66.8 ; percentage completing 
75 per cent of test, 94.4 ; number of items reache 
by 80 per cent of the candidates, 142; total num- 
ber of items, 152. ; 

The problem of the committee in developing 
tests that represent accurately the objectives 0 
high school physics for college entrants is a very 
difficult one. The scope of content covered an 
the character of the functions measured shou 
be indicated by an examination of the test items: 
The reviewer classified the items of the five forms 
in accordance with the topics generally covere 
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in the high school physics course. This led to 
the following percentage distribution: mechan- 
ics, 38; heat, 9; sound, 8; light, 16; magnetism 
and electricity, 26; and general, 3. This distri- 
bution, except for its greater emphasis on light 
and its lesser emphasis on heat, is not greatly 
discrepant from that of a recent study of 10 high 
school physics tests and 12 physics text books. 
It may be said, however, that over a period of 
years the Board makes use of subject matter 
checklists to insure that there are no biases in 
subject matter coverage. The appearance of 
questions, though few in number (3 per cent), 
cutting across the various divisions of physics 
would probably be favored by an increasing 
number of high school physics teachers, as 
probably would be a wider integration around 
the principles of the whole area of the physical 
sciences. 

A second classification of items, according to 
the function tested, yielded the following, per- 
centages : informational, 21; interpretive, 42; 
and problem solving, 37. This is an overall ap- 
proximate classification which would be more 
meaningful if the conditions of instruction were 
known to be in agreement with the classification 
principle. Nevertheless, the emphasis upon un- 
derstanding of principles and the ability to apply 
them to new situations, as well as the stress on 
understanding and applying the scientific 
method, reflects very well current points of view. 
The committee could very well extend the ap- 
plication of scientific method since physics pro- 
vides such clear and rigorous examples of sci- 
entific analysis and approach. А 

These tests, then, reflect high standards of 
attainment. If institutions, both higher and sec- 
ondary, succeed in using the information latent 
in the test results to contribute to the solution of 
their own instructional and guidance problems, 
the potentialities of such a well conceived and 
executed measurement program will come to 
fruition. 


[ 634] 

*Cooperative Physics Test. High school; 1932-51; 
IBM; Forms X (’47), Y (48), Z (49) ; Forms 1932- 
37, N-Q, S out of print; no data on validity; no spe- 
cific manual; general Cooperative manual (^51) ; 
norms [37]; $2.50 per 25; 50€ per specimen set, post- 
paid; separate answer sheets may be used; 8o per 25 

M answer sheets; 15€ per stencil for scoring answer 
sheets; cash orders postpaid; 40(45) minutes; Paul J. 
Burke; Cooperative Test Division, Educational Test- 
ing Service. * 

REFERENCES 


Få 
i 

1. Brnincer, Mary LEE. A Study on the Prediction of Suc- 

cess in an Engineering College. Master's thesis, Clark University 


(Worcester, Mass.), 1945. (Abstracts of Dissertations....1945, 


рр. 20-21.) 
F. "The Distribution of Emphasis in Ten 


2. WEAVER, J. 
Physics Tests and in Twelve Physics Textbooks." J Ed Res 


39:42-55 S '4s. * (РА 20:571) 

For a review by G. P. Cahoon of Forms P, 
Q, and S, see 3:581; for reviews by Andrew 
Longacre, Alvin W. Schindler, and Ralph К. 
Watkins of Form P, see 40:1608; for reviews 
by Ernest E. Bayles and A. W. Hurd of Form 
1937 and Form N, see 38:1088. 


[635] 
*Cooperative Physics Test: Educational Rec- 
ords Bureau Edition. High school; 1941-51; Forms 
ERB-RX (48), ERB-SX (49), ERB-TX (о); 
Forms ERB-R, ERB-S, ERB-T out of print; no data 
on reliability and validity with test (for data presented 
elsewhere by the publisher, see 2 below) ; no specific 
manual; general Cooperative manual (’51); Form 
ERB-RX norms (’51), Form ERB-SX norms (49), 
Form ERB-TX norms (о); $2.50 per 25; 25¢ per 
specimen set; postpaid; 80(85) minutes ; the Commit- 
tee on Physics Tests of the Educational Records Bu- 
reau (Lester D. Beers, Chairman) ; Educational Rec- 
ords Bureau. * 
REFERENCES 

т. Traxuer, AnTHUR E. “Some Data оп the Difficulty and 
Validity of the Cooperative Tests in Biology, Chemistry, and 
Physics, Form ERB-R," pp. 64-8. In 1941 "Achievement Test- 
e Program in Independent Schools and Supplementary Studies. 
Educational Records Bulletin, No. 33. New York: Educational 
Records Bureau, June 1941. Pp. x, 76. Paper, lithotyped. * 
(PA 15:4388) 

2. TRAXLER, ARTHUR E. “Some Data on the Difficulty, Relia- 
bility, and Validity on the Cooperative Biology, Chemistry, and 
Physics Tests, Forms —Rx," pj ДЕ (PA 23:314) In 
1948 Achievement Testing Program in Independent Schools and 
Supplementary Studies. Educational Records Bulletin, No. 50. 
New York: Éducational Records Bureau, July 1948. Pp. xiii, 
86. Paper, lithotyped. * 


[636] 5 

*Dunning Physics Test: Evaluation and Adjust- 
ment Series. Grades 11-13; 1951-52, C1950-5I ; IBM; 
Forms AM ('51), BM (32); manual (°51) ; $2.50 per 
25; separate answer sheets must be used; 8o¢ per 25 
IBM answer sheets; postage extra; 356 per specimen 
set, postpaid; 45(50) minutes; Gordon M. Dunning; 
World Book Co. * 

С. P. Canoox, Professor of Education, Ohio 
State University, Columbus, Ohio. 

This test appears to have been carefully con- 
structed “to measure the extent to which stu- 
dents have achieved the important objectives of 
a high school course in physics." The directions 
for administering and scoring appear specific 
and adequate. 

Each of the two parts of the test contains half 
of the items on each topic in order to give *slower 
students an opportunity to attempt items in all 
the different content fields." 

While considerable information is given in the 
manual on the general method of construction 
and the tryout procedures, the actual validation 
procedure is not clearly indicated. The guiding 
principle (no others are indicated) seems to 
have been that “most elements measured may be 
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justified both in terms of frequency of inclusion 
in commonly used textbooks and on the basis 
of expert judgment as to importance.” Little 
or no emphasis seems to have been placed upon 
such aspects as critical thinking or scientific 
methods, the application of physics principles to 
everyday devices, and important recent develop- 
ments. 

In general, however, this test seems to be as 
valid and reliable as any in terms of what is ac- 
tually “covered” in most secondary school phys- 
ics courses, and more functional and more care- 
fully constructed than most available tests in 
physics. 


[ 637] 
*Graduate Record Examinations Advanced 
Physics Test. Senior year college through graduate 
school and candidates for graduate school; 1939-51; 
available only in Graduate Record Examinations pro- 
grams (see 527); 180(220) minutes; prepared by the 
Advanced Physics Test Committee appointed by Edu- 
cational Testing Service; J. William Buchta (Chair- 
man), Edward С. Campbell, Louis М. Heil, Claude J: 
Lapp, and Richard M. Sutton; Educational Testing 
ervice.'* 


[638] 

Iowa Placement Examinations: Physics Apti- 
tude, Grades 12-13; 1925-44; IBM ; 2 editions; no 
specific manual; 50(55) minutes; G. D. Stoddard and 
C. J. Lapp; Bureau ot Educational Research and Serv- 
ice, State University of Iowa. * 

а) SERIES PAI, REVISED. 1925-26; series title on Form 
B is Series PA-1, Revised; Forms A (25), B (25); 
series manual ['26] ; $5 per 100, postage extra; 60€ per 
specimen set (includes the other 10 tests in the series), 
postpaid; prepared under the direction of C. Е, Sea- 
shore and G. M. Ruch. 

b) NEW SERIES PA-2, REVISED. 1925-44; also called 
Quick-Scoring Edition; IBM; Form M (41) ; $175 
per 25; 25¢ per series manual ('44) ; 50¢ per specimen 
set (includes the other 4 tests in the series), postpaid ; 
separate answer sheets may be used ; $3.50 per 100 IBM 
answer sheets; 30€ per stencil for Scoring answer 
ШЕ, postage extra ; revised by C. ]. Lapp and D. B. 

tuit. 


REFERENCES 
1-4. See 3:587. 


5. STALNAKER, JOHN M. A Statistical Study of Some Aspects 
of the Purdue Orientation ed Program. Bulletin of Pur- 
due University, Vol. 28, No. 6; Studies in Higher Education 
VIII. Lafayette, Ind.: Division of Educational Reference, 
the University, February 1928. Pp. 68. Paper. * (PA 5:831) 

Unrsrock, RICHARD STEPHEN. “Construction of a Selec- 
tion Test for College Graduates.” J General Psychol 41:153-93 
О 49. * (РА 24:4874) 


Joun W. Frencn, Research Associate, Educa- 
tional Testing Service, Princeton, New Jersey. 
This test consists of four parts: 


a) Twenty-five short free answer items in arith- 
metical and algebraic skills that are essential for 
physics. These are well constructed and of great vari- 
ety. Two rather time-taking items in arithmetical 
Computation are included, which, although they might 
deprive slow computers of sufücient time to finish the 
part, would sefve appropriately to weed out a few very 


slow computers who would have trouble in a physics 
course. y 

b) Three reading paragraphs, each with 10 truc- 
false items. This is the only part that is not highly sat- 
isfactory for a test of this kind. It suffers mainly he. 
cause first-year training in physics would permit the 
student to answer all questions without readin the 
paragraph. This means that the part is so sensitive to 
training that it would measure a student's knowl t 
prior to formal training rather than his ability to com- 
prehend scientific reading matter. It should be pointed 
out that the inclusion of a part on reading is highly 
desirable for providing valid prediction of course 
grades. It is suggested here, however, that the content 
of the passages be selected from material not ordinarily 
covered in elementary physics but at the same level of 
difficulty. - 

c) Twenty series and syllogistic statements (true- 
false). These seem an excellent measure of ability to 
do the reasoning required in physics and to put verbal 
statements on a quantitative or proportional basis. 

d) Fifty general physics information or common- 
sense statements (true-false). With the exception of 
a few items such as "Energy is the capacity to do 
work," all of the information required to attain a high 
score on this part of the test is of the sort that an alert 
high school student interested in physical things is 
likely to pick up outside of any formal training. The 
items have good coverage of the various areas of prac- 
tical physical knowledge. A few of the items, however, 
are ambiguous or would be answered incorrectly by 
the well informed purist. 


The general Series РАт manual is definitely 
wanting. It gives norms, reliabilities, and validi- 
ties without any mention of the population on 
which the figures are based. Validity is not pro- 
vided separately for each test (Physics Aptitude, 
English Aptitude, etc.) ; only an average var 
lidity, .50, for the aptitude series is given. 

The New Series PA-2, Revised, is a test of 
generally high quality, a slight improvement 
over PAr, with about the same virtues and vices. 
The first three parts have five choice answers; 
the last uses true-false items. An answer sheet 
is used for all parts, making scoring easier. In 
the first part, unlike that in PA, precise numer- 
ical computation is not measured by the items 
calling for this activity, since the choices are $0 
widely separate that the items can be answer? 
by estimation. The reasoning part, now Part 2, 
“Formulation,” instead of Part 3, shows jd 
greatest change from the earlier test. This раї 
which involves the selection of a formula » 
represent a verbal statement, is excellent. PES 
3, "Reading," is, like Part 2 in РАт, оуег 
tive to training. There is а bad error in PS 
graph II. In several instances a number of po 
following the paragraphs bear upon the sat 
point. Part 4 is very similar to Part 4 of PA p 

The general Series PA-2 manual is а 87 es 
improvement over the earlier manual. It LAU 
more explanation and indicates the populatio! 
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on which the figures are based; in fact, it does 
everything that a manual reasonably should do, 
with the exception that it gives only a range of 
validity coefficients (.61—72). There are a few 
minor inconsistencies in an explanatory table 
(p. 20). 

This appears to be a very satisfactory test for 
the placement and predictive purposes to which 
it aspires. Physics aptitude uncontaminated by 
previous training is of necessity difficult to meas- 
ure. With the exception of the reading part, this 
test goes about such measurement in a highly 
commendable way. 


For a review by Robert M. W. Travers, see 
3:587. 


[639] 

Iowa Placement Examinations: Physics Train- 
ing: Series PT1, Revised. Grades 12-13; 1925-26; 
Forms A (25), B (25); no specific manual; general 
series manual ['26] ; $5 per 100, postage extra ; 60€ per 
specimen set (includes the other то tests in the series), 
postpaid; 43(50) minutes; C. J. Lapp and G. D. Stod- 
dard; Bureau of Educational Research and Service, 
State University of Iowa. * 


REFERENCES 


G. P. Canoon, Professor of Education, Ohio 
State University, Columbus, Ohio. 

This test consists of 125 test items to be done 

within the usual period of 50 minutes. The test 
is divided into four parts, with time limits of 
from 8 to 17 minutes for each part, totalling 43 
minutes for the whole test. 
, No manual is available for this specific test; 
instead, there is a short 3-page “manual of di- 
rections" for all tests in the series of Iowa Place- 
ment Examinations. The manual presents no 
information regarding the validity of the Physics 
Training test. How the items were determined is 
not suggested. 

While there seems to be no direct statement 
as to the level of students to whom this should be 
given, the test apparently has been planned par- 
ticularly to be administered to college students 
at the beginning of a college course in physics. 
While one may question the statement in the 
manual that “this examination should give at 
the end of two hours as adequate information 
about the student's place and needs in the course 
as the instructor ordinarily acquires at the end 
of the first semester under the traditional meth- 
ods of instruction" it probably does suggest one 


of the major purposes. Another statement in 
the manual that ““Тһе training series measures 
previous training which is considered essential 
preparation to the work at hand" does not seem 
to clarify much the possible usefulness. 

This examination, of course, was one of the 
earliest of the “new type tests" prepared toward 
the beginning of the "testing movement," as 
the copyright date of 1925 suggests. When the 
revision occurred, or just what the nature of 
the revision has been, is not indicated. In the 
light of the present status of the construction 
and uses of tests, this test appears to have con- 
siderable limitations. Of course, the validity of 
the test should be questioned, particularly since 
there is no evident relationship of the test items 
to what is or should be included in a modern 
course in “physics training." The majority of 
the test items seems to be concerned with the 
memorization of selected facts; some are com- 
pletion, and some are true-false. The short sec- 
tion of seven items (Section 3, Part 3) on the 
association of the name of the “discoverer” of so- 
called fundamental laws or principles is particu- 
larly questionable with reference to “training” 
in physics. The sampling of 15 problems in Part 
4 seems quite inadequate and inappropriate; 
their solution depends largely upon the memori- 
zation of key formulae and the substitution of 
given values in them. 

‘There is little concerning the application of 
physics principles to everyday devices and phe- 
nomena. One of two true-false items relating to 
radio at least dates the test with a reference to 
“а man associated with radio is Armstrong.” 
Of course there are no items concerned with 
recent developments and applications of physics 
during the past 10 or 15 years. There also are 
no items dealing with the methods of science, 
critical thinking, or scientific attitudes. 

Decile norms based upon more than 2,000 
cases are given. A reliability coefficient of .85 is 
reported without any indicat as to how it was 
computed. 

While this test pioneered in the field, and per- 
haps was quite appropriate and adequate in “the 
early days,” it does not seem useful for present- 
day courses which are concerned with different 
objectives and changing content. 

[ 640] 
Physics: Achievement Examinations for Sec- 
ondary Schools. High school; 1951; 1 form; no data 
on reliability and validity; no manual; Minnesota 


norms (median and quartile deviation) available ; simi- 
lar norms for other regions by special arrangement 
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with publisher; 7¢ per test, postage extra; 60(65) 
minutes; edited by Walter W. Cook; prepared by a 
curriculum committee of high school teachers for „use 
in the Minnesota State Board Achievement Examina- 
tions Program; Educational Test Bureau, Educational 
Publishers, Inc. * 


[ 641] 

*Physics: Every Pupil Test. High school; 1929- 
51; new form usually published each April and De- 
cember; form December 1951; no data on reliability 
and validity; no manual; norms (’51); 215€ per test ; 
1¢ per answer key; postpaid; 40(45) minutes; Ohio 
Scholarship Tests, Ohio State Department of Educa- 
tion. 


[ 642] 


*Physics Test: State High School Tests for In- 
diana. First, second semesters high school; 1934-49; 


2 levels; mimeographed; no data on reliability and 

validity; no manual; norms ['49]; 4526 per test; 1s 
r specimen set; postpaid ; 40(45) minutes; James Ў 

Mackell; State High School Testing Service for Indi- 

ana, Purdue University. * 

a) FIRST SEMESTER. Form A ['48]. 

b) seconp semester. Form N ['48]. 


For a review by A. W. Hurd of an earlier edi- 
tion, see 38:1092. 


[643] 
*Physics: 20th Century Test. First, second semes- 
ters high school; 1949; 2 levels; Form A; no data on 
reliability and validity; no norms—author recommends 
the use of local norms; 10 or more copies, 5€ each, post- 
age extra; 35€ per specimen set, postpaid ; 40(45) min- 
utes; Benton Review Publishing Co., Inc. * 


SENSORY—MOTOR 


Reviews BY Anna Espenschade, Elsie Murray, William G. Peacher, 


William Sloan, and 


Miles A. Tinker. 


[644] 

Harris Tests of Lateral Dominance. Ages 6 and 
over; 1947; I3 scores: knowledge of right and left, 
hand preferences, simultaneous writing, handwriting, 
tapping, dealing cards, strength of grip (optional), 
total hand dominance, monocular sighting, binocular 
sighting, visual acuity (optional), total eye dominance, 
kicking; individual; 1 form; no data on reliability: 
$2.90 per manual and 25 record forms; $175 per 25 
record forms; $1.50 per manual and sample record 
form; the actual test materials, including A-B-C 
Vision Test for Ocular Dominance (see 654), must be 
purchased or assembled separately ; Postpaid; (10-15) 
minutes; Albert J. Harris; Psychological Corpora- 
tion. * 
WILLIAM С. Peacuer, Assistant Professor of 
Speech Pathology and Audiology, School of 
Speech, and Assistant Professor of Special Edu- 
cation, School of Education, Syracuse Univer- 
sity; Instructor of Surgery (Neurosurgery), 
School of Medicine, State University of New 
York Medical Center; Syracuse, New York. 

This test has E used in speech and read- 
ing clinics for cases of disability, particularly 
when the problem of lateral dominance has been 
involved. A 27-page manual accompanies the 
test form. Five pages only are devoted to theory, 
the remaining being concerned with directions 
for test administration. The meaning and sig- 
nificance of lateral dominance are discussed, to- 
gether with four major theories to demonstrate 
its relationship to language disabilities, It would 
have been more helpful to the average reader to 
have discussed the theories of dominance in 


greater detail and to have provided additional 
references. There are 24 references, the ma- 
jority of which, however, concern reading dis- 
ability. 

It is conceivable that this test might also be 
of value in the determination of laterality in 
cerebral lesions in adults with aphasia. This 
would concern the case of pathology in Broca’s 
area in a right-handed individual in the absence 
of aphasia. Detailed testing, by revealing that 
such an individual may actually be predomi- 
nantly left-handed, may obviate the production 
of aphasia in the anticipated dominant hemi- 
sphere. The correlation of stuttering and hand- 
edness has been deemphasized in recent years. 

Perhaps the greatest need in relation to test- 
ing for lateral dominance would be in the field 
of cerebral palsy in the preschool age group with 
motor handicap. This test would not be applic: 
able in this category as it is geared to an age 
range of six years to maturity. Under ordinary 
circumstances one is not concerned about domi- 
nance in children below three, and handedness 
is usually well established by the age of six. The 
only exceptions in our experience have been In 
those cases with language, mental, and motor 
retardation, and associated emotional implica- 
tions. As the author points out, there has We 
some relationship between dominance and rea 
ing disability, although such emphasis has var! 
in different clinics. Aside from its use in reading 
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disability and also in the aphasic group, lateral- 
ity testing will still be largely in the preschool 
age group. 

In summary, therefore, the Harris Tests of 
Lateral Dominance have been used successfully 
in cases of reading and language disabilities. This 
examination merits value because of its simplic- 
ity, brevity, ease, and quickness of performance. 
Thus far, however, its results have been quali- 
tative and clinical rather than quantitative. 


Mns A. Tiwxzm, Professor of Psychology, 
University of Minnesota, Minneapolis, Minne- 
sota. 

This series of tests was organized to meet the 
practical requirements of clinical practice and 
at the same time to maintain satisfactory stand- 
ards with regard to comprehensiveness, relia- 
bility, and validity. Lateral dominance is in terms 
of composite ratings derived from several de- 
terminations of ocular preferences, manual or 
foot preferences, and relative skill in eye-hand 
coordination. These ratings range along a five 
point scale from strongly right to strongly left 
dominance. 

The tests are interesting and relatively simple. 
Directions for administering and scoring the 
tests and for interpretation of results are clear 
and complete. Although further standardization 
is under way, some relevant information is given. 
Data showing hand and eye dominance in ran- 
dom samplings of the population and in reading 
disability cases are listed in the manual as well 
as the relation between total hand dominance 
ratings and eye dominance ratings. There are 
relatively little data on reliability given. The 
reliability of the A-B-C Vision Test, reviewed 
elsewhere in this volume, is satisfactory. A re- 
test reliability of .88 for the Kaleidoscope Test 
and of .71 for the Hole in Card Test are cited. 
The three parts of the monocular tests are in 
substantial agreement. This also holds for the 
two parts of the binocular tests. The author and 
publisher should provide additional data on re- 
liability and validity as they become available. 
Nothing is said concerning the validity’ of the 
separate tests. Presumably they have face va- 
lidity, 

_ These various tests for measuring lateral dom- 
inance are well chosen. The view that a compos- 
ite score derived from responses in several tests 
is more valid than the score on a single one is 
sound; nevertheless, some of the tests in this 
battery probably contribute little, as, for in- 


utes. 


stance, the strength of grip and the stereoscopic 
tests. This is recognized by the author, who lists 
these two tests as optional. 

This series, which yields composite ratings of 
eye and hand dominance, is adequate for prac- 
tical clinical use. The tests are simple, interest- 
ing, and readily administered. Nevertheless, ad- 
ditional data on reliability should be provided as 
soon as possible. P 


4 


For an excerpt from a review, see 3:466. 


[ 645] 
XPre-Tests of Vision, Hearing, and Motor Co- 
ordination. Grades kgn-1, 1-3, 4-8, 7-10, 9-12 and 
adults; 1936-51; formerly a part of California Test of 
Mental Maturity (see 282) ; 5 levels; 1 form, '51; no 
data on reliability; manuals [51]; 3€ per copy of any 
one level; 5¢ per manual of any one level; 2¢ per scor- 
ing key for any one level; postage extra; 15¢ per speci- 
men set of any one level, postpaid; Elizabeth T. Sulli- 
van, Willis W. Clark, and Ernest W. Tiegs; California 
Test Bureau. * 5 
a) PRE-PRIMARY. Grades kgn-1; (15-20) minutes. ` 
b) PRIMARY. Grades 1-3; (15-20) minutes. 
c) ELEMENTARY. Grades 4-8; (12-17) minutes. 
d) INTERMEDIATE. Grades 7-10; (10-15) minutes. 
е) ADVANCED. Grades 9-12 and adults; (10-15) min- 
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[646] 
*Auditory Tests (Spondaic Word Lists). Grades 
2 and over; 1951; a recorded adaptation of Auditory 
Test No. 9 developed by Psycho-Acoustic Laboratory, 
Harvard University ; individual; 2 tests: each test con- 
sists of 6 different arrangements of the same 36 words ; 
mimeographed manual; $10.50 per set of three 12-inch 
(78 rpm) records for any one complete test; $4 per 
single 12-inch (33% rpm) record containing four lists 
of any one test; $4 per single 12-inch (3316 rpm) rec- 
ord containing two lists of each test; $10.50 per set of 
three 12-inch (33% rpm) records containing all lists of 
both tests; postage extra; (10) minutes; records dis- 
tributed by Technisonic Studios; manual published by 
Central Institute for the Deaf. * 
a) TEST w-1. A “constant level" test in which sound 
intensity must be attenuated by examiner. 
b) test w-2. A “descending level" test in which sound 
intensity has been attenuated by manufacturer in down- 
ward steps of 3 decibels. 


[647] 

*The Massachusetts Hearing Test. Grades 1-16 
and adults; 1948; a method of group screening using 
prescribed forms of scoring, signal presentation, and 
calibration; any pure tone audiometer adapted to group 
testing may be used; the method of testing and instruc- 
tions for scoring and interpretation are presented in 
J Acoust Soc Am 20:697-703 S '48; (17) minutes; 
Philip W. Johnston; standardized instructions for ad- 
ministering may be secured from the Massachusetts 
Department of Public Health, Division of Maternal 
and Child Health. 


[648] 
*Sonotone Screening Audiometer Model 30. Ages 
6 and over; 1949-50; a pure tone instrument (49) 
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equipped with one air conduction receiver available in 
both line and battery operated models; individual; 
manual ('50) ; $222 per line operated audiometer ; $199 
per battery operated audiometer; modified models for 
group screening available on special order; Sonotone 
Corporation. 


REFERENCES 
1. TAYLOR, GLENN J. “An Experimental Study of Tests for 
the Detection of Auditory Malingering." J Speech & Hearing 
Disorders 14:119-30 Je '49. * (PA 23:6377) 


MOTOR 


[ 649] 

*Edmiston Motor Capacity Test. Grades 1-12 and 
adults; 1948; 4 parts; 1 form; $2 per 25 sets of tests, 
postage extra; 35€ per specimen set, postpaid; R. W. 
анод, the Author, Miami University, Oxford, 

hio, * 
.a) HAND-EYE COORDINATION. Grades 1-12; 6(10) min- 
utes. 
b) ESTIMATE OF DISTANCE. Grades 7-12 and adults; 
3(8) minutes. 
C) EYE COORDINATION AND JUDGMENT, I. Grades 7-12 
and adults; 3(8) minutes. 
d) EYE COORDINATION AND JUDGMENT, 2. Grades 7-12 
and adults; 3(8) minutes. 


[650] 


Oseretsky Tests of Motor Proficiency: A Trans- 
lation From the Portuguese Adaptation. Ages 4- 
16; 1946; originally published in Russian in 1923; 
Portuguese adaptation by Maria Irene Leite da Costa 
was published in 1943; individual; $0.25 per set of 
testing materials, 25 record blanks, and manual; book 
and stop watch necessary for administration; $1 per 
25 record blanks; postage extra; (20-30) minutes ; 
English translation by Elizabeth Joan Fosa; sponsored 
and edited by Edgar A. Doll; original test by N. Ose- 
retsky; Educational Test Bureau, Educational Pub- 
lishers, Inc. * 


REFERENCES 


1-6, See 3:472. 

7. Korr, HELENE, “The Relationship of Stuttering to Motor 
Disturbances,” Nerv Child 2:107-16 Ja 4 * (PA 17:2360) 

8. Korr, HELENE. “The Relationship о Stuttering to Motor 
Disturbance." J Consult Psychol затоа Ja-F 48, * 

9. LAssNER, RUDOLF, "Annotated Bibliography on the Oseret- 
sky Tests of Motor Proficiency." J Consult Psychol 12:37-47 
Ja-F '48. * (РА 22:3034) А 

10. SEDAN У/пллАм. The Lincoln Adaptation of the Oseret- 
sky Tests: A Measure of Motor Proficiency. Lincoln, Ill.: Lin- 
coln State School and Colony, 1948. Pp. iv, 68. Paper, mimeo- 
graphed. * (РА 23:1547, title only) 

1r. BALDWIN, BARBARA Jean. A Study of the Oseretsky 
Tests of Motor Proficiency as Correlated. with Mental Age, 
КЕКИ Haas Age, I.Q., and Reading Aptitude of First Grade 
гири. aster's thesis, University of Wisconsin (Madison, 

is.), 1949. 

12. CASSEL, Ковевт Н. The Vineland Adaptation of the 
Oseretsky Tests. Training School Bulletin, Supplement to Vol. 
46, Nos. 3-4, May-June 1949. Monograph Supplement Series, 
Мо; апе, N.J.: Training School, [о Pp. 32. Paper. 

24:15 

13. Casset, Ковевт Н. “The Oseretsky Tests: Vineland 
ETAT Am J Mental Def 55:251-6 О “Ча А 25: 
244 

14. PuELPs, Mary Louise, A Study of the Oseretsky Tests 
of Motor Proficiency at the Four, Five and Six Year Levels, 
Master’s thesis, University of Nebraska (Lincoln, Neb.). 1950. 

15. SLOAN, WittraM. Motor Proficiency and Intelligence. 
Doctor’s thesis, Northwestern University (Evanston, Ill.), 1950, 
нын yf Doctoral Dissertations... une-September 1950, 
1951, p. 611. 

16. StoAN, WiLLIAM, “Motor Proficiency and Intelligence,” 
Am J Mental Def 55:394-406 Ја "т. * (PA 25:5420) 


ANNA ESPENSCHADE, Professor of Physical 
Education, and Research Fellow, Institute of 
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Child Welfare, University of California, Berke. 
ley, California. 

This test is a maturational scale devised to 
measure six different components of motor 
ability in children aged 4—16. One test for each 
component is listed for each year 4 through 10, 
and for the combined years 11-12, 13-14, and 
15—16. At these latter ages, separate tests for 
boys and for girls are included for some com- 
ponents. 

Test number one in each case measures static 
coordination; number two, dynamic coordina 
tion of the hands; number three, dynamic co- 
ordination in general; number four, speed of 
movement ; number five, simultaneous voluntary 
movement; number six, asynkinesia. The Binet 
method of motor age calculation is used, but a 
profile score may be of greater value in diagnosis 

No reports are available on reliabilities of the 
measures, nor is the basis for age placement of 
tests specifically stated. Results from testing sev- 
eral thousand normal Russian children and some 
psychotic patients were used in developing the 
scale. The 1931 publication (German) revised 
an earlier (1923) Russian form and made some 
radical changes in age placement of tests. 

An English translation (1—4) of the test was 
sponsored by Doll, who pointed out specific in- 
adequacies of the test in regard to clarity of e 
planation, objectivity of scoring standards, de 
scriptions of equipment, and the like. Certain of 
these omissions have been rectified in the form 
distributed by the Educational Test Bureau. 

A recent publication (12) by Cassel presents 
a number of changes made on the basis of е 
perimentation with roo subjects. All tests for 
asynkinesia were discarded as impossible to 
objectify; a dynamic coordination test of "s 
ning and jumping onto a chair was eliminat | 
because of danger of injury; and an pe 
has been made to adjust the jumping-for-heig i 
tests to the height of the subject being tested. ^ 
comparison of results obtained on “endogenous 
and “exogenous” mentally deficient subjects "i 
dicates that over half of the retained measure 
which*were within the ability range of this sam 
ple differentiate reliably between groups. i 

Before this test can be useful as a en 
tional or diagnostic scale for children 1" dy 
United States, a great deal of additional We. 
is needed. Experimentation has shown that d 
motor or manual and gross motor perfot n 
show low or zero correlations with each ot ài 
Yet the Oseretsky tests include both type 


( 
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measures of the same component of ability. The 
stunt type test does not correlate highly with 
jumping over a rope. Many stunt tests give sim- 
ilar results for the two sexes up to approximately 
eleven years of age; the jump over a rope shows 
sex differences by seven years of age. The un- 
‘reliability of a single stunt test must be recog- 
nized, and considerable reliance must be placed 
upon the judgment of the examiner. In addi- 
tion, with tests of this nature, a good demonstra- 
tion will make a difference in the results. It is 
improbable that the average examiner would find 
this practicable. The substitution of good pic- 
tures would be an alternative that would aid in 
standardizing procedures. Balance tests with 
eyes open have been shown to measure different 
factors than those with eyes shut. The assump- 
tion that a certain group of tests measures spe- 
cific components of motor ability may be seri- 
ously questioned. 

A complete inventory of motor abilities of 
children cannot neglect the areas of strength and 
speed in gross motor performance. Tests which 
measure these factors cannot be given in a lim- 
ited space, however. In addition, sex differences 
and the influence of height, weight and physio- 
logical maturity must be taken into considera- 
tion, so that an age standard is practically mean- 
ingless. 

It seems clear that this scale for the measure- 
ment of *Motorik" falls far short of the purpose 
of the author and that no simple revision will 
provide a valid measure of the components 
named. It should be possible to use some of the 
tests, especially those listed under “dynamic co- 
ordination of the hands" and "simultaneous vol- 
untary movement," in the development of age 
norms and in diagnosis of marked motor defi- 
ciency. Extensive experimentation and thorough 
statistical study is essential, however. 

To the reviewer, it would seem that additional 
experimentation with available manual manipu- 
lation tests and such tests as the Brace Scale of 
Motor Ability would: be of greater value than 
efforts to revise and refine a scale that has so 
many weaknesses. 


For a related review, see 651; for an excerpt 
from a review, see 3:472. 


[651] 
ike Oseretsky Tests of Motor Proficiency. 
ASSEL, Rosert Н. The Vineland Adaptation o 
the Oseretsky Tests, Training School Bulletin, Sup- 
plement to Vol. 46, Nos. 3-4, May-June 1949. Mono- 
graph Supplement Series, No. 1. Vineland, N.J.: 
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Training School, [1949]. Pp. 32. Paper, Gratis. * (PA 
24:15) 

J Consult Psychol 13:385 O '49. Laurance F. 
Shaffer. The Vineland adaptation simplifies the 
materials needed for the Oseretsky tests of motor 
ability, and makes the tasks more specific and 
objective. * Although the monograph contains 
a report of the performances of 54 feebleminded 
subjects, the test is regarded as not yet stand- 
ardized. 


[652] 
*The Rail-Walking Test. Ages 5 and over; 1941- 
44; a measure of locomotor coordination and general 
motor control; I form, 41; testing apparatus consists 
of three wooden rails: 9 feet by 4 inches, 9 feet by 2 
inches, and 6 feet by 1 inch; specifications for making 
rails and directions for administration, scoring, and 
interpretation are presented in Am J Psychol 57 : 482- 
99 О '44; (5) minutes; S. Roy Heath; norms may be 
obtained from. the Author, Department of Psychology, 
Princeton University, Princeton, N.J. * 
REFERENCES 

т. HrATH, S, Roy, Jr, "Rail-Walking Performance as Re- 
lated to Mental Age and Etiological Type Among the Mentally 
Retarded.” Am J sychol 5 :240-7 Ар '42. * (PA 16:3587) 

2. HeatH, S. Roy, Ja. “The Military Use of the Rail-Walk- 
ing Test as an Index of Locomotor Coordination,” Psychol B 
40:282-4 Ap '43. * (РА 1712664) 3 

3. HEATH, S. Roy, Jr. “Clinical Significance of Motor Defect, 
ye ae Implications.” Am J Psychol 57:482-99 O '44. 

1680. 
: 4. Н FATH, S. Roy, Jr. “А Mental Pattern Found in Motor 
Devas J ‘Abn & Social Psychol 41:223-5 Ap '46. "262. 
20: 
es YKLEBUST, Hetmer К. “Significance of Etiology in 
Motor Performance of Deaf Children With Special Reference 
to Meningitis." 4m J Psychol 59:249-58 Ap 146. * (PA 20: 


3573 ‘Tizarp, J.; IN, COLLABORATION WITH N. O'CoNNOoR AND 
Ji Mc Cien rhe АН 6 06 288-907 д. 
igh-Grade Male Реїѓес e а 9 907 5 
* (PA 25:5421) " 
‚7. O'Connor, N, AND TIZARD, T “Predicting the Occupa- 
tional Adequacy of Certified Mental Defectives. Occupational 

Psychol 25:205-11 Jl '51. * (PA 26:3487) 

WiLLIAM SroAN, Supervising Psychologist, 
Lincoln State School and Colony, Lincoln, Illi- 
nois. 

This test is easy to construct and simple to 
administer. It is not marketed by anyone and no 
separate manual, as such, exists. Instructions 
and directions for its use must be gleaned from 
the various writings of the author. Unfortu- 
nately, these are not always consistent and so 
the user must decide, in some instances, оп his 
own procedures. Two items in point are: (a) 
In the 1942 article (т) the procedure is given 
as, “S, shoes removed, was told to walk to the 
end of Rail I.” In 1944 (3) the procedure reads, 
“Shoes and socks are removed." (b) In 1942 
the rails are “all....1 in. thick.” In 1944, “The 
first rail is about 1 in. high and the others some- 
what higher.” 

Heath claims that the test measures locomotor 
coordination and that it differentiates extremely 
well between endogenous and exogenous mental 


653 


defectives. Later work in the Army has led him 
to postulate five clinical varieties, the other three 
being mixed defectives, exogenous normals, and 
endogenous normals. The rationale for this 
categorization is based upon rail walking scores 
and scores on Wechsler’s Mental Ability Scale 
(т) interpreted in terms of Cattell's concept of 
fluid and crystallized abilities. Evidence from 
sources external to these criteria obviously are 
desirable for validation of such a conceptualiza- 
tion. 

The only published norms on the test are in the 
1944 paper. The standardization group consists 
of a random sample of 1,013 normal Army men. 
The mean motor score on the test was 130.73 
with a standard deviation of 19.63. Since the 
total possible score is 153, a person cannot score 
much higher than one sigma above the mean. 
The distribution is decidedly not normal, with 
poor differentiation at the upper end of the scale. 
Later data on 778 school children ages 6-14 are 
obtainable from the author. The mean scores for 
each age show a general progression although 
the increments for each year are somewhat un- 
even. After age seven, girls are slightly, but 
consistently, inferior to boys. Both Heath and 
the present writer have found that railwalking 
scores correlated about .65 with MA and negligi- 
bly with CA. (This does not hold for the exo- 
genous defectives used in Heath’s 1942 study.) 
As a consequence, norms based upon MA and 
sex would appear to be desirable in any further 
standardization work. 

The test is relatively easy to administer and 
has appeal for young subjects. It takes but a few 
minutes to give and is fairly easy to score. How- 
ever, the large and bulky rails are not easily car- 
tied about. Effect of practice upon performance 
has not been determined. 

A unique measure of locomotor coordination, 
the Rail-Walking Test offers most promise as 
an aid in diagnosis of exogenous mental defect 
if MA is held constant. The test is worthy of 
further studies investigating its possible diag- 
nostic uses. 

ў 


[ 653] 
*V.D.L. Psychomotor Scale for the Measure- 
ment of Manual Ability. Ages 6-16, adults; 1939- 
49; a revision of Un Profil Psychomoteur (’39) and 
Manual Ability Test Series (Psycho-Motor Profile) 
(44); 10 scores: speed-prehension, speed-asyn- 
kinesia, pressure-reproduction, pressure-control, ac- 
curacy-steadiness, accuracy-precision, motor memory- 
direction, motor memory-spatial, coordination-static, 
coordination-dynamic; individual; 2 levels; $58 per 
complete set of testing materials ; $2.50 per 25 complete 
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sets of paper materials; stopwatch and blindfold o 
screening curtain essential for administration; $235 
per manual of any one level; specimen set not avail. 
able; postpaid; nontimed in part (40) minutes; M. 

А. Van Der Lugt; Meed Scientific Apparatus Co, + ` 
а) PSYCHOMOTOR TESTS FOR CHILDREN. Ages 6-6; 


1939-49; І form, '49. 
b) PSYCHOMOTOR TESTS FOR ADULTS. Adults; 1939-48; 
I form, '48. ч 


REFERENCES 
1. VAN Der Lucr, ыл LS “The V.D.L. Psychomotor Scale 
о 


Eu E BED ROI ^E UT. ату 
ANNA EsPENSCHADE, Professor of Physical Ed- 
ucation, and Research Fellow, Institute of Child 
Welfare, University of California, Berkeley, 
California, 

A manual ability test for children was devised 
by M. J. A. Van Der Lugt and standardized on 
Dutch children between the ages of 6 and 12 
during the years 1934-39. In 1944 the test was 
translated into English and adapted and revised 
for adults. The purpose is “to measure the psy- 
chomotor functions, as strictly limited to the 
manual abilities.” Functions measured are speed, 
pressure, accuracy, motor memory, and coordi- 
nation. There are ro tests, of which 5 are said 
to be essentially motor in nature and 5 sensory- 
motor or visual-manual. The majority are per- 
formed with the preferred hand but several are 
bimanual. Movements toward and away from the 
body, to the right and to the left, up, down, and 
around are included, The administration of the 
battery requires about 45 minutes and is done m- 
dividually. 

Tests of speed include preferred-hand removal 
of rings from a vertical rod, movement of rings 
in several directions on a table, and punching 
holes through a paper placed under a metal plate _ 
in which the pattern of holes is fixed. Two-hand — 
speed and coordination tests are simultaneous 
turning of right and left handed screws into? 
plate and simultaneous placement of marble ап 
peg into appropriate depressions in а meta 
square, In the latter test, hands must be cross 
for alternate placements. One pressure test ТЄ 
quires blindfolded reproduction of finger pom 
sures on a calibrated instrument especially de | 
vised for this test. It is unfortunate that the” — 
strument does not have an indicator which wou 
remain fixed at the place where movement it 
as inaccuracies in reading the dial will tend Я 
make the scoring unreliable. According to t м 
author, results on this test are the least discite 
nating of all in regard to age changes, and it wr 
be that the test should not be retained. The e 
pressure measure uses a set of plates like o: 


| 
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‘in the hole punching test. Papers are placed 
under both plates and the subject is instructed 
to punch through the first paper only. Motor 
memory is tested by finger mazes and by re- 
production of geometric figures such as squares 
and triangles. Whether these are tests of manual 
ability may be questioned. One accuracy test 
would seem to be largely a steadiness measure, 
as a marble must be balanced on a plate which 
is to be moved in and out through a pattern of 
rods. Another test consists of two-handed pat- 
tern tracing by means of a pencil attached at 
right angles to the center of a foot-long bar. 

The author claims a retest reliability of .90 
for the battery on 50 cases, but no statement is 
made in regard to age range of subjects used in 
this study. In children, improvement with age 
in all tests except one is reported, and no con- 
sistent sex differences were found. Some start 
has apparently been made on validation studies 
with adults, but the evidence reported is frag- 
mentary. 

It is unfortunate that no studies independent 
of those made by the author of the scale are 
available. This reviewer tried out the scale 
briefly with the assistance of some 20 college 
students, men and women. The tests are interest- 
ing and for the most part challenging. The norms 
given may be questioned since all of this group 
were above the 75th percentile on most meas- 
ures and in many cases were above the goth 
percentile. There also appeared to be a differ- 
ence in favor of the girls. The test directions are 
not clear in some instances, and the apparatus 
for the accuracy-steadiness test would appear to 
be in need of revision. Perhaps the set exam- 
ined was defective in the latter respect, but the 
clearance between plate and rods for several 
pathways in this apparatus was scarcely one 
millimeter, Consequently, all subjects made er- 
rors at these places. A gradation in clearance 
would provide a normal distribution of scores. 
In addition, the marble would scarcely balance 
in the smallest hole of the holder even when the 
latter was stationary. Knowledge of scoring will 
influence results, also, since once a bar is touched 
there is no penalty for repetition of contact with 
this same bar. College students quickly take ad- 
vantage of this. 

Certainly а thorough statistical study of the 
battery would be enlightening and is badly 
needed. A casual examination suggests that it 
has many good features. If the same tests can 
be used for children and adults, surely they 
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should provide an excellent tool for the study of 
development and decline of some manual abili- 
ties. The cost of the equipment may prove a 
deterrent to wide use. However, it is to be hoped 
that а number of investigators will experiment 
with this scale. 


Wittram SroAN, Supervising Psychologist, 
Lincoln State School and Colony, Lincoln, ПИ- 
nois. 

The two forms of this test for children and 
adults use the same set of materials but have 
separate manuals. The testing materials are com- 
pact and well constructed. However, the paper 
forms used with some of the tests suffer from 
indistinctness because they are mimeographed. 
Printing should produce more satisfactory re- 
sults. 

Specific items may be worth some considera- 
tion. In naming the subtests some confusion is 
apparent. Test 1 is “speed-prehension” and con- 
sists of grasping and placing rings. Test 2, 
“speed-asynkinesia,” is considered a “motor- 
sensory” item and consists of punching holes in 
a sheet of paper under a metal sieve. The ra- 
tionale is that vision is a necessary sensory com- 
ponent of the task. On the other hand, the third 
test, “pressure-reproduction,” is considered a 
“motor” item. It consists of pressing on a 
plunger while blindfolded and then reproducing 
the same pressure. Why the kinesthetic sensory 
process is ignored here is not clear. In Test 4, 
“pressure-control,” (a “motor-sensory” test) 
the subject is required to punch through a sieve 
as in the “speed-asynkinesia” test, but here there 
are two sheets of paper underneath the top sieve, 
separated by another sieve. The subject is re- 
quired to punch just hard enough to pierce the 
first layer of paper but not the second. Since no 
preliminary trial is given, the subject never 
knows whether on any one punch he has pierced 
one or both sheets of paper. Test 5, “accuracy- 
steadiness,” (a “motor” test) requires extremely 
good visual and manual coordination, but the 
assembling of the test leaves sufficient margin 
for deviation to make comparison from subject 
to subject of doubtful validity. In this test, too, 
the instructions suffer from adumbration and 
could be reworked to obtain clarity. Test б, “ас- 


curacy-precision," is called “motor-sensory” but 


appears to require less of a "sensory" element 
than the “accuracy-steadiness” test. The in- 
clusion of Test 7 in this battery is curious. This 
is an elevated finger maze which is familiar to 
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many psychologists as a learning test. Just how 
it now becomes a test of “motor-memory” is not 
clear, especially since the author classifies it as 
a "motor" test. Test 8 is a “motor memory- 
spatial" test and is included in the “motor-sen- 
sory" group. It consists of having the subject 
trace a geometric figure after his hand has been 
guided over the figure. АП this is done while he 
is blindfolded. The scoring of this item is the 
most unsatisfactory in the battery. Criteria are 
not clear, illustrations of sample responses for 
adults are in the children's manual, and illustra- 
tions are given for performances of various clini- 
cal types, e.g., schizophrenic, mental defective, 
normal, organic brain damage, blind, etc. The 
author states, “Norms are based on mathemati- 
cal measurements and have been correlated.” 
Correlated with what? She goes on to state, 
“Quantitative scoring may be used if the test is 
given as a separate unit. As part of a battery, 
however, quantitative scoring alone, which 
would include the use of elaborate measurements 
and calculations, for this particular test in its 
present state would be too wasteful in time, if 
valuable at all.” Such statements are not worthy 
of mature psychological research. Tests 9 and 
IO, "coordination-static" and “coordination- 
dynamic," are modifications of several current 
tests of manual dexterity. One consists of turn- 
ing screws into holes and the other of placing 
marbles into hollows and putting pegs into holes 
simultaneously. The former is a “motor” test 
and the latter a “motor-sensory” test. 

As can be seen from the above, the items are 
given labels without apparent justification, scor- 
ing is not always clear, and administration of the 
items may fluctuate. Particularly perplexing is 
the author’s statement, “All material is original” 
(italics hers). Workers in this field will feel 
familiar with a number of the items, except for 
some slight modifications. 

Scores are recorded separately for the 10 sub- 
tests. No overall score is available since each sub- 
test is scored on the basis of a percentile. Means, 
medians, and quartiles are given for each sub- 
test. It is regrettable that standard deviations 
are omitted so that no approximation of the shape 
of the distribution is available, nor can one ob- 
tain standard scores so that scores from one 
test to another may be made more easily com- 
parable. The group on whom these norms were 
established is not clearly defined. The author 
speaks of “several thousand scores...for the 

Motor Memory test.” “The tests have now been 
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administered to more than 700 subjects, some’ 
of them to a considerably larger number + 
Norms have been established on the scores of 
500 adults of various nationalities, predomi- 
nantly North Americans. The latest revision has 
been adjusted to the Tesults obtained for 225 
subjects, the great majority of whom are college 
students * The norms for adults as published in 
a previous edition have been revised whenever 
such revision was necessitated by a more com- 
plex test pattern or refined apparatus.” Finally, 
"In general, since the norms are still considered 
to be provisory, it seems premature fo publish 
detailed statistical information before a fur- 
ther completely satisfactory and especially a 
more representative standardization has been 
achieved.” The reviewer feels that the whole 
test may well have waited. Data comparable to 
the above are given for the norms for the chil- 
dren's scale. In substance, the statistical treat- 
ment is crude and ambiguous. 

The overall impression one gets from this test 
is that a good deal of unsupported inference and 
subjective, intuitive material has gone into its 
construction. Statements appear to be buttressed 
by vague claims and prestige suggestions (cf. 
the appendix of press notices). It is entirely pos- 
sible that this battery of tests may be valid and 
reliable, but the presently available data do not 
support such an assumption. It is to be hoped 
that collection of data on this instrument will 
conform more to objective and experimental 
methods in the future than it has in the past. 


VISION 


[ 654] 

A-B-C Vision Test for Ocular Dominance. Mm 
5 or б and over; 1927-46; individual; 1 form, '4 ide 
data on reliability and validity with test (for data s 
sented elsewhere by the author, see 1-3) ; directions i 
administering (40) ; $2 per set of testing mad 
postpaid; (2-5) minutes; Walter R. Miles; Psy 
logical Corporation. * 


REFERENCES 

1-5. See 3:459. 
Mites A. TINKER, Professor of Psychology 
University of Minnesota, Minneapolis, Minne 
sota. E 

This test is an ingenious device for meast d 
ing ocular dominance uninfluenced by the we 
edness of the subject. The apparatus rd 
simultaneous and balanced use of the two han Fi 
Furthermore, although the subject has the ! и! 
pression that he is using both eyes, he 15 ^ 
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tually employing unilateral sighting. These fac- 
tors constitute advantages for this apparatus in 
comparison with measures of ocular dominance 
such as those obtained by sighting through a hole 
in a card or along a straightedge. Directions for 
giving the test are complete and clearly stated. 

The age ranges at which the test may be used 
are not given in the advertising statement in the 
publisher's catalog or with the instructions for 
giving the test. Ordinarily it is used for indi- 
viduals in the first grade and at all later ages. 
Nevertheless, it has been used successfully with 
children 3, 4, and 5 years of age. 

Omission by the publisher of data on age 
ranges, reliability, and validity is unfortunate 
since many of these data are available (see 1—5). 
Reliability is satisfactory. There is substantial 
agreement from test to retest as follows: 88 per 
cent for children 3, 4, and 5 years old; 95 per 
cent for school children; and 80 per cent for 
adults. Consistency of performance, therefore, 
is high. If one may assume, as is customarily 
done, that the eye preferred for sighting indi- 
cates ocular dominance, the A-B-C Vision Test 
for Ocular Dominance has satisfactory validity. 

This is the most satisfactory single test of 
ocular dominance presently available. It avoids 
the influence of handedness, has satisfactory re- 
liability and (face) validity, and may be used at 
all age levels from 3 years up. 


[655] 
*Eames Eye Test. Grades r-16 and adults; 1938- 
50; 8 pass-fail scores: visual acuity, lens, near vision, 
coürdination, fusion, astigmatic chart (optional), eye 
dominance (optional), total; individual; 1 form, '38; 
revised manual ('50) ; $5.50 per set of testing materials 
needed by examiner, postpaid; hand stereoscope essen- 
tial for administration; 756 per 25 individual record 
cards, postage extra; nontimed T minutes ; Thomas 
H. Eames; World Book Co. * 


REFERENCES 
1-5. See 3:463. 


[656] 
Farnsworth Dichotomous Test for Color Blind- 
ness: Panel D-15. Ages 12 and over; 1947; individ- 
ual; 1 form; $22 per set of testing materials and roo 
analysis sheets ; $2.50 per 100 analysis sheets; post- 
paid; (2-5) minutes; Dean Farnsworth; Psychologi- 
cal Corporation. * 
REFERENCES 

1. FarnswortH, Dean, "The Farnsworth-Munsell 100-Hue 
oud Dichotomous ‘Tests for Color Vision.” J Opt Soc Am 33: 
568-78 O 43, * (PA 18:399) 3 
ta 2. Jupp, бвлме B, “Standard Response Functions for Pro- 
МЛР! and Deuteranopic Vision.” J Opt Soc Am 35:199-221 

т '45. * (PA 19:3284) 


Ers Murray, Research Associate in Psy- 
chology, Cornell University, Ithaca, New York. 
. This test is designed to separate the genu- 
inely “color-blind” (those lacking the red and 


green qualities) from the normal and the merely 
color-weak. In a group of 2,146 males, 5.5 per 
cent failed this test whereas on the usual pseudo- 
isochromatic test, which fails color-weak also, 
7 to 9 per cent of the subjects would have failed. 

The dichotomy in question is not that be- 
tween the old "red" and "green" categories 
(protan and deutan in more recent idiom), but 
that between normal and distinctly anormal or 
grossly deficient. The test is therefore of use in 
industry, military service, and vocational guid- 
ance where this distinction is the focal one. It 
is not intended to measure degree of defect or of 
proficiency. An 8-page manual covers adminis- 
tration, interpretation, and technical aspects. A 
number of typical case diagrams are provided. 

Test material consists of 15 34-inch black 
caps, each exposing a 14-inch disk of color, with 
a single fixed reference cap, in a 14-inch rack. 
The task is to arrange the caps in order by color. 
The colors used are chosen from the 20 stand- 
ard Munsell hues of the color circle, and are of 
medium and uniform chroma and brightness 
(value), thus eliminating the cues ordinarily 
used by color deficients to discriminate and name 
colors. Scoring is facilitated by numbers on the 
reverse of the caps, and analysis blanks are pro- 
vided on which the order of arrangement may be 
plotted on a color circle. Minor transpositions 
are normal, but cross transpositions, such as the 
confusion of red and green, count as serious er- 
rors and score failure. 

According to the location of these cross trans- 
positions on the circle, the subject may be classi- 
fied (according to Farnsworth) as a protan, a 
confuser of a crimson-red and blue-green; a 
deutan, a confuser of purplish red and bluish 
green; ora tritan, a confuser of violet and yellow- 
ish green. Characteristic error diagrams for all 
three are shown. 

When this test was used by Farnsworth (and 
by this reviewer) using all 20 spectral hues, in 
weaker chromas, its reliability was increased. 
The reviewer has occasionally encountered a 
subject with a considerable degree of defect, who 
has been able to effect a correct arrangement, 
apparently by reliance on the degree of chroma 
or saturation of blues (from blue to green) and 
of yellows (from green to purple). 

The test is best used in a battery of color 
vision tests in the clinic. It is of doubtful value 
below high school age, owing to certain con- 
fusions arising from the fact that the color cir- 
cle returns on itself. In the high school, it should 
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be used to replace the familiar American Optical 
Company and the Ishihara pseudo-isochromatic 
plates which provide too great a temptation to 
coaching. The diagram on the analysis sheet 
affords more convincing proof of deviation from 
the normal than do pseudo-isochromatic plates, 
shows the exact location of the defect, and dem- 
onstrates the tendency for complementaries on 
the opposite side of the circle to be affected simul- 
taneously, as maintained by four primary the- 
orists. 

It is unfortunate that the cumbersome old 
Greek derivatives and the misleading caption 
“color-blindness” are retained in this simple and 
useful test. 


For an excerpt from a review, see 3:464. 


[ 657] 

*The Farnsworth-Munsell 100-Hue Test for the 
Examination of Color Discrimination. Mental 
ages 12 and over; 1043-49; formerly called Farns- 
worth-Munsell ro0-Hue Test for Anomalous Color 
Vision; individual; 1 form, '43; manual ('49) ; $60 per 
complete set of testing materials and 25 score sheets; 
$4 per 100 score sheets; postage extra; (10) minutes ; 
Dean Farnsworth; Munsell Color Co., Inc. * 


REFERENCES 
1, FAnNSWORTH, Dean. "The Farnsworth-Munsell roo-Hue 
and Dichotomous Tests for Color Vision." J Opt Soc Am 33: 
568-78 О ’43. * (PA 18:399) 
2. THomas, Савтн J. "Visual Sensitivity to Color: A Com- 
parative Study of Four Tests." Ат J Psychol 56:583-91 O 43, 
(PA 18:422) 


ЕгзтЕ Murray, Research Associate in Psychol- 
оду, Cornell University, Ithaca, New York. 

This test is designed for the detailed study of 
color discrimination, detection of cases of low 
discriminative ability, and location of their points 
of weakness. It is not suitable for the detection 
of superior ability or for drawing a line between 
the merely “color-weak” and the genuinely red- 
green blind. It is an exploratory or research test 
adapted to the college laboratory and the train- 
ing of examiners. А 6-page manual and score 
sheets giving the dominant wavelengths of the 
colors utilized add to the test's value. 

The test is made up of 85 black buttons or 
caps, each displaying a 14-inch disk of color se- 
lected from the Munsell 100-hue experimental 
circuit of a few years ago. The caps are pre- 
sented in four 20-inch black trays, with a fixed 
pilot cap at either end. The task is arrangement 
in color order of the caps in one tray at a time. 
If, by indirect daylight or by artificial filtered 
light of about 6,500° Kelvin color temperature 
(illuminant C), there is uniform illumination 
over the entire length of the tray, the color dif- 
ferences, though small, are well within the dis- 
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criminative range of the normal, and only a few 
transpositions, if any, will be made, 

The test is scored by recording the number 
on the under side of each cap in order and figur- 
ing its displacement. The errors are then plotted 
on the color circle as peaks running out from 
the center. Typical cases are diagrammed in the 
manual, enabling the examiner to identify the 
general type of defect of the individual and the 
mildness or severity of the disability, The oli 
German-type classification into Deutan, Protan, 
and Tritan is followed. 

While the scoring and plotting, which involve 
some eye strain, are time-consuming, the test 
has value as laboratory training apart from its 
diagnostic validity. It familiarizes both subject 
and examiner with the variety and range of pure 
hue differences when chroma and brightness 
(value) are equated. It trains the examiner in 
exact recording and scoring, and the color cir- 
cuit diagrams of error drive home the point that 
physicists reared on the 3-primary theory are 
prone to overlook, that color defects fit the 4 
point hypothesis of the opposites (or 4-color 
theory) most easily. For the error peaks usually 
show on complementary colors at opposite sides 
of the circle, e.g., on red-purple and green, red 
and blue-green, violet and yellowish green. — 

This test should serve not only to train clini- 
cians, but to pick salesmen or industrialists 
whose work involves discrimination of small 
color differences. It will afford better training M 
hue discrimination, moreover, than Dvormes 
so-called "Training Charts." Its use below tlt 
12th year is dubious, however. 


[658] T 
*Glenn Colorule. Ages 7 and over; 1042; а tes ality 
the existence of anomalous color vision and the as re 
of light sources; individual; 1 form; no data on are 
bility and validity; no norms; $5 per testing арра a 
postpaid; test free for educational and researc Blue 
poses; James Glenn and Harold Taylor; Sidney 
menthal & Co., Inc. * 


REFERENCES ; Nou 
1. VAN рем AKKER, J, A.; Topp, J. Еру гопа 
PHILIP; AND INK, WiLLMER А. “Use of a MO Soc 
Colorimeter for the Study of Color Blindness.” J ОР 
37:363-87 My '47. * (PA 21:2909) 


[659] m t. А 
*The Illuminant-Stable Color Vision Tes о 
6 and over; 1949; individual; 1 form; $15 iis Free- 
testing materials, postpaid; (1-3) minutes; I 
man; distributed by Psychological Corporation. · 


” , H ) ‚у a 
Test, L" J Opt Soc Am 38 1535-8 Je M8 т AA An domin. 
Stable Color Vision Test: IL, Experimental Statistica 
tion.” J Opt Soc Am 38:971-6 N ‘48, * (PA 23:2483) 
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Exsre Murray, Research Associate in Psychol- 
ogy, Cornell University, Ithaca, New York. 

This test sets out to eliminate the need for 
regulation of illumination in testing color vision 
by narrowing the range of colors used to yellows 
and oranges, the wavelength stimuli for which 
lie between 575 and 590 millimicrons. (These 
are the hues, by the way, used by Stilling in his 
first experimental plates in 1876.) Only indi- 
rect proof of the failure of changes in illumina- 
tion to affect legibility of the plates is offered. 
While the 12 pseudo-isochromatic plates were 
“selected to yield a dichotomy between the 
normal and the definitely deficient,” the propor- 
tion of the latter is left indeterminate (some- 
where between 4 and то per cent), the test hav- 
ing been standardized on 100 volunteering 
“color-blinds.” The plates are unique, too, in 
introducing, on the basis of Gestalt theory, com- 
plicating white cross-lines which confuse even 
the normal eye and require the lengthening of 
the exposure time for each plate to five seconds 
and the tracing by eye movement of the color 
blocks forming the digit. This transforms the 
task from a color-field test (one of the advan- 
tages of the Ishihara and Stilling) to a test for 
merely foveal color acuity. 

The test comes as a compact 5- by 6-inch 
booklet of 12 stiff black leaves, on which are 
mounted, with cellophane protection, six 6- by 
6-centimeter color charts and six 3- by 6-centi- 
meter charts. The colors are disposed on a uni- 
form checkerboard of 5 millimeter squares 
(marked off by white lines) on which the colors 
forming the digit are superimposed, Configura- 
tional white diagonals and circles also overlay 
the whole. There is a leaf of instructions, and a 
small ro-page manual, largely theoretical. 

The smaller charts are designed for "differ- 
ential diagnosis" to separate the 25 per cent of 
Protanopes and Protanomalous from the 75 per 
cent of Deuteranopes and Deuteranomalous. The 
latter group, it is stated somewhat vaguely, will 
have more difficulty with the smaller plates. As 
in most current tests, with the exception of those 
of Pickford at Glasgow, the fact is ignored that 
the so-called protanopic type in reality repre- 
sents a group with varying shortening of the 
red end of the spectrum, perhaps a shift of all 
the sensitivity curves along the entire spectrum. 

While this test undoubtedly merits being tried 
out on an unselected group by an impartial in- 
vestigator, this reviewer feels that it leans too 
heavily on theoretical suppositions. The con- 


figurational design tends to produce eye-strain 
and is too confusing for young subjects. The 
claim that legibility is unaffected by the quality 
of illumination needs verification ; to the normal 
eye the reds (or oranges) under mazda light 
appear redder and the digit more legible, which 
should be an aid to the ordinary red-green de- 
ficient. There are too few hues used to detect 
all types of lowered sensitivity at a time when, 
according to Pickford, blue-yellow defect is 
gaining more prominence in test results and 
theory. The colors used are all fairly saturated, 
which factor favors the color-weak. A wide lati- 
tude in the misreading of digits (as o for 8) 
complicates diagnosis. The greater range of 
hues and the three degrees of chroma promised 
in the Hardy-Rand-Rittler pseudo-isochromatic 
plates now in preparation, along with their avoid- 
ance of digits, should prove these more useful 
and reliable, even though they demand illumina- 
tion standardized in quality and intensity. 

It is, of course, doubtful that any single ab- 
breviated set of charts can function as an all pur- 
pose screening test, even where two trials are 
possible, Along with the variables in illumina- 
tion and preadaptation, in each examiner’s tech- 
nique, in the mental set of the subject and his 
adjustability to odd shaped digits or geometric 
figures, and the ease of coachability of the test, 
there is the variability in individual hue sensi- 
tivity. This is a problem yet unsolved by science ; 
sometimes, perhaps, it springs from a shift of 
the points of maximal effectiveness for the 
fundamental qualities of red, yellow, green, and 
blue toward either the short-wave (violet) or the 
long-wave (red) end of the spectrum or stimu- 
lus continuum. This latter possibility may ac- 
count for the variations in the Rayleigh equation 
(the proportion of a fixed red to be added to a 
fixed green to match a fixed yellow) and the 
ambiguity of diagnoses by the anomaloscope. All 
indications point to the need of a battery of 
tests for adequate screening. 

[660] 
*The Master Ophthalmograph. Grades 1-16 and 
adults; 1936-47 ; a camera, using 35 mm. standard mo- 
tion picture film, for hotographing eye movements 
while the examinee reads; developed films may be ob- 
tained within 8 minutes; 110 AC, 60 cycles; $400 per 
instrument (Model No. 1522B) [47]; 156 per 25 rec- 
ord cards; (15) minutes; developed by the Bureau of 
Visual Science of the American Optical Company and 
the Research Division of the Eastman Kodak Com- 
pany; American Optical Co. 
REFERENCES 


1-2. See 40:1559. 
3-17. See 3:479. 


ббо 
18. Тлутов, Eart A. Controlled Reading: A Correlation of 
Diagnostic, Teaching, and Corrective Techniques, pp. 


909. 
Chicago, Til.: Chicago University Press, 1937. Pp. xxviii, 367. 
Ў 19, Винни, ғ РБ. "Осие Stability in Deception.” 7 Ару 
6: ^42. 16:23 . 
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B 43:93-120 Mr "46. * (PA 20:2490) 


Reading." Psychol 
Dixon, W. Ковент, Jr. Studies of the Eye-Movements 


E: 

i di Uni: ity Professors and Graduate Students. 
Doctor! vnd Michigan’ State Conve (East Lansing, CH 
AREE RA Microfilm Abstracts 8:81-3 no 2 '48. * (Р. 


hits Preston, Влірн C., AND Turr, EnwiN N. “Тһе Reading 


Habits of Superior College Students." J Exp Ed 16:196-202 
Mr '48, * (РА 20:5126) 

25. Asner, E. y AND Ort, В. S. “Eye Movement as a Com- 
plex Indicator." J General Psychol 45:209-17 О 's1, * 


Mires A. TINKER, Professor of Psychology, 
University of Minnesota, Minneapolis, Minne- 
$оїа. 

The Master Ophthalmograph is a revised and 
greatly improved model of the Ophthalm-O- 
Graph. This portable eye-movement camera may 
be used to obtain binocular records of eye move- 
ments during reading. It is accompanied by sev- 
eral pamphlets which describe the apparatus, 
give instructions for operation, and furnish sug- 
gestions for interpretation of the obtained photo- 
graphic records. This explanatory material is 
precise and clearly stated. Operation of the ap- 
paratus is simple and readily learned. Contrary 
to the impression gained from the pamphlet on 
interpretations, considerable supervised training 
is required to achieve proper interpretation of 
€ye-movement records. The apparatus is pro- 
moted as a precision instrument for both clini- 
cal and research uses. Nevertheless, the appa- 
ratus lacks the flexibility and the precision for 
any but the more simple research investigations 
in the field of reading. Since the apparatus is 
portable and easy to operate, it may be readily 
employed for diagnostic purposes in the reading 
clinic. The degree to which it is used depends 
upon the viewpoint of the clinician. Most of 
the people who have done extensive fundamental 
research on eye movements in reading or who 
have made an adequate analysis of the available 
experimental reports consider that particular 
oculomotor patterns reflect central processes of 
perception and assimilation. Others, particularly 
those with an inadequate knowledge of the psy- 
chology of reading, believe that specific oculo- 
motor patterns represent habits which determine 
proficiency in reading. These individuals con- 
sider the eye-movement camera indispensable in 
the reading clinic, A comprehensive analysis 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK [ 656 


(22) of literature in the field reveals that there 
are no valid data to support the view that 
movements determine reading proficiency, 

The direction and interpretation manuals have 
nothing to say about either reliability or valid. 
ity of eye-movement records. As a matter of 
fact, when proper materials and procedures are 
employed, these measures do have satisfactory 
reliability and validity.! But neither materials 
nor procedures for the Ophthalmograph are sat- 
isfactory as specified by the manufacturer, There 
is, for instance, no practice trial to promote ad- 
justment to the artificial task of reading before 
the camera. And the material to be read consists 
of 25 or 50 words, i.e., not over 5 or 10 lines 
At least 20 lines must be read to yield a measure 
of satisfactory reliability, 

The procedure specified for measuring com- 
prehension of what is read (per cent correct of 
ten true-false statements on the contents of the 
50-word passage) is of doubtful validity. Fur- 
thermore, to force a person to try to remember 
10 details from 25 or 50 words of text will tend 
to produce an undesirable type of reading. А 
related question is concerned with the eye-move- 
ment averages provided as norms. These av- 
erages, derived from a rigid procedure which 
undoubtedly produces an atypical type of read- 
ing, are of limited value, particularly with read- 
ing disability cases. AB 

In the material provided as instruction in 
analysis of the reading graph, great айр is 
placed upon possible evidences of binocuhr 
motor incoordination. If there is enough binoct- 
lar incoordination to be of clinical significance 
that fact will have been discovered and evaluat й 
by the oculist or optometrist. It is standard proe 
tice to check the visual status of all clinical eer 
Although eye-movement records may put 
ment other data in clinical diagnosis, it is dot! 
ful whether such data are essential. Acn 
to many writers, including this reviewer, i 
eye-movement camera is essentially a resear 
apparatus rather than a clinical device. sedi 

Most of the criticisms raised here have i 
made at least ten years earlier (38:1108 cis 
40:1559). Although many of the det) 
pointed out at that time could have been a 
nated by modification of procedures and a d 
rials, it appears that the manufacturer has i i 
no attempt to do so even though the тап 


„Move 
1 Tinker, Miles A. ‘“Reliabilit 


and Validity of Er» "46. 
ment Measures of Reading.” J Exp Psychol 19:732-4 
* (PA 11:1431) 
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have been revised and the apparatus modified. 

The most recent model of the О phthalmograph 
is mechanically a great improvement over earlier 
models. It may be used as a supplementary de- 
vice in clinical diagnosis and for relatively sim- 
ple investigations in reading. Nevertheless, it is 
not indispensable in the clinic. Equivalent infor- 
mation can be obtained by less tedious, less diffi- 
cult, and less expensive means. 


For a review by G. T. Buswell and two ex- 
cerpts from reviews of the 1936 model, see 
40:1559; for reviews by Stella S. Center, David 
Kopel, Marion Monroe, J oseph Tiffin, and Miles 
A. Tinker and one excerpt from a review of the 
1936 model, see 38:1109. 


[ 661] 
*Pseudo-Isochromatic Plates for Testing Color 
Perception. Ages 7 and over; 1940-47; а, test de- 
signed to duplicate the most useful of the Ishihara and 
Stilling plates; individual; postage extra; American 
Optical Со. * 
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2 [46-PLATE EDITION.] 1940; out of print. 

b) REVISED SELECTION. 1947, C1940; no data on validity 
in manual (for data presented elsewhere, see 8) ; $12 
per test; (1-2) minutes. 


REFERENCES 


—9. See 3:471 

10. нтте, M. S. “The Value of the Pseudo-Isochromatic 
Color Vision Test in Original Paprlicents for Military Flying.” 
Mil Surg 89:801-5 N '41. * (РА 17:1323) Reprinted in Eye, 
Ear, Nose Thr 22:140-3 Ap '43. (PA 17:3235, title only) 

п. DUNLAP, Knicut. “Defective Color Vision and Its 
Remedy.” J Comp Psychol 26209-85 Ар '45. * (PA 19:2121 

12, NEUBERT, FRANK R. “Colour Visteon in the Consulting 
Room." Brit J Ophthal 31:275-88 My '47. * (PA 21:2902 

13. CHAPANIS, ALPHONSE. “A Comparative Study of Five 
Tests of Color Vision." J Opt Soc Am 38:626-49 Л ^48. * (PA 
23:530 

v Авру, Le Grann H.; RAND, GERTRUDE; AND RITTLER, 
M. CarnEnINE. "Incidence of Color-Blindness Among Psychotic 
Subjects." J General Psychol 39:229-42 О 748. * (РА 23:2801) 

15. SLOAN, Louise L., AND WOLLACH, LORRAINE. “Com ari- 
son of Tests for Red-Green Color Deficiency.” J Aviat Med 
19:447—55 D '48. * (PA 23:4083) — 

16. CHAPANIS, ALPHONSE, Bee Hern} Types of Color De- 
ficiency by Means of Pseudo-Isochromatic Tests.” J Opt Soc 
Am 39:242-9 Mr '49. * (PA 24:60) 

17. Harpy, Le GRAND H.; RAND, GERTRUDE; AND RITTLER, 
M. Слтневіхе, “The American Optical Company's Compilation 
of Pseudo-Isochromatic Plates as a Means of Detecting and 
Analyzing Defective Color Vision." J General Psychol 40:103-17 
Ја '49. * (PA 23:3587) 


For an excerpt from a review of the 46-Plate 
Edition, see 3:473. 
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*College Entrance Examination Board Achieve- 
ment Test in Social Studies. Candidates for college 
entrance; 1937-51; available only in College Entrance 
Examination Board Admissions Testing Program (see 
526); 60(70) minutes; prepared by College Entrance 
Examination Board Committee of Examiners in 50- 
cial Studies in cooperation with the Staff of Educa- 
tional Testing Service: 1951 membership: Henry W. 
Bragdon (Chairman), William A. Aiken, Harry D. 
Berg, Paul А. Fullam, and Loretta Е. Klee; program 
administered by Educational Testing Service for the 
College Entrance Examination Board. * 


REFERENCES 1 
‚1. Warg, Слкоілхе Е. “The Alternative History Examina- 
tions of the College Entrance Examination Boar ," Soc Ed 
2:399-403 S '38. * 
, 2, PARGELLIS, STANLEY, AND WARI 1 
C.E.E.B. Examinations in History.” Soc Ed 3:478-84 О '39. 
з. Ware, CamoLiNE Е. "New College Entrance Examination 
oard Examinations im History." Proc Middle States Assoc 
Hist & Social Sci Teach 37:3176 "39. * 
4. CHAUNCEY, Henry. "The Social Studies Test of the Col; 
lege Entrance Examination Board." Social Ed 8:253-7 О '44;, 
5. STALNAKER, RuTH C., AND STALNAKER, JOHN М... 
Study of the Social Studies Achievement Test of the College 
ntrance Examination Board Taken by Girls From Inde- 
pendent Schools.” Ed Adm & Sup 30:490-6 N 4; * 
Weser, Curistran О. “Old and New College Board 
Scores and Grades of College Freshmen,” J Am Assn Col Reg 
20:70-5 О '44. * (PA 19:570) 


RosznT L. THORNDIKE, Professor of Education, 
Teachers College, Columbia University, New 
York, New York. ( 
This test is planned for administration to high 
school seniors close to the time of graduation as 
part of the information supplied to colleges to 
which they are applying for admission. For that 
reason, the focus of the test is on American his- 
tory, with a secondary emphasis on general 
world history. However, the test items them- 
selves range far beyond the traditionally taught 
history course. In fact, there was some thought 
on the part of schools surveyed in an appraisal 
of the test that it ranged too far from the stand- 
ard course content, becoming, according to one 
critic, “more an aptitude than an achievement 


test.” 
Including, as it does in some editions, sections 


on relevance of arguments in favor of and against 
a proposition, interpretation of cartoons and 


reading passages, allocation of quotations to pe- 
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riods and authors, as well as a wide range of item 
types covering terminology, chronology, causa- 
tion, and the like, the test does place heavy de- 
mands upon a variety of reading skills, as well 
as upon a wide range of background informa- 
tion. These skills are certainly important for 
college work. Whether the social studies test is 
the point at which they should be evaluated, 
rather than the verbal section of the Scholastic 
Aptitude Test, may, perhaps, be debated. One 
would anticipate that there would be rather sub- 
stantial correlation between these two tests and 
that the social studies test might have, as a re- 
sult, a somewhat limited unique contribution to 
make to the prediction of college success. How- 
ever, the skills, over and above the knowledge, 
represented in the test are certainly important 
and worth representation somewhere in the 
CEEB testing program. 

The statistics with respect to both item and 
test characteristics indicate that the test is well 
suited for the group with which it is being used. 
The test is essentially a power test, and different 
forms all have reliabilities (split half) over .go. 
Item-test correlations average .40 or better, and 
the items seem to include very few “lemons.” 
This is, of course, a reflection of the continuity 
of the Board's testing operation, whereby item 
analysis data are gathered from one testing and 
used as a basis for item selection for later forms 
of the test. 

Though content-minded teachers may be 
somewhat unhappy about this test as an evalua- 
tion of the outcomes of their instruction, it ap- 
pears to provide a technically competent evalua- 
tion of a range of significant knowledges and 
skills in the social studies area. 


[663] у 

`*Соорегаііуе Social Studies Test for Grades 7, 
8, and 9. Grades 7-9; 1041—51; 4 scores: informational 
background, terms and concepts, comprehension and 
interpretation, total; IBM; Forms X C47), Y (48); 
Forms R, S out of print; no data on validity; no spe- 
cific manual; no description of normative ulation ; 
general Cooperative manual ('51) ; norms 0487 ; $2.50 
рег 25; 50€ per specimen set, postpaid; separate an- 
swer sheets may be used; 806 per 25 IBM answer 
sheets; 15€ per stencil for scoring answer sheets; cash 
orders postpaid; 80(90) minutes; Harry Berg (X), 
Elaine Forsyth (X), and Eunice Ann Lloyd (Y) ; Co- 
operative Test Division, Educational Testing Serv- 
1ce. " 
Ніра ТАВА, Professor of Education, San Fran- 
cisco State College, San Francisco, California. 
[Review of Forms S, X, and Y.] 

This test is made ир of multiple choice items 


of a fairly discriminating quality. Each of the 
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forms consists of three parts : information, term; 
and concepts, and comprehension and interpre. 
tation. It is quite difficult to tell what is sam- 
pled in each part. The general manual gives only 
directions for administering and Scoring ап} 
leaves one uninformed as to how the item sam. 
ple was secured or what the validity may be, 

The information items in Forms S and X 
seem to cover a vast range: early American his- 
tory, world history, English and European his- 
tory, general geographic facts and principles, 
civics, and the use of sources. It is even more 
difficult to determine the sampling pattern used 
for terms and concepts in these two forms. The 
items seem to emphasize historic information, 
especially of the kind that requires memory of 
specific information. While the concepts in Form 
X seem more functional, perhaps because this 
form was constructed more recently than Form 
S, many are either specious or self-definitive, 
Perhaps least satisfactory, from a technical point 
of view, not from the point of view of purpose, 
is the section on comprehension and interpreta- 
tion in Forms S and X. Some reading items can 
be answered correctly only with rather specific 
information outside the given data. Other, items 
can be answered by eliminating the obviously 
wrong answers, Still others are so vague that 
the right answer can be chosen only by guessing. 
And there are some in which interpretation 
hinges on an understanding of a single meta- 
phoric expression. 

Form Y seems to be the most adequate form, 
especially for the type of general social studies 
courses now conducted in the junior high school. 
The historic information required is rather gen- 
eral and is addressed to trends instead of very 
specific isolated bits of information. The items 
dealing with skills in locating information seem 
rather functional. The concepts sampled in the 
second part are by and large those that are 1 
quired in a thoughtful understanding of the 80" 
cial scene today. These items also require fairly 
refined judgment rather than knowledge of spe- 
cific definitions acquired from texts. The items 
on comprehension and interpretation requi 
not merely surface reading but also understan® 
ing. The selection of items includes questions 
not only on what is true according to data bu 
also on what can and cannot be concluded x 
the data. This pairing is important, because 
awareness of the limitations of data is almost 2 
important in interpretation as is the recognition 
of truths revealed. Some other items are so cof 


659 | 


structed as to include systematically several con- 
nected and needed aspects of analysis, such as 
comparing, contrasting, predicting, and logi- 
cal reasoning from assumptions to conclusions. 
It is evident that Form Y is constructed on the 
basis of sampling important aspects of behavior 
as well as of content. This does not seem to be 
true for Forms S and X. 


For reviews by Robert A. Davis and Edgar 
B. Wesley of Forms R and S, see 3:592. 


[ 664] 

History and Civics Test: Municipal Tests: Na- 
tional Achievement Tests. Grades 3-6, 6-8; 1938- 
49; a subtest of Municipal Battery (see 20) ; 3 scores: 
lessons of history, miscellaneous facts, total; 2 levels; 
no data on reliability and validity and no description 
of normative population in manuals; no norms for part 
scores; Form A manual (^38), Form B manual (39) 3 
$1.25 per 25; 356 рег specimen set of any one level ; 
postage extra; 15(20) minutes; Robert K. Speer and 
Samuel Smith; Àcorn Publishing Co. * 

a) GRADES 3-6. 1938-49; Forms A ('48—except for 
minor changes, same as test copyrighted in 1938), B 


om 6-8. 1030-40; Forms A ('49—except for 
[253 changes, same as test copyrighted in 1938), B 
Harry D. Brno, Associate Professor, and. Ex-, 
aminer of Social Studies, Michigan State Col- 
lege, East Lansing, Michigan. 

These tests are short form subtests of the 
Municipal Battery of the National Achievement 
Test series. Each test is divided into two sections 
for which subscores may be obtained. Part I is 
devoted to "lessons of history" and Part II to 
“historical” or “miscellaneous facts." The rec- 
ommended testing time is 15 minutes, which is 
evidence that these are truly short forms. In the 
estimation of this reviewer, these tests have little 
to recommend them and might better be passed 
over in favor of such generally superior tests as 
the Cooperative Social Studies Tests for Grades 
7, 8, and о, or the Progressive Tests in Social 
and Related Sciences. The following are some 
of the specific counts on which the tests were 
found to be wanting: (а) Even as short forms 
the tests seem far too short. As a result, relia- 
bilities are low despite the fact that tryouts were 
administered to relatively heterogeneous popu- 
lations, This inadequacy is heightened by the 
fact that many of the items which are included 
are trivial in nature. (b) The tests have little 
diagnostic value as compared with others on the 
market. The two parts of each examination test 
for what is essentially the same thing—the re- 
tention of mere factual information. The differ- 
ence between the two parts lies chiefly in the 
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use of multiple choice items in the one and true- 
false in the other. What could be gained from 
studying subscores under such circumstances is 
problematical. (с) The technique of having stu- 
dents select “lessons of history,” while it sounds 
impressive, in its present application places a 
premium upon the retention of facts in some 
cases and leads to overgeneralizations in others. 
(d) No norms of any consequence are available. 
(e) Data with regard to validation are lacking. 
(f) The tests for grade levels 3-6 and 6-8 seem 
quite undifferentiated insofar as difficulty of con- 
tent, nature of content, and vocabulary are con- 
cerned. From inspection it would seem that they 
could be used interchangeably. There are few 
technical flaws in the construction of individual 
items, so no criticism can be made on this score. 
However, this is faint praise since there are few 
opportunities to make technical errors on a typi- 
cal true-false item such as “Lord Baltimore was 
a Protestant.” Clues and differences between 
apparent and functioning content usually appear 
only when higher level objectives are being 
tested.. 

This reviewer claims no extensive knowledge 
of the elementary school curriculum, but he at 
least hopes that something more is being accom- 
plished there in the area of the social sciences 
than the imparting of such discrete bits of in- 
formation as “Abraham Lincoln was born on 
January 22,” “John Cabot captured Monte- 
zuma,” “the Indians did not eat meat or fish,” 
or “Armistice Day comes once a year.” (All of 
the preceding are from the true-false sections of 
the tests.) But validity is meaningful only in 
relationship to objectives. If there are teachers 
who are interested in testing for such data and 
that alone, then these tests could be considered 
for use. 


[ 665 
Introduction to Social dieses Achievement 
Examinations for Secondary Schools. High 


school; 1951; 1 form; no data on reliability and valid- 
ity; no manual; Minnesota norms (median and quar- 
tile deviation) available; similar norms for other re- 
gions by special arrangement with publisher; 76 per 
test, Me extra; 60(65) minutes; edited by Walter 
W. k; prepared by a curriculum committee of high 
school teachers for use in the Minnesota State Board 
Achievement Examinations Program; Educational 
Test Bureau, Educational Publishers, Inc. * 


[ 666 ] 
*Social Studies Test: Acorn National Achieve- 
ment Tests. Grades 7-9; 1946-50; 5 scores: human 
relations and life situations, products and places, social 
jdeas and facts, application of knowledge, total; Forms 
А (50), В (48 ; mimeographed manual ['46] ; Form 
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A norms ('50), Form B norms 
35€ per specimen set; 
Lester D. Crow and 
Publishing Co. * 
Encar B. Westey, Visiting Professor, Stan- 
ford University, Stanford, California; formerly 
Director of Social Studies, University High 
School, and Professor of Education, University 
of Minnesota, Minneapolis, Minnesota. [Review 
of Form A.] 

This 150-item test consists of four parts: hu- 
man relations and life situations, products and 
places, social ideas and facts, and application of 
knowledge. АП items except those in Part 4 are 
of the multiple choice form with four options in 
each. True-false items are used in Part 4. Since 
very little information is provided with the test, 
the user scarcely knows what the test is or what 
it is supposed to do. 

The categories are neither clear nor discrete. 
Historical items are numerous in Part т and 
predominate in Part 3. Many of the errors which 
appeared in tests during the 1920's appear in 
this test. An examination of a few items in Part 
I will demonstrate the uncritical nature of this 
test. Item 2 uses "an" in the stem of the item, 
thus providing a clue to the correct option. Item 
16 refers to the family as a “type of community.” 
Item 23 uses a foil that is contrary to fact. Item 
25 says “elections enables.” Item 27 refers to 
the characteristics of frontiersmen as “condi- 
tions.” Item 41 indicates that the Declaration of 
Independence is “important” because it is “fa- 
mous.” Item 49 gives the clue by making the 
correct option unusually long and explicit. Part 
4 contains fabricated passages, and the items 
constitute a rather uncritical reading test. 

While the content of many items is important, 
and a few (e.g., Items 18 and 19 in Part т) are 
exceptionally good, the test as a whole is not 
up to a very high standard. 


[ 667 ] 
Social Studies Test: National Achievement 
Tests. Grades 4-6, 7-9; 1937-45; Forms A (45), В 
C45—except for minor changes, same as test copy- 
righted in 1939) ; 2 levels; no data on reliability and 
validity and no description of normative population in 
manuals; no norms for part scores; manuals (45); $2 
per 25 of any one level; 35¢ per specimen set of any 
one level; postage extra; 35(40) minutes; Robert К. 
Speer and Samuel Smith; Acorn Publishing Co. * 
@) GRADES 4-6. 6 scores: human relations, life situa- 
tions, social problems, products and peoples, meaning 
of events, total. 
b) GRADES 7-9. 7 scores: human relations, life situa- 
tions, social interpretations, values of products, social 
ideas, miscellaneous facts, total. 


(48); $2.25 per 25; 
tage extra; 40(45) minutes; 
үеге! Е. Augspurger; Acorn 
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For areview by Ray G. W ood, see 3: 504. 


[ 668 ] 
*A Test of General Proficiency in the Field of 
Social Studies: Cooperative General Achieve. 
ment Tests, Revised Series, Test I. Grades 10-12 
and college entrants; 1940-51; for complete battery 
see 5; 3 scores: terms and concepts, comprehension 
and interpretation, total; IBM; Forms X (47), Y 
(48), Z st) 5 Forms QR-T; out of print; no spe. 
cific manual; no norms for part scores; general bat- 
tery folder (751); general Cooperative manual (51); 
norms ['41]; $2.50 per 25; 50€ per specimen set, post- 
mo separate answer sheets may be used; 80€. per 25 
BM answer sheets; 15€ per stencil for scoring answer 
sheets; postpaid; 40(45) minutes; Jeanne M. Brad- 
ford (X) and Elaine Forsyth Cook (Y, Z); Coop- 
erative Test Division, Educational Testing Service, * 


REFERENCES 


1, Автькү, А. SrERL. A Study of Certain Relationships 
Existing Between General Reading Comprehension and Reading 
Comprehension in a Specific Subject-Matter Area, Doctors 
thesis, Pennsylvania State College (State College, Pa.), 19 

2. Acuamp, Е, +» AND CLARKE, FLoRENCE H, “You Con 
Measure the Probabiilty of Success as a Supervisor," Personne 
33335973 My '45. * y " 

3. Автгкү, A. SrERL. “General and Specific Factors in 
ERA Comprehension.” J Exp Ed 16:181-6 Mr '48. * (PA 
20:5122 


For a review by Paul L. Dressel of Forms 
T, X, Y, and QR of the complete battery, stt 
5; for a review by Harry D. Berg of Forms S 
and T, see 3:596; for a review by John V. Mc 
Quitty of the complete battery, see 3:3. 


[669] 

*Understanding of Basic Social Concepts: nm 
Tests of Educational Development, Test 1. 
Grades 9-13; 1942-51 (first published as a separate In 
1951) ; for complete battery, see 17; Form Y-2 (51— 
same as 1949 edition) ; manual ('51) ; general тапш 

(51) ; $3.75 per 25; separate answer pads or d 
sheets must be used; $1.95 per 25 answer pads; $3 Ls 
100 IBM answer sheets; 50€ per scoring stencil; 25 
рег 25 first semester ('46) or second semester (4 | 
profiles for any one of grades 9-12; $1 per 25 ti ixl 
interpreting profiles for students (’51) ; 256 per sc ); 
summary report ('44) ; 4¢ per pupil score sheet ( ) 
50€ per specimen set; cash orders postpaid; 55 s 
minutes; edited by E. F. Lindquist; J. W. Maucket; 
Science Research Associates, Inc. * 


For a review by Eric F. Gardner of the total 
battery, see 17; for reviews by Henry Chauncey; 
Gustav J. Froehlich, and Lavone A. Hamna 0 
Forms X-1 and У-т of the complete battery, 5% 
3:12. 


ECONOMICS 


[670] i il 
*Economics Test: State High School „Тев Form 
Indiana. High school; 1934-50; Form B [aol ality 
A out of print; mimeographed; no data on re st: 15 
and validity; no manual; norms ['50] ; 4¢ per “Richard 
per specimen set; postpaid; 40(45) minutes; се 
Н. Gemmecke; State High School Testing Serv 
Indiana, Purdue University. * 
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[671] 
XEconomics: 20th Century Test. 1 semester high 
school ; 1950; 1 form; no data on reliability and valid- 
ity; no norms—author recommends the use of local 
norms; 10 or more copies, 5€ each, postage extra; 35€ 
per specimen set, postpaid; 40(45) minutes; Benton 
Review Publishing Co. * 


[672] 


*Graduate Record Examinations Advanced Eco- 
nomics Test. Senior year college through graduate 
school and candidates for graduate school; 1939-51; 
available only in Graduate Record Examinations pro- 
grams (see 527) ; 180(220) minutes; prepared by the 
Advanced Economics Test Committee а pointed by 
Educational Testing Service: Joseph n Spengler 
(Chairman), Mary Jean Bowman, Clark L. Allen, 
Kenneth E. Boulding, and Ben W. Lewis; Educational 
Testing Service. * 


[673] 


*Hills Economics Test. High school and college; 
1940; Form A; $1.05 per 25; 20¢ per specimen set; 
postpaid; 40(45) minutes; John R. Hills and H. E. 
Schrammel (manual); Bureau of Educational Meas- 
urements, Kansas State Teachers College of Empo- 
ria. 


GEOGRAPHY 
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*Economic Geography: Achievement Examina- 
tions for Secondary Schools. High school; 1951; 
1 form; no data on reliability and validity; no manual; 
Minnesota norms (median and quartile deviation) 
available; similar norms for other regions by special 
arrangement with publisher; 7¢ per test, postage ex- 
tra; 60(65) minutes; edited by Walter W. Cook; pre- 
pared by a curriculum committee of high school 
teachers for use in the Minnesota State Board Achieve- 
ment Examinations Program; Educational Test Bu- 
reau, Educational Publishers, Inc. * 


[ 675] 


*Geography: Every Pupil Test. Grades 4, 5, 6,7; 
1935-51; new form usually published each April and 
December; 4 levels; form December 1951; no data on 
reliability and validity; no manual; norms (51); 215€ 
per test; 1¢ per answer key; stpaid; 40(45) minutes ; 
Ohio Scholarship Tests, Ohio State Department of 
Education. * 


[676] 


*Geography Test: Municipal Tests: National 
Achievement Tests. Grades 3-6, 6-8; 1938-50; а 
subtest of Municipal Battery (see 20) ; 3 scores: geo- 
graphic ideas and comparisons, miscellaneous facts, 
total; 2 levels; no data on reliability and validity and 
no description of normative population in manuals; no 
norms for part scores; $1.25 per 25; 356 per specimen 
set of any one level; postage extra; 20(25) minutes; 
a К. Speer and Samuel Smith; Acorn Publishing 
о. 


а) GRADES 3-6. 1938-49; Forms A ('49—except for 
minor changes, same as test copyrighted in 1938), B 


b) cranes 6-8, 1938-50; Forms А (50), В 39). 
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Epwin H. Reever, Professor of Education, 
University of Illinois, Urbana, Illinois. 

When testmakers attempt to devise an instru- 
ment which will evaluate the results of instruc- 
tion in so broad and complicated a field as geog- 
raphy, they have indeed undertaken a very diffi- 
cult task, When they try to make this evaluation 
with a test containing only 40 test elements to 
be answered by children in 20 minutes, they 
have, in the opinion of the reviewer, attempted 
an impossible task. It might be interesting 
enough to give this test to elementary school 
children, but the reviewer would be at a loss to 
know what conclusions might be drawn from the 
results. Any deductions about the quality of 
geography teaching which were drawn from chil- 
dren's scores on the tests would be of extremely 
doubtful validity. 

The assumption on which the maker of a 
short test in a broad field must rest his case in 
defending its validity is that the individual test 
items are so significant for the total field of the 
test that knowledge of them is correlated very 
highly with the pupils total knowledge of the 
field. When viewed in the light of the above 
statement, many of the items in the tests under 
discussion would be open to serious question. 
One item requires the pupil to know how houses 
are numbered; others, that he know “which 
state produces the most of ready-made cloth- 
ing” ; “what other continent has most nearly the 
same number of people as North America" ; that 
handshaking is more common in the United 
States than in France, China, or India; that in 
1938 Germany was “our greatest competitor for 
South American trade” ; that Greece rather than 
Ireland, Finland, or Palestine has about the same 
area and population as Illinois; that the Kiel 
Canal connects the North and Baltic Seas. One 
item indicates in the given correct answer that 
“men kiss each other when they meet” in France. 
In addition to being of doubtful significance, the 
accuracy of the implications of this item would 
also be challenged by those who had spent some 
time in France. No doubt men do sometimes 
kiss each other in France, but certainly they do 
not do so as often as we—and they—shake hands. 

While it is true that in a long test some of 
the items above might be defended, they become 
highly questionable in a test of only 40 items. 

Some of the items would be objected to by 
any geographer on grounds of accuracy. In Form 
A of the test for grades 3-6, Part I, Item 17 
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indicates that Canada is "part of Great Brit- 
ain"; and in Form B, Item 17, the correct an- 
swer implies that Canada "belongs to a Euro- 
pean power." One wonders what a Canadian 
would say about these items. He would certainly 
raise the question whether the testmaker had any 
conception of the meaning of "Dominion" or 
of "The British Commonwealth of Nations." In 
the test for Grades 3-6, Form A, Part II, Item 
9 reads as follows: “Which of these countries 
has the most of an important product used in 
making automobile tires? Argentina, Brazil, 
China, Australia." The correct answer given in 
the key is, of course, Brazil. But no geographer 
would pay much attention to Brazil as a rub- 
ber producer, because it is at present producing 
less than two per cent of the word's rubber. 

In conclusion, it may be reiterated that the 
giving of the tests in question might be a rather 
interesting exercise, but it is doubtful that valid 
conclusions could be drawn from the results 
with respect to the quality of instruction or 
learning in the field of geography. 


[677] 

*Geography Test: National Achievement Tests. 
Grades 6-8; 1938-49; 6 scores: geographical ideas, 
locating products, uses of products and instruments, 
economic and human relations, miscellaneous problems, 
total; Forms A ('49), B (748) ; no data on reliability 
and validity and no description of normative pula- 
tion in manuals; no norms for part scores; Form А 
manual (49), Form B manual (48); $2 per 25; 35¢ 
per specimen set; postage extra; nontimed (35-45) 
minutes; Robert K. Speer, Lester D. Crow (A), and 
Samuel Smith; Acorn Publishing Co. * 


For a review by Elaine Forsyth, see 3:600. 


[678] 

*Modern Geography and Allied Social Studies, 
Grades 6-10; 1949-50; 11 scores: trade routes and 
their products, causal geography (U.S.), causal geogra- 
phy (world), miscellaneous geographical facts, inven- 
tion- wer-transportation-communication, geographical 
vocabulary, world products (sources and uses), eco- 
nomic and human relations, place geography (U.S. and 
western hemisphere), place 18 %0втарћ А апі 
Eastern hemisphere), total; Form А 749) ; no norms 
for part scores; manual [50] ; 4-page map sheet (49) ; 
$2.50 per 25; separate answer sheets must be used; 2¢ 
per answer sheet; 3¢ per map sheet; postage extra; 
356 per specimen set, postpaid; nontimed (90) min- 
utes; C. A. Gregory; C. A. Gregory Co. * 


Ертн M. Норргѕтом, Supervisor of Test De- 
velopment in Social Studies, Educational Test- 
ing Service, Princeton, New J. ersey. 

A review of this test indicates that it lives up 
to the author's concept of it as a test in social 
science "with a heavily weighted core of Geog- 
raphy.” The emphasis on causal relationships 
and basic principles, in addition to factual knowl- 
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edge, is to be warmly commended. There area 
few individual questions to which the reviewer 
takes exception, but on the whole the material js 
sound. The coverage is well-suited to courses of 
study which emphasize Western culture and 
civilization, but unfortunately there is only one 
item on the Far East—an area which is receiy- 
ing increasing emphasis in current curricula, 

The manual presents a table for the conver- 
sion of raw scores into scaled scores. These data 
are derived from a pooled sample of 812 stu- 
dents in grades 6-10, drawn from seven school 
systems located in four states. The population 
is not described in further detail, and there is 
no indication of the number of students taken 
from each grade level. According to the manual, 
a particular scaled score “means the same for a 
sixth grade student that it does for a tenth 
grade student." This statement may be mislead- 
ing, since the scaling does not correct for differ- 
ences between grades. An identical scaled score 
for a sixth-grade and a tenth-grade student can 
mean the same for the two students only in the 
sense that it represents a comparable number of 
items answered correctly. 

Raw-score means and standard deviations are 
given for grades 6-A, 6-B, 7-A, 7-B, 8-A, 8-В, 
9-A, and 10-A. The grade means are quite close 
together, ranging from 57.61 to 71.44, with 
standard deviations varying between 18 and 
19. Since no information is given as to the type 
of geography instruction which the students re- 
ceived in each grade, the grade norms are diffi 
cult to interpret. It would seem advisable for 
schools using the test to create their own norms, 
interpreting them in the light of their own 
courses of study. 

The statements on correction for guessing are 
theoretically in error. The author makes the 
assumption that since there are 140 4-choice 
questions, a student may answer one fourth of 
the questions correctly by chance (the author 
computes one fourth of 140 as 36!) ; therefore, 
he states that the quantity 36 should be sub- 
tracted from all students’ scores. The test user 
should be warned that according to any tenable 
theory, correction for guessing should be base 
on the number of wrong responses and the num- 
ber of items attempted by the student rather than 
the total number of items in the test. The author 
does not state what use is to be made of the cot- 
rected scores, once they are obtained. Appat 
ently he has not incorporated them in the con- 
version table. 


— 
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The reliability of the test is reported as .92, 
computed on the basis of odd vs. even scores. 
'The manual does not name the population used 
in obtaining the reliability, but presumably it is 
the above-discussed sample of students in grades 
6-10. If this is true, the reliability could be ex- 
pected to be lower within a single grade level. 

The manual claims validity for the test on the 
basis of the wide usage of the textbooks con- 
sulted. If the test material was reviewed by ex- 
perts in the field, the manual does not so state, 
and no validation on the basis of external cri- 
teria is reported. 

Despite the limitations in statistical treatment, 
however, the test contains good material which 
individual teachers or principals may wish to 
evaluate in terms of local courses of study. 


Epwin Н. Reever, Professor of Education, 
University of Illinois, Urbana, Illinois. 

This test is a praiseworthy attempt to make a 
comprehensive evaluative instrument in an im- 
portant field of the school curriculum. 'The areas 
covered by the ten parts of the test indicate that 
the author has a broad conception of the curricu- 
lum content in the social studies. 

While to some public school people the 9o 
minutes required to administer this test may 
seem excessive, the reviewer agrees with the au- 
thor that any estimate of pupils' progress in so 
comprehensive a field as the social studies can- 
not be based on a shorter test. 

A common criticism of most tests in the social 
studies is that they are purely factual and require 
mere memorization of data. This is not true of 
the test under discussion. While there are parts 
of the test that are highly factual, other parts 
require reasoning and judgment. A teacher who 
required only memorization of facts would be 
likely to find that his pupils would not do well on 
several parts of this test. Tt is, incidentally, un- 
fortunate that no norms are given for scores on 
the individual parts of the test; such norms 
would greatly enhance its diagnostic value. 

A reviewer can probably always find in any 
test elements which he believes to be worthless 
or positively objectionable. This is true in the 
present instance, For example, the reviewer 
doubts whether it is worthwhile for school pu- 
pils to know that the approximate capital invest- 
ment of railroad companies to provide a job for 
each employee is $24,000; that Japan catches 
and consumes more fish than does Norway, Rus- 
sia, or the United States ; or that Texas is known 


as the Lone Star state. He seriously objects to 
the use of the word “race” in referring to such 
peoples as Mongolians or Caucasians. No rep- 
utable anthropologist today accepts any such 
racial divisions. The most objectionable test 
element is Item 2, Part 3. The correct answer as- 
signed to the question why farming is not done 
extensively in the Amazon basin is that “white 
men cannot endure the climate and compete with 
the dense vegetation and the natives are too 
ignorant and lazy to farm extensively." This re- 
viewer doubts the geographic accuracy of the 
reason assigned in the first place, and categori- 


' cally denies its human implications in the second. 


But the percentage of unimportant or objec- 
tionable elements in the test is, in the opinion of 
the reviewer, small. 

The most serious defect in the test is that the 
testmaker in his multiple choice statements has 
made the correct answer the longest one in far 
too many of the elements. In 71 of the multiple 
choice items, the answers consist of more than 
one or two words. In 33 of these 71, the longest 
response is the correct one. Thus a testwise pupil 
could make a fairly good score simply by count- 
ing the words in the responses and marking the 
longest. 

The last two parts deal with place geography 
and require the use of maps on which locations 
are designated. Without research on these two 
parts, the reviewer cannot be sure that his im- 
pressions are correct; but he believes that the 
maps are too indistinct and cluttered to be satis- 
factory testing instruments. He believes that the 
results on these two parts will prove to be a 
combination of the evaluation of the visual acuity, 
intelligence, and knowledge of place locations of 
the pupils ; and he doubts that anyone will know 
without further research in what proportion 
these three elements operated in yielding a score. 

In spite of the defects cited above, this re- 
viewer believes that the test is well worth using 
and that it will bean evaluative instrument which 
will prove helpful to teachers and supervisors in 
improving instructional techniques. 


HISTORY 


[679] 3 
xAmerican History: Achievement Examinations 
for Secondary Schools. High school; 1951; 1 form; 
no data on reliability and validity; no manual; Minne- 
sota norms (median and quartile deviation) available ; 
similar norms for other regions by special arrange- 
ment with publisher; 76 per test, postage extra; 60 
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(65) minutes; edited by Walter W. Cook ; prepared by 
a curriculum committee of high school teachers for use 
in the Minnesota State Board Achievement Examina- 


tions Program; Educational Test Bureau, Educational 
Publishers, Inc. * 


[680] 
*American History: Every Pupil Test. Grades 7- 
8, 11-12; 1931-51; new form usually published each 
April and December; 2 levels; no data on reliability 
and validity; no manual ; norms pes ; 216€ per test; 
1@ per answer key; fpi; 0(45) minutes; Ohio 
Scholarship Tests, Ohio State Department of Educa- 
tion. 
4) GRADES 7-8. 1935-51; 3 tests: To 1770 A.D. (form 
December 1951 and April 1952), 1770-1860 (form 
April 1951 and December 1951), 1860 and on (form 
December 1951 and April 1952). | 
b) GRADES 11-12. 1031—51; form April 1951. 


REFERENCES 
1, Brown, NrLsow. A Critical Analysis of the Ohio Every 
Pupil Tests in American History for the Years 1929-1941. 
Master's thesis, Ohio University Ad ens, Ohio), 1941. Pp. 79. 


(Abstracts оў Masters" Theses...., 1941 


. 6. 

2. FiNCH, Joun F. A Study of the Effects of the Ohio Eve: 
Pupil Tests Upon the Teaching of American History. Master's 
thesis, Ohio University (Athens, Ohio), 1950. Pp. 159. (Ab- 
stracts of Masters’ Theses...., 1950, pp. 34-5.) 


[681] 
*American History Test: National Achievement 
Tests. Grades 7-8; 1 37-49; 5 scores: lessons of his- 
tory, time concepts, historical associations, miscella- 
neous problems, total; Forms A (49), В (45); no 
data on reliability and validity and no description of 
normative population in manuals; no norms for part 
Scores; manuals ('45); $2 per 25; 35¢ per specimen 
set; postage extra; 40(45) minutes; Robert K. S er, 
rex D. Crow, and Samuel Smith; Acorn Publishing 


For reviews by Jacob S. Orleans and Wallace 
Taylor, see 40:1630. 


[ 682 ] 

*American History Test: State High School 
Tests for Indiana. First, second semesters high 
school; 1934-51; 2 levels; mimeographed; no data on 
reliability and validity; no manual; norms ['51]; 46 
per test; 15@ per specimen set; postpaid; 50(55) min- 
utes; Easdale Pickett; State High School Testing 
Service for Indiana, Purdue University. * 

а) FIRST SEMESTER. Form B ['48]; Form A out of 
print. 

b) SECOND SEMESTER. Form О ['48]; Form N out of 
print. 


[683] 

* American History: 20th Century Test. First, sec- 
ond semesters high school; 1938-48; 2 levels; Forn A 
(48) ; no data on reliability and validity; no norms — 
author recommends the use of local norms; manuals 
(48) ; то or more copies, 5€ each, postage extra; 35¢ 
рег specimen set, postpaid; 40(45) minutes; Gale 
Smith; Benton Review Publishing Co., Inc. * 

а) TEST I, FROM DISCOVERY OF AMERICA TO 1860. First 
semester. 

b) TEST п, 1860 TO PRESENT. Second semester. 


2 [684] 
*Cooperative American History Test. High school 
and college; 1932-51; IBM; Forms X C47), Y C5o— 
same as test copyrighted in 1948), Z (49); Forms 
1932-37, N-T out of print; no data on validity; no 
specific manual; general Cooperative manual (51); 
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high school norms [37]; college norms ['44] ; $250 
per 25; 50¢ per specimen set, postpaid; separate an. 
swer.sheets may be used; 80f per 25 IBM answer 
sheets; cash orders postpaid; 40(45) minutes; Harry 
D. Berg; Cooperative Test Division, Educational 
Testing Service. * 


REFERENCES 

e P i AND MurixHEAD, Јоѕерн V. “Age of Mili 

i Enlisted Men "as Factors in Tesis of General шн 
Development and American History." J Ed Кез 43:303-6 D 
i Man 26:2506) AND MUIRHEAD, JoserH V. “School Grade 
Last Completed by Military Enlisted Men as Factors in Tests 
of General Educational Development and American History," 
J Ed Res 43:221-4 N '49. * (PA 24:3877) 
Donornv C. Apkiws, Professor of Psychology 
and Chairman of the Department, The Uni- 
versity of North Carolina, Chapel Hill, North 
Carolina. [Review of Forms X, Y, and Z.] 

This test is designed to test achievement in 
American history at the end of a high school or 
elementary college course. It is divided into two 
parts with time allowances of 25 and r5 minutes. 
However, a subject who completes the first part 
before the time is up can go on to the second 
part, and one who finishes the second part can 
return to the first. Thus, the purpose of the 
two-part division is not to insure that all sub- 
jects spend the same amount of time on either 
part. The general nature of the content of the 
two parts reveals no striking difference, all the 
items are five-choice items, part scores are not 
obtained, and the same scoring formula is ap- 
plicable to both parts. Thus, the reason for the 
separation, with the inconvenience in adminis- 
tration of the test, is too subtle for this reviewer 
to grasp. Я 

The technical quality of the items is high. This 
reviewer is not competent to judge the content, 
but the test appears to be up to date and compre 
hensive. 

The materials accompanying the test do not 
contain information on the estimated reliability 
coefficient of the test, but they do give the stand- 
ard error of measurement in terms of 
scores, based on the parallel forms method. 

The test can be used with separate answer 
sheets, either hand scored or machine scored, of 
the answers can be recorded in the test booklets. 
It is curious that the directions for scoring the 
Cooperative tests when the answers are recorde 
on the booklets and when separate answer sheets 
are used are not consistent. The directions for 
scoring when answers are recorded on the book 
lets contain the following provision : “When ВЯ 
answers are given for any item, the item sho 
be marked as an omission, provided that опе 4 
the answers given is the correct answer; but 
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both answers are wrong, the item should be 
marked and counted as a wrong answer." The 
scorer is left in doubt as to the correct procedure 
if more than two answers are marked. Perhaps, 
however, this contingency is so rare when an- 
swers are marked on the booklets that no in- 
struction is needed. A subject who marked only 
the right answers for 50 items, only a single 
wrong answer for each of 15 items, two answers 
including one right and one wrong for 10 items, 
and two wrong answers for 10 items would get 
a score of 50 — 25/4 — 44. The directions for 
scoring separate answer sheets, however, call 
for counting all marks even where there is more 
than one for an item, counting the number of 
blackened spaces appearing through the rights 
key, and subtracting the latter (the number 
right) from the former to get the number of 
wrong answers. The subject above, who marked 
105 answers, including бо right answers, now 
gets a score of 60 — 45/4 — 49. (By the more 
common scoring procedure of counting as wrong 
any item in which more than one answer is 
marked, the subjects score would be 50— 
35/4 — 41.) This reviewer fails to see any sound 
basis for the variant scoring instructions in the 
two instances. 

As is done for most Cooperative tests, tables 
of scaled scores for the several forms of the 
American history test, and percentile norms are 
available. 


Martua E. Layman, Associate Professor, 
Board of Examiners, Michigan State College, 
East Lansing, Michigan. [Review of Forms X, 
Y, and Z.] 

Each of the three forms consists of 85 five- 
option multiple choice items to be administered 
in 40 minutes of working time. Though the items 
are not classified according to objectives, the ex- 
perienced teacher of American history can easily 
recognize that the authors have included items 
designed to test knowledge of historical facts; 
understanding of cause and effect relationships, 
trends, and developments; and ability to rec- 
ognize chronological relationships, interpret 
Rie maps, and locate historical informa- 
ion. 

The major emphasis is on political and diplo- 
matic history. Economic and social history are 
covered to a lesser extent, but intellectual and 
cultural history have been given very little at- 
tention. For example, approximately 60 per cent 
of the items in Form X deal with political and 
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diplomatic developments, about 35 per cent are 
designed to test the student's understanding of 
economic and social events and trends, while the 
remaining 5 per cent are devoted to cultural and 
intellectual history and to testing the ability of 
the student to locate historical information. 
Though the sampling is probably an accurate 
reflection of the amount of attention given these 
factors in the average American history course, 
this test would not meet the needs of teachers 
who consider the history of American ideas as 
meriting a considerable amount of emphasis. 

About one half of the items deal with events 
preceding 1860, while the remaining items are 
about equally divided between the period 1860 
to 1900 and developments of the twentieth cen- 
tury. It is surprising that though Form Z was 
published in 1950 only two of the items deal with 
the post-World War II era. Schools stressing 
current trends and developments in American 
history classes would find it necessary to supple- 
ment this examination with a test of contempo- 
rary affairs. 

This examination shows evidence of unusual 
care and technical skill in item construction. It 
is undoubtedly one of the best, if not the best of 
the commercial tests in American history. 


For a review by Margaret Willis of Form P, 
see 40:1633; for a review by Edgar B. Wesley 
of Form 1937, see 38:1014. 


[685] 

*Cooperative Ancient History Test. High school; 
1933-51; 3 scores: historical facts, historical judgment, 
total; IBM; Forms О (’50—same as test copyrighted 
in 1938), Р (’51—same as test coyrighted in 1939) ; 
Forms 1933-37 out of print; no data on validity ; no 
specific manual; general Coo, rative manual (51); 
norms ('37) ; $2.25 per 25; eng per specimen set, post- 
paid; separate answer sheets may be used; 80€ per 25 
TBM answer sheets; 15¢ per scoring stencil; cash or- 
ders postpaid; 40(45) minutes; Howard R. Anderson, 
E. F. Lindquist, Wallace Taylor Œ), and Charlotte W. 
Croon (P); Cooperative Test ivision, Educational 
Testing Service. * 


For a review by S. P. McCutchen of Form р; 
see 40:1634; for a review by Wilbur F. Murra 
of Form 1937, see 38:1015. 


[686 ] 

*Cooperative Modern European History Test. 
High school and college; 1932-51; IBM; Forms х 
(47), Y C49 ; Forms 1932-37, N-Q out of print; no 
data on validity ; no specific manual ; general Coopera- 
tive manual ('51) ; norms (37) ; $2.50 per 25; 50€ per 
specimen set, postpaid; separate answer sheets may be 
used; 80¢ per 25 IBM answer sheets; 15¢ per stencil 
for scoring answer sheets; cash orders postpaid; 40 
(45) minutes ; Frederick H. Stutz; Cooperative "Test 
Division, Educational Testing Service. * 
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For a review by Lavone A. Hanna of Form 
O, see 40:1635; for reviews by A. C. Krey and 
S. P. McCutchen of Form 1937, see 38:1016. 


[687] 
*Cooperative World History Test. High school ; 
1934-51; IBM; Forms X (47), Y ('48), Z (49); 
Forms 1934-37 out of print; no data on validity; no 
specific manual; no description of normative popula- 
tion; general Cooperative manual (51) ; revised norms 
(740); $2.50 per 25; 50€ per specimen set, postpaid ; 
separate answer sheets may be used; 80€ per 25 IBM 
answer sheets; 15¢ per stencil for scoring answer 
sheets; cash orders postpaid; 40(45) minutes; Wal- 
lace Taylor (X, Y) and Frederick Н. Stutz (7); Co- 
operative Test Division, Educational Testing Service, * 
For a review by Kenneth E. Gell of Form 
1937, See 40:1636; for a review by R. M. Tryon 

of Form 1037, see 38:1017. 


[688] 

*Crary American History Test: Evaluation and 
Adjustment Series. Grades 9-13; 1951-52, с1950-52; 
IBM; Forms AM (’s1), BM ('s2) ; manual (51); $2 
per 25; separate answer sheets must be used; 80¢ per 
25 IBM answer sheets; postage extra; 356 per speci- 
men set, postpaid; 40(50) minutes; Ryland w Crary; 
World Book Со. * 

Epcar B. WEsLEY, Visiting Professor, Stan- 
ford University, Stanford, California; formerly 
Director of Social Studies, Ù niversity High 
School, and Professor of Education, University 
of Minnesota, Minneapolis, Minnesota. 

This test consists of two 9o-item forms. The 
manual provides rather complete information — 
nature and purpose, development, validity, re- 
liability, standardization, item difficulty, direc- 
tions for giving, scoring, and interpreting, and 
suggestions on individual and group guidance. 
АП the customary data are furnished and the 
user can proceed with a clear understanding of 
what to do and what to expect. 

This test is excellent, perhaps the best one 
available on American history at the secondary 
level. In fact, its merits are numerous and its 
faults curable. Only by a narrow margin does 
it fail to be outstanding. 

The author consulted most of the obvious 
Sources for determining content, and he has dem- 
onstrated how to measure information, skills, 
interpretations, understandings, and reasoned 
inferences. The user is thus assured that he has 
a test which measures more. than information 
alone, for it emphasizes knowledge and synthe- 
sized understandings rather than minute details. 

The test deserves some criticisms. One sen- 
tence in the directions for the first section is 
confusing, in fact, almost: unintelligible. *Notice 
that all letters (a, b, c, d, e, f, g, and h) ‘аге not 


à 


given for each item." The testee will look down 
the page and see that all the items do contain 
all these letters. If he is a detective he will dis-, 
cover that the shortage of letters is on the an- 
swer sheet and not in the test. This confusi, 
and needless distraction can be easily cured. This 
sentence and the one that follows are utterly un- 
necessary. Delete them. 

The author lists the five aspects of history 
with which the test deals and states the number 
of items under each aspect: 28 factual, 16 skill, 
8 interpretation, 26 understanding, and 12 in- 
ference. Unfortunately the reader can not figure 
out, except approximately, which items the au- 
thor thinks belong to each of these categories, 
Thus the classification becomes merely an asser- 
tion, the correctness of which the reader can not 
check, 

The author cites five sources from which the 
content was derived. They are worthy and useful 
sources, deserving attention апа utilization. 
There is one curious omission. One can only 
speculate as to why he ignored American His- 
tory in Schools and Colleges.* This report is the 
only one ever written in the field of the social 
studies that had the combined support of na- 
tional historical associations and the National 
Council for the Social Studies. It contains the 
only official list of specific minima in American 
history that was ever agreed upon by a national 
committee. Furthermore, it contains the test 
which was administered all over the country 
to all kinds of groups to demonstrate the differ- 
ence between ancillary facts and permanent out- 
comes. Surely this report deserved at least 4 
casual examination by the author of this test. 

A few items deserve comments. Item I, Form 
AM, declares that the first representative m 
sembly in America met in 1619. Spanish age? 
cies, representative in nature, had been meeting 
for years before this date. Changing Nes 
to "English colonies in America" would теше 
the inaccuracy. Item 42c, Form AM, says thatt 
public school system was accomplished by pe 
control and support. State aid and legis 
controls make a mockery of this popular ш 
boleth. Item 55e, Form BM, is so neat as to s 
serve special praise. The association of the Cr ү 
of Gold speech with the election of Melon 
is a touch of zetetic genius. This whole sect ; 

rt o! 
це Committe on American ge. е Staale and Cal 
of the American Historical Association, the Mississippi Social 
Historical iation, and the National Council for 


ociatit . 
Studies (Edgar B. Wesley, Director of the Committee) 
York: Macmillan Co., 1944. Pp. xv, 148. * 
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(Items 53-57 and in Form AM, Items 47—50), 
is admirable in its searching use of association. 
Item 67, Form BM, asking for the fur-trading 
capital of the United States, is comparatively 
picayunish. . 

The section in each form made up of Items 

9-66 in Form AM and Items 68-77 in Form 
BM consists of a reading test. In the opinion of 
this reviewer, this section is the weakest in the 
test, for it requires little discrimination or alert- 
ness. The fault is not in the idea but in the se- 
lection of the passages. 

Items 67-87, Form AM, and Items 78-104, 
Form BM, are multiple choice items, the correct 
answers to which are incorrectly referred to as 
best. When an item deals with information there 
is a correct answer but not a best answer, for a 
best implies degrees of correctness. 

Consider a few of the specific items. Item 68d, 
Form AM, asserts that the "industrial re- 
sources" contributed most to the Northern vic- 
tory, which is about as logical as saying the hind 
wheels are more important than the front ones. 
Item 71, Form AM, and Item 82, Form BM, 
deal with groups of Presidents who increased or 
did not increase the "influence" of the executive 
branch. Harding, who was among the "least," 
followed Wilson who was among the "most." 
Why not be more precise and talk about strong 
and weak executives instead of a nebulous and 
disconnected “influence”? Item 79, Form AM, 
deals with the alleged reason why Theodore 
Roosevelt opposed Wilson's administration. АП 
the options are unrealistic. Questions that in- 
volve motives are seldom desirable in a test. 

Items 80 and 84, Form BM, are thinly dis- 
guised questions of dates. A. simple completion 
would be a more economical method of testing. 
Item 83, Form BM, asks why the United States 
abandoned isolationism. The answer is pseudo- 
philosophical, nebulous, and unconvincing. Item 
85, Form BM, deals with Lincoln's stand on 
slavery. When, as authorities agree, the candi- 
date himself was equivocal, why ask high school 
students to be clear? Item 87, Form BM, should 
include the saving phrase "principal effect of 
those listed." Item 88, Form BM, is trivial and 
erroneous. The significance of the Kentucky and 
Virginia Resolutions is utterly obscured by tying 
them to the question of freedom of the press. 
Item 94, Form BM, states that a third party was 
"significant" in 1924. While this is possibly true, 
it is a fine point. Item 98, Form BM, deals with 
mercantilism. Should not this old skeleton be 


left to dangle in the musty past? Item 103, Form 
BM, asks what "territory" was last acquired. 
The word "area" should for obvious reasons be 
substituted. 

The last section of the test (i.e., Items 88-90, 
Form AM, and Items 105-7, Form BM) is con- 
fusing. The items look like puzzles. Surely less 
complicated techniques could be used to achieve 
results. 

The author is least successful with the multi- 
ple choice form. It should never be used to ascer- 
tain the presence or absence of specific informa- 
tion. Obviously there are more economical forms 
for that purpose. The multiple choice form is 
most useful when recognition, or still better, dis- 
crimination, is required. It can be most appro- 
priately used when the situation allows shades or 
degrees of truth in all the options. 

The virtues of the test far outweigh its defects. 
It is unfortunate that a description of its short- 
comings requires more space than the enumera- 
tion of its good qualities. The reviewer regrets 
this inevitable disproportion, but it should not 
lead any reader to conclude that the final evalua- 
tion is unfavorable. It is, as stated at the begin- 
ning of this review, a challenging, reasonable, 
and practical test—one that deals with ideas 
rather than catalogic facts. 


[ 689 ] 

*Cummings World History Test: Evaluation 
and Adjustment Series. Condes 9-13; 1951-52, 
c1950-51; IBM; Forms AM (’s51), BM (52); man- 
ual (51); $2.50 per 25; separate answer sheets must 
be used; 80@ per 25 IBM answer sheets; postage ex- 
tra; 35€ per specimen set, postpaid; 40(50) minutes; 
Howard H. Cummings; World Book Co. * 
Dorotuy C. Арктхв, Professor of Psychology 
and Chairman of the Department, The Univer- 
sity of North Carolina, Chapel Hill, North Caro- 
lina. 

This test is one of a series intended for high 
school use and referred to as the Evaluation and 
Adjustment Series. The test is available in two 
forms, each of which contains 8o items. 

This reviewer, who is not a historian, will con- 
fine herself largely to technical aspects of the 
test. 

The test uses separate answer sheets which 
may be scored either by hand or by machine. The 
front page of the test booklet is devoted to di- 
rections and sample items. The general format 
is good, with one exception wort y of comment. - 
Instead of following the usual procedure of des- 
ignating the answers to the items (all of which 
have five choices) by 1,2; $ 4, 5 or A, B, C, D, 
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E, the author alternates the first five letters of 
the alphabet with the second five in the matching 
exercises in Part A and the first five numbers 
with the second five in the items of Part B. Thus 
the responses to Item 46 are designated by r, 
2, 3, 4, 5, and those for Item 47 by 6, 7, 8, 9, то. 
Such a practice forces one who wishes to use 
the test with answer sheets to purchase the spe- 
cially printed answer sheets rather than to use 
the standard ones that are doubtless less ex- 
pensive. It would also seem to have the disad- 
vantage of leading to confusion on the part of 
the subject. 

The manual accompanying the test reports 
that the objectives and the content of the test 
were based upon representative textbooks, 
courses of study from representative schools, 
statements of objectives by teachers in curricu- 
lum studies, and the Twentieth Yearbook of the 
National Council for the Social Studies.* Part 
A consists of 15 sets of 3 items each, each of 
the 3 items to be matched with one of 5 alterna- 
tives. The sets consist of factual items on his- 
torical events, dates, places, and leaders. Part 
B consists of 35 five-choice items distributed as 
follows: ancient history, 5; medieval Europe, 
5; modern Europe, 12; World War I, 7; World 
War II, 6. 

The 8o items for each form were selected from 
preliminary forms of 100 items each. It might 
have been advisable to start with a somewhat 
larger number of tryout items. The experimental 
forms were administered in 1949 to about nine- 
teen hundred students in 17 high schools in 7 
states. Item difficulty values and item-test cor- 
relations were approximated by averaging the 
percentages passing each item in upper and lower 
27 per cents and by means of the Flanagan tables 
based on upper and lower 27 per cents, respec- 
tively. The size of the sample on which the item- 
analysis data were obtained does not appear to 
be stated in the manual. It is reported that the 
mean difficulty index for each form is 44 and 
that the items were so selected that the two forms 
are precisely balanced with respect to distribu- 
tion of item difficulties, As a matter of fact, the 
two distributions of difficulty values are not 
identical, as this statement would seem to imply; 
on the other hand, there is no reason why they 
should be exactly the same. It is also reported 
that "the items in the two forms were balanced 


1 National Council for the Social Studies. I й 
но yeh eor ten aed 
y Edi: C.: il, . Pp. xii 
275. Paper, $2.50; cloth, $3.00. * du аге 


with respect to validity indices" (item-test сог. 
relations), although elsewhere it is stated that 
the mean validity index of the items in Form AM 
is .50 and in Form BM, .41. One may wonder 
why it was not possible to select the items in 
such a way that these means would have been 
much closer. 

The manual reports corrected split half re 
liability coefficients of .gt and .94 for samples 
of 160 and 161 students, but does not specify 
which form was used. With further respect to 
the equivalence of the two forms, the manual re- 
ports that they are comparable in content in the 
sense that their items cover the various aspects 
of the field in about equal proportions. It is also 
reported that as an additional check on equiva- 
lence, both forms of the test were given to the 
same group in counterbalanced order. It was 
found that the two forms were almost directly 
comparable at all points along the scale. At this 
stage the reader can legitimately expect to learn 
the correlation between the two forms, since 
they had been given to the same group, as the 
clinching argument for their equivalence. But 
the manual neglects this point. 

The manual contains sections on “General 
Directions to the Examiner," “Specific Direc- 
tions for Administering,” “Directions for Scor- 
ing,” “Interpretation of Results,” and “Using 
the Test Results,” all of which seem adequate. 
The norms were based on 3,587 students in 47 
schools distributed over 24 states. The median 
chronological age of this group was 16 years an 
I month and their median IQ, based on the 
Terman-McNemar Test of Mental Ability, was 
104. The raw scores on the Cummings test arè 
converted to a scale of normalized standa 
scores with a mean of 104 and a standard devia- 
tion equal to that of the IQ of the standardization 
sample, 13.5. Percentile norms are also pro 
vided. Я 

Brief consideration will be given to the quality 
of the items as revealed by inspection ка 
technical standpoint. Some of the sets of mate 
ing items suffer from the disparate character 0 
the alternatives from which choice is to be ma Ў 
Thus, опе is to match “the head of а Mer 
which consists of a number of parishes of. t ) 
Roman Catholic Church" with one of the fol 
ing alternatives : bishop, Brahman, hoplite, put 
darin, and tribune. The student surely would H j 
have to be well acquainted with the Бета 
of the Roman Catholic Church to make the p 
rect choice. Some of the items have undesir? 
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overlapping in that a part of one item may help 
to indicate the answer to another (e.g., Items 22 
and 74 and Items 46 and 51 in Form AM). In 
Form BM, one may note that Items 41 and 61 
both depend on understanding that the term 
bourgeoisie refers to the middle class in Euro- 
pean society. It may also be noted that some of 
the items in one form are essentially duplicated 
in the other, so that there may be some direct 
memory effect from one to the other. Some of 
the negative items reflect the problem of the 
item constructor who can think of four related 
statements that are true or four results that did 
stem from one cause but cannot think of a plausi- 
ble but unacceptable alternative to include as the 
answer. Thus, in some cases the answers to the 
negative items seem to be so far afield that they 
would stand out at once as the answers, even for 
persons with little discriminatory knowledge of 
the subject matter in question. On the whole, 
however, the quality of the individual items is 
probably above the average for that of most 
standardized tests in the field of history. 


Howarp R. ANvERSON, Chief for Social Sci- 
ences, Division of Higher Education, Office of 
Education, Federal Security Agency, Washing- 
ton, D.C. 

A 7-page manual contains information about 
the nature and content of the test and its devel- 
opment, the reliability and equivalence of the 
two forms, general directions to the examiner 
and specific directions for administering the test, 
and directions for scoring, interpreting, and 
using the test results. 

The manual states that this test "measures not 
only factual knowledge of world history, but 
also understanding of the great movements and 
social trends..in civilization." Part A in- 
cludes 45 matching-type items (events, dates, 
places, leaders), and Part B, 35 five-response 
multiple choice items. The 80 items were se- 
lected from a total of тоо included in a prelimi- 
nary test administered to students in 17 high 
schools in 7 states. Difficulty and validity indices 
were computed for the items in the two prelimi- 
nary forms. The mean validity index of items 
in Form A is .50, and in Form B is .41. The 
mean difficulty of items in both forms is 44. Cor- 
rected split half reliability coefficients of .91 and 
-94 are reported for the two forms, and the 
standard error of measurement is given as 3.0. 
Many precautions have been taken to insure the 
equivalence of the two forms. 
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End-of-year percentile norms, derived from 
the administration of the test to 3,587 students 
in 47 schools representing 24 states are pro- 
vided. The median age and median IQ of these 
students are stated. Raw scores on this test are 
converted to a scale of normalized standard 
scores which relate them to scores on the Ter- 
man-McNemar Test of Mental. Ability. Sug- 
gestions are provided for using test results in 
evaluating individual and group achievement, 
and in guidance. 

The matching-type items generally are more 
narrowly factual and in other ways inferior to 
the multiple choice items. In Form A, for ex- 
ample, Item 6 in Part A requires the student to 
select “а great Russian novelist of the 19th Cen- 
tury" among Alexander, Cromwell, Demos- 
thenes, Pitt, and Tolstoy. In Form B, a similar 
item calls for the student to identify in this list 
(Ebert, Gandhi, Lenin, Marx, Smuts) the man 
who “led the movement for independence in In- 
dia.” There is a need to include in the test a 
few items dealing with important developments 
since World War II. 

This world history test, as the manual admits, 
cannot be used to furnish "analytical measures 
of....mastery of various aspects of the subject.” 
It does have value in providing “a valid, objec- 
tive measure of achievement....for the individual 
student” and as a “predictor of what he is likely 
to do in the future, particularly in closely related 
fields.” 


[ 690] 

*Graduate Record Examinations Advanced His- 
tory Test. Senior year college through graduate 
school and candidates for graduate school; 1939-51; 
available only in Graduate Record Examinations pro- 
grams (see 527) ; 180(220) minutes; prepared by the 
Advanced History Test Committee аро by Edu- 
cational Testing Service: Edward C. Kirkland (Chair- 
man), Philip Davidson, Edgar E. Robinson, Joseph R. 
Strayer, and Alice Felt Tyler; Educational Testing 
Service. * 


[691] 
xXIndiana History. Grades 7-8; 1950; I form; no 
data on reliability and validity ; no manual; no norms 
—publisher recommends use of local norms; 4 or more 
copies, 5€ each, postage extra; 35€ per specimen set, 
postpaid ; 40045) minutes; M. Vigil Schneider; Ben- 
ton Review Publishing Co., Inc. * , 


[692] 
xModern World History: Achievement Exami- 
nations for Secondary Schools. High school ; 1951; 
1 form; no data on reliability and validity ; no manual ; 
Minnesota norms (median and quartile deviation) 
available; similar norms for other regions by special 
arrangement with publisher; 7€ per test, postage ex- 
tra; 60(65) minutes; edited by Walter W. Cook; pre- 
pared by a curriculum committee of high school teach- 
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ers for use in the Minnesota State Board Achievement 
Examinations Program; Educational Test Bureau, 
Educational Publishers, Inc. * 

[ 693 ] 
*Understanding of American History, 1940 Re- 
vision. Grades 8-12; 1922-40; 5 scores: character 
judgment, historical vocabulary, sequence of events, 
cause and effect relationships, total; 1 form, "40; man- 
ual ('40) ; $2.40 per 100, postage extra; 126 per speci- 
men set, postpaid; nontimed (25-40) minutes; Luella 
Cole and R. C. Richards; Public School Publishing 
Co. * 


ErrzABETH C. Apams, formerly of the Social 
Studies Section, Test Development Department, 
Educational Testing Service, Princeton, New 
Jersey. 

There is little to commend in this test. Its 
title is deceptive (at least, no historian or his- 
tory teacher of the reviewer's acquaintance 
would concede that most of the items measure 
anything but memorization) ; its appearance is 
displeasing; and it is a time-waster for the 
teacher. 

On each of the four 514 X 9 inch pages are 
crowded 26 multiple choice items. The student 
underlines the correct choices in the booklet. 
Since the choices are not aligned vertically, scor- 
ing is laborious. Answers are printed in four 
columns in the manual—the columns cannot be 
separated into strips for convenient use with- 
out destroying the class record sheet on the re- 
verse side! Few teachers can afford the time 
which scoring this test requires ; and if they have 
the time, they may better spend it in analyzing 
their students’ work on a better test which is 
more easily scored. 

The manual contains the customary directions 
for administration and statements on reliability 
and validity. The validity is based on correla- 
tions with teachers' grades and with other tests, 
and on a textbook survey. The date of the survey 
is not mentioned, but the date of original publi- 
cation of the test and the type of test item in- 
cluded lead one to suspect that it was made 
around 1920. Norms consist only of median 
scores for October and May testing in grade 8 
and in high school. There are no other percen- 
tiles and no system of standard or scaled scores. 
Few teachers will consider these norms adequate. 

In evaluating a subject matter test, one must 
necessarily consider the points just mentioned as 
secondary in importance to the subject matter 
coverage. The content of this test may have been 
appropriate when the test was first published, 
but it bears no recognizable relation to the con- 
temporary history curriculum as discussed, for 
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example, іп the Seventeenth Yearbook of the 
National Council for the Social Studies, Litte 
attention is given to understanding, and the 
facts tested are those which can be put into pat 
phrases. Large-scale historical developments are 
ignored in favor of the Battle of Buena Vista ay! 
the Ironclad Oath. 

Test 1, Character Judgment, deserves to be 
condemned on both educational and scholarly 
grounds as perpetuating the tendency to label 
a character with an adjective and dismiss him, 
In one item, the student must judge whether 
Benjamin Franklin was emotional, retiring 
prudent, or blunt. Is “prudent” the only word 
which can be found to describe one of our mo 
original and farsighted founding fathers? When 
a test can devote so little attention to any one 
figure, let the recognition be consistent with his 
stature. Unfortunately, this item is typical of the 
items in Test 1. In Test 2, Historical Уос. 
lary, the items are much more significant. But it 
Item 26 the student must select “corporation 
as the word most similar in meaning to “trust”! 
We may hope that no teacher will allow his stt- 
dents to retain such an indefensible misconcep- 
tion. 

Each item in Test 3, Sequence of Events, re 
quires the student to select which one of four 
events came first. Test 4, Cause and Effect Re 
lationships, requires the student to select from 
four events the one which is the result of tht 
other three. It is sound in conception, and à 
number of the items do involve some exercis? 
of reason. Other items, however, require a 
that the student call up such things as the a 
verbial "three causes of the Revolution a 
"three reasons for the growth of sectionalism 
which used to be, and in some cases still até 
standard textbook phrases. “he 

Certain items contain outdated phrases— i 
World War,” for instance. On the side of ze 
sions, the test, most recently revised in d 
perforce includes no items dealing with 9 M 
War II or with American politics since v 
There is almost nothing on race problems, ү 
relations, geographical influences on Ha 
the whole question of the government 1n [ 
nomic life. Questions on sectionalism an log" 
Civil War abound; questions on the dev 
ment of the Constitution are totally mere 

The reviewer sees no reason why this 7. 
should be used. The Cooperative Aner ath 
tory Test has much better constructed ! я 
both it and the Crary American. History 
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have better factual coverage. These two tests 
approximate the contemporary course of study 
much more closely than does the Cole-Richards 
test ; they have better norms ; and they are more 
economical of the teacher’s time. 


[ 694] 

*World History: Every Pupil Test. High school; 
1933-51; new form usually published each April and 
December ; form December 1951; no data on reliability 
and validity; no manual; norms (’51) ; 210 per test; 
1¢ per answer key; postpaid ; 40(45) minutes; Ohio 
Scholarship Tests, Ohio State Department of Educa- 
tion. 


[ 695] 

xWorld History Test: Acorn National Achieve- 
ment Tests. High school and college; 1948; 6 scores: 
social studies terms, world geography, contributions of 
world peoples to civilization, political history, eco- 
nomic-social-cultural history, total; Form A; no norms 
for part scores; $2.50 per 25; 35¢ Жел specimen set; 
postage extra; 40(45) minutes; Vincent McGarrett 
and Edward H. Merrill; Acorn Publishing Co. * 


[ 696 ] 
*World History Test: State High School Tests 
for Indiana. First, second semesters high school; 
1034-51; 2 levels; mimeographed; no data on relia- 
bility and validity; no ЕДВ 46 per test; 15 per 
specimen set; postpaid; 40(45) minutes; Donald P. 
Knott (C), Paul C. Baker (C), Lawrie F. Davis (О), 
Harold Hargrave (O), and Frankie Jones (O) ; State 
High School Testing Service for Indiana, Purdue Uni- 
versity. * 
а) FIRST SEMESTER. Form C [50] ; Forms A, B out of 
print; no norms. 
b) sEcowp ЅЕМЕЅТЕВ. Form О [48]; Form N out of 
print; norms [’51]. 


[697 ] 

*World History: 20th Century Test. First, second 
semesters high school; 1937-49; 2 levels; Form 
('48) ; no data on reliability and validity; no norms— 
author recommends the use of local norms; manuals 
(49) ; 10 or more copies, s¢ each, postage extra; 35€ 
per specimen set, postpaid; 40(45) minutes; Gale 
Smith; Benton Review Publishing Co., Inc. * 


POLITICAL SCIENCE 


[698 ] 
*American Civics and Government Tests for 
High Schools and Colleges. High school and col- 
lege; 1930-49; Forms A [49], В ['49]; 90¢ per 25, 
postage extra; 12¢ per specimen set, postpaid; 40(50) 
minutes; F. A. Magruder, R. J. Clinton, and M. М. 
Chambers; Public School Publishing Co. * 


[ 699] 

*American Government and Citizenship: Every 
Pupil Test. Grades 11-12; 1935-51 ; new form usually 
published each April and December; form April 1951; 
no data on reliability and validity; no manual; norms 
(751) ; 216€ per test; 16 per answer key; postpaid; 40 
(45) minutes; Ohio Scholarship Tests, Ohio State De- 
partment of Education. * 


Етл2АВЕТН C. Арлмз, formerly of the Social 
Studies Section, Test Development Department, 
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Educational Testing Service, Princeton, New 
Jersey. 

This test is unprepossessing in appearance, 
uninspiring in content. Whether it was care- 
fully standardized cannot be determined from 
the brief statement on construction of the test ; 
the point is more or less irrelevant, since it 
would hardly be good economics to spend much 
on a standardization program for a test so medi- 
ocre in conception. 

The items in Part I are multiple choice; the 
choices are strung across the page and the items 
are crowded together. Part II contains matching 
items. Answers for both parts are written in 
parentheses in a column on each page, so the 
scoring is reasonably easy. Booklets are expend- 
able, since no provision is made for using sepa- 
rate answer sheets. The manual consists of one 
sheet bearing instructions, descriptive informa- 
tion, and norms. The latter are in the form of 
end-of-course percentiles for high school and 
college. They are stated in the reverse of the 
customary fashion; ie. the highest score is 
listed as “1% equals or exceeds score" instead 
of “99 percentile.” This will make for con- 
fusion when the students' ratings are entered on 
cumulative record cards. The college scores are 
very little higher than the high school scores. 
This may only indicate, of course, that the high 
school government course teaches as much as 
does the college course. However, the high school 
population is so much more heterogeneous in 
ability that one would expect more than 3 raw 
score points difference in median scores. 

The test is strictly factual despite the authors’ 
statement that an effort was made to select items 
which test understanding. In each form there 
are one or two "thought" questions, but that is 
all. Now let it be agreed that a knowledge of the 
facts of American government is vital to effec- 
tive citizenship and that such knowledge should 
be tested. But the kind of facts tested here are 
nearly all of the definition type or the “five func- 
tions of the legislative branch" type—the kind 
of knowledge which can be put into test questions 
by any teacher at the rate of perhaps 30 an hour. 
Why buy a standardized test for the purpose? 

More serious than what the test includes is 
what it neglects: many processes which are of 
tremendous importance in American political 
life but which cannot be put into one-word an- 
swers. For example, there is no mention of the 
dilemma posed by the need to preserve civil 
liberties and at the same time protect the state 
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against those who would destroy it. The con- 
cepts of administrative legislation and judicial 
review—the wide power of agencies to make 
rulings and of courts to determine the meaning 
of the Constitution for any given generation— 
are not dealt with. Nor is there any emphasis on 
the responsibility of the individual for securing 
good government. The basic differences between 
the philosophy of democracy and the philosophy 
of totalitarianism are not covered, and there is 
no mention of the questions brought up by Amer- 
ican participation in international organizations. 
These are points which alert teachers think im- 
portant. They are also topics which make the 
course come to life for the student. In effect, I 
should say, this test is a partner of certain text- 
books in rendering the study of American gov- 
ernment dry, dead, and repulsive to countless 
students. Can anything be more subversive? 
Teachers may better use the Cooperative Ameri- 
can Government Test, which contains much 
more penetrating items and puts life into the 
subject. 


[700] 
*Civic Vocabulary Test. Secondary school (high 
school) ; 1951; 1 form; no norms ; preliminary mimeo- 
graphed manual; 2s. 6d. per 10; 3d. per single copy; 
9d. per manual ; cash orders postpaid within Australia ; 
nontimed (10-30) minutes; S. A. Rayner; Australian 
Council for Educational Research. * 
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1. Rayner, S. A. The Special Vocabula: of Civics. Fore- 
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[701] 

*Civics: 20th Century Test. First, second semesters 
high school; 1935-49; formerly called The 20th Cen- 
tury High School Civics Test; 2 levels; Form A 
(48) ; no data on reliability and validity; no norms— 
author recommends the use of local norms; manuals 
(49) ; 10 or more copies, 5¢ each, postage extra; 35¢ 
рег, specimen set, postpaid; 40(45) minutes; Gale 
Smith; Benton Review Publishing Co., Inc. * 


[702] 

PES deta American Government Test, High 
school; 1947-51; IBM; Forms X C47), Y (48); no 
data on validity; no specific manual; general Coopera- 
tive manual (’51) ; norms ('48) ; $2.50 per 25; 50€ per 
specimen set, postpaid; separate answer sheets may be 
used ; per 25 IBM answer sheets; 15€ per scoring 
stencil; cash orders postpaid; 40(45) minutes; John 
Haefner ; Cooperative Test Division, Educational Test- 
ing Service. * 


FREDERIC L. AYER, Head, Division of Evalua- 
tion and Research, Citizenship Education Proj- 
ect, Teachers College, Columbia University, 


x. York, New York. [Review of Forms X, 


This test is a comprehensive examination ot 
the ability to learn and retain facts about the 
structure, mechanics, and history of government 
in the United States. It covers adequately the 
content of most textbooks on American govern- 
ment and should be particularly useful to those 
who are interested in determining how well stu- 
dents have assimilated the atoms of truth which 
make up the factually oriented course. 

Although the tests are listed by the publishers 
as high school forms, the norms supplied are 
based on data for twelfth grade students only. - 
Students of grades 9 through 12 to whom this 
reviewer administered the tests found great 
difficulty with the vocabulary used in many of 
the items ("negotiate," “significant weakness of 
a unicameral..legislature," “His Superfluous 
Excellency," to mention a few). 

Occasional items are introduced which re- 
quire the selection of the poorest choice (eg, 
“Which of these is not essential to a democ- 
racy ?"), a technique that in the context of items 
calling for the best choice often throws respond- 
ents off the track. Mixed in with straight recall 
items are many which appear to require a high 
discriminative judgment, often based on much 
more complete information and wider expert 
ence than students might be expected to have. 
("Which is the most serious governmental prob- 
lem facing metropolitan areas like New York 
and Chicago ?" “Which is the most serious prac 
tical objection to the principle of 'separation “ 
powers’ ?" “What is considered [sic/] the great 
est advantage of the ‘office column’ type of bal- 
lot?") { 

Other items commendably require the appli- 
cation of facts to a specific situation—the meth- 
ods of practical politics—rather than a T 
knowledge of facts (“Sam Spellbinder, the lead- 
ing orator of his party, is most likely to be give 
what job in the national nominating conver 
tion?" “The job of a typical precinct captat.. 
is to ‘deliver the vote’ * How is he usualy 
paid?" In connection with a map of a gem 

mandered state, “In whose favor has the imag" 
nary state of Pandemonium been gerry 
dered?”) There are, however, only about fou 
or five items of this type in each form. m 
Two professors of political science at pk. 
bia University who checked the key voiced se E 
ous disagreement with the choices given as © 
rect. On some items they disagreed both M 
each other and with the key; on others y d 
opinion was that none of the choices гер 
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sented a correct response. For a few items they 
stated flatly that the "correct" response was 
wrong. Part of this confusion occurs with those 
items which deal with problems concerning jury 
trials and other situations in which the law is 
not the same in all states; part is accounted for 
by “dated” items—problems which were serious 
in 1947-48 and have now either been solved or 
are overshadowed by still more serious prob- 
lems of government. It would probably be ad- 
visable for those using this test to prepare their 
own keys to be consistent with the state laws 
and text materials applying in the local situa- 
tion. 

No specific manual is issued for these tests ; 
the manual supplied gives only general instruc- 
tions for test administration and interpretation 
of scaled scores. No data are given as to relia- 
bility or comparability of forms, or to specific 
statistical validity studies. 

The authors have made an enviable attempt 
to cover the field of course material in American 
government. They have succeeded in sampling 
adequately the body of knowledge contained in 
many different kinds of government courses. A 
more effective approach might have been to 
limit the coverage to either knowledge back- 
ground or political reasoning based on knowl- 
edge, or to divide the forms into subtests cover- 
ing these areas. This would permit teachers to 
measure directly the success of either the his- 
torical or the problems type of course. 

The reviewer would recommend the test for 
use in classes where the emphasis is on the teach- 
ing of facts about American government. It does 
not appear to be applicable, however, to courses 
such as those titled Problems of Democracy and 
is not recommended for students below the 
twelfth grade unless they have exceptional read- 
ing ability. $ 


[703] 
*Graduate Record Examinations Advanced Gov- 
ernment Test. Senior year college through graduate 
School and candidates for graduate school; 1939-51; 
available only in Graduate Record Examinations pro- 
grams (see 527); 180(220) minutes; prepared by the 
Advanced Government Test Committee appointed by 
Educational Testing Service: V. O. Key, Jr. (Chair- 
man), Albert Lepawsky, Carl B. Swisher, John A. 
View, and Ruth G. Weintraub; Educational Testing 
ervice. 


[704] 
*Junior High School Civics Test: State High 
School Tests for Indiana. Grades 7-9; 1934-46; 2 
levels ; mimeographed; no data on reliability and va- 
lidity; no manual; norms [47] ; 414€ per test; 156 per 
specimen set; postpaid; Florise Hunsucker; State 
High School Testing Service for Indiana, Purdue Uni- 
versity. * 
a) First SEMESTER. Form А ['46]. 
b) SECOND SEMESTER. Form N ['46]. 


[705] 

XPatterson's Tests on the Federal Constitution. 
Grades 914i 1927-37; 3 tests (labeled Forms A, B, 
and С); 1 form, '37; no data on reliability and valid- 
ity; no norms; manual ['27]; 10 or more copies, 5e 
each; 25€ per specimen set; cash orders postpaid; Ray- 
mond G. Patterson; Palmer Co. * 

a) FORM A, FUNDAMENTAL FACT TEST. Nontimed (25) 
minutes. 

b) FORM B, COMPLETION TEST. Nontimed (35) minutes. 
с) FORM C, MATHEMATICAL TEST. Nontimed (25) min- 


utes. 
[706] 

*Senior High School Civics Test: For a One- 
Semester Course: State High School Tests for 
Indiana. r semester high school; 1949; Form B; 
mimeographed; no data on reliability and validity ; no 
manual; norms [49]; 4%%# per test; 15 per specimen 
set; postpaid; 40(45) minutes; Meribah Clark; State 
High School Testing Service for Indiana, Purdue Uni- 


versity. * 

[707 ] 
*Senior High School Civics Test: State High 
School Tests for Indiana. 1, 2 semesters high 
school; 1934-47; 2 levels; mimeographed; no data on 
reliability and validity; no manual; norms [47]; 4€ 
per test; 15@ per specimen set; postpaid; 40(45) min- 
utes; Meribah Clark (A) and Olis G. Jamison (N); 
State High School Testing Service for Indiana, Pur- 
due University. * 
a) FIRST SEMESTER. Form A [’47]. 
b) SECOND SEMESTER. Form N [47]. 


SOCIOLOGY 


[708] 
*Black-Schrammel Sociology Test. High school ; 
1936-37 ; Forms А (36), В (^37) ; manual [37] ; $1.05 
per 25; 20€ per specimen set; puis; 40(45) min- 
utes; William A. Black and H. E. Schrammel; Bureau 
of Educational Measurements, Kansas State Teachers 
College of Emporia. * 

[ 709 ] 
xGraduate Record Examinations Advanced So- 
ciology Test. Senior year college through graduate 
school and candidates for gaia school; 1939-51; 
available only in Graduate Record Examinations pro- 
grams (see 527) ; 180(220) minutes; prepared by the 
Advanced Sociology Test Committee appointed by 
Educational Testing Service: Wilbert E. Moore 
(Chairman), Theodore Abel, Everett C. Hughes, Ed- 
ward P. Hutchinson, and Logan Wilson; Ed ucational 
Testing Service. * 
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VOCATIONS 


Reviews nv Anne Anastasi, Dwight L. Arnold, Brent Baxter, Harold Bechtoldt, George K. Ben- 
nett, Ralph F. Berdie, Reign H. Bittner, Milton L. Blum, Edward S. Bordin, Arthur H. Brayfield, 
Leo J. Brueckner, John В. Carroll, Harold D. Carter, E. С. Chambers, Stuart A. Courtis, Lee J. 
Cronbach, Edward E. Cureton, Louise Witner Cureton, Jerome E. Doppelt, George A. Ferguson, 
Н. M. Fowler, Norman Frederiksen, Frank S. Freeman, John W. French, Clifford P. Froehlich, 
Edward B. Greene, Milton E. Hahn, George W. Hartmann, Edward N, Hay, Kenneth L. Heaton, 
Elmer D. Hinckley, Lloyd G. Humphreys, Clifford E. Jurgensen, Harry W. Karn, Raymond A. 


i Katzell, Willard А. Kerr, Albert К. Kurtz, Charles R. Langmuir, Wilbur L. Layton, D. Welty 


Lefever, D. W. McElwain, Milton M. Mandell, Ross W. Matteson, Joseph E. Moore, М. W. Mor- 

ton, Jay L. Otis, C. Robert Pace, Donald G. Paterson, E. A. Peel, Harry N. Rivlin, Alec Rodger, 

Floyd L. Ruch, G. A. Satter, Douglas G. Schultz, May V. Seagoe, I. Macfarlane Smith, Julian 

C Stanley, Dewey B. Stuit, Donald E. Super, Howard R. Taylor, Albert S. Thompson, Robert 

L. Thorndike, Robert M. W. Travers, Arthur E. Traxler, P. E. Vernon, William W. Waite, 

S. Rains Wallace, Jr., Edwin Wandt, Morey J. Wantman, F. W. Warburton, A. T. Welford, 
Alexander G. Wesman, D. A. Worcester, and Dale Yoder. 


[710] 
*Aptitude Tests for Occupations. Grades 9-13; 
1951; IBM; 6 tests; Form A; 35€ per specimen set of 
any one test, postpaid; separate answer sheets may be 
used; 4¢ per IBM answer sheet for any one test; 40€ 
per stencil for machine scoring of answer sheets of any 
one test; 40€ per stencil for hand scoring of answer 
sheets of any one test; 5€ per profile chart; postage 
extra; I07(135) minutes; Wesley S. Roeder and Her- 
bert B. Graham; California Test Bureau. * 
а) PERSONAL-SOCIAL APTITUDE. $1.50 per 25; 20(25) 
minutes. 
a MECHANICAL APTITUDE. $2 per 25; 20(25) minutes. 
с) GENERAL SALES APTITUDE. $1.50 per 25; 20(25) min- 
utes, 
d) CLERICAL ROUTINE APTITUDE. $1.25 per 25; 12(15) 
minutes, 
е) COMPUTATIONAL APTITUDE. $1.25 per 25; 15(20) 
minutes, 
f) SCIENTIFIC APTITUDE. $1.50 per 25; 20(25) minutes. 
Стлвғовр P. FgoEnricH, Associate Professor 
of Education, University of California, Berkeley, 
California. 

This battery is made up of six parts, each la- 
beled as a test of one of the following aptitudes: 
personal-social, mechanical, general sales, cleri- 
cal routine, computational, and scientific. The 
parts contain from 45 to 60 items and require 
from 12 to 20 minutes of testing time, The bat- 
tery contains 300 items and requires 107 minutes 
of testing time. The length of this battery seems 
very short in view of its scope and the experi- 
ence of such test development programs as that 
of the United States Air Force. 

'The manual states that these "tests were con- 
structed for the purpose of obtaining a quick but 
valid general vocational profile or pictures of an 
individual" (italics mine). This emphasis on vo- 
cational application of the test is reiterated 
throughout the manual and in descriptive litera- 


ture. The publisher's catalog, for example, states 
that these tests “аге intended to serve as aids in 
the analysis of the individual's aptitudes in order 
to determine whether or not he possesses the 
potentialities necessary for success in a given 
field." The name Aptitude Tests for Occupa- 
tions implies the vocational significance of the 
tests. But the manual does not report test scores 
obtained by persons actively engaged in any 
occupation. The closest it comes is to present 
correlation coefficients of subtests of the battery 
with criterion tests. For example, Clerical Rou- 
tine Aptitude was correlated with the Minne- 
sota Clerical Test, the clerical test of the Differ- 
ential Aptitude Tests, and SRA Clerical Apti- 
tudes. The correlations were based on groups 
of students in grades 9 to 13 (usually two grades 
together), ranging in number from 26 to 78. Of 
the ten coefficients reported, four are in the 
forties, three are in the fifties, two are in the 
sixties, and one is in the seventies. A comment 
made in the manual in connection with the inter- 
correlations among the subtests of the battery !5 
pertinent here: “In evaluating these intercorre- 
lations, it must be considered that a correlation 
of .50 accounts for 25 per cent of the variance, 
and that 75 per cent of the variance thus remains 
unaccounted for by the relationship between the 
tests." While these correlations with so call 
“criterion tests" yield interesting data, they pt 
vide little information about the vocational sig- 
nificance of the Aptitude Tests for Occupation’. 
Additional statistics concerning the test 21 
described in the manual in these words: 


The following cross-validation data were obtained by 
determining the raw scores at standard deviation !n 
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vals of +2, +1, 0, —1, —2 for the Aptitude Tests for 
Occupations and for certain criterion tests, and then 
converting these scores to percentiles using the norms 
rovided for the tests. For example, for the Personal- 
оса! Test in grade 9, the +2 S.D. percentiles were 98 
{ог the Personal-Social Test and 99 for the Test of 
Social Comprehension; for the same grade and tests, 
the —1 S.D. percentiles were 10 and 10, respectively. 

The two or three comparisons presented for 
each of the subtests indicate the school grade 
and sex of the group tested, the names of the 
tests compared, and the percentile ranks for each 
test at the five standard deviation points indi- 
cated above. Neither the data nor the procedures 
are explained further in the manual. This re- 
viewer, in correspondence with the Director of 
Research and Technical Services of the Cali- 
fornia Test Bureau, raised the following ques- 
tions: "What do such data reveal about the va- 
lidity of the test? Could I not, by chance, find 
that a test of musical aptitude and Personal- 
Social Aptitude would yield comparable stand- 
ard deviation comparisons, even though the tests 
were uncorrelated ?" The prompt and courteous 
reply to this query pointed out that the data were 
obtained by testing “groups available in the 
public schools in various parts of the nation.” 
The standard deviation points “were derived 
from the original plots, wherein significant cor- 
relations were found.” Certainly the size of the 
group and the correlation coefficient obtained 
between a test in the battery and a criterion test 
should be reported in the manual if one is to 
make meaningful interpretations. 

The reviewer was informed that the technique 
of standard deviation percentile rank compari- 
sons was developed by Alice Horn “in connec- 
tion with an examination of the comparability of 
five well-known intelligence tests * It was her 
feeling at the time that straight correlation would 
not be entirely satisfactory for her purposes. 
Therefore, she devised a technique which would 
enable her to get comparability of the data all 
along the scale.” But, what is revealed about 
baci, by comparing percentile ranks equiva- 
^ to certain standard deviation points? How, 
?r example, does the fact that a group of grade 
9 boys obtained a mean score equivalent to a 
pas iud rank of 90 on General Sales Aptitude 
3 hs mean score equivalent toa percentile rank 
Te $ on the George Washington University 

st of Ability to Sell reveal validity? The —2 
eee ee percentile ranks for the same 

р and tests are 40 and 50 respectively. The 


‘foremen; " ^ i 
nical S tioned Director of Research and Tech 


ervices writes, “We inserted these data ' 


in the manual in order that the test user might 
see that comparability holds up even with such 
groups." 

Perhaps a more important finding from the 
comparisons that are reported is the distribution 
of percentile ranks equivalent to the mean score 
in the 17 groups tested. This shows one mean 
equivalent to a percentile rank of 9o, four to 
80, three to 70, five to 60, one to 50, one to 40, 
and two to 30. While the reviewer recognizes 
that the mean score is not necessarily equivalent 
to a percentile rank of 50, he feels that this dis- 
tribution of equivalent percentile ranks hints 
that the groups tested are atypical or that the 
published norms are inadequate. 

The "occupational patterns" presented in the 
manual "include not only a list of occupations 
related to each of the six major fields of aptitude 
measured by the tests but also include occupa- 
tions which are related to more than one field." 
The manual presents no information concerning 
the basis upon which these patterns were pre- 
pared. The basis was described, however, by the 
senior author of the test in a letter to this re- 
viewer in these words: 


Among the materials we found helpful in devising 
the list of occupational uu were the Dictionary 
of Occupational Titles, Handbook of Job Facts and 
Occupational Monographs in "The American Job Se- 
ries,” and Occupational Briefs on America's Major 
Job Fields. We have done considerable work in collat- 
ing the list with this information, and especially by ob- 
serving individuals on the job. The four years spent in 
the army as Port classification officer, and 
personal counselor gave unlimited opportunity to study 
the various individuals on their respective Jobs. 

The manual instructs the test user that “re- 
gardless of the nature of an individual's profile, 
the highest percentile ranks should be investi- 
gated"; this despite the fact that the typical 
standard error of measurement is 3 raw score 
points, which on some subtests is equivalent to 
a large percentile rank range. For example, the 
norms for Personal-Social Aptitude show that 
the raw scores one standard error of measure- 
ment below and one standard error of measure- 
ment above the median are equivalent to per- 
centile ranks of 30 and 70 respectively for ninth 
grade girls and 20 and 8o for ninth grade boys. 
To this reviewer, the failure to explain the haz- 
ards of differential prediction is a grave over- 
sight. Test users need more sophistication in 
interpreting the profile than the manual's state- 
ment that “the important feature of an exami- 
nee's profile is the comparison of his standing in 
each field with his standing in other fields. The 
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standard error of measurement may render these 
comparisons meaningless. 

This reviewer is of the opinion that not enough 
is known about this test to recommend its use 
for occupational guidance, selection, or classifi- 
cation. He recommends that the publishers cor- 
rect errors and eliminate the unsupported claims 
in the present manual and that they collect and 
report sufficient data to provide a basis for 
judging the test’s value. 


-J Consult Psychol 15:515-6 D ’51. Laurance 
F. Shaffer. * In the main, these tests fulfill the 
current expectations of psychologists for a series 
to measure differentiated abilities. They fall de- 
cidedly short of ultimate ideals for test develop- 
ment in two respects. First, there is no evidence 
that they measure “aptitude” as distinguished 
from achievement, and the magical word is likely 
to deceive the unwary. Second, although the tests 
are recommended mainly for occupational guid- 
ance, they are not accompanied by evidence to 
show their power to predict successful adjust- 
ment to any vocation. The latter fault can be 
remedied by further research. 


[711] 
*Differential Aptitude Tests. Grades 8-12; 1947- 
50; IBM; 8 tests in 7 booklets; Forms A (47), B 
(^47) ; manual (47) ; manual supplements (^48, '49, '50, 
"51, '51) ; $21.50 per 25 sets of the 7 booklets; separate 
answer sheets must be used; $1.85 per 50 IBM answer 
sheets for any one booklet; $1.25 per set of machine 
scoring stencils; $1.25 per set of hand scoring stencils ; 
$2 per manual; $3 per specimen set; postpaid; 186 
(240) minutes; George К. Bennett, Harold С. Sea- 
shore, and Alexander G. Wesman; Psychological Cor- 
poration, * 
а) VERBAL REASONING. $3 per 25; 30(40) minutes. 
b) NUMERICAL ABILITY. $2.25 per 25; 30(35) minutes. 
с) ABSTRACT REASONING. $3 per 25; 25(30) minutes. 
d) SPACE RELATIONS. $3.50 per 25; 30(40) minutes. 
e) MECHANICAL REASONING. $3.75 per 25; 30(35) min- 
utes. 
f) CLERICAL SPEED AND ACCURACY. $3 per 25; 6(15) 
minutes. 
g) LANGUAGE USAGE. 2 scores: spelling, sentences; $3 
per 25; 35(45) minutes. 
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One of the more valuable tools for sound pi 


cational and educational guidance available 
day is represented by the Differential Aptitude 
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Tests. The basic hypothesis used in the develop- 
ment of this battery is that the appraisal of each 
of several "abilities" will enable vocational and 
educational counselors to form realistic judg- 
ments as to the educational curricula appropriate 
to the skills of students and reasonable judg- 
ments as to which students should take each 
course. 

The eight tests, presented. in seven booklets, 
are available in two forms. Separate answer 
sheets are provided so that repeated use of the 
test booklets is possible. High editorial stand- 
ards are reflected in the pleasing appearance of 
each booklet; the content of the items similarly 
reflects competent test construction work. The 
test items are similar in content and form to the 
verbal-numerical and pictorial items found in 
other scholastic aptitude tests. Norms, perhaps 
the most adequate special aptitude or ability test 
norms currently available, are based upon over 
47,000 cases secured from nearly тоо schools se- 
lected to represent the major geographical areas 
of the United States and, to some extent the 
various socio-economic groups characteristic of 
urban areas. The norms are further distinguished 
by the reporting by sexes and by grade levels 
from grades 8 to 12 inclusive the major percen- 
tile points together with the means, standard 
deviations, and number of cases for each of the 
eight tests. The usual sex differences in score 
distributions are noted for the spatial and nu- 
merical tasks. Although the score distributions 
Suggest somewhat inadequate discrimination at 
the lower 5 per cent of the eighth grade popula- 
tion on a few tests, the published results indicate 
an acceptable degree of differentiation between 
individuals in the populations sampled. The use 
of the Scoring formula R-kW for all but two tests 
18 apparently a concession to tradition. No evi- 
dence is presented to the effect that weighting 
the wrong responses, W, is of any value in the 
Non power tests. An interesting feature of the 
пиа ee cAbilir test is the use of a fifth choice, 
a of these, with each of the 40 items. A unique 
s very valuable adjunct of the Differential 

Ptitude Tests is the loose-leaf form of the test 
manta], . 

M. ow authors have not only provided the 
eu S 1 ап extensive set of norms, but they 
Wed ni included in the very extensive manual 
dis pe materials and very valuable 
and "ua ing with the concepts of test reliability 

validity, and of errors of estimate. The man- 


ual i i 3j 
itself presents an interesting contrast between 


extreme naiveté (statements about correlation 
coefficients on page E-50 of the manual) and con- 
siderable sophistication in test theory (extensive 
tables of validity coefficients, reliability coeffi- 
cients, and expectancy data). How vocational 
counselors can use these, or any tests, adroitly 
without knowing a great deal about test analysis 
is difficult for the reviewer to understand. Coun- 
selors administering these tests will probably ac- 
quire a “felt need" for more detailed theoretical 
and empirical knowledge of the relations be- 
tween performance measures. 

Reliability coefficients determined by a split 
half technique for the seven power tests are pro- 
vided for each form of each test separately for 
the two sexes and for grades 8 to 12 inclusive. 
These coefficients (averages of the several grade 
coefficients) range from .86 to .93 for six of the 
tests. The coefficient for the seventh test, Me- 
chanical Reasoning, is somewhat lower for girls. 
For Clerical Speed and Accuracy, an average 
reliability of .87 for similar groups was deter- 
mined by the alternate forms method, this 
method being considered more appropriate for 
a highly speeded test. Corresponding reliability 
coefficients are also presented for the separate 
grades by form and sex. In addition, the test 
authors have provided in tabular form the esti- 
mates of the standard errors of measurement as- 
sociated with these correlation coefficients. The 
use of the standard error of measurement in the 
evaluation of test scores is definitely recom- 
mended by the reviewer to all test publishers 
and consumers. 

Since the several tests were designed so that 
each would provide a unique contribution to the 
counseling prediction solution, the test intercor- 
relations presented in detail by grade, sex, and 
form, together with the average correlation for 
similar samples, is a most admirable feature of 
the manual. The authors properly indicate that 
both high reliability and relatively independent 
tests are necessary for differential prediction. 
They might have emphasized the same require- 
ments for efficient prediction in any one area of 
activity. The test intercorrelations suggest that 
the multiple correlation with their criteria of 
success may not be appreciably higher than the 
zero-order correlations ; therefore, the magni- 
tude of the unique contribution of some of the 
tests may be too small for practical use. The 
unique contribution required for differential pre- 
diction is analyzed in terms of Kelley’s technique 
for estimating the proportion of differences in 
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excess of the chance proportion for pairs of 
tests. The results are interpreted by the authors 
as evidence warranting the inclusion of all eight 
tests in the battery (page C-2 of the manual). 
The reviewer would suggest that these analyses 
constitute the first step in the solution of the 
differential prediction problem. The presenta- 
tion for each academic area of study and/or each 
occupational job family of the linear regression 
equations or discriminant function weights to- 
gether with their standard errors would seem to 
provide more relevant data as to the necessity for 
inclusion of all eight tests of the battery. This 
problem, which has not been discussed in the 
manual, is definitely complicated by the unsatis- 
factory nature of the criterion—the critical phase 
of the validity problem. 

The relationships between the scores on the 
Differential Aptitude Tests and a variety of other 
tests for both men and women and for grades 
9 to 13 inclusive are presented in some detail. 
The other tests used in the intercorrelations in- 
clude the American Council on Education Psy- 
chological Examination, the Primary Mental 
Ability Tests, and the General Aptitude Test 
Battery, as well as objective achievement exami- 
nations in English, reading, mathematics, and 
social studies. These data together with the in- 
tercorrelations of the Differential Aptitude Tests 
and the intercorrelations of the other aptitude 
and achievement tests available to the test au- 
thors may provide eventually a practical answer 
as to the number of different measures (from 
among those studied) to be used for both selec- 
tion and differential prediction problems. Cor- 
relations with the Kuder Preference Record 
covering three grades in one school system are 
uniformly low. Especially commendable in this 
connection, is the warning, on page E-85 of the 
manual, to vocational counselors who rely 
mainly upon indications of verbally expressed 
interest. Perhaps, from the extensive studies 
carried out by the authors of the Differential 
Aptitude Test, vocational counselors can be 
given at a later date some positive suggestions 
based on empirical evidence as to the recom- 
mended techniques of prediction and the asso- 
ciated errors of estimate. Since the test authors 
express concern about what the counselor can 
learn from the data in the manual, the reviewer 
would like to add the suggestion that the coun- 
selor can learn to ask himself and the test pub- 
lishers, “What is the evidence for the statement 
or prediction ?" 
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The manual includes rather detailed discus- 
sions of the problem of validity and of the neces. 
sity for accurate estimates of predictions. The 
positive correlations with school grades are con- 
sistent with the general literature on the predic- 
tion of educational success. It is very encourag- 
ing to find the authors cautioning the test user 
about “validity by inference" (based on similari- 
ties of test form) since thé valuable and exten. 
sive data on the correlations of the eight aptitude 
tests with course grades often are not in line 
with "expectations". so frequently expressed in 
the vocational counseling literature. The spec- 
ificity of the relationships between the predictor 
and criterion measures is emphasized repeatedly 
in the manual. The authors may be doing them- 
selves an injustice in this case since a cursory 
reading of these tables of validity coefficients 
might lead to the conclusion that the tests are of 
little value. 

То the reviewer, the crux of the educational 
guidance and differential prediction problem is 
the complete inadequacy of school grades as cri- 
terion measures (the problem is one of the ex- 
traneous sources of variance rather than the re- 
liability of the school grades). The problem is: 
one that cannot be handled in this review or in 
a manual aimed at a test user who has to be in- 
formed tliat a correlation coefficient is an index 
of association ranging in magnitude from —1.00 
to +1.00. Briefly, the evaluation techniques cur- 
rently used in our high schools are such that 
little, if any, differential prediction can be ob- 
tained. Similar unsatisfactory or inadequate 
definitions of “success on the job" may well lead 
to the conclusion that differential prediction is 
not attained in either educational or vocational 
activities with current criteria. 

The reviewer would like to commend the au- 
thors of the Differential Aptitude Tests on their 
work to date, and to recommend these tests to 
vocational counselors for use in educational guid- 
ance or educational research programs. The re 
sults of further work by the test authors on the 
problem of the criterion and on the problem 0 
differential prediction of success in educational 
or vocational activities will be awaited with in- 
terest. These results may indicate that the 
prediction (differential or simple) of the 
available criterion measures can be accom 
plished as well by composite scores based of 
two or three clusters or groupings on thé 
eight tests as by the scores on the separate 
tests. 


y" due 
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Ёлрн F. BERDIE, Professor of Psychology, and 
Director, Student Counseling Bureau, Univer- 
sity of Minnesota, Minneapolis, Minnesota. 

The adequacy of the factorial theory upon 
which differential aptitude testing is based de- 
termines the ultimate usefulness of differential 
tests. The authors of the Differential Aptitude 
Tests, after considering carefully the advantages 
and disadvantages of developing a differential 
battery of specific tests identified in factor analy- 
ses, decided instead to attempt to select tests that 
would fill the practical needs of personnel and 
guidance workers, They describe this battery as 
а compromise “between the desire to measure 
‘pure’ mental abilities that emerge from factor 
analysis and the practical necessities continually 
encountered by personnel and guidance workers 
through the years.” Several of the tests in this 
battery are similar to tests of “pure” abilities, 
as defined by factor analyses. The'clerical speed 
and accuracy test and the language usage tests, 
however, purposefully present compounded pic- 
tures of two or more pure aptitudes. 

Included in the battery are eight tests. With 
the exception of the clerical speed and accuracy 
tests, these are power tests with items graded in 
order of difficulty and with time limits that per- 
mit most people to attempt all items. For each 
form of the test, the authors present coefficients 
of correlations between each of the tests and the 
others, for each sex and for each grade from 8 
through 12. The averages of these intercorrela- 
tions range from .06 to .67. A study conducted 
at the University of Minnesota found that the 
intercorrelations between tests for each of three 
groups of freshmen engineering students were 
no higher and in some cases lower than the inter- 
correlations presented by the test authors. The 
two language usage tests are relatively closely 
related both to each other and to the verbal 
reasoning test, but, in general, it appears that 
they successfully measure relatively discrete fac- 
tors. It was found at Minnesota that the ceilings 
of the tests were too low for selected groups of 
college freshmen, 

he test items show signs of rare ingenuity. 
hi: items in the verbal reasoning test are double 
к ra We where the first term of the analogy is 
ud Selected from four alternatives, and the 
Bub, of the analogy is to be selected from a 
ability group of four alternatives. The numerical 
these ou ranges from simple arithmetic 
ru ae more complex numerical problems in- 

ing cube root and number reasoning. The 


abstract reasoning test involves the detection of 
a principle of change occurring in four succes- 
sive figures and then the selection of an appro- 
priate fifth figure to be chosen from a group of 
five alternatives. The items in the space rela- 
tions test involve visualizing a constructed ob- 
ject from a picture of a pattern of its surfaces. 
The authors support the choice of this type of 
item by saying that "the test requires mental 
manipulation of objects in three-dimensional 
space. Item forms which refer to only two di- 
mensions are less useful since there are relatively 
few occasions when perception of two-dimen- 
sional space alone is important." The items in 
the mechanical reasoning test present pictures of 
mechanical situations together with simply 
worded questions. The clerical speed and ac- 
curacy test requires the testee to select combina- 
tions of letters and numerals marked in the test 
booklet and then to identify from a group of 
similar combinations the identical combination 
on the answer sheet. The first language usage 
test, the spelling test, requires the testee to 
identify from a list of words those words spelled 
incorrectly. The second test presents a series of 
sentences, each divided into five sections, and re- 
quires the testee to identity those sections con- 
taining errors. 

The test items have intrinsic interest for stu- 
dents, and the tests are easy and convenient to 
administer. The printing is good, the paper stock 
superior, and there is sufficient white space on 
each page. 

The most impressive feature of the Differ- 
ential Aptitude Tests is the test manual. This is 
unquestionably the best test manual published, 
and its authors received recognition from the 
Council of Guidance and Personnel Associations 
in 1951 for the superior job they have done in 
presenting information about the tests. There is 
complete information concerning the purposes of 
the tests, procedures to be used in administering 
the tests, statistics concerning the tests and their 
standardization, principles to be considered in 
interpreting test scores, and normative frames 
of reference. 

The manual’s section on statistical informa- 
tion is particularly pertinent. Not only are reli- 
ability coefficients for each test presented by 
grade and by sex, but standard errors of meas- 
urement are also systematically presented, Too 
many of the tests have reliability coefficients in 
the .80’s, the mechanical reasoning test and the 
clerical speed and accuracy test having somewhat 
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lower reliabilities than the others. The reliabili- 
ties of all of these tests, however, are adequate 
for use in counseling individual students. 

The manual contains, at present, 94 pages di- 
rectly relating to the validity of these tests. Prac- 
tically all of these data are in terms of correla- 
tions between test scores and course grades of 
high school students. Tables added to the man- 
ual summarize this mass of validity data and bear 
out the authors’ contention that a test does not 
have validity but rather validities. The available 
information suggests that for counseling high 
school students the tests may have useful validi- 
ties. This is a statement that, on the basis of 
existing data, can be made about no other exist- 
ing aptitude tests. 

A useful section in the manual contains eight 
brief case histories. These histories and ones 
similar to them should be useful to high school 
teachers and counselors attempting to gain fa- 
miliarity with these tests. Another useful section 
describes various methods of plotting individual 
report forms; a convenient graphic method is 
also described to assist in determining the sig- 
nificance of differences between scores on a 
profile. 

In summary, the Differential Aptitude Tests 
have been carefully developed and standardized 
by competent authors who have done an excep- 
tionally good job in making information about 
these tests available to the public. The tests have 
some validity in predicting success in high school 
courses, and some evidence has appeared con- 
cerning their validity in predicting vocational 
success. The tests are presented primarily for 
use with boys and girls in grades 8 through 12, 
although the authors indicate that they may be 
useful with young adults. Because of the rela- 
tively low ceilings of the tests, young adults or 
college students of high ability can not be differ- 
entiated adequately by means of these tests. 
Finally, information concerning the relationships 
existing between these test scores and other 
types of psychologically meaningful data should 
be gathered as soon as possible. 


Lrovp G. Humpureys, Associate Professor of 
Psychology, Stanford University, Stanford, 
California. 

The factor analysis research concerning the 
organization of abilities and the subsequent use 
of tests measuring separate abilities in industrial 
and military psychology led directly to the de- 
velopment of the Differential Aptitude Tests 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK 


[ 6% 


(DAT). This battery was designed аза unit 
and has been standardized and validated as such, 
Users will normally want, therefore, to use the 
entire battery. е 

It will be noted from the list of test names that 
the final selection of materials represented a 
compromise between a straight "primary mental 
abilities" battery and the more familiar kinds of 
tests, such as clerical speed and accuracy, 

The publisher has done and is continuing to 
do a careful professional job with these tests, 
There are many assets. In only one place in the 
manual does the desire to inform become over- 
powered by other motives. In presenting case 
reports for which IQ’s and DAT scores were 
available, many of the cases were selected for 
the discrepancy between the IQ and the mean 
battery score. The authors must have searched 
for some time, for example, for the girl with an 
IQ of 97, supposedly the result of repeated ex- 
aminations, and with superior scores on every 
test of the DAT. Only rarely would a descrep- 
ancy this large occur by chance if the intelligence 
tests were standard verbal tests and were given 
by trained examiners. More importantly, such 
cases do not illustrate the advantages to be gained 
from testing separate aptitudes. 

The manual is complete and detailed and in 
loose-leaf form so that new material can readily 
be added. It is gratifying to report that new ma- 
terial is constantly being added. The reliabilities 
on individual tests are satisfactory and deter- 
mined in a reasonably conservative fashion. If 
individual tests are weighted into composites, the 
reported reliabilities are more than adequate. 
Reliabilities are also reported for each grade 
level in which use of the test is recommended. 

Many validity data, mostly follow-up, have 
now been reported. The publishers have pre- 
sented the zero correlations along with the high 
ones for the schools where data were obtained 
and have consolidated the information obtained 
from many small samples, giving the test use 
more stable values for the various validity co 
efficients. 

The norms are far better than those available 
for most tests. The numbers are large and we 
distributed by grade, sex, and school. Thirty 
different schools from several geographical sec 
tions and from communities of different sizes 416 
represented. (Norms with a broader geographi- 
cal basis are promised for the near future.) Per- 
centile ranks are provided, but the profile form 
places these ranks as they would fall on a norma 
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curve. A rule is also provided, when this profile 
form is used, for deciding on the probable de- 
gree of significance of differences between scores, 

There are a few things to report on the liability 
side of the ledger. For efficiency in differential 
prediction, it is desirable to have a battery with 
low intercorrelations. While the intercorrela- 
tions among the tests in the present battery are 
low enough to make the battery useful, they are 
not as low as possible. Purer tests than several of 
those used here are available. 

The reviewer likes to see correlations re- 
ported with other tests, particularly with com- 
peting tests. For this battery, for example, one 
would like to see the correlations with the 
Thurstone Primary Mental Abilities. No corre- 
lations of this type appear in the manual. (An 
announcement arrived as this was written con- 
cerning the early appearance of these correla- 
tions. ) 

A more important omission is the failure to 
come to grips with the problem of interpreting 
tests from a profile. How does the guidance 
worker make use of the information supplied on 
the significance of differences between separate 
scores? It is not obvious that with information 
concerning significance of differences and knowl- 
edge of validity coefficients of the individual 
tests the counselor can reach valid decisions 
from the battery as a whole. Perhaps the next 
step is to supply regression equations for these 
tests where the data permit. (A discussion of 
this problem has recently been announced for 
early publication. ) 


For an excerpt from a review, see 3:620. 
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manne Aptitude Series. Adults; 1047-50; 14 
ot 55 battery manual (’48) ; norms (750) ; $4.25 per 50 
id ay test; $2 per specimen set; $1 per specimen 
trial ру Опе test; postpaid; Joseph E. King; Indus- 
n sychology, Inc. * 
Grice TERMS, 1947-50; I form, '47; 5(8) minutes. 
z Mus TERMS. 1948-50; І form, ^48; 5(8) minutes. 
TOM: 1948-50; 1 form, '48; 5(8) minutes. 
4 тон 1947-50; І form, '47; 5(8) minutes. 
NUMBERS. 1947-50; 1 form, '47; 5(8) minutes. 
9 Yay HON. 1947-50; 1 form, '47; 5(8) minutes. 
һ Ын. 1948-50; 1 form, '48; 5(8) minutes. 
i) УТ 1947-50; І form, '47; 6(9) minutes. 
i) o: 1948-50; т form, '48; 5(8) minutes. 
Eire wae 1948-50; 1 form, '48; 5(8) minutes. 
1) NC 1949-50; I form, '49; 5(8) minutes. 
m) md 1949-50; 1 form, 49; 5(8) minutes. 
n) мотор 1948. 249 50: 95: form, 49; 3(5) mine. 
2 0; I form, '48; $2 r set of testin 
Apparatus; 6( тоў minutes. ' BUT у 


Te REFERENCES 
Testing.” ApOSEPH E. “The Perception Factor in Industrial 
title oniy) stract. Am Psychol 5:331 Jl ’50. * (PA 25:1300, 
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D. Мїшлү LzrrvER, Professor of Education, 
University of Southern California, Los Angeles, 
California. 

$ The Factored Aptitude Series is, as the name 
implies, a collection of tests based on the find- 
ings of factor analysis. With two exceptions, 
each test requires five minutes for administra- 
tion. Eight major factors of intelligence were 
selected as the basic framework for the battery. 
These include comprehension, reasoning, sys- 
tems, perception, fluency, memory, space rela- 
tions, and coordination. The eight factors are 
sampled by means of 14 separate measures, each 
of which is a 2-page paper and pencil test (the 
Motor test requires special apparatus). 

The series of tests is intended to be used in 
different combinations to predict success in a 
variety of job areas. For example, for junior 
clerk, the applicant takes the tests Office Terms, 
Numbers, and Perception; a candidate for a 
job as an inspector is given these “factored apti- 
tude” tests: Tools, Precision, Parts, Blocks, and 
Dimension. 

Raw scores are converted to stanines and are 
recorded on a "qualification grid" designed for 
each job area. The grid indicates the weights for 
predicting success on the job to be assigned to 
the several possible stanine values for each test. 
These weights are not always directly propor- 
tional to the test score since some applicants may 
show too much aptitude for the job assignment 
under consideration. 

The author states that the aptitude series will 
aid management in making decisions about hir- 
ing new workers, transferring employees to job 
assignments for which they may be better fitted, 
planning the training program, assigning job 
duties, checking on the quality of work done in 
relation to aptitude or work potential, promoting 
employees to more complex jobs requiring 
greater and possibly different aptitudes, and de- 
termining staff reductions. 

STRONG POINTS. (a) The author provides a 
broad base for operations by recognizing that 
aptitude tests must be used as “опе of the many 
tools and sources of information for personnel 
planning. Aptitude tests are by no means ad- 
vocated to replace the interview, personal history 
check, supervisor judgment, other personnel 
methods." (b) The listing of the names and pub- 
lishers of other well known standardized apti- 
tude tests indicates a less narrowly competitive · 
attitude than characterizes some authors. (c) 
Several tests in the series employ pictorial rather 
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than verbalized contents on the assumption that 
where language is not essential to job success 
it should not dominate the aptitude measure. 
Greater face validity is gained by the use of pic- 
tures of tools, photographs of personnel, and 
Sketches of plant operations. (d) The need for 
each applicant to study his profile of test scores 
in relation to his own pattern of success is em- 
phasized. Intrapersonal comparisons sometimes 
produce better mental hygiene and more effec- 
tive vocational guidance than does the usual in- 
terpersonal analysis. (e) Considerable attention 
is given to the problem of the “overqualified” 
applicant whose potential may be too high for 
the particular job for which he is being tested. 
Too much ability can be the source not only 
of dissatisfaction but of serious inefficiency. (f) 
The design and printing of the tests and manuals 
are outstanding. (g) The general explanation of 
basic theory of the testing instruments and of the 
techniques of interpretation and use is excep- 
tionally clear. The author has succeeded in sell- 
ing his aptitude series with unusual effectiveness. 

WEAK POINTS. (a) The manual does not pre- 
sent an adequate statement concerning the pres- 
ent limitations of aptitude tests as predictors of 
job success. The fact that the manual was written 
in a semipopular style for the administrative and 
personnel staff of business and industry does not 
excuse the author for rather vague statements 
implying a high order of predictive precision for 
the series. In the 1948 edition of the handbook 
he states that the study of aptitude test predic- 
tion "has quite conclusively shown that....job 
performance can be predicted by knowing an 
employee’s scores on the pertinent aptitudes.” 
Definite indications of the large errors frequently 
found in prediction studies should accompany 
such statements. It is not fair to present a few 
exceptionally favorable examples of aptitude 
test validation. This “sin of omission” is par- 
tially offset by the following quotation from the 
1948 handbook : “The tools and methods of per- 
sonnel measurement admittedly require a good 
deal of further development. They are by no 
means a complete answer to questions about 
personnel, nor will they ever be." (b) More di- 
rect research evidence should be presented in 
support of the assertions regarding the relation- 
ship of test data to the pattern of performance 
on the job. In the judgment of this reviewer, 
the general literature of industrial psychology 
does not support the sweeping implications of 
the author’s discussion. For example, how were 
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the weights determined that were assigned to 
the several stanine values in the “qualification 
grid," and what is the order of precision in. 
volved in their use? (c) A strong research 
foundation should be offered for such broad gen- 
eralizations as: "The Q, (well-qualified) em- 
ployee pays off double (his salary and overhead) 
for the company." The Q, (average qualified) 
shows "productivity roughly one and one-half 
times his salary and overhead expense." In the 
absence of any supporting data, it seems highly 
probable to this reviewer that such ratios of 
productivity to cost would vary greatly with 
different job areas, kinds of businesses, and 
management policies. (d) The composition of 
the “total employed population” on which norms 
were established seems rather arbitrary, and the 
proportions of job areas represented somewhat 
fortuitous. (е) The handbook should report 
more specifically the data on the reliability of the 
tests. In the 1948 edition complete information is 
given for the Sales Terms test, but only one 
measure of reliability for each of the other tests 
is presented as the diagonal values of an inter- 
correlation matrix. Presumably, these are со- 
efficients derived by the odd-even procedure. 
Since it seems quite probable (from personal 
observation) that most of the tests in the Fac- 
tored Aptitude Series will show a fairly large 
portion of unattempted items for the average 
applicant, the reliability measures should con- 
sist of test-retest correlations or coefficients com- 
puted between alternate forms. Speeded tests аге 
very likely to produce odd-even correlations 
that are spuriously high. The only complete data 
(for the Sales Terms test) show the test-retest 
correlation to be .81 compared with .91 for the 
odd-even determination. ; 
SUMMARY. The Factored Aptitude Series 
gives evidence of careful design and thoughtful 
development. The test booklets, the manual, an 
the technical notes are exceptionally aon 
and readable and produce a decided effect 0 
face validity. Apparently, a considerable amount 
of thorough research has gone into the project: 
The general pattern of factors employed as the 
guiding framework for the series seems to fut- 
nish a worthwhile basis for a first class battery ; 
of aptitude measures. у 
In the judgment of this reviewer, the nl 
mism of the handbook and brochures should be 
tempered by more definite and frequent T efer- 
ence to actual research and to the inherent limit?" 
tions of aptitude tests for predicting job succe* 
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The few validation studies cited appear to in- 
clude the more favorable examples of predictive 
accuracy. The aptitude series shows excellent 
possibilities and deserves a more complete and 
openly frank exposition in the manuals designed 
for general use. 


J Consult Psychol 13:66—7 F ’49. Laurance 
F. Shaffer. * No data on validity are given in 
the manual, but figures supplied by the author 
indicate some differentiations between jobs in 
a hierarchy, and between employees rated as suc- 
cessful and unsuccessful. * With their attractive 
formats and ingenious conveniences of adminis- 
tration and scoring, these tests will appeal to 
industrial clients. A critical evaluation, however, 
must await additional evidence of their validity. 
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Fife Tests of Ability. Entrants to secondary schools 
(England and Scotland); 1947; 4 tests (also listed 
separately) ; 1 form ['47]; no data on reliability; no 
data on validity in manuals (for data presented else- 
where by the author, see 7-2) ; manuals ['47]; 5d. per 
single copy of any one test; 4d. per manual of any one 
test; specimen set not available; postage extra; 165 
(185) minutes; Frank M. Earle; University of Lon- 
don Press Ltd. * 

а) TEST I, ABILITY FOR ENGLISH (LANGUAGE). Ós. per 
25; 30(35) minutes. 

) TEST 2, ABILITY FOR SCIENCE. 65. 25; 
vinis per 25; 30(35) 
С) TEST 3, ABILITY FOR ALGEBRA. 55. 6d. ; 
ae 5з. 6d. per 25; 60(65) 
d) TEST 4, ABILITY FOR GEOMETRY. 55. 6d. per 25; 45 
(50) minutes, 

REFERENCES 
т. EARLE, FRANK М. Tests of Abilit for Secondary School 


Courses. Publications of the Scottish Council for Research in 


lucat { > Uat 
Рр, iii, did ce 10. London: University of London Press, 1936. 


loup bo RANK M. Reconstruction in the Secondary School. 
3. WILSON, тену. ог ondon Press Howe Pe S pijit 
Algebra Shown by Secondary $ Eka n cred Bachelor's del. 
Glasgow University (Glas; pes ‘Scotl жө iH . Abstract: Brit 
J Bd Psychol 20:65-6 F Pro. * (PA 25 1570, title only) 
I. Macrartane Ѕмітн, Offcer-in-Charge of 
the Tests Division, National Foundation for 
Educational Research in England and Wales, 
London, England. 
aie now nearly twenty years since the Scot- 
hob ouncil for Research in Education spon- 
i an investigation into tests of ability for 
Condary school courses. Its report, by Frank 
gues (т), published in 1936, contained four 
i Phe Tj appendix. These were: Test of Ability 
Тарр у; of Words, Test of Ability in Science, 
in б bility in Algebra, and Test of Ability 
niri Тһе Fife Tests of Ability are sub- 
ШЫ the same as these. Some of the more 
а words have been omitted from the vo- 
Ty section of the Test of Ability in the Use 
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of Words, and the test has been renamed Test of 
Ability for English (Language). Some minor 
alterations have also been made in the Test of 
Ability for Geometry, but the form now pub- 
lished is essentially the same as that printed in 
the 1936 report. 

, These tests were devised for use with pupils 
just commencing a secondary course, in the 
hope that they might help the school administra- 
tor to differentiate between those who would 
probably achieve a satisfactory standard of at- 
tainment in subjects cognate to the tests and 
those who would be unlikely to reach the re- 
quired standard. It was assumed that the abili- 
ties required for the various school subjects de- 
pend not only on the presence of a general factor 
but also on certain group factors. In the construc- 
tion of the tests, 22 draft tests were prepared 
from material commonly used in intelligence 
tests and given to about 400 children. The scores 
were then correlated with marks in various 
school subjects. The Fife tests were prepared 
from this material by putting together subtests 
which correlated most highly with individual 
subjects. The extent to which Dr. Earle's aim 
has been achieved can be gauged from the fol- 
low-up data accumulated over a period of nearly 
10 years and published in an appendix to his 
book, Reconstruction in the Secondary School 

2). 

The tests were validated by being adminis- 
tered to some 1,500 Scottish pupils early in the 
first year of the secondary school course. Scores 
were also obtained for a smaller number of Eng- 
lish pupils. The criterion adopted was the stand- 
ard of attainment expected for the Junior Cer- 
tificate (three years) in Scotland, and the School 
Certificate (four years) in England. Thus, for 
the Test of Ability in the Use of Words, fol- 
low-up data are given for six different years 
in one school in Scotland, and for 10 different 
schools in one year in England. From an analy- 
sis of the performance of these pupils in the Cer- 
tificate Examinations, estimates were made of 
the probability that а pupil who gained a par- 
ticular score would pass the Junior Certificate 
Examination in Scotland or the School Certifi- 
cate Examination in England. It was also pos- 
sible to find a “critical score,” by means of which 
the number of errors made in forecasting suc- 
cess or failure was reduced to a minimum. The 
availability of extensive follow-up data is one of 
the great merits of the Fife tests. Few tests have 
been so systematically followed up over such a 
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long period. The figures show, however, that 
the four tests are not of equal predictive value. 
When the Test of Ability for English (Lan- 
guage) was administered to pupils who had been 
selected for a secondary course and whose in- 
tention to complete the course remained un- 
changed, a critical score was found which, over 
six years, gave an average number of errors of 
only то per cent of the forecasts. For the Tests 
of Ability for Geometry and for Science, how- 
ever, the figures showed a higher percentage of 
errors. Because of recent changes which have 
taken place in the character of the School Certifi- 
cate Examinations, the tables showing probabili- 
ties of passing or failing certain examinations 
are now of less practical value to teachers than 
they were some years ago. 

There are several defects which are common 
to all four tests. In none of them are the sec- 
tions timed separately, and there is a strong 
probability that many pupils will not reach the 
later sections in the time allowed. No attempt 
has been made to arrange the layout of the mark- 
ing keys in a form convenient for marking. Most 
answers are arranged closely in rows instead of 
in columns, and in some sections they are not 
even numbered to correspond with the items. 

The norms for all four tests show marked ir- 
regularities in the relation between score and 
age. The upper and lower quartile scores of 
Scottish children in the Ability for Geometry 
Test are 87 and 60 for the age 11—6, while the 
corresponding figures for the age 12-9 are 68 
and 42. It would appear that older children in 
the sample of first year pupils tested tend to be 
less intelligent than younger pupils. There is a 
similar tendency in the norms for the other tests, 
but it is much less marked in the norms for Eng- 
lish children. A peculiar feature of the norms 
for the Ability for Geometry Test is the fact that 
English girls obtain very much higher scores 
than Scottish girls of the same age. It is difficult 
to account for this anomaly, except by supposing 
that different standards of marking have been 
employed. Information on the reliability coeffi- 
cients of the tests would be of interest and 
should be included in future editions of the man- 
uals. Apparently, the only available data on re- 
liability are figures based on small groups, re- 
ported in Earle's first publication. Two of the 
tests were administered again after 16 months 
and the test-retest correlations were found to be : 
Test of Ability for Algebra, r = .785 (N = 48); 
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and Test of Ability in the Use of Words, = 7 
(N = 31), and r — .72 (N = 36). 

TEST I, ABILITY FOR ENGLISH (LANGUAGE), 
This test contains five sections: Vocabulary, 
Word Pairing, Sentence Completion, Paragraph 
Completion, and Sentence Construction. The 
possible score is 225, and the time limit is 30 
minutes. The test appears to be unduly difficult, 
since the norms show that the upper quartile 
score is usually less than half the total possible 
score. The Word Pairing section is open to 
criticism on the ground that many of the items 
are ambiguous. For example, the pupil is re- 
quired to underline the word which makes the 
best pair with “hazardous” from voyage, adven- 
ture, attempt, storm. The answer given is “ad- 
venture," but surely a voyage might also be 
hazardous. The marking of the items in the Sen- 
tence Construction section is subjective, and the 
manual does not give examples to help the 
marker to decide between satisfactory and un- 
satisfactory responses. - 

TEST 2, ABILITY FOR SCIENCE. This test is in six 
sections. The first involves arrangement in order 
of size, the second and third involve classifica- 
tion, the fourth consists of analogies, the fifth 
of multiple choice sentence completion, and the 
sixth of logical reasoning. The possible score is 
180, and the time limit is 30 minutes. The stand- 
ard of difficulty is appropriate, the median 
scores being somewhat above half the possible 
score. Most of the items are standard types used 
in intelligence tests, and all are objectively 
marked. Apart from the fact that the first section 
lays emphasis on a knowledge of tables (of 
length, capacity, etc.), it is difficult to find any- 
thing in the content of the test to lead one to ex 
pect it to have specific predictive value for 
courses in science. 

TEST 3, ABILITY FOR ALGEBRA. This test con- 
sists of two sections: Number Series Comple- 
tion and Problems involving arithmetical proc 
esses, substitution in formulae, and symbolic ех- 
pression. The possible score is 120, and the time 
limit is one hour. The manual recommends that 
this test, like the others, be given between the 
second week and the fifth week of the first term 
of a grammar school course; yet many of 2i 
examples are decidedly too difficult for pup! 
who have only just begun the study of algebra 
Some of those on substitution or symbolic repre 
sentation would be more suitable for second year 
pupils. That the test as a whole is too difficult i 
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evident from the norms, the upper quartile scores 
being considerably less than half the total pos- 
sible score. Since formal knowledge of algebra 
is required, pupils who have received some in- 
troduction to algebra before leaving the primary 
school would certainly have an advantage in 
working the latter part of the test. 

TEST 4, ABILITY FOR GEOMETRY. This test con- 
tains six sections based on diagrammatic ma- 
terial and originally described by Earle as tests 
of Form Perception and Form Analysis. The 
possible score is 160, and the time limit is 45 
minutes. Some of the sections are what would 
now be termed spatial tests. The last section, for 
example, is a Fitting Shapes test of the creative 
response type and has, probably, a considerable 
spatial loading. The first two sections, however, 
probably require very little spatial ability but 
have the advantage that the marking is com- 
pletely objective. There is considerable subjec- 
tivity in the marking of the last four sections, 
and the marking key gives little or no guidance 
to the marker. Many items require answers in 
the form of sketches or of lines drawn on dia- 
grams, and most of these have alternative solu- 
tions. For three of the sections the only instruc- 
tions given are that each figure is evaluated ac- 
cording to the conditions prescribed and that 
any solution which adequately satisfies the con- 
ditions is accepted. The lack of precision of 
these instructions is a serious weakness of the 
test. The provision of a key containing possible 
alternative solutions and showing samples of 
acceptable and unacceptable responses would 
have increased the reliability of the test as well 
as greatly facilitated the marking. 

SUMMARY. In assessing the value of the tests 
asa whole, it must be borne in mind that they 
Were originally prepared at a time when factor 
analysis as now understood was in its infancy, 
m methods of test construction were less 
B vanced than they are now, and when many 
a psychologists minimised the importance 
ot HP sette Subsequent research in the field 
eos s ability has tended to j ustify Earle's at- 
of а 5 assess aptitude for geometry at the age 
A "i there is less certainty about the ex- 
ко k abilities additional to “g” for algebra 
fedeli bug Thus, the Fife tests should be re- 
educational е КО of pioneering research on 
gained Mee ilities. No doubt, the experience 
е the development and follow-up of the 

ests has been fully utilised by Earle in the 


preparation of his more recent Duplex Series of 
Ability Tests. The latter appear to be based on 
a somewhat different classification of abilities, 
and the results are intended to be used in a differ- 
ent way, but it is clear that the Duplex tests 
represent a very considerable advance on the 
Fife tests. It would be of great interest to have 
the results of a factor analysis of a comprehensive 
battery containing both the Fife and the Duplex 
tests. 


For a review by William G. Mollenkopf of 
Test 3, see 380; for a review by James Maxwell, 
see 3:8. 
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*General Aptitude Test Battery. Ages 16 and 
over ; 1046-47; a test battery developed in conjunction 
with the occupational counseling services of the United 
States Employment Service and released for general 
use in 1948; titles on test booklets are GATB, Book I 
and GATB, Book II; 10 scores: intelligence, verbal 
aptitude, numerical aptitude, spatial aptitude, form per- 
ception, clerical perception, eye-hand coordination, mo- 
tor speed, finger dexterity, manual dexterity; 15 tests: 
11 tests in 2 booklets and 4 performance tests; 1 form, 
*46; manual in 2 booklets (’47) ; tests available to non- 
profit institutions for counseling purposes; subtests 
available to employers as separates in specific aptitude 
batteries for selection and placement of their own em- 
ployees; testing services free of charge when program 
is conducted through the facilities of the U.S.E.S.; in- 
stitutions and employers using their own facilities must 
purchase tests and employ testing supervisors trained 
by U.S.E.S.; details may be secured from local and 
state offices, through which all orders must be cleared ; 
(135) minutes; United States Employment Service, 
Bureau of Employment Security, U.S. Department of 
Labor. * lj 
a) BOOK I. $15 per 100, postpaid; U.S. Government 
Printing Office. А 
b) воок п. $12.50 per 100, postpaid ; U.S. Government 
Printing Office. 1 3 
c) PEGBOARD. $18.50 per set of testing materials, post- 
age extra; distributed by Specialty Case Manufactur- 
ing Со. and Warwick Products Co. } 
d) FINGER DEXTERITY BOARD. $8.65 per set of testing 
materials, postage extra ; distributed by Specialty Case 
Manufacturing Co. and Warwick Products Co. 


1. War MANPOWER COMMISSION, DIVISION OF OCCUPATIONAL 
AxaLYsis, STAFF. “Factor Analysis of Occupational 
T ; 


.D K, BEATRICE J. 
Test Battery." J Appl Psychol g ал25 Ag a i + 2211930) 
. Dvorak, Beatrice J. "The New U.S.E.S. eneral pti- 
tide Test Battery.” Occupations 26:42-4 О '47. (PA 22: 
js RR, BERNARD. “The General Aptitude Test Battery.” 
xs oe AE DE Diviston оғ OCCUPATIONAL 


Ч “Aptitude Bat- 

AL ND INDUSTRIAL Services DIVISION. ‘Aptitude 

= So Oria Tee Вт бет Иня р College 
0) h 

prp абу Utah. Master's thesis, Uni- 


Warsow, С. Е., А 
Counseling for Rural Youth." 


cn i the General 
‚ Снлвілѕ MANLEY. An Evaluation of the 
po Тен Battery as an. Instrument for Predicting Success 
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| (Salt 
Lake City, Utah), 1949. и 

13. Raven, Rav B. The Predictive Efficiency of the Gen- 
eral Aptitude Test Battery in the Colle, е of Medicine. Master's 
thesis, University of Utah (Salt Lake City, Utah), 1949. 

14. ROBINSON, LesLie Grorce М. An Appraisal of the 
United States Employment Service General Aptitude Test Bat- 
tery B-1001 for the Grade Ten Population of Minnesota, Doc- 
tor's thesis, University of Minnesota (Minneapolis, Minn.), 
І . 

УЕ: Szirz, Marcarer J. A Follow-Up Study of the Use of 
the General Aptitude Test Battery of the United States Employ- 
ment Service in the Placement of Ph School Seniors. Master's 
thesis, University of Delaware (Newark, Del), 1949. (РА 
26:1814, title only) Se ! А 

16. Super, DoNALD E, Appraising Vocational Fitness by 
Means of Psychological Tests, pp. 358-68. New York: Harper 
& Brothers, 1949. Pp. xxiii 727. * (PA 24:2130) bW 

17. THURMAN, C. G. A Statistical Analysis of the Predictive 
Efficiency of the General Aptitude Test Battery With Biologi- 
cal and Physical Science Students. Master’s thesis, University 
of Utah (Salt Lake City, Utah), 1949. h ҳ 

18. Wzsrwoop, DALE. Predicting Academic Achievement of 
Senior Engineers With the General Aptitude Test Battery. 
Master's thesis, University of Utah (Salt Lake City, Utah), 
1949. 

i Woopuzap, MARJORIE J. The Predictive Efficiency of the 
General Aptitude Test Battery in Social Sciences. Master's 
thesis, University of Utah (Salt Lake City, Utah), 1949. a 

20. Birp, Rozert С. The United States Employment Service 
General Aptitude Test Battery as a Predictor of Academic Suc- 
cess Ghee у. Master's thesis, Pennsylvania State College 
(State College, Pa.), 1950. 1 j 

21. BiscHor, Leprorp Junius. Relationships of General Ap- 
titude Test Battery Scores With Scores on the ACE Psycho- 
logical Examination ee College Freshmen. Doctor's thesis, In- 
diana University (Bloomington, Ind.), 1950. (Thesis Abstract 
Series, 1951.) (PA 25:7103, title only) А 

22. GORDAN, OAKLEY J. A Factor Analysis of Aptitude 
Achievement, and Studiousness Scores of 233 University Col- 
iie iride Master's thesis, University of Utah (Salt Lake 

ity, Utah), 1950. 

23. GRANT, ERU Vance, Jr. The Comparative Value of 
the General Aptitude Test Battery and the American Council 
on Education Psychological Examination in Predicting the 

uality Point Averages of a Group of Florida State University 
жле. Master's thesis, University of Florida (Gainesville, 

а.), 1950. 

24. RALPH, Ray B., AND Тлүгов, Carvin W, “А Compara- 
tive Evaluation of the Professional Aptitude Test and the Gen- 
eral Aptitude Test Battery." J Assn Am Med Col 25:33-40 Ja 


50. 
25. Rerran, Henry МАСА”РІЕ, Predictive Value of the Gen- 
eral Aptitude Test Battery for Student Success in Engineering. 
Doctor’s thesis, University of North Dakota (Grand Forks, 
-)) 1950, Review: Hermann Е. Buegel. Sch Ed Rec Univ 
N Dak 36:248-5т My 'sr. * 

26. SMITH, WiLLIAM В. The Construction of a Clerical Apti- 
tude Test Battery for the Mountain States Tele hone and 
Telegraph с ок, Master's thesis, University of Utah (Salt 
Lake City, Utah), 1950. д 

27. THoMPsoN, ]онх W., Jr. An Evaluation of the General 
Aptitude Test Battery as an Aid in Selecting Architectural 
ОЛИМ Master's thesis, University of Florida (Gainesville, 

а.), 1950. 

28. TIZARD, J.; IN COLLABORATION WITH N. O’Connor AND 
J, M. Свлуғовр. “The Abilities of Adolescent and Adult High- 
yp Male нет J Mental Sci 96:888-907 О "ko, * 

25:5421 

29. COLMEN, les G.; FIEDLER, GorrHELF O.; AND BLACK- 
BURN, James R. “Identification of Administrative Talent 
DES à Government Department." Abstract. 4m Psychol 

1393-4 51. 

30. Ствѕох, ANNE Marte. A Study of the Relationship of 
the General Aptitude Test Battery and. Academic Achievement 
in the School of Business, Indiana University. Doctor's thesis, 
Indiana, University (Bloomington, Ind.), 1951. (Thesis Ab- 
stract Series....1951, 1952, рр. 43-9.) 

. 31. O’Connor, N., AND Tizarp, J. “Predicting the Occupa- 

tional Adequacy of Certified Mentai Defectives.” Occupational 

Psychol 25:205-1ї Jl ’51. * (PA 26:3487) 
-1 Dvorak, BEATRICE J.; AND MEICH 

. "Role of Testing in Defense Mobilization.” Empl 


“General Aptitude 
Test Battery: Patterns for College Areas.” Occupations 29: 


Mitton L. BLUM, Associate Professor of Psy- 
chology, The City College, New York, New 
York. 

According to the United States Employment 
Service, the General Aptitude Test Battery 
(hereafter referred to as GATB) “is designed to 
measure several aptitudes which have been found 
important to success in many occupations," The 
battery is used primarily as an aid in the more 
effective placement of applicants who have in- 
sufficient job experiences. Various state agencies 
in cooperation with local school authorities have 
been extending the application of the battery to 
the counseling of high school youth. 

The battery consists of 15 tests, 11 pencil and 
paper and 4 apparatus tests. It requires approxi- 
mately 244 to 2% hours for completion. The ac- 
tual tests are named A, Tool Matching ; B, Name 
Comparison ; C, H Markings ; D, Computation; 
F, Two-Dimensional Space; G, Speed; H, 
Three-Dimensional Space; I, Arithmetic Rea- 
son; J, Vocabulary ; K, Mark Making; L, Form 
Matching; M, Place; N, Turn; O, Assemble; 
and P, Disassemble. m 

Most of the tests in the battery are not original. 
They resemble, with modifications, parts of such 
familiar tests as the Minnesota Clerical Test, 
the MacQuarrie Test for Mechanical Ability, 
the Revised Minnesota Paper Formboard Test, 
O'Rourke's Survey Tests of Vocabulary, and 
assorted items as usually found in group intelli- 
gence tests such as arithmetic computation and 
reasoning, etc. 

It does appear desirable to encourage the use 
of test batteries in the form of a single test. This 
procedure has the advantage, at least, of prevent- 
ing examiners from selecting inadequate single 
tests and also tends to prevent them from weight- 
ing intelligence too heavily in vocational consid- 
erations. A similar approach is found in the Dif- 
ferential Aptitude Tests. |, 

The reviewer is impressed with the consider- 
able amount of research that must have accom 
panied the development of the GATB. He 55 
annoyed with the lack of availability of the ev 
dence. To the best of his knowledge a complete 
story has never been told. The published mate 
rial is spotty, sporadic, overenthusiastic, aps 
decidedly incomplete. It may well be that a ad 
siderable portion of the research has not hen 
reduced to reporting according to expected 5e 
entific standards. The GATB suffers as a rest 
and compares very unfavorably with the D d 
(Differential Aptitude Tests). The manua 
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the Differential Aptitude Tests is a masterpiece 
in comparison with the floundering and incom- 
plete GATB material. 

If this review encourages the USES to recog- 
nize that it has the same responsibilities as pri- 
vate test publishers, then it will not be in vain. 
That USES limits its tests to its own group is 
no excuse. A private counseling outfit, regard- 
less of, its reputation, would justifiably be at- 
tacked if it administered its own battery and of- 
fered little evidence of its scientific acceptability. 

Another annoyance in reviewing the GATB 
is the matter of finding changes without any ref- 
erence to why such changes occurred. For ex- 
ample, test booklet I contains a test, Test E, that 
is no longer used. The test is not mentioned in 
the guide or, for that matter, anywhere else. 
Obviously, there must have been some reason 
for its being dropped, but the reviewer can only 
guess at the reason since no explanation is of- 
fered. The test is the usual letter sequence test 
sometimes found among items in intelligence 
tests, Whether the directions were inadequate 
or the test too difficult, whether the test was not 
related to any of the factors in the GATB or 
found not to be important to success in any occu- 
pation is simply never mentioned. 

р Another irritation is reference in an early pub- 

lished study (т) to a factor “L” but to which 
no further reference is ever made. Apparently 
this factor was dropped for good and sufficient 
reasons, but no indication is ever offered as to 
why the “L” dropped out. 

The GATB emerged after a factor analysis 
Study of 59 different tests administered to 9 
ш groups in batteries ranging from 15-29 
ests. These groups varied; the smallest con- 
sisted of 98 persons and the largest comprised 
1079. In all, 2,156 persons in 12 different geo- 
eraphie regions (cities) took the tests. They 
Ni all applicants or trainees in different train- 
ee Hid One may immediately criticize the 
тыт. not only because of the relatively homo- 
EN s nature of the group (students in certain 

ng courses) but also because no females 
were included. 
фсе factor analysis methods were ap- 
rad ле test results of the above groups, and 
(Bid emerged: They are G, Intelligence 
Ve ^s in Tests H, J and parts of I); V; 
parts of T a J); N, Numerical (Tests D and 
Pius ); S, Spatial (Tests F, Н); P, Form 
tion С. (Tests L, А); Q, Clerical Petcep- 

(Test В); A, Aiming—Eye Hand Coordi- 
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nation ( Tests C, K) ; T, Motor Speed (Tests K, 
С); Е, Finger Dexterity (Tests P, О); and M, 
Manual Dexterity (Tests M, N). 

Although the standardization process’ of a 
test battery is more difficult than that for a single 
test, the same rigid standards must be met. Ex- 
ami the GATB for evidences of validity, re- 
liability, and norms was not too rewarding. An- 
swers to the most simple basic questions are not 
readily obtained. 

The “Guide to the Use of GATB" consists of 
two booklets and totals 155 pages. It is really a 
manual describing the clerical tasks involved in 
administration, scoring, and interpretation of re- 
sults. It presents no data on reliability and va- 
lidity and does not resemble the usual test man- 
ual. It categorically states : “This battery is com- 
posed of fifteen tests selected because they are 
good measures of ten aptitudes that have been 
found to be important to successful job per- 
formance on a large number of jobs." It presents 
tables to convert raw scores to aptitude scores 
and introduces the concept of Occupational Ap- 
titude Patterns. These OAP's are defined as 
“The combination or pattern of aptitudes that 
is required to perform satisfactorily the major 
tasks of the occupation identified with each pat- 
tern." For both aptitude scores and patterns, 
there is again no evidence either in the guide or 
elsewhere. 

The guide appears to be written for glorified 
clerks who are expected to follow directions and 
not ask questions. Nowhere in either of the parts 
of the guide is there any reference to validity 
and reliability or even a description of the size 
of the sample of the population which serves as 
a basis for the conversion tables. 

The guide or manual is merely a collection of 
directions, directives, and claims without evi- 
dence. This would be excusable if a supplement 
were available concerning the evidence of stand- 
ardization, but such a compendium is not known 
to the reviewer and his requests for one were not 
successful. Furthermore, very little concerning 
this test battery has been published in profes- 
sional publications. Apparently quite a bit has 
been published in mimeographed form by gov- 
ernmental agencies. Such reports as issued by 
the Bureau of Employment Security of the 
United States Department of Labor are often 
labeled “restricted” and this is puzzling to the 
reviewer. It does seem far fetched to assume 
that there is a relationship between such research 
studies and national defense. Insofar as a sam- 
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ple of such studies was furnished to the reviewer, 
he is assuming that no national secrets are being 
divulged by his reference to them. On the other 
hand, an hypothesis may be offered that these 
studies are purely experimental, and the use of 
these tests must be restricted until such time as 
much needed evidence is available. If this is 
true, then it does not fit at all with the very wide 
usage of the test within the agency. and state 
offices, the enthusiastic claims, and the various 
directives issued. 

The few articles published in the literature are 
brief and leave the reader with many gaps of 
knowledge. An early publication (r) describes 
the population from which data were obtained 
for the factor analysis as well as the determina- 
tion of the factors. The 2,156 males in the sample 
were applicants or trainees enrolled in Voca- 
tional Education National Defense Training 
Courses. No information concerning the known 
occupational success and the extent of the range 
of occupations of these subjects is reported. A 
later article states that the Battery is standard- 
ized on samples of workers employed in various 
occupations. It is to be noted that this is quite 
different from the apparent standardization on 
the 2,156 trainees. Tt describes norms developed 
for 20 fields of work representing approximately 
2,000 occupations, but not even a reference to 
the size of the population in any of the fields of 
work is presented. The norms are expressed as 
Occupational Aptitude Patterns and consist of 
"minimum aptitude scores required for occupa- 
tions grouped according to the Part IV classifi- 
cation code structure of the Dictionary of Occu- 
pational Titles. It is claimed that each occupa- 
tional aptitude pattern consists of minimum 
scores for only the most significant aptitudes 
required for the group of occupations covered 
by that pattern. Minimum scores, ranging from 
85 to 130, were established to eliminate approxi- 
mately the lowest third of the employed sample, 
but no evidence is presented that these cutoff 
scores are related to the lowest production group 
of workers. An example of a pattern is pattern 
4, GNSF, with critical scores of 100 on intelli- 
gence, numerical ability, and spatial ability and 
85 on finger dexterity. Occupations covered by 
this pattern are those on all-round metal ma- 
chining and all-round mechanical repairing. 
Such results in such round numbers are almost 
too wonderful to believe. 

More evidence allowing one to evaluate the 
establishment of any or all of the occupational ap- 


titude patterns should be made available, The 
reviewer has seen no such evidence. The state. 
ment, “The USES Batteries are comprised of 
critical scores on only those aptitudes which are 
significant for a particular field of work” is in 
need of substantiation, especially with reference 
to the word "significant." 

Test reliability is relatively easy to establish, 
Nevertheless the evidence of reliability of the 
GATB is paltry. A report based upon 156 cases 
establishes the test-retest reliability for each fac- 
tor. The highest reported is .93 for the numerical 
factor, and the lowest is .8r for the finger dex- 
terity factor. No description of the characteris- 
tics of the sample or the length of the time inter- 
val between test and retest is offered. 

Another formal publication is a study report- 
ing the use of the GATB with high school sen- 
iors (тт). The battery was given to 439 high 
school seniors in five different schools. Senior 
students in commercial courses were found to 
be highest in P (form perception) and second 
highest in Q (clerical perception). Senior stu- 
dents in industrial arts and technical courses 
have S (spatial) as their highest aptitude. Sen- 
iors in academic courses are highest in G (in- 
telligence) ; their V (verbal) is higher than the 
V of the other two groups. This study can be 
construed as offering indirect evidence of test 
battery validity. Another published study with 
indirect relevance to validity was the use of 
the GATB with 49 sophomores in a medical 
college (24). The verbal, spatial, and numerical 
factors were found to be correlated with grades. 
Incidentally, the intelligence factor in this study 
was correlated with grades to a higher degree 
than the others but was ruled out because tt 
overlapped to a great extent with the other fac- 
tors. Neither the statistical nor the logical reason 
for this is clear to the reviewer. It does, how- 
ever, raise the question of the uniqueness of the 
factors and their contributions to the various 
Occupational Aptitude Patterns established. 

At least 13 master’s theses and 3 doctors 
dissertations have been written on various 4S 
pects of the GATB. In view of the complexity 
of this test and much that is unknown, it is easy 
to predict that this test battery can become “a 
happy hunting ground of graduate degree see T 
ers for many years to come. According to à lis y 
ing of thesis titles, most are on the prede 
aspects and the evaluation of the САТВ. Ж 
reviewer has read only one such study. It yc 
written by Margaret Jane Seitz of the University 
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of Delaware (15). The subjects in the study 
were 353 high school seniors all of whom vol- 
untarily took the GATB. A follow-up approxi- 
mately one year later determined that 212 were 
working. Of this number, job descriptions were 
available on. 117. They were classified as fol- 
lowers if they were working in one of the fields 
indicated by the OAP, or nonfollowers if not 
working in the fields. Of this number, 87 were 
found to be followers and 30 were nonfollowers. 
The author of the thesis states, "The greater 
number of followers can in part be explained by 
the fact that the employment service counselors 
tried to place the graduates in follower jobs. In 
addition, each graduate was informed that his 
chances of success might be more likely in an 
occupational pattern shown by his GATB." The 
reviewer believes that this shows conclusively 
that if you encourage people to be “followers,” 
they will be. 

A significant contribution of the thesis indi- 
cates that the followers had higher average scores 
on the Hoppock Job Satisfaction Blank than the 
nonfollowers. This difference is reported as 
being statistically significant. The followers were 
also rated more highly by employers. In certain 
respects, this study can be regarded as evidence 
for the validity of the GATB as a counseling 
instrument. i 

Norms for the GATB are reported in a mime- 
ographed leaflet of four pages. Actually, the 
total population consists of only 519 регзопз (70 
males and 449 females) ; their mean age is 30.4, 
and their mean education is 11 years. The state- 
ment “the sample used is large and varied enough 
to serve as a general population” is typical of the 
overenthusiastic claims made in connection with 
the GATB. It is extremely doubtful whether 
this sample is large enough, and the fact that 
it is varied, while admittedly true, only compli- 
cates matters insofar as its projection to a gen- 
ега] population is concerned. The 519 subjects 
were drawn from 22 samples. The two largest 
ей of 56 “laborers, process (drug ргер.)” 
tom Berkeley, California and 47 grocery check- 
ers from Detroit, Michigan. The two smallest 
samples consisted of three tabulating machine 
Operators from Los Angeles and five electrician 
анса from Newport, Rhode Island. The 
mA aem empling size was 25. А statement 15 
шы The norms shown in Table II should be 

: аз the base population for interpretation of 
A neral Aptitude Test Battery data until a more 

Presentative population can be obtained." 


The criticism is obvious. This sample is to- 
tally inadequate for nationwide testing and such 
directives are certainly out of order. It is possible 
that field office interpretations based upon such 
inadequate norms are given too much weight. 
This reviewer has also seen seven technical re- 
ports all labeled “restricted.” These reports have 
the dual purpose of developing a specific aptitude 
test battery and securing additional data for 
standardizing the GATB. Apparently employers 
who use the USES facilities to help them solve 
their selection problems are given the oppor- 
tunity to have some of their employees take parts 
of the GATB. These studies report the relation 
between test results and certain job criteria and 
result in still another concept known as Spe- 
cific Aptitude Test Batteries. These are always 
parts of the GATB. All reports are rather posi- 
tive in their recommendations. They indicate 
that certain parts of GATB would have elimi- 
nated a high percentage of failures and included 
a high percentage of successes. Some of these 
technical reports can not be considered as sci- 
entific despite the statistical analysis conducted. 
“Feeling” apparently enters into the selection of 
tests. A brief quote from one of the reports is as 
follows: “Although D had a relatively high mean 
and significant correlation, it was not selected 
as it was felt that test I reflected numerical abil- 
ity to а higher degree and also measured intelli- 
gence.” There is no standardized procedure in 
the selection of tests for the variety of aptitude 
test batteries obtained. Sometimes it appears as 
if tests are selected on the basis of job descrip- 
tions. That is, the job analysis requirements are 
compared with what the test is supposedly meas- 
uring and, for example, if the clerical compo- 
nent is high, the clerical perception factor or 
name checking test is selected. 

Other evidence of a statistical nature is the 
correlation between the test results and the job 
criterion; or, sometimes, the comparison be- 
tween the mean of the test results on the tested 
sample and the mean of the test results on the 
general population. 'These technical reports are 
written by people with wide differences in train- 
ing and background in testing, and consequently 
some are rather poor. Characteristic of virtually 
all of these reports is an enthusiastic positivism 
not warranted by the evidence. 

In summary, the reviewer believes that the 
GATB is a test battery with some promise. It 
must be regarded as purely in the experimental 
stage, and during this period the enthusiasm of 
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those who work with the instrument should be 
tempered by the understanding that there is 
great need for much work to be done. Efforts 
should be made to publish in clear and certain 
terms the basis for the Occupational Aptitude 
Patterns, A better and more realistic set of norms 
based upon a truly general population should be 
made available. Time should be taken to present 
adequate evidence on the reliability of this in- 
strument. Considerable evidence is needed on 
the validation processes both with respect to 
occupational groups and with respect to the use 
of this battery as a counseling instrument. In 
all sincerity, the reviewer, adopting the enthusi- 
asm of those who are working with this instru- 
ment, has the “feeling” that this may very well 
be a good test battery. However, he is certain 
that he has not seen the evidence. 


Epwarp B. Greene, Educational Supervisor, 
Department of Supervisory Training, Chrysler 
Corporation, Detroit, Michigan. 

Eleven of the tests are paper and pencil tests 
with written and oral directions. Their range of 
difficulty makes them applicable to all adults 
who can read and understand English and han- 
dle paper and pencil situations. While no age or 
grade equivalents have been issued, the tests 
seem unsuitable for individuals with less than 
fifth grade accomplishment. Four of the tests 
require manipulation of pegs or small washers 
and rivets; language is a small factor in under- 
standing the test directions. No attempt was 
made to have these tests look like worksamples, 
but they were designed to indicate the aptitudes 
likely to be required in successful job perform- 
ance. Since all the tests are timed so that very 
few in a group are able to finish a test in the time 
allowed, speed is an important factor. 

The GATB is now administered without cost 
to applicants who might benefit from it at the 
larger state employment offices throughout the 
country. The test forms have been made avail- 
able at cost to private employers who have 
agreed to use only well trained examiners, to 
keep the test forms closely restricted, and to 
give the U.S. Employment Service the results 
of test programs. About 300 employers have 
already used the GATB under these conditions, 
A larger number of employers have requested 
the Employment Service to screen applicants by 
tests before referring them for employment. 

The writer has compared the General Apti- 
tude Test Battery with two other similarly pre- 
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pared tests, the Chicago Tests of Primary 
Mental Abilities and the Guilford-Zimmermay 
Aptitude Survey. The three have fairly com. 
parable sections on vocabulary, arithmetic 
computations, reasoning (represented by arith- 
metic reasoning in the GATB and the Guilford. 
Zimmerman), and two-dimensional Space rela- 
tions. The GATB and the Guilford-Zimmerman 
test both contain tests of form perception and 
three-dimensional space. Each battery has some 
unique parts: the PMA has word fluency and 
immediate memory tests, the Guilford-Zim- 
merman includes a measure of mechanical 
knowledge, and the GATB contains clerical 
perception, motor speed, eye-hand coordination, 
and dexterity tests. The overall testing time, in- 
cluding time for administration and practice, is 
about 2 hours for the PMA, 214 hours for the 
GA'TB, and 314 hours for the Guilford-Zimmer- 
man. The Guilford-Zimmerman is thus longer 
than the other two and not so dependent on 
speed. No careful studies comparing these bat- 
teries have yet appeared, but it is probable that 
the similar parts will give similar results. 

Only a few validation studies of the GATB 
have appeared. Ralph and Taylor (24) report 
correlations, based on 49 students, between total 
grade-point averages for five quarters of medi- 
cal school work and the scores on the GATB 
taken near the end of the sophomore year. 
Four aptitudes—verbal, numerical, spatial, and 
form perception—were selected because the 
medical students had means on these tests at 
least one standard deviation above those for the 
adult sample and because these tests showed the 
highest correlations, about .40, with grade-point 
averages. Using the first three of these four tests, 
a multiple correlation (Wherry-Doolittle) of 
55 with a standard error of .11 was found. The 
standard error of estimate was .4 grade points, 
and about 30 per cent of the variance in the сп- 
terion was accounted for by the three aptitude 
measures. These figures were somewhat higher 
than those from the Medical College Admission 
Test, given a year and a half earlier and used in 
part for the selection of students. The effect 0 
the time interval here may be important. 

A USES report gives validation figures ЮГ 
134 survey workers (clerical) of the Bureau 0 
Labor Statistics. The criteria were combine 
rank order ratings of various supervisors 0 
quality and quantity of work and overall Pi 
ficiency. Four aptitudes showed the highest 
tetrachoric correlations with the criteria: ™ 
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telligence, verbal, numerical, and clerical per- 
ception. A total score which had a tetrachoric 
correlation of .56 with the criteria was secured. 

Another USES report compares test scores 
with supervisor’s ratings on a 4-point scale of 
overall proficiency for 81 experienced and 27 
inexperienced insurance underwriters. A final 
battery of three tests—Name Comparison, 
Arithmetic Reasoning, and Vocabulary—yields 
a tetrachoric correlation of .57 with the criterion 
among experienced workers and .76 among the 
inexperienced. 

Another USES study reports a tetrachoric 
correlation of .45 with a standard error of .17 
between average piece-rate earnings for two 
weeks of gt mounters of filaments for electric 
lamps and a battery of four aptitudes, Form Per- 
ception, Aiming, Finger Dexterity, and Manual 
Dexterity. 

Several criticisms of the General Aptitude Test 
Battery can be made; all of them can be reme- 
died by the designers or by future studies. (a) 
The test forms are expensive: at present no an- 
swer sheets are used, and each complete test re- 
quires 72 pages. (b) There are too many tests. 
Similar results would probably be secured from 
about 9 good tests instead of the present I1 short 
paper and pencil tests and 4 performance tests. 
The administration of so many short speed tests 
is expensive and leads to errors. (c) Some im- 
portant skills, such as mechanical relations, 
reasoning, and fluency, as well as tests of tech- 
nical knowledge, are omitted. The test of lit- 
terary vocabulary should probably be made more 
general. (d) The scoring and conversion of 
scores to standard scores is unreasonably time 
consuming. It could easily be done in profile 
tables, (e) The designation of both tests and 
aptitudes by capital letters is confusing ; in fact, 
the use of both terms seems unnecessary. (f) 
The selection of test batteries for particular oc- 
cupations or Occupational Aptitude Profiles 
Seems to have been rather arbitrarily made by in- 
spection of job analyses and rough validity cor- 
relations. It could be improved by more analyti- 
kd po analyses and more adequate statistical 
ШО. ation. (g) The distribution of 2,000 job 

es under 20 OAP's leaves one wondering 
Noo: how the classification was made. The 
* ection of workers to be used in establishing 
í ec ia norms is a persistent problem, and 
à € 1s no evidence at hand to show how ade- 
quately this has been done. Specifically, evi- 

ence should be secured and published to show 
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what proportion of the workers used in estab- 
lishing norms have skills or knowledge consider- 
ably in excess of that needed for satisfactory 
performance of their work. For instance, the 
writer has found a considerable proportion of 
sales clerks who have vocabulary, grammar, and 
number skills far in excess of that needed on the 
job. In almost any occupational group there are 
persons who have skills not needed or used on 
the job. To include these persons in groups used 
to establish norms will raise the critical or cut- 
off score above a reasonable level. This seems 
to the writer to have occurred in quite a number 
of OAP's. (h) The use of the 33rd centile of a 
group as the cutoff score is questionable. Evi- 
dence should be found to show what propor- 
tion of the tested group was doing unsatisfactory 
work because of inability. In many instances 
workers are rated as unsatisfactory because of 
emotional or adjustment difficulties, not because 
of mental or physical ability. To set a cutoff 
score at the 33rd centile may result in eliminat- 
ing the steadiest and most useful workers, par- 
ticularly among routine jobs. (7) The use of 
the battery for counseling applicants or high 
school students is not yet supported by good 
norms or validation studies. Much more work is 
needed to show how well preemployment tests 
will predict success on a job a year or more 
later. (j) There should be a very determined 
effort to avoid the use of overall ratings of per- 
formance for criteria when it is obvious that 
quite unrelated skills and attitudes are com- 
bined in unknown quantities in such ratings. 
More analytical ratings can and must be secured 
if such an analytical battery is ever to be vali- 
dated. The use of general or overall criteria will 
lead to excluding very important aspects in rat- 
ing and in corresponding aptitude tests. (k) The 
use of the battery for selection of experienced 
workers has justification, particularly when the 
battery is pretested on a group of carefully se- 
lected, experienced workers. However, the use 
of the battery on inexperienced applicants is cer- 
tainly justified at this time only as an experi- 
ment. The harm that can be done to applicants 
and employers by use of such tests for selection 
of inexperienced workers is probably very great, 
since it may result in exclusion of a variety of 
workers who are needed for routine work as well 
as for elasticity and growth of an organization. 
There are several points on the credit side of 
the ledger: (a) The GATB is essentially sound 
in its rationale and planning, except as noted 
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above. (b) It is probably as valid as most other 
tests now used for counseling and selection, few 
of which are well validated. (c) It has the sup- 
port of a large Federal agency, and it should 
therefore have a continuous growth toward more 
accuracy and economy and better prediction. 
There are probably in existence today a con- 
siderable number of validation studies of the 
GATB and additional norms for high school 
and college groups which are not yet available 
in print. 


Howanp К. Тлүгов, Professor of Psychology 
and Head of the Department, University of Ore- 
gon, Eugene, Oregon. 

То have a battery of 15 tests, various combi- 
nations of which will measure то basic aptitudes 
for 20 fields of work including approximately 
2,000 of the occupations classified in the Dic- 
tionary of Occupational Titles (DOT), is more 
than most psychologists concerned with place- 
ment or vocational guidance would have dared 
to hope for. This does not imply, of course, that 
organizations using tests to hire employees will 
no longer need to validate these tests against 
their own specific criteria of success on the job. 
But in the general situation of searching for the 
best available person to fill a job as typically de- 
fined, or of finding the type of work for which 
an applicant is best qualified, the General Apti- 
tude Test Battery (GATB) gives promise of 
exceptional versatility and dependability. 

Occupational success in such diverse job fami- 
lies as creative writing ; structural, mechanical, 
and electrical engineering; metal machining; 
mechanical repairing; routine recording work; 
plumbing; and carpentry; etc., can be forecast 
to advantage from a pattern of two to four apti- 
tude scores. Predictions of grades in medical 
school based on the verbal, numerical, and spa- 
tial aptitudes have been shown to be more ac- 
curate than those based on the Moss Scholastic 
Aptitude Test or the new Medical College Ad- 
mission. Test, formerly called the Professional 
Aptitude Test, both of which were specifically 
designed for such use. The test battery and the 
aptitude scores derived from it are outcomes of 
extensive factor analysis studies. In all, 59 dif- 
ferent tests were tried out in 9 combinations with 
groups of male subjects ranging in size from 98 
to 1,079, in age from 17 to 39 years, and with an 
average of 11 years of schooling. Most of the 
subjects were from vocational education train- 
ing courses in 12 cities located in various regions 
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of the United States. Approximately s per cent 
were Negroes. The minimum number of factors 
which would explain the obtained correlations 
between the tests in each group were determined 
by the centroid (Thurstone) method. An or. 
thogonal structure was imposed on each group; 
the resulting factors proved to be consistent be- 
tween groups as well as with the results of pre- 
vious investigations. The final selection of 10 
aptitudes was made in terms of occupational va- 
lidity. 

By means of conversion tables in the manual, 
raw scores are converted to standard scores so 
scaled that the general population average is 
100 and the standard deviation 20. Minimum 
scores for occupations in each of the 20 job fami- 
lies have been set to eliminate the lowest third 
of the employed sample. 

All 15 tests can be administered in little over 
2 hours. It is, obviously, assumed that the ad- 
ministration and use of the tests will be restricted 
to technically trained personnel. Given reason- 
able familiarity with the occupational classifica- 
tions of the DOT they will find the manuals 
comprehensive, well organized and clearly writ- 
ten. The standardization of Occupational Apti- 
tude Patterns in a form corresponding closely to 
the Entry Occupational Classifications of Part 
IV of the Dictionary of Occupational Titles has 
many advantages for practical use. : 

The test-retest reliability of the separate apti- 
tudes has been reported by the USES with no 
identification of their source except that the 
coefficients were computed from local office ap- 
plicant populations (N = 156). These соећ- 
cients are G, Intelligence, .89; V, Verbal, .90; 
N, Numerical, .93; S, Spatial, .87; P, Form 
Perception, .82; Q, Clerical Perception, .91; A, 
Aiming, .88; T, Motor Speed, .01; F, Finger 
Dexterity, .81; and M, Manual Dexterity, 86. 
Only five of the aptitudes are measured with o 
precision (r = .90) usually considered essentia 
for appraisals of individual capacity, two 2 
definitely lower, and the other three are perhaps 
borderline. Moreover, since these are test-retest 
correlations, they are to some extent overesti- 
mates of actual reliability; and, if there are ap- 
preciable correlations between aptitudes, the 
accuracy with which an individual's relative 
strengths and weaknesses can be diagnosed V! 
be still further restricted. " , 

Unfortunately, no table of the correlations ү 
tween factors has been published. It might д 
presumed, of course, that the factor analysis 0 
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which they were based would insure their inde- 
pendence; but several have such heavy factor 
loadings from the same tests that considerable 
commonality seems certain. In a study of 216 
student nurses, the intercorrelations of factors 
G, V, N, and О are given; they are all positive 
and range from .29 to .69. Hence, the assump- 
tion that the battery provides measurements of 
unique traits is questionable. Nevertheless, the 
reviewer believes that these deficiencies in the 
reliability of diagnostic scores on the GATB are 
less serious than those found in any comparable 
group of tests now available. 

Much more significant and reassuring is the 
evidence of the GATB’s validity in comparison 
with other tests similarly constructed, e.g., the 
Differential Aptitude Tests or the Factored Ap- 
titude Series. Many detailed technical reports of 
validation studies by state and national employ- 
ment services, universities, etc., have already ap- 
peared, and a continuous program of research, 
revision, and follow-up investigations is evi- 
dently under way. Undoubtedly there are basic 
occupational aptitudes which the present battery 
does not measure at all. It also makes no attempt 
to sample occupational interests or other per- 
sonality characteristics. These limitations and 
their importance are fully recognized. It is, there- 
fore, all the more remarkable that a single bat- 
tery of this sort will do so much, so well. 
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*The Guilford-Zimmerman Aptitude Survey. 
Grades 9-16 and adults; 1947-50; IBM for Parts 1-П, 
V-VII; 7 parts; Form А (%47); 256 per manual 
(47) ; 2¢ per tentative high school norm sheet (48); 
2¢ per college profile chart ('50) ; $1.50 per specimen 
set, postpaid; separate answer sheets may be used for 
Parts I-II, V-VII; 2¢ per IBM answer sheet for апу 
one part; $1 per hand scoring key for any one part (no 
key for Part VI) ; 50¢ per stencil for machine scoring 
of answer sheets for any one of Parts I-II, V-VII; 
postage extra; 143(188) minutes; J. P. Guilford and 
Wayne S. Zimmerman; Sheridan Supply Co. * 
а) PART Т, VERBAL COMPREHENSION. IBM; $2 per 25; 
25(30) minutes. 

) PART II, GENERAL REASONING. IBM; $2 per 25; 35 
(40) minutes. 

C) PART IIT, NUMERICAL OPERATIONS. $2 per 25; 8(13) 
minutes, 

ie IV, PERCEPTUAL SPEED. $2 per 25; 5(10) min- 
с) PART V, SPATIAL ORIENTATION. IBM; $3.50 per 255 
neo minutes, 

) PART VI, SPATIAL VISUALIZATION. IBM; $5 per 25; 
30(40) minutes. 
б , PART уп, MECHANICAL KNOWLEDGE. IBM ; $3.50 per 
5; 30(35) minutes. 
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Anne АмАѕтАЅІ, Professor of Psychology, 
Fordham University, New York, New York. 
This test battery is an outgrowth of the re- 
search conducted by Guilford and his associates 
during World War II on the selection of aircraft 
personnel. The authors state that they plan ulti- 
mately to develop tests of approximately 20 
"primary abilities" The seven parts already 
published, each in a separate booklet, are de- 
scribed as covering the leading factors in three 
traditional areas: abstract intelligence, clerical 
aptitude, and mechanical aptitude. The tests can 
be used according to these traditional groupings 
or recombined in different ways for specific pur- 
poses. For personnel classification or vocational 
guidance, the authors recommend the use of the 
entire battery which requires approximately 


three hours. The tests are virtually self-adminis- 


tering, written instructions and sample items 
All items are of 


being provided for each test. t 
the multiple choice type, most having five al- 
ternatives. The battery is designed for use with 
older children and adults,” the instructions being 
said to require sixth grade reading ability. 
Parts I and П, covering the area of abstract 
intelligence, include the tests Verbal Compre- 
hension and General Reasoning. The former is 


Company, 
O., Septem- 
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а vocabulary test, in which the level of difficulty 
and the part of speech of all alternative responses 
are approximately equivalent to those of the 
word defined. General Reasoning is an arithme- 
tic reasoning test in which the problems are 
stated in rather intricate verbal terms. In order 
to reduce the contribution of numerical computa- 
tion in this test, items which correlated signifi- 
cantly with the total score on the Numerical 
Operations in the preliminary testing were elimi- 
nated, 

Parts III and IV, designed to measure two 
important components of clerical aptitude, con- 
sist of the tests Numerical Operations and Per- 
ceptual Speed. The first involves addition, sub- 
traction, and multiplication. A novel feature of 
this test is that six alternative responses are 
provided for each pair of items, thus saving 
space and probably reducing reading time on 
the part of the examinee. Part IV requires the 
matching of identical drawings of common ob- 
jects. 

‚ Ког the measurement of abilities involved in 
mechanical pursuits, three tests are provided: 
Part V, Spatial Orientation; Part VI, Spatial 
Visualization ; and Part VII, Mechanical Knowl- 
edge. The distinction between the two spatial 
factors measured by Parts V and VI was first 
made by Guilford on the basis of his Air Force 
research. The former refers primarily to an ap- 
preciation of the spatial relations of objects with 
reference to the individual’s own body, an ability 
which proved important in learning to pilot a 
plane and which may be involved in many jobs 
requiring machine operation. The second spa- 
tial factor involves the ability to imagine move- 
ment or transformations of visual objects. All 
the items in Part V show two positions of the 
prow of a motor boat in which the examinee is 
supposed to be riding. The task is to determine 
the change in the direction in which the boat 
is heading from the first to the second picture. 
In Part VI, each item contains a drawing of a 
clock in an initial position, followed by verbal 
instructions to turn the clock in a prescribed 
direction over a given number of degrees. The 
resulting position of the clock is then to be se- 
lected from the five alternatives provided. It 
would seem that in both Parts V and VI the uni- 
formity of item content might make the test 
monotonous for many subjects and might create 
a problem of sustaining interest. One wonders 
to what extent personality factors may be re- 
flected in achievement on these two tests. Part 
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VII, like most available mechanical comprehen- 
sion tests, measures knowledge of common tools 
or mechanical objects and their functions, The 
first 20 items of this test employ a combination 
of pictorial and verbal presentation, the remain- 
ing 35 are entirely verbal. The latter are some- 
what reminiscent of the oral trade tests devel- 
oped by the United States Employment Service 
and by the armed services. The authors point 
out that the mechanical knowledge test can be 
meaningfully applied only to adolescent or adult 
males in the United States culture, and should 
be interpreted in the light of the individual's 
mechanical experience. 

Parts ПІ, IV, and V are speed tests. The rest 
are power tests, although speed is appreciably 
involved in Part VI. Norms are now provided 
on a separate profile chart, which includes per- 
centile, T-score, and C-score equivalents of raw 
scores on each part for college men and women. 
There is no appended explanation of these 
norms, nor are the sizes and other characteris- 
tics of the normative samples indicated. Pre- 
liminary estimates of reliability for each part 
range from .88 to .92 on groups of approximately 
100 to 200 college men. These coefficients were 
obtained by the odd-even technique for the power 
tests and by separate timing of equivalent halves 
for the speed tests (4). 

The mimeographed manual, which is almost 
identical with an article appearing in the Jour- 
nal of Applied Psychology (4), includes rough 
estimates of the factorial validity of each part, 
based upon the “known factorial composition of 
very similar tests.” In a subsequently published 
Psychometric Note (8), reference is made to 
three factor analyses of these tests, on college 
men, high school boys, and high school girls, 1n 
which the obtained principal factor loadings of 
each part were fairly close to the estimated val 
ues. Part VII was not included in these analyses, 
and the other parts were curtailed in length 
when given to the college group. As is generally 
found in factorially designed tests, the factoria 
validities indicate that the tests still fall far 
short of factorial purity, the loadings ranging 
from .52 to .89. There is also considerable over” 
lap, the two spatial tests, for example, correlat- 
ing almost as highly with each other as they d? 
with their respective factors, and the gener? 
reasoning test having high verbal loadings ^ 
some groups. Finally, it should be remembere 
that factorial validity is not a substitute for € 
pirical validity. The factorial composition 0 
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most practical criteria is unknown. Hence only 
empirical validity can adequately indicate the 
accuracy of prediction possible with a given test. 
Nor is there as yet universal agreement as to the 
“factors” or categories to be measured. This is 
especially true in the areas of spatial abilities and 
reasoning. 

Some initial effort toward empirical valida- 
tion is indicated in another separately published 
Psychometric Note (9), which reports the cor- 
relations between grades in college courses and 
scores on Parts I to VI of the battery. As might 
be expected from earlier studies with similar 
tests, Verbal Comprehension tended to yield the 
highest correlations with grades, even in sci- 
ence and mathematics courses. The highest cor- 
relation in the table, however, is only .54, be- 
tween Verbal Comprehension and English 
composition. On the whole, these correlations 
are no higher than, if as high as, those generally 
found between grades and the more crudely de- 
signed "intelligence tests." In a study on 84 
supervisors in an industrial plant, the highest 
correlation with an index of job success was 
again found for Verbal Comprehension (:52); 
the only other significant correlation being ob- 
tained in the case of General Reasoning (.29). 

All in all, this battery, although embodying 
many refinements and promising innovations in 
test construction, appears to be in a preliminary 
or experimental stage. There is, especially, need 
for more normative data and for further empiri- 
cal validation. Some revision of content may be 
indicated as more information on different popu- 
lations is accumulated. There would seem to be 
à real question regarding the advisability of re- 
leasing such a battery in its present stage of 
development without some qualifying designa- 
tion such as "experimental form." It is quite 
likely that many test users may rely unduly on 
factorial labels, assuming, for example, that а 
student who is poor in numerical operations and 
in the two spatial tests should be discouraged 
from taking courses in geometry, trigonometry, 
or advanced mathematics. The lack of signifi- 
Cant correlation between mathematics grades 
and scores in such tests, as found both with the 
Present battery and with other similar tests, 
certainly offers no support for such a conclusion. 
But itis very easy for the unwary tester to slip 
into the habit of matching “factors” with school 
Courses, occupations, or other practical criteria. 
The inclusion in the present manual of a list of 

Occupations in Which the Survey Factors Are 
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Probably Important" certainly does nothing to 
discourage this tendency. In a similar vein, the 
test authors, in a section entitled “Enlightened 
Selection of Batteries," state: “Ап inspectional 
job analysis made with the primary-ability cate- 
gories in mind is the most effective preliminary 
approach to this problem * The better-known 
factors are so dependable that one would rarely 
go astray in the choice of selective tests following 
the approach here recommended." The available 
data fail to justify such optimism. 


Навор Brcurorpr, Associate Professor of 
Psychology, State University of Iowa, Iowa 
City, Iowa. 

A supposedly new approach to aptitude test- 
ing is implied by the authors in their manual of 
instructions. This approach may be summarized 
as the appraisal of the several sources of variance 
which the authors (and nearly all other investi- 
gators in the field today) are convinced enter 
into the definitions of success in various activi- 
ties of our society. It is the test authors’ hope 
or conviction that the several measures of suc- 
cess eventually can be shown to be expressible 
in terms of a set of 20 performance measures or 
“aptitude tests.” i 

The seven (out of 20 planned) published 
parts of this survey appear to be the praduct of 
а very competent test construction staff. The 
printing format is pleasing, the paper is heavy, 
and the range of difficulty on the “power” parts 
of the survey seems to be adequate. The complex 
directions utilized for Spatial Orientation (Part 
V) and Spatial Visualization (Part VI) may 
present a serious problem to students who lack 
considerable verbal comprehension ability. (The 
instructions are supposedly written for 6th grade 
educational level.) Marked sex differences in 
favor of the males exist on four parts of the sur- 
vey. The distributions of test scores presented 
on the profile chart indicate a desirable amount 
of discrimination among the members of the 
normative group ; but the absence of a definition 
of the standardization samples, at least in terms 
of geographical distribution, age ranges, educa- 
tional levels, and number of cases, seems almost 
inexcusable in a test offered to the public for 
sale, The retention of the traditional scoring 
formulas (R — kW) for two of the so-called 
power tests (Parts I, II, and VII) is inconsist- 
ent with the authors’ sophistication in test 
theory. The temporary status of these scoring 
formulas, however, is indicated by the authors 
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warning to the user to be alert for possible fu- 
ture changes. 

The authors’ report on the reliability of five 
of the seven parts consists of a brief paragraph 
giving two "internal-consistency" coefficients 
(formulas not stated) of .89 and .96 and three 
“split-half, separately timed" coefficients of .88, 
.92, and .92. The failure to provide more of the 
test score statistics precludes any more accurate 
evaluation of the consistency of students' be- 
havior to these parts. Even rough estimates of 
reliability coefficients are lacking for Parts VI 
and VII; surely in the current test market, pub- 
lishers assume some little technical competence 
on the part of the prospective buyer and provide 
more data than that given in the present manual 
and Psychometric Notes. 

The problem of validity is presented to the 
prospective test user in two apparently unrelated 
topics, (a) applications of the survey and (b) 
factorial and practical validity. The reviewer can- 
not separate these two topics so conveniently ; 
the greater number of applications would seem 
to be based on the "practical validities" of the 
tests, while the research worker or test techni- 
cian would be concerned with the factorial data. 
The manual itself provides no “practical valid- 
ity" data to support the "convictions" and “Ъе- 
liefs" of the test authors as to the occupations 
"in which the survey factors are probably im- 
portant." The occupational listings reflect many 
similarly unsubstantiated statements in the vo- 
cational guidance literature. 

The possibility of an eventual rational ap- 
proach to both job analysis and test construction 
is accepted by many personnel psychologists to- 
day. Whether, as indicated in the manual, *an 
inspectional job analysis made with the primary- 
ability categories in mind" is, or is not, the most 
effective preliminary approach at our present 
stage of ignorance would seem to be debatable 
(for example, who is to do the inspection?). 
However, a point that the reviewer contends is 
strictly empirical and not debatable is the rela- 
tion of the separate test scores singly and in 
combination to the criterion used to define suc- 
cess in each occupation. 

Two master's theses are summarized briefly 
(9) as providing evidence of “the possibilities 
of differential prediction" in educational and 
industrial applications. The results are consistent 
with most such attempts to predict college grades 
and supervisory success (manufacturing aircraft 
parts) ; they are positive correlations of verbal 
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and arithmetic reasoning scores with grades and 
with a supervisory performance criterion, The 
data required for the evaluation of the actual. 
ity of differential prediction in the first study 
was not presented. The multiple correlation be- 
tween the supervisory criterion and four parts 
of the survey was increased from .52 to .57 for 
Part I alone. No indication of the expected 
shrinkage of this multiple correlation was pre- 
sented. 

The problem of factorial validity is, of course, 
related to that of practical validity. The eventual 
empirical analysis of sets of criterion measures 
with sets of test scores may provide the evi- 
dence needed for the development of efficient test 
batteries to be used in selection and differential 
prediction studies. The “factorial validities" 
should assist in the development of a battery 
covering the several factors with a minimal 
number of tests ; the “practical validities” would 
indicate which of the many factors should be 
represented in the prediction equations. 

In the vocational guidance field at present, the 
job family classifications of clerical and me- 
chanical skills have considerable status. For edu- 
cational counseling, academic or scholastic ap- 
titude tests have a similar standing. The con- 
tinued analysis of the criteria of success in these 
areas should provide the empirical evidence as 
to the sources of variance associated with these 
success measures. It is not clear to the reviewer, 
without such empirical evidence, that “the three 
traditional aptitude areas [verbal intelligence, 
clerical aptitude, and mechanical aptitude] have 
been broken down into their chief fundamental 
components" if the “breaking down" means that 
the common factors in the various criteria of 
success on the job have been determined. 

The reviewer, in conclusion, cannot recom- 
mend these tests for use in vocational counseling 
or guidance at this time. The lack of evidence to 
support the claims presented in the manual is, 
unfortunately, a common characteristic of many 
such tests offered for sale today. The tests are 
recommended, on the other hand, to research 
workers interested in a series of relatively lin- 
early independent measures of test behavior, al- 
though the intercorrelations among the parts 
are apparently higher than the test authors had 
expected. Eight of the 15 correlations based 00 
a restricted (college) sample (8) are significant 
at the 1 per cent level. In this study these tests 
were curtailed in length or time of administra- 
tion; the intercorrelations of the complete bat- 
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tery might be higher if a more heterogeneous 
and representative sample were used. 


Joun B. CARROLL, Lecturer on Education, Har- 
vard University, Cambridge, Massachusetts. 

Believing that “aptitudes of individuals can 

be evaluated most adequately, economically, and 
meaningfully by using a series of tests each of 
which measures a unique ability,” the authors 
have made a very deliberate attempt to construct 
such tests. They plan eventually to publish tests 
of approximately 20 unique factors ; the tests re- 
viewed here constitute only the first installment 
and are intended to cover the chief abilities in 
three “traditional” areas—abstract intelligence, 
clerical aptitude, and mechanical aptitude. The 
basic theory is that of Thurstone’s multiple fac- 
tor analysis; there is no attempt to measure a 
general factor, even аз a second-order composite 
of the primary abilities. The authors have at- 
tempted to build tests as factorially “pure” as 
possible ; in some cases they have excluded items 
from a test because they correlate too highly 
with a different factor. Nevertheless, they sug- 
gest that where a perfectly pure factor test can- 
not be devised, “univocal factor scores” can be 
computed by appropriate linear combinations of 
raw scores. 
В The authors seem to have succeeded quite well 
in devising a series of pure factor tests. Prelimi- 
nary results on rather small samples at least 
show that the tests have high factorial validities 
(ie, have high correlations with the statistical 
factors they are supposed to measure). The in- 
tercorrelations of the tests are lower, in general, 
than in the case of Thurstone's primary mental 
abilities tests, although one must make such a 
comparison with considerable caution because of 
the lack of comparability of the samples. A dis- 
turbingly high correlation (r — .55) occurs only 
between Parts V and VI. 

Although these tests have some technical 
flaws, considerable care has gone into their con- 
Struction. High reliabilities are reported, but no 
Standard errors of measurement are given. 

orms are now available for college men and 
Women (based on large groups of freshmen or 
Sophomores at three universities), and, in a pro- 
Visional form, for high school boys and girls. 
aan is the case with other primary mental 

Mnties batteries, the amount of evidence on the 
сове validities of these tests is disappoint- 

Agly small. The few studies which have been 
Published indicate that the tests offer some pos- 


sibilities for differential prediction, but the most 
predictive parts of the battery are those which 
correspond to the usual omnibus tests of in- 
telligence. Nevertheless, it should be remem- 
bered that tests highly similar to those of the 
present battery were quite useful in differential 
classification of aircrew personnel in the Army 
Air Forces during World War II. It is largely 
on the basis of indirect evidence—evidence 
which has accumulated with respect to other 
factor tests—that the authors have drawn up a 
table of occupations for which each test in the 
battery is “probably” important. Unfortunately, 
these lists go beyond existing knowledge and 
reflect in some cases an inadequate job analysis 
of the occupation in question. For example, the 
occupation “mathematician” is listed in connec- 
tion with Part III, Numerical Operations. A 
mathematician, however, seldom has to make 
rapid mental computations, if he has to do them 
at all. While the authors’ lists of occupations 
may have some value as a guide to hypotheses 
which should be tested in further research, most 
guidance counselors are probably wary enough 
to avoid taking them at face value. 

The format of the tests needs improvement ; 
in several tests (Parts ТУ and V) the material 
for each item is arranged in such a way that a 
right-handed examinee will continually find his 
hand in the way when he tries to scrutinize the 
choices. The scoring mechanics have not been 
sufficiently simplified. 

Part I, Verbal Comprehension, appears to be 
a quite homogeneous vocabulary test ; the time 
limit is such that it is virtually a power test. 
Part II, General Reasoning, could better be 
termed a “quantitative reasoning test" ; the items 
have been constructed and selected so as to mini- 
mize the role of the number factor. The time 
limit for this test probably needs to be longer if 
it is to be a pure power test. Part III, Numerical 
Operations, is constructed along the lines of 
other number factor tests; one wonders, how- 
ever, why the addition, subtraction, multiplica- 
tion, and division sections of the test are not 
separately timed. Part IV, Perceptual Speed, re- 
quires the examinee to find matching pictures of 
yarious objects by comparing fine details. In 
addition to the defect in format mentioned ear- 
lier, the test suffers because the items are not 
numbered and the subject is not told in what 
sequence to perform them. Part V, Spatial Ori- 
entation, is an exceedingly clever test, requiring 
the examinee to indicate changes in the position 
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of the prow of a motorboat with reference to a 
landscape in the background. A subject with a 
vivid sense of kinesthetic imagery will do well 
on this test, though he may find himself getting 
a trifle seasick! Part VI, Spatial Visualization, 
has to do with visualized rotations of an object 
on its three axes. Good performance on the test 
would possibly depend too much on the compre- 
hension of the special instructions and defini- 
tions which are used. Part VII, Mechanical 
Knowledge, could be said to measure a “pure 
factor” only in a statistical sense; the items are 
a hodgepodge of facts about plumbing, auto me- 
chanics, etc. There is only one item which con- 
cerns radio repair. The authors observe that 
scores on the test are valid only if the examinee 
has “grown up in the United States culture or 
a similar culture.” Perhaps the same thing, of 
course, could be said of other subtests in the bat- 
tery. 

The Guilford-Zimmerman Aptitude Survey 
must therefore be evaluated as a series of re- 
search tests which succeed reasonably well in 
measuring, over a certain range of ability, a 
number of unitary traits. Guidance personnel 
and industrial psychologists will look forward 
to the extension of the battery and publication 
of results on its validity, for the complete bat- 
tery will undoubtedly measure a somewhat dif- 
ferent sampling of abilities from those measured 
by other primary mental abilities batteries. 


P. E. Vernon, Professor of Educational Psy- 
chology, Institute of Education, University of 
London, London, England. 

This battery is said to be suitable for high 
school, adult, and college populations ; but norms 
are provided only for college men and women 
(numbers not stated). Most of the tests, except 
Perceptual Speed, will probably be too difficult 
for other groups, especially female, and distribu- 
tions may be badly skewed. 

In reviewing the Primary Mental Abilities, 
the writer has objected to the use of one test as 
a measure of a factor and the assumption that 
it will predict any educational or vocational 
ability that looks as though it might be related to 
that factor. The same criticism holds here, 
though to a lesser extent, because some of the 
tests are a good deal more thorough and because 
special care has been taken to purify their con- 
tent. Refactorisations have yielded average load- 
ings on the appropriate factors of .66 (variance 
44 per cent) and on the largest inappropriate 


factors of .21. Nevertheless, the first two and 
the last three tests tend to show considerable 
overlapping, and it is doubtful whether the dis- 
tinction between orientation (awareness of spa- 
tial directions) and visualization (of spatial 
manipulations) can be maintained. The authors 
hope to extend the battery to cover many more 
of the 20 or so factors extracted in Army Air 
Forces researches on pilot selection. But, as the 
reviewer has pointed out elsewhere,’ numerous 
tiny group factors are likely to emerge from in- 
vestigations of populations which have been 
highly selected on qualifying tests—factors 
which fail to differentiate in more heterogeneous 
groups. So far the validation data against ex- 
ternal criteria are singularly unpromising, 
though it is greatly to the authors’ credit that 
they are publishing these in a series of psycho- 
metric Notes (8, 9), so that the user can judge 
for himself. 

The tests are largely selí-administering, 
though Spatial Orientation and Spatial Visuali- 
zation seem likely to cause difficulties, both be- 
cause of the elaborateness of the instructions 
and because they are none too clearly printed. 
Verbal Comprehension is of particular interest 
in that the correct answer and the distractors 
are all of the same level of difficulty as the word 
to be defined. Mechanical Knowledge has not 
been item analysed, and the items are clearly not 
arranged in order of difficulty ; however, there 
should be sufficient time for almost all testees to 
try all items. 

In conclusion, the battery should be of value 
for experimental use, particularly among male 
college students. The low interest correlations 
give promise of high multiple correlations with 
educational and vocational performances, but 50 
far there is little evidence that much more 1$ 
measured than the g and V abilities measured by 
any equally thorough verbal intelligence. test. 
The most useful section, after Verbal Compre- 
hension and General Reasoning, is likely to be 
Mechanical Knowledge. 
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*[Primary Mental Abilities]. Ages 5-7, 7-1 17 
17; 1938-50; 3 levels; cash orders postpaid; "Be 
Thurstone and Thelma Gwinn Thurstone; Science 
search Associates, Inc. * s5- 
а) SRA PRIMARY MENTAL ABILITIES : PRIMARY. Ages ү, 
7; 1946-48; also called PMA; formerly called Tes А, 
Primary Mental Abilities for Ages 5 and 6; 6 Ents 
verbal-meaning, perceptual speed, motor, space, 99 25; 
tative, total; 1 form, ’46; manual ('48); $2.90 pe Hn 

1 Vernon, Philip E. The Structure of Human Abilis john 
don: Methuen & бо. Ltd., 1950. Pp. xii, 160. (New York: 
Wiley & Sons, Inc., 1951.) * (РА 25:6026) 
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оё per specimen set; for accompanying instructional 
БАКЫЙ, see 62, 71, 84 below; (60-80) minutes. 
D) SRA PRIMARY MENTAL ABILITIES: ELEMENTARY. 
Ages 7-11; 1048-50; also called РМА; 11 scores: 
words, pictures, yerbal-meaning, space, word-group- 
' ing, figure-grouping, reasoning, perception, number, 
total non-reading, total; Form AH (’48) ; preliminary 
manual ('48); 49¢ per test and answer pad; separate 
answer pads must be used; $1.90 per 25 answer pads; 
$$@ per 25 percentile profile sheets (48) ; 55€ per 25 
age and quotient score profile sheets (^50); 756 per 
specimen set; (60) minutes. 
с) SRA PRIMARY MENTAL ABILITIES: INTERMEDIATE. 
Ages 11-17; 1047-49; also called PMA; 6 scores: 
verbal meaning, space, reasoning, number, word- 
fluency, total; IBM; 2 editions; 1 form; revised man- 
ual (740) ; separate answer pads or answer sheets must 
be used; 556 per 25 revised percentile profile sheets 
(49); 75€ per specimen set of any one edition; for 
an explanatory booklet, see 64 below; 26(45) minutes. 
1) Form AH (Hand Scoring Edition). 1947; 40€ 
per test and answer pad; $1.90 per 25 answer pads. 
2) Form AM (Machine Scoring Edition). 1948; 
30¢ per test; $2.90 per roo IBM answer sheets; $2 
per set of scoring stencils. 
d) THE CHICAGO TESTS OF PRIMARY MENTAL ABILITIES. 
Ages 11-17; 1041-43; 7 scores: number, verbal mean- 
ing, space, word fluency, reasoning, memory, total; 2 
ом, ЫР ы 
1) Six Booklets Edition. 1038-41; originally pub- 
lished by the American Council on Education Kith 
the title Tests of Primary Mental Abilities; IBM ; 6 
tests in individual booklets and 5 practice booklets ; 
I form, '41; manual ('41) ; $6 per 25 sets of 6 test- 
answer booklets and 5 practice booklets; $2.15 per 
25 of any one of the following tests: Number, Verbal 
Meaning, Space, Word Fluency, and Reasoning; 
$1.10 per 25 of the test Memory; machine scoring 
stencils: Number—206, Verbal Meaning—3o¢, Space 
—6o¢, Reasoning—3o¢, Memory—20€, no stencil for 
Word Fluency; $2 per set of hand scoring stencils; 
$1 per set of memory cards; $3.75 per specimen set; 
50@ per specimen set of any one test; (240) minutes. 
2) Single Booklet Edition. 1943; an abbreviated 
form of Six Booklets Edition; 1 form; $3.75 per 25; 
75€ per set of scoring stencils; $1 per set of memory 
cards; 50¢ per specimen set; (120) minutes. 
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ANNE ANasTast, Professor of Psychology, 
Fordham University, New York, New York. 

Beginning in 1938 with the first experimental 
edition of the form for high school and college 
students, the Thurstone tests of primary mental 
abilities have undergone various metamorphoses. 
The original edition was the direct result of 
Thurstone’s factorial analysis of 56 tests on a 
college sample, together with supplementary and 
corroborative studies of narrower scope on sev- 
eral high school samples. A subsequent factorial 
analysis of 60 tests on a sample of eighth grade 
children led to the development of the Chicago 
PMA tests for ages 11 to 17, which first ap- 
peared in a separate booklet edition requiring 
six testing sessions and, two years later, in a 
single booklet edition requiring two hours of 
testing time. The last stage in this abridgment 
process to date is provided by the edition called 
the SRA Primary Mental Abilities for ages II 
to 17 (Intermediate), which can be adminis- 
tered in from 40 to 50 minutes. This edition, 
available in both hand and machine scoring 
forms, contains only five short tests, each re- 
quiring from 4 to 6 minutes of actual working 
time. Another parallel development has been the 
extension of the series to cover still younger age 
levels. The SRA Primary Mental Abilities for 
ages 7 to 11 (Elementary) consists of seven tests 
grouped under five factors, the total testing time 
being approximately one. hour. Finally, a form 
for ages 5 to 7 (Primary), also published by 
SRA, is approximately as long as that for ages 
7 to 11 and also yields five factor scores, the 
number of tests for each factor varying from one 
to five. 

The various forms, editions, and levels of this 
series cover different combinations of eight 
group factors, or “primary mental abilities,” 
identified by Thurstone and his associates. The 
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entire PMA list includes : Verbal-Meaning (V), 
Space (S), Reasoning (R), Perceptual Speed 
(P), Number (N), Word-Fluency (W), Mem- 
ory (M), and Motor (Mo). In the original ex- 
perimental form (1938), reasoning was repre- 
sented by two factors, Induction and Deduction, 
but subsequent research failed to support sucha 
distinction and a single R factor was substituted, 
Memory tests were excluded from all SRA 
forms, at all age levels, “because they take too 
long to administer” (sic). Word-Fluency does 
not appear in the two earlier age batteries, be- 
low age 11, since group tests of this ability among 
younger children are said to depend to a large 
extent upon speed of writing. Motor ability is 
measured only at ages 5 to 7, the test employed 
for this purpose being concerned primarily with 
the young child's ability to handle a pencil. The 
form for ages 5 to 7 is also characterized by the 
presence of a Quantitative (Q) score in place 
of either the N or R scores found in the older- 
age forms, since the two latter factors could not 
be clearly identified among younger children. 
All SRA editions yield only a single score for 
each factor, In the form for ages 7 to 11, how- 
ever, a word and a picture test are provided for 
both the V and R factors, so that readers and 
nonreaders may be differentiated by a compari- 
son of the two sets of scores. 

In general, the format is well designed for ex- 
pediency of administration and scoring. Both the 
11 to 17 and 7 to 11 SRA forms employ reusable 
booklets with step-down pages and detachable 
carbonized answer pads with automatic Scoring 
grids inside. (A machine scorable form is also 
available at the intermediate level.) One won- 
ders why, in the Word-Fluency test for ages 11 
to 17, the subject is directed to print his name 
and other personal data on the answer sheet after 
the signal to stop at the end of the test. The pro 
cedure would be better controlled if such infor- 
mation were requested before the specific instruc- 
tions for the Word-Fluency test are given, when 
cheating would be impossible. The 1948 revision 
of the 5 to 7 year form is identical with the first 
edition except that all items have been printe 
in larger size and on a larger page, the booklet 
now being the same size as those for older sub- 
jects. For an unexplained reason, however, this 
revision is designated as the form for ages 5 (0 
7, while the first edition was marked for age 
5 and 6. 

Norms are based on samples which, on 
whole, appear satisfactory in size but which are 
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inadequately described with reference to type 
of school, part of the country, proportion of ur- 
ban and rural representation, national and eth- 
nic origin, and socio-economic level. Little or no 
information is provided in the manuals to indi- 
cate the representativeness of the normative sam- 
ples or to define the population that was sam- 
pled. No separate sex norms are reported, nor 
are any data on sex differences included in the 
manuals, 

In most of the forms, percentile norms for each 
factor are provided at each year or half-year 
level. Individual profiles are constructed from 
these percentiles. The SRA form for ages 11 to 
17 provides the most satisfactory treatment of 
scores, since the profiles are plotted on a sliding 
percentile scale in which the units are adjusted 
to conform to a normal distribution. Such a 
procedure eliminates the inequality of percentile 
units in the graph, but not, of course, in the 
numerically reported scores. The same form 
provides for total scores “IQ equivalents” which 
actually correspond to normalized standard 
scores with a mean of 100 and a standard devia- 
tion of 16. Despite the obvious technical advan- 
tages of the latter measure, the authors recom- 
mend the use of percentiles in preference to 
such an IQ “for almost all uses.” It is difficult 
to find the justification for such a recommenda- 
tion. In the form for ages 7 to тт, a sliding scale 
of percentiles had formerly been employed in 
plotting the profiles, but, in the 1950 revision of 
the profile sheet, a mental age unit has been sub- 
stituted. This is a clear example of deterioration 
rather than progress in test construction. The 
present MA units are all treated as equal, the 
rise from ages 13 to 14, for example, being as 
large as that from 6 to 7 on the scale. Moreover, 
ability quotients are computed by dividing test 
age for each factor by CA, with no indication 
Whether or not the SD of such quotients remains 
Constant with age. The same quotient may thus 
have very different meanings at different ages 
or in different abilities. In the 5 to 7 year form, 
only MA units and quotients are provided, the 
Scores thus being subject to the same inaccura- 
cles as those found in the latest revision of the 
7 to 11 profile sheet. 
is iris common to most of the PMA tests 
um i undue emphasis upon speed, even in the 
EE ne tests as Verbal Meaning and Rea- 
b. 8, designed to measure power. Speed is 
х Pecially prominent in all forms for ages II to 

7, including the two longer Chicago versions 


and the SRA abridgment. In the tests for ages 
7 to 11, some effort was made to estimate the 
influence of speed, but unfortunately a very 
crude speed index was employed, based essen- 
tially on the proportion of individuals who at- 
tempt the last item in each test. On this basis, 
Perceptual Speed and Number appear to be 
primarily speed tests, while the remaining tests 
depend to a much smaller, though not negligible, 
degree upon speed. It would be desirable (a) to 
apply a more dependable and precise speed- 
power index, such as H?, and (b) to reduce still 
further the contribution of speed to the sup- 
posedly power tests in the battery. In the form 
for ages 5 to 7, only the P and Mo tests are speed 
tests, the remaining tests being administered 
with no time limits. 

A special weakness of the entire PMA series 
is the treatment of test reliability. In tests such 
as these, designed for intra-individual compari- 
sons and profile analysis, the need for proper de- 
termination and reporting of reliability is par- 
ticularly urgent. Yet in the various forms of the 
PMA tests, reliability coefficients are either in- 
adequately reported, incorrectly computed, or 
completely omitted. Odd-even and Kuder-Rich- 
ardson techniques have been repeatedly em- 
ployed in finding the reliability of speeded tests. 
In several forms, no recognition is given to this 
problem at all, spurious and meaningless relia- 
bilities as high as .98 being reported without 
comment, except to say that the reliabilities 
would probably be higher in more heterogeneous 
samples. In the manual of the form for ages 7 to 
11, it is admitted that for highly speeded tests 
such as P and N, the usual internal consistency 
techniques are unsuitable. But this is followed by 
the astounding and erroneous conclusion that 
Kuder-Richardson formula 21 is applicable in 
such cases, and the “reliabilities” thus computed 
are accordingly reported for these tests! The 
manual of the form for ages 5 to 7 contains no 
reliability coefficients. In a personal communica- 
tion to the present reviewer, the reliabilities of 
four of the five scores on this form were re- 
ported as follows : V, 77; Q, 90; P, .96; S, .87. 
No mention was made of the size or nature of 
the samples on which these reliabilities were 
computed nor of the techniques employed to 
determine reliability. In view of the suspiciously 
high reliability of the Perceptual Speed score, 
however, it appears likely that internal con- 
sistency measures were employed throughout. 

It is interesting to note the implications of 
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.test reliability for the evaluation of individual 
profiles. As an illustration, the Space scores on 
the PMA for ages 7 to 11 may be considered. 
With a reliability coefficient of .79, the stand- 
ard error of a score on this test is .46 SD; a 
minimum deviation of r.19 SD is therefore re- 
quired for significance at the .o1 level. Refer- 
ence to the percentile profile sheet shows, how- 
ever, that deviations above the 75th or below 
the 25th percentile on any test are interpreted 
as representing "high" or “low” standing, re- 
spectively, in the given ability. Such a deviation 
corresponds to .67 SD, not 1.19 SD. Many such 
deviations therefore fall well within the error of 
measurement of the test. In the profile sheets 
plotted in terms of МА units, no attempt has 
even been made by the authors to indicate how 
large a deviation from the norm is required for 
the meaningful interpretation of profile points. 
Another important factor in the interpretation 
of intra-individual variations is the degree of 
intercorrelations among the tests. In the PMA 
batteries, such intercorrelations are consider- 
able, especially at the lower age levels. At the 
II to 17 level, most of the intercorrelations 
among ability scores are in the .30's, those be- 
tween V and R being in the .50’s. At the 7 to 
11 level, intercorrelations on one sample of nine- 
year-olds ranged from .41 to .70; at the 5 to 
7 level, correlations from .46 to .67 are reported. 
At least some of these correlations are probably 
as high as the correlations between different 
tests designed to measure the same ability. Such 
a situation casts doubt upon the value of finding 
separate scores in these abilities for young sub- 
jects. Moreover, in the light of the probable re- 
liability of the various tests (when reliabilities 
are properly computed), it would seem that a 
large majority of the intra-individual differences 
between scores on the various primary mental 
abilities are the result of chance errors. For ex- 
ample, with a mean reliability of .80 and a cor- 
relation of .50 between two tests, only about 22 
per cent of the obtained differences between 
scores on the two tests are in excess of chance. 
The validation of the PMA tests has been 
limited principally to an analysis of their fac- 
torial validity. In the manuals for the two forms 
of the Chicago PMA batteries, correlations of 
the composite score in each of the six abilities 
with each factor are reported. It may be noted 
that the two tables, for the six-booklet and the 
single-booklet edition, are identical. Obviously 
the original factorial validities of the longer bat- 


tery were simply reproduced as though they ap- 
plied to the shorter battery, without recheckin, 

or estimating the effect which a reduction in 
number and variety of items would have upon 
the factorial validities of the tests. No factorial 
validities are reported for any of the SRA forms 
at any age level. Attempts toward the estima- 
tion of empirical validity appear to have been 
meager and unsystematic. A few correlations 
with other psychological tests, including Ameri- 
can Council on Education Psychological Exami- 
nation, Kuhlmann-Anderson Intelligence Test, 
General Aptitude Test Battery, Minnesota Vo- 
cational Test for Clerical Workers, and Revised 
Minnesota Paper Formboard Test, are reported 
for the 11 to 17 and 7 to 11 SRA forms. The last 
three tests mentioned, incidentally, are incor- 
rectly included in a section on measures of 
“achievement.” Correlations of the 11 to 17 form 
with tests of educational achievement are like- 
wise reported. The predominant role of the V 
and R factors in achievement in all school sub- 
jects is suggested by these correlations. 

No evidence of validity, whether factorial or 
empirical, is given for the 5 to 7 year form; nor 
is there any published report of the factor analy- 
sis from which either this battery or that for ages 
7 to II were presumably constructed. The man- 
ual for the 5 to 7 year form contains an extensive 
list of references of general psychological in- 
terest, ranging from Carmichael’s Manual of 
Child Psychology to Bernardine Schmidt's mon- 
ograph on the rehabilitation of the feebleminded; 
but one searches in vain for the relevant sources 
giving the details of construction of the particular 
test. In 1948 and again in 1951, the present Ie 
viewer learned through personal communication 
with the publishers that the report of the fac 
torial investigation from which the 5 to 7 year 
tests were developed was in preparation but 
delayed. It is unfortunate that it has been de 
layed for so many years subsequent to the publi- 
cation and distribution of the tests. ud 

It would seem that the use of the term р!" 
mary mental abilities" connotes the measurement 
of factors which are somewhat more baste, 
permanent, and universal than is warrante y 
the available research, whether conducted y 
the test authors or by other independent investi 
gators. Moreover, both the test manuals ds 
the profile sheets emphasize the readiness Wit” 
which teachers, parents, and students may TA 
terpret the PMA profiles in terms of spect? 
educational and vocational plans. Such recom 


703 ] TESTS & REVIEWS: VOCATIONS 
mendations go far beyond the available evidence 
for empirical validity. 

In a booklet entitled You and Your Mental 
Abilities, addressed to high school students, the 
examinees are again urged to request and to in- 
terpret their own scores on the basis of over- 
simplified schemata. After a brief description of 
the primary mental abilities (as defined in the 
SRA tests), the booklet presents sample items 
for the student to work through, as well as hints 
on how to take psychological tests. The logic 
underlying such a booklet is puzzling. If the use 
of the booklet can effect an improvement in test 
scores, either through specific practice or 
through general test sophistication, then the 
availability of such a booklet to some individuals 
and not others reduces the validity of the tests. 
On the other hand, if scores are not appreciably 
raised, then it would seem that many of the 
statements in the booklet create misleading ex- 
pectations in the reader. On the basis of what 
is known regarding test wiseness and the effects 
of practice оп psychological test performance, 
it appears likely that the use of such a booklet 
would in fact influence scores. 

„The problem of test sophistication and spe- 
cific practice is also encountered in three other 
booklets constituting a Learning to Think series 
and designed for kindergarten and first grade 
children. Covering a daily 20-minute lesson for 
three semesters, these booklets provide practice 
materials in each of the PMA areas. The simi- 
larity between these practice materials and the 
test items is quite close, many of the same pic- 
tures being employed, albeit in different ways. 
It would be helpful to know the extent to which 
practice on these materials leads to improvement 
in (a) scores on the РМА tests, (Р) scores on 
Other psychological tests, and (c) the child's 
general intellectual functioning in school work 
and other "criterion" situations. Only thus would 
T be possible to determine whether such ma- 
terials provide primarily coaching, test sophisti- 
cation, or education. No data bearing on any of 
these points, however, are cited. 

In summary, the early forms of ће PMA tests 
Were the result of extensive research and repre- 
Sented an important and promising new step 
in test construction. But subsequent develop- 
Ment, rather than providing the needed refine- 
шу and empirical validation of such an experi- 
i ental instrument, has proceeded downward in 
д € direction of abridgment and crude populari- 

ation. As for the supplementary booklets and 
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practice materials, perhaps their chief useful- 
ness will be to provide problems for discussion 
by committees engaged in formulating codes of 
professional ethics for psychologists. 


Ratru Е. BrnprE, Professor of Psychology, and 
Director, Student Counseling Bureau, Univer- 
sity of Minnesota, Minneapolis, Minnesota. 

The original Tests for Primary Mental Abili- 
ties were perhaps more substantially founded on 
basic psychological theory than any other tests 
we have available today. During the past то 
years, however, these tests have been revised, 
renamed, and reassembled to such an extent that 
it is difficult to trace the present tests back to 
their ideational sources. 

In 1938, the experimental edition of the Tests 
for Primary Mental Abilities was published by 
the American Council on Education. These orig- 
inal tests measured seven factors that provided 
a total of 16 separate scores. The time limit for 
the practice exercises and for the tests totaled 
222 minutes. In 1941, the Chicago Tests of Pri- 
mary Mental Abilities were presented by the 
authors. These tests provided seven scores and 
required 240 minutes. A single booklet edition 
of the Chicago Tests of Primary Mental Abili- 
ties was published in 1943 and required 120 min- 
utes. 

In 1946, the first of the SRA Primary Mental 
Abilities tests appeared. This series was for ages 
5-7 and had originally been published under the 
title Tests of Primary Mental Abilities for Ages 
5 and 6. Six scores are provided by these tests 
and the time required is between 60 and 80 min- 
utes. The SRA Primary Mental Abilities for 
ages 7—11 appeared in 1947. This series provides 
тт scores and requires 60 minutes. The SRA 
Primary Mental Abilities for ages 1 1-17 ap- 
peared in 1947 and provided six scores. The test- 
ing time was 45 minutes. Thus, the latest de- 
velopment of these tests requires approximately 
one fourth of the time required by the earlier edi- 
tions. No acceptable data indicating the effect of 
this condensation on the reliability of the tests 
are available. ) 

In light of the elaborate experimental and sta- 
tistical developments of these tests, 1t 1s surpris- 
ing that they are not better than they are. In fact, 
one can question whether there is any accepta le 
evidence at all to suggest that the use of these 
tests can be justified in an educational or guid- 


ance program. T . 
The evidence for the reliabilities reported in 
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the manual is insufficient, and since these tests 
are, on the whole, highly speeded, the method 
used for determining reliability was inappropri- 
ate. Furthermore, since these tests are relatively 
pure factorially, any measure of reliability based 
on internal consistency must be high. 

Little evidence concerning the validity of the 
test scores is presented. Tables showing correla- 
tions between the PMA and other tests are pre- 
sented in the manuals. The most indicative in- 
formation should come from correlations 
between PMA scores and achievement, but 
when the PMA scores are correlated with 
achievement test scores, it is a question whether 
or not the resulting correlation is due to the 
achievement tests being heavily loaded with the 
factor contained in the PMA test. Much evi- 
dence that the tests are relatively independent is 
presented, but nothing to show that the tests 
predict either scholastic success or vocational 
performance is available. 

A minor error appears in the manual where 
бо test items are reported to constitute the Num- 
ber Test; actually, 7o numbers are there. 

In general, one would expect these tests to 
be a great contribution to education and guid- 
ance. That they have not been may be due either 
to the test itself or to the inadequate follow-up 
work that the authors or others have done. It 
may be that in attempting to produce a test that 
requires relatively little time or money, the pub- 
lishers have sacrificed those very things that 
made the tests potentially valuable. Tt is too bad 
that after such tests have been available for more 
than 14 years, one must still conclude that their 
principle uses are experimental. 


Joun B. CannoLL, Lecturer on Education, Har- 
vard University, Cambridge, Massachusetts. 

Administrators, teachers, and guidance per- 
sonnel in both elementary and secondary schools 
may consider themselves fortunate in having 
available to them, at last, an integrated series of 
tests of primary mental abilities. This review is 
intended to appraise the major features of the 
series and some of its accompanying materials, 
to offer some thoughts about their proper use 
and interpretation, and to discuss the features of 
each of the separate batteries. 

The theoretical basis for the tests is now well 
known: "intelligence" has been shown to be 
composed of a number of separate and more or 
less independent “abilities” or “factors.” In the 
development of this series, an effort has been 
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made to construct tests which are "pure" meas. 
ures of the several primary (first-order) fac. 
tors. Some of these tests, particularly those of 
the reasoning factor, correlate rather highly with 
a general (second-order) factor; hence, a good 
measure of the general intellective factor may be 
secured, as the authors indicate, by making ap- 
propriate combinations of scores of the primary 
factors. Nowhere in the test manuals is there 
any suggestion that regression techniques should 
be used to obtain measures of primary factors 
from which the effect of a general factor has 
been eliminated. The authors would probably 
maintain (and this reviewer would concur) that 
from the standpoint of pupil guidance, the elimi- 
nation of the general factor from the factor 
scores might give rise to unnecessary difficul- 
ties in interpretation. 

The authors have selected for these tests the 
most prominent and most often isolated factors 
in a long series of studies. Number, Reasoning, 
Verbal-Meaning, and Space are represented in 
all batteries reviewed here, except the primary 
school battery where Number and Reasoning 
coalesce in a single Quantitative factor, since 
young children have not encountered formalized 
instruction in number operations. The Word- 
Fluency (W) factor is included only in batteries 
designed for ages 11 to 17 ; the authors state that 
it cannot conveniently be measured earlier be- 
cause the usual tests for W involve writing. 
Tests of M (Memory) appear only in the longer 
Chicago tests, for the upper age levels. Tests of 
P (Perceptual Speed) occur only in the SRA 
Primary and SRA Elementary tests. The SRA 
Primary battery also includes a test of “Motor 
Ability”; actually, this test could be better de- 
scribed as a measure of ability to manipulate à 
pencil. Apparently, it was the intention of the 
authors to include, at each level, tests of only 
those factors deemed of most importance for 
guidance purposes. Inclusion of other factors 
would have resulted in longer batteries. Most 
teachers would agree that measures O N 
(Verbal), R (Reasoning), S (Space) and i 
(Number) abilities should be included at 2 
levels, but it is to be regretted that measures ^ 
P (Perceptual Speed) were omitted at the ИШ 
levels, especially if one postulates that this a?! 
ity has some relevance for commercial шг 
and consequently for vocational guidance. 
tests of factors W (Word Fluency) and 
(Memory) could easily have been dropped; 
cause thus far we have seen little evidence of th 
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relevance for either educational or vocational 
guidance. The fact that a given dimension of 
ability is clearly disclosed in a series of factor 
studies is of paramount interest in psychological 
research, but it does not necessarily justify the 
inclusion of tests of that dimension in a battery 
intended for practical purposes. On the other 
hand, with further use of the present batteries, 
some interesting research data pertaining to the 
relevance of all the various factors will undoubt- 
edly be obtained. 

In general, the tests appear to be well con- 
structed. For any given factor, the tests used are 
those which have shown most factorial purity, 
highest factor loadings, and highest reliabilities ; 
even in the abbreviated SRA Intermediate bat- 
tery, the reliabilities of the tests are such that 
little accuracy of measurement or factorial va- 
lidity is lost in comparison with the longer Chi- 
cago Tests of Primary Mental Abilities (Six 
Booklets Edition). The test manuals state that 
extensive item analysis procedures have been 
followed in developing the tests, but in some 
cases one feels that the range of item difficulty 
should be expanded. Furthermore, some of the 
tests are so short that, in spite of the high relia- 
bilities which are reported, the standard errors 
of measurement must still be appreciable. This 
is especially true for high scores on the subtests 
of the SRA Primary battery, where the differ- 
ence of a raw score point often corresponds to 
a difference of 6 or 8 months of age placement. 
The normative data are in most cases as ade- 
quate as can be expected at this stage of the test 
development program, although it is unfortu- 
hate that norms are not differentiated by sex. 

The difficulties in interpreting individual 
scores become particularly apparent when the 
guidance counselor tries to decide how the scores 
are relevant to the individual’s potential success 
at higher educational levels or to his choice of 
а career, A guidance counselor who is not fa- 
miliar with the difficult technical and theoreti- 
cal problems involved in making psychological 
interpretation of the factors can easily be misled 

у some of the authors’ statements about the na- 
ture of the factors—statements which in the 
authors’ research publications are offered mainly 
as suggestive hypotheses. For example, in the 
mu for the tests at the 5 to 7 year age level, 
MD measured by the “motor” test is de- 
ev, ed as "the ability to coordinate hand and 
ae movements"; immediately following 1s а 

atement that “a high degree of motor ability 
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is valuable to adults.” A guidance counselor, or 
even more likely a parent, might easily draw the 
erroneous conclusions that (а) the motor test 
in the 5 to 7 year battery measures the same 
kind of motor ability that is called for in ath- 
letics, and (b) this motor test predicts adult 
motor ability. There is a wealth of research to 
show that motor abilities are highly diverse and 
specific, and that they are probably not very pre- 
dictable over long time spans. Again, the Word- 
Fluency factor is described as the "ability to 
write and talk easily." This reviewer knows of 
little evidence to show that W is anything more 
than a measure of a type of imaginative “flow” 
which occurs under certain experimental con- 
ditions; his own studies do not show any im- 
portant relation of W with the ability to write 
or to talk easily. To cite one final example, the 
memory factor is described by the authors as 
"the ability to recall past experiences." Their 
own research has shown that a number of sepa- 
rate memory factors can be isolated; the usual 
type of memory test measures only a specific 
kind of immediate rote memory and thus may 
have little to do with the way one can recall ex- 
periences from a more remote past. There is 
some evidence to show that the Rote Memory 
factor has something to do with success in learn- 
ing foreign languages where a large number of 
paired associations may have to be established, 
but this factor has not yet been shown to be of 
any importance in such an activity as, say, re- 
membering the plots of novels. 

The authors are undoubtedly on solid ground 
in their discussions of the Verbal-Meaning and 
Reasoning factors. In all probability, the state- 
ments which the authors make about the Num- 
ber and the Space factors and their relevance to 
certain types of curricula and jobs will even- 
tually be substantiated in validity studies, but 
this is only the reviewer's hunch. With regard 
to the other factors included in these batteries, 
the reviewer feels that it is fortunate that the 
tests are now available for wide use in research, 
but their usefulness in guidance and selection 
may not correspond to the usefulness one would 
expect from a literal interpretation of the au- 
thors’ statements. The booklet, You and Your 
Mental Abilities, by Bouthilet and Byrne, is not 
untainted by wishful thinking, when viewed ob- 
jectively, because unfortunately we simply do 
not know enough about the relation of the pri- 
mary factors to success in school subjects and in 


various occupations. 
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tional memory question-response type of items. 
In all tests the exercises involve mental func- 
tioning in action. The record sheet is removable 
so that the test booklets may be used over and 
over again. Percentile norms and self-interpret- 
ing profile graphs are given. There are charts 
and provisions for converting scores into mental 
ages and IQ’s. Many tables and references pro- 
vide reliabilities, correlations, and other back- 
ground material. In other words, the tests and 
manuals might well serve as models for all pub- 
lishers of tests to follow. 

"Fifteen years of research into the nature of 
intelligence" have supplied the “primary mental 
abilities theories" upon which the tests are based. 
“The research projécts involved the use of the 
statistical method called factor analysis and have 
revealed that intelligence is not a unitary trait." 
"Eight abilities have already been identified," 
Those “most commonly tested are called" (my 
italics) by names judged appropriate, and de- 
noted by letters. 

Every person who makes or uses tests has, of 
course, perfect freedom to adopt or follow any 
theories or methods of interpretation he pleases. 
For those who accept conventional mass and 
correlational statistics, the PMA tests can be 
enthusiastically accepted without criticism. 

This reviewer, however, rejects as inappro- 
priate or totally false both the statistical methods 
and the theories of primary mental abilities de- 
rived from their use. More than 40 years of edu- 
cational measurement and interpretation, using 
mass statistics and assumptions borrowed from 
economic and other fields, have proved sterile 
in permanent contributions to educational sci- 
ence. There have been many changes in educa- 
tion since 1910, the beginning of the educational 
measurement movement. But philosophical the- 
orizing led to changes in education before meas- 
urement began. While educational measurement 
has been a great stimulus to speculation, it has 
made no contribution to the kind of objective, 
verifiable, impersonal knowledge that goes by 
the name of science. This reviewer predicts that 
the PMA tests, in spite of their structural and 
functional excellence, will not yield laws or edu- 
cational principles any more than other tests 
have done. 

The authors have almost completely ignored 
the significance of individual differences and 
those recent advances in theory and practice 
that have resulted from an increasing aware- 
ness of the influence of maturation upon test 


scores. Many books are being published about 
researches in child development which Prove 
each child’s score is uniquely significant in terms 
of his own developmental pattern, so that his 
scores may not with impunity be combined with 
those of other children, until the scores have been 
established as reliable by immediate repeated 
retesting and then converted to comparable ma- 
turational units. This the authors of the PMA 
tests have not done. 

The norms for the PMA tests have all the 
characteristics of maturation curves. From these 
norms 18 isochronic equations * have been com- 
puted, one for each test, and these equations 
prove the inadequacies of many of the authors’ 
statements. 

The authors appear to be well aware of the 
fact that tests measure performance only and 
that hundreds of factors contribute to every 
score. Yet they persistently accept single scores 
at their face values. No figures are given on 
test-retest reliability, nor is there any discus- 
sion of the causes of individual variation in 
score on retests. 

Confronted with conflicting theories of edu- 
cational measurement, the ordinary user of tests 
can only wait to see which theory is most pro- 
ductive of useful results. Today new measure- 
ment techniques, individual statistical methods, 
and new theories that supply competing systems 
against which to check the validity of this re 
viewer’s prophecies as well as the authors’ pub- 
lished claims are available. 


P. E. Vernon, Professor of Educational Psy- 
chology, Institute of Education, University 0 
London, London, England. 3 

The Chicago Tests of Primary Mental Abili- 
ties (1941 and 1943) and their factorial back- 
ground are so well known that the review will 
concentrate on the new features presented by the 
SRA edition. 

Each of the three levels covers five factors 
only: (a) Primary (ages 5-7): V, S, QN t 
R), P, and Mo. (b) Elementary (ages 741) 
V, S, N, R, and P. (c) Intermediate (ages 117 
17): V, S, N, R, and W. Me (memory) 5 
omitted on the grounds that testing it takes ү 
long. W (word fluency) is omitted at the tw 
younger levels as depending too much on SP e 
of writing, and Mo (motor) at the two №21 
levels because the motor coordination pr ж 
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for most scholastic purposes has largely matured 
by 7 years. P tests are available only in experi- 
mental form above ІІ years. N and R are 
merged, at ages 5-7, in Q (quantitative) ; this 
is an interesting point of developmental psy- 
chology, about which one would like more in- 
formation. 

There is now only one test per factor, apart 
from certain composite tests in the Primary 
battery and the reading and nonreading tests 
of V and R in the Elementary. While this en- 
ables the testing to be reduced to between 40 
and 75 minutes, depending on the level, it in- 
evitably means that the validity of the tests 
(either in relation to primary trait vectors or 
to educational or vocational criteria) is reduced. 
Judging from the intercorrelations and the re- 
sults of a new analysis of roth and r2th grade 
scores reported in the manuals, the writer esti- 
mates that the typical factor score measures gen- 
eral ability, or g, to about 50 per cent and a dis- 
tinctive primary ability only to about ro per 
cent. 

Thurstone has clearly retreated from his ear- 
lier opposition to "general" intelligence. He not 
only allows total scores to be calculated for each 
battery; but even provides for their conversion 
to IO's. (Mo is omitted from the Primary total ; 
actually its saturation with the general factor 
in this battery is as high as .65). Each compo- 
nent factor is approximately equally weighted in 
these totals. 

That this is a wise step is indicated by the 
average intercorrelation of .55 at the Primary 
level and .50 in a 9-year-age group. The figure 
sinks to .34 and about .30 in grade groups 4B 
and 10-12. There seems to be little evidence, 
then, of the differentiation of abilities with age 
(as expounded by Garrett); the greater ho- 
Mogeneity within the grade groups could fully 
account for the reduced overlapping. 

Earlier editions were criticised because all 
the tests were speeded. This is still true of the 
Intermediate battery, but nob of the others, 
Where only the P, Mo, and N tests are timed. 

The V and P scores at the Primary level are 
claimed to be diagnostic of reading readiness, 
though no evidence is presented. At the Ele- 
Mentary level, the difference between reading 
and nonreading V and R scores is shown to be 
educationally significant; also a separate non- 
reading total score or IQ can be calculated by 
Omitting the two reading tests. 

You and Your Mental Abilities is a pleas- 


antly written pamphlet for students in grades 
9-12. It explains individual differences in in- 
telligence and other abilities, and gives advice 
on taking and interpreting PMA tests. If it were 
true that the tests were valid predictors of edu- 
cational and vocational potentialities, one could 
cordially recommend it. 

The Learning to Think manuals and work- 
books provide a series of daily 20-minute les- 
sons for three half-years—kindergarten and 
first grade. Not having had any opportunity to 
try them out, the reviewer can merely record his 
opinion that they will be thoroughly attractive 
to, and suitable for, 5- to 6-year-olds. The exer- 
cises are all based on pictorial material and cover 
the eight factors in rotatiop. There is no doubt 
that they will improve the ability to do the SRA 
Primary tests. The crucial point is whether they 
will transfer or train any wider manifestations 
of these faculties. Although Mrs. Thurstone be- 
lieves that they will and claims to have research 
evidence, the only relevant references quoted 
are those of the Iowa school. 

As one would expect of the Thurstones and 
of SRA publications, the mechanics of the tests 
are admirable and the directions for administra- 
tion and scoring clear and simple. Each test 
covers the specified range adequately, and the 
norms are based on large numbers. It is a pity 
that separate sex norms or at least some indica- 
tion of the typical profiles of boys and girls are 
not provided. Reliabilities are good for the two 
older levels. The fact that they are not quoted 
for the Primary level increases one's suspicion 
that they are less favorable. There is no informa- 
tion on correlations between tests of the same 
factors at different levels. It is most unlikely 
that the Primary battery gives high predictions 
of scores at later ages, though the user is al- 
lowed to infer this. Still more serious is the im- 
plication that all the tests are valid educational 
and occupational predictors, especially since 
teachers, parents, and even pupils are encour- 
aged to interpret and act on the profiles. A set 
of weighted scores on several tests, including 
suppressors, could indeed have supplied “rela- 
tively pure measures of the factors.” It is most 
misleading to suggest that single short tests can 
do so. There is little enough evidence to show 
that pure measures of, say, W, will predict abil- 
ity as “actor, stewardess, reporter, salesman, 
writer and advertising man.” The validity of a 
single W test is probably much lower and is 
likely to derive more from its general verbal jn- 
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telligence content than from its distinctive flu- 
ency component. In conclusion, the SRA tests 
would be valuable to a trained school psycholo- 
gist who would realise their weaknesses and 
could supplement any diagnostic indications 
from the profiles by more thorough testing ; but 
when applied and interpreted by laymen and 
pupils, they may often do considerable harm. 


For a review by Florence L. Goodenough of 
the Primary test, see 3:264; for a review by 
Cyril Burt of the. Single Booklet and the Six 
Booklets Editions, and a review by James R. 
Hobson of the Single Booklet Edition, and a re- 
view by F. L. Wells of the Six Booklets Edition, 
see 3:225; for reviews by Henry E. Garrett, Tru- 
man L. Kelley, C. Spearman, Godfrey Н. Thom- 
son, and Robert C. Tryon and excerpts from re- 
views of the Six Booklets Edition, see 40:1427; 
for related reviews, see 40:B1099 and. 38:B 503. 
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*[United States Employment Service Special 
Aptitude Tests.] Ages 16 and over; 1935-44; also 
called C Series; a series of tests used by the U.S.E.S. 
as the basis from which specific aptitude test batteries 
were developed for entry jobs in industrial plants; 
-available only in batteries standardized prior to 1945, 
after which time they were superseded by subtests of 
the General Aptitude Test Battery (see 714) in the de- 
velopment of new batteries ; tests Cr, C2, C13-C17, Соз, 
Стоо, Cror adapted from MacQuarrie Test for Me- 
chanical Ability (see 759) and must be purchased from 
California Test Bureau, the publisher; all other tests 
must be purchased from U.S. Government Printing 
Office; 30 tests; 1 form; testing services free of charge 
when program is conducted through the facilities of 
the U.S.E.S.; institutions and employers using their 
own facilities must purchase tests and employ testing 
supervisors trained by U.S.E.S.; details may be se- 
cured from local and state offices, through which all 
orders must be cleared; postpaid; United States Em- 
ployment Service. * 


а) CI [MOTOR SPEED]. 1942; 276 per сору; 14(5) min- 
utes. 
b) c2 [AIMING]. 1942; 2.24 per copy; %(5) minutes. 
с) C9 [NUMBER WRITING]. 1939; $1 per 100; 4(8) min- 
utes. 
d) сїз [EYE-HAND COORDINATION]. 1942; 2.76 per 
copy; 1(5) minutes. 
е) C14 [FIGURE COPYING]. 1942; 2.26 per сору; 2144(7) 
minutes. 
f) c15 [PLOTTING]. 1942; 276 per copy; 207) minutes. 
9) c16 [SPATIAL]. 1942; 2.2¢ per copy; 4(8) minutes. 
h) сїў [MAZE]. 1942; 2.26 per copy; 2 (7) minutes. 
i) CIQ [ARITHMETIC]. 1939; $1.50 per 100; 6(10) min- 
utes. 
5) C29 [NUMBER COMPARISON]. 1939; $2 per 100; 6 
(10) minutes. 
k) c30 [NAME COMPARISON]. 1939; $1.50 per 100; 6 
. (10) minutes. 

1) c32 [SUBSTITUTION]. 1939; $I per тоо; 4(10) min- 
utes. 
m) C33, [PAPER FORM BOARD]. 1941; $675 per 100; 7 
(12) minutes. 
n) c43 [AUTOMOTIVE]. 1944; $2 per тоо; 10(15) min- 
utes. 


ON 


о) c48 [т0015]. 1941; $3.50 per 100; 10(15) minutes, 
b) C55 [SERIES COMPARISON]. 1941; $3.50 Per 100; 4 
(10) minutes. j 
q) C56 [SIZE AND SHAPE DISCRIMINATION], 1941; % 
рег 100; 5(10) minutes. F i 
r) C57 [3-DIMENSIONAL SPATIAL RELATIONS], 101; 
$4.50 per 100; 4(10) minutes, d 
5) CÓI [ARITHMETIC]. 1941; $1.50 per 100; 5(10) min- 
utes. 

t) c67 [IDENTICAL FORMS]. 1941; $3.50 per 100; 4(10) 
minutes, 

и) C70 [SURFACE DEVELOPMENT]. 1041; $6 per 100; 6 
(10) minutes. 

v) CBI [REASONING PROBLEMS]. 
6(10) minutes. 

w) C86 [MECHANICAL INFORMATION]. 1041; $4.50 per 
100; 11(16) minutes, 

х) c87 [MARKING]. 1942; $1.50 per 100; ló(s) min- 
utes. 

у) c88 [PAPER SPATIAL RELATIONS]. 1941; $1.25 per 
100; 5(10) minutes, 

х) C92 [MIRROR IMAGE]. 1942; $1 рег 100; 3(7) min- 
utes. 

aa) соз [BLOCK COUNTING]. 1942; 2.7¢ per copy; 3(7) 
minutes, 

bb) сод [откестіом5]. 1942; $1.50 per 100; 5(10) min- 
. utes. 

сс) CIO0 [3-LINE copvixc]. 1942; 2.26 per copy; 2% 
(7) minutes, k 
dd) ctor [LOCATION]. 1942; 2.2¢ per copy; 2(7) min- 
utes. Y 


1942; $1.50 per 100; 
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Yale Educational Aptitude Test Battery, Grades 
9-16; 1946-47; 7 scores: verbal comprehension, arti- 
ficial language, verbal reasoning, quantitative reason- 
ing, mathematical aptitude, spatial relations, mechani- 
cal ingenuity; IBM ; 7 tests in 2 booklets (tests 1, 2 
5, 6 in Single Booklet Edition—’46: tests 3, 4, 7 i 
Single Booklet Edition, Form B—'47) ; $1.75 per мый 
plete set of testing materials; $2.50 per complete set 0 
testing materials including the scoring and reporting 
of individual results; separate answer sheets must be 
used; 25€ per set of IBM answer sheets; postage €x- 
tra; (45) minutes per test if practice booklets are ad- 
ministered previously, otherwise (бо) minutes per 
test; Albert B. Crawford and Paul S. Burnham; dis- 
tributed by Educational Records Bureau. * 


REFERENCES 


1. HERRMANN, WILLIAM CHamLes. A Follow-Up Study and 
Evaluation of a Pre-College Program of Educational Gut ы 
at an Engineering College. Master's t esis, Clark Unive pp. 
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New Haven, Conn.: Yale University Press, 1946. UA 1 
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[719] 

XA.C.E.R. Speed and Accuracy Tests. Ages 13.5 
and over ; 1942; includes a “seldom used” subtest, Clas- 
sification, which was not used in obtaining norms; 1 
form; no data on reliability and validity; 3s. per 10; 
separate answer sheets must be used; rs. 4d. per 10 
answer sheets; 3d. per scoring stencil; 6d. per man- 
wal; 15. 3d. per specimen set; cash orders postpaid 
within Australia; 12(20) minutes; Australian Council 
for Educational Research. * 


D. W. McErwaiw, Senior Lecturer in Psy- 
chology, University of Melbourne, Victoria, 
Australia. 

This booklet contains three short tests cover- 
ing name checking, number checking, and classi- 
fication. Responses are made on a separate an- 
swer sheet. The three tests are separately timed. 

The tests were validated against the assess- 
ments of training of female students in a busi- 
ness college. The item types selected were found 
to be the best available. Subsequently it was 
found that the classification test was in fact only 
a short intelligence test, and that when a good 
intelligence test was already present in the bat- 
tery, this test added nothing. As a result, the 
classification test is now seldom used. The short 
form, including only name checking and number 
checking, is thus almost identical with the Min- 
nesota Clerical Test. 

Studies carried out by the Australian Council 
for Educational Research and by the Test Con- 
struction Section of the Australian Army Psy- 
chology Service indicate that the test has a high 
validity for a wide variety of clerical type tasks 
and that it adds appreciably to the prediction 
given by intelligence tests for academic type 
learning situations. 

An interesting point shown in the validity 
trials made by the A.C.E.R. when making the 
test, was that the use of the separate answer sheet 
Seemed of itself to add to the validity of this test 
for clerical tasks, 
ise simple quasi-perceptual tests seem 
; ely to be a source of much psychometric in- 
vestigation in the future. Their performance 
apparently taps some orectic component of per- 
dur that is important in a variety of socially 
pane forms of institutionalized behaviour, 
“uch as "desire to do well at examinations.” 
VE tests have been found generally useful; 
uk er, we may speculate about the compo- 

15 for their performance. They were used by 


all the Australian Armed Services during the 
war and have been used in the basic battery of 


the Commonwealth Vocational Guidance Serv- 
ice. А 


[720] 
*Alphabetizing Test. Clerical applicants and high 
School; 1947; Forms A, B; manual ['47]; $2 per 25, 
postage extra; 50f per specimen set, postpaid; 6(9) 
minutes; David J. Chesler; Personnel Research Insti- 
tute, Western Reserve University. * 


[721] 
*Filing Test. Clerical applicants and high school; 
1947; Forms A, B; manual ['47]; $2 per 25, postage 
extra; 50€ per specimen set, postpaid; 10(13) minutes ; 
Jay L. Otis and David J. Chesler; Personnel Research 
Institute, Western Reserve University. * 


[722] 
*General Office Clerical Test (Including Fil- 
ing): National Business Entrance Tests. Grades 
12-16 and clerical applicants; 1948-52; replaces Filing 
Test (see 3:379) ; this series was entitled National 
Clerical Ability Tests from 1938-46 and United- 


МОМА Business Entrance Tests in 1947; for complete 


battery, see 453; Form 1393 (’49); no data on relia- 
bility; general series folder ('50); series manual 
(748) ; norms ('52) ; $5 per 25; $1 per test when scor- 
ing service and certificates for acceptable work are 
desired (tests administered only at National Business 
Entrance Test Centers which may be established in 
any community) ; 50€ per sample test; $2 per specimen 
set (includes all 6 tests in the series); postpaid; 120 
(130) minutes; prepared by the Joint Committee on 
Tests representing the United Business Education As- 
sociation and the National Office Management Associ- 
ation; Joint Committee on Tests. * 


REFERENCES 
Same as for 453. ў 


[723] 
X*Group Test 20 [Checking]: National Institute 
of Industrial Psychology. Ages 15 and over; 1936; 
2 scores: speed, accuracy; 1 form ['36]; no data on 
reliability and validity and no description of normative 
population in manual; data available on request; no 
norms for part scores; mimeographed manual [36] 
4s. 6d. per 12; Is. per single сору; od. per manua 
postage extra; 12(17) minutes; National Institute of 
Industrial Psychology. * 
REFERENCES 


. Kerr, mcr, "Aptitude Testing for Secondary Courses: 
An ium чє АНД nder War-Time Difficulties," Occupa- 


: E gue 
tional Psychol 16:73 8 “Tests of Technical Apti- 


Ap ‘42. 
МОЛЕ Ие ЕНИ Peychol 16:175-83 O a, 
E. G. Снлмвевѕ, Assistant Director of Re- 
search in. Industrial Psychology, University of 
Cambridge, Cambridge, England. А 
Group Test 20 is an adaptation of the Minne- 
sota Clerical Test, British names being substi- 
tuted for American names in the second part. 
The test is very simple and consists of number 
checking and name checking at speed. Separate 
scores for speed and accuracy are obtained. 
Norms for these are provided in the form of per- 
centiles based on an unspecified number of sec- 
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ondary school boys in the two age groups: 15 
and 16 plus. 

The publishers state : 

This clerical test is not intended to give an indica- 
tion of general suitability for work of an office kind. 
Its purpose is to provide a basis for an estimate of an 
individual’s present speed and accuracy in straightfor- 
ward clerical tasks. Any score obtained in it should 
always be considered in the light of the individual's 
intelligence test result, and in the light of his (or her) 
scholastic record, before a recommendation to office 
work is made. It should be recognized that speed and 
accuracy in operations of the types included in the test 
can to some extent be cultivated. Any score above the 
70th percentile may be regarded as a favourable indica- 
tion for an occupation of a clerical type; scores below 
this level should not necessarily be regarded as contra- 
indicated. 

No reliability coefficient is provided nor is 
there any evidence that the test has been vali- 
dated. In the absence of such evidence and in 
view of the statement quoted, it appears to the 
reviewer that this test would not be very valu- 
able by itself. The dual speed-accuracy method 
of scoring is a disadvantage, for in many cleri- 
cal tasks accuracy alone is important. Given time, 
anyone above the rating of a moron should be 
able to do this test with perfect accuracy. 


[ 724] 

*Group Test 25 (Clerical): National Institute 
of Industrial Psychology. Ages 14 and over; 1925- 
44; 7 scores: checking, classification, arithmetic, copy- 
ing, filing, problems, oral instructions; 1 form, '38; no 
data on reliability and validity in manual; data avail- 
able on request; mimeographed manual (’44) ; 4s. 6d. 
per 12; Is. per single copy; 1s. 6d. per manual ; postage 
extra; (35) minutes; National Institute of Industrial 
Psychology. * 


REFERENCES 


1. VERNON, PHiLIP E., AND Parry, Јонм B. Personnel Selec- 
tion in the British Forces. London: University of London Press 
Ltd., 1949. Pp. 324. * (РА 24:776) 


E. G. Cuampers, Assistant Director of Re- 
search in Industrial Psychology, University of 
Cambridge, Cambridge, England. 

This test comprises seven short subtests en- 
titled Checking, Classification, Arithmetic, 
Copying, Filing, Problems, and Oral Instruc- 
tions. Normally only the first five subtests are 
given. The Problems test is regarded as a quick 
intelligence test which may be given in the ab- 
sence of a separate intelligence test, and the Oral 
Instructions test is given only when immediate 
memory for verbal instructions is particularly 
important, as in the case of telephone operators 
or office messengers. 

Five-grade norms for tests 1-5, based on 
4,509 vocational cases from secondary grammar 
schools, are given for the three age groups 1 5, 
16, and 17, as are similar norms for secondary 
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modern school pupils aged 14 based on results 
of 2,168 pupils. In addition, 5-grade norm; 
based on 582 industrial applicants for clerical 
employment, are given for the three age groups 
15, 16, and 17 for tests 1—5, 1-6, and 1-7 sepa- 
rately. 

No form of reliability coefficient is Provided 
by the publishers, and there is no evidence given 
that the test has been validated. However, Ver- 
non and Parry (r) state that while the test was 
found to be useful in selection work, a test re- 
sembling it, known as the Clerical and Instruc- 
tions Test, with a reliability of .92, was probably 
superior to it. 

It seems very probable that Group Test 2; 
measures elements involved in certain types of 
clerical work. Whether or not the short subtests 
(mostly 2% minutes each) measure these ade- 
quately is another matter and one which cannot 
be decided on available evidence. The method of 
scoring, in which one mistake counterbalances 
two and sometimes three correct responses, is 
rather arbitrary. However, among tests of this 
sort, where both speed and accuracy are regarded 
as essentials, the test should fulfil its function 
reasonably adequately. 
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*[Hay Tests for Clerical Aptitude.] Applicants 
for clerical positions ; 1941-50; 4 tests; manuals (80; 
$1.50 per battery manual Tests for Clerical Aptitude: 
A Handbook ('50—ѕее 4 below) ; $2 per specimen sai 
postpaid; 13(28) minutes; Edward N. Нау; Aptitude 
Test Service. * i 
a) HAY TEST I: THE WARM UP. 1945-50; à prac 
exercise to precede administration of battery ; Form i 
('50—a revision of the 1945 form) ; no manual; $I 
per 25; 1(3) minutes, А 
b) HAY NUMBER PERCEPTION TEST. 1947-50; Forms 
(47), В (’50—same as test copyrighted in 1947); 
per 25; 4(7) minutes. 50; 
C) HAY NUMBER SERIES COMPLETION TEST. ДУ, 
Forms A ('41), В ('so—same as test copyright i 
1949); $1.25 per 25; Form А: 3(6) minutes; Form 
4(7) minutes. 
d) HAY NAME FINDING TEST. 1941-50; Form 
C (’50-—same as test co yrighted in 1949); admin- 
exercise, entitled Form (49) must precede ale 
istration of test; $2 per 25; Form A: 3(10 
Form C: 4(11) minutes. 

REFERENCES shine Dok 


1. H. Б 
keeping." J Appl Psychol 27:483-93 D ’43. * (РА 821877) 
2. 


з. Hay, Ер . 
Tests.” J Appl Psychol 34:153-8 Је '5о, * (PA 25:3991) Hont 
N. 38 


26. Paper. * chol 
s. Hav, Ерулар N. "A Warm-Up Test" Personnel P. if t 
3:221-3 su 'so. * (PA E 4 i 
6. Miner. уез B Ae educing the Time Rer igo. j 
Testing Clerical Applicants.” Personnel J 28:364 7 
Om Еа да, Anuixe. "Reducing Typing Costs With AM 
Д EMORE, ARLINE. ucin; А 
tude Tests." Personnel J * (PA él 7746 Sele" 
‚8. Hay, Epwarp N. “< 
tion.” J Appl Psychol 35:250-1 Ag ’51. 
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Fe ee See ee 


Кесх Н. BITTNER, Assistant Director of Per- 
sonnel Research, The Prudential Insurance 
Company of America, N ewark, New Jersey. 

This series of tests is presented for use in 
selecting good clerical workers. In designing the 
test battery, the author apparently assumed that 
clerical aptitude is a composite trait, the com- 
ponents of which are measured by the tests in 
the battery, and that this composite trait is in- 
volved in success in all kinds of clerical work. 

The first test in the battery, The Warm Up, 
is unusual, an easy one-minute test designed to 
help the applicant adjust to the testing situation. 
This may be a useful testing device, although no 
evidence is presented on its effectiveness. The 
Number Perception Test, the second test, con- 
tains common business errors and is similar to 
the number comparison part of the Minnesota 
Clerical Test although it is shorter and its items 
are less difficult. The Number Series Comple- 
tion Test consists of standard items of the num- 
ber series type. The Name Finding Test in- 
volves the finding of key names in a list, a variant 
form of name comparison. 

Reliabilities of the tests are adequate with the 
possible exception of Number Comparison. Co- 
efficients are .86 for Number Perception and .94 
for each of the other two tests. 

The validity of the test battery as a measure 
of general clerical aptitude cannot be assessed 
from the data presented. Studies are cited which 
indicate that certain tests in the battery predict 
with reasonable accuracy success on five jobs: 
beginning typing, machine bookkeeping, key 
Punching, ticket press operation, and ticket in- 
spection. This is a very limited sample of clerical 
jobs and seemingly atypical in that all but the 
last involve certain special skills. These studies 
do not indicate the validity of the battery as 
recommended for use—“passing” all three tests 
—Since the validity of only single tests is given. 
Ns и аге all designed for ease of adminis- 
nities "8 scoring, an obj ective which has been 
aidin ed, The manual is particularly good for 
REE administration of the tests; as a guide for 
m pretation, it appears less valuable. A “pass- 
E esu is given for each test, and it is recom- 
win 5 that employment be limited to persons 
quie pass" all three tests. Presumably this re- 
Pis HH is recommended for applicants for all 
tu UN kai un work. It would seem that the 
e x the battery might be combined in differ- 
dé to predict success in different types of 

Work, but no recognition of this possi- 


bility is given in the manual. The validation data 
presented, however, would seem to point to the 
use of various combinations of the tests for differ- 
ential selection and placement rather than to a 
single combination of the three tests for employ- 
ment for all clerical jobs. 

Norms for the tests are presented in terms of 
the three quartiles for a number of separate 
groups. Generally these groups are defined only 
as male or female "clerical applicants." Other 
data defining the characteristics and aiding in 
the interpretation of these samples are not given. 
It seems likely, too, that norms presented in 
terms of these statistics will have little meaning 
to the lay user of the tests. Presumably, it is in- 
tended that the recommended “passing” scores 
be used as the frame of reference in interpreting 
the tests. 

The battery of tests has certain characteristics 
which recommend it: it has been devised with 
adequate care for the technical aspects of test 
construction, testing time has been cut to a mini- 
mum, and ease of administration and scoring is 
assured. Its content emphasizes speed and per- 
ceptual factors which are generally found to be 
involved in clerical success. It would seem to 
have a good chance of proving to be a valid pre- 
dictor of success in many clerical jobs, particu- 
larly if it were used along with a test of gen- 
eral intellectual ability. 


Epwanp E. Cureton, Professor of Psychology, 
The University of Tennessee, Knoxville, Ten- 
nessee. 

This battery consists of Test 1 (a warm-up 
test), a Number Perception Test, a Number 
Series Completion Test, and a Name Finding 
Test preceded by a special practice exercise. The 
tests are printed as separate sheets or booklets. 
Each of the tests except Test 1 has its own short 
manual, and there is also a 26-page handbook 
entitled Tests for Clerical Aptitude which con- 
tains most of the material in the separate man- 
uals along with some additional data and sug- 
gestions. : 

Test 1 is similar to the Wonderlic Personnel 
Test, but with shortened and simplified direc- 
tions. It contains 20 four-choice items and is 
administered with a one-minute time limit. Its 
use as a preliminary exercise should help set at 
ease examinees who are not test-wise, especially 
when it is followed by a group intelligence test 
such as the Wonderlic ( 5). ; ' 

The Number Perception Test is essentially a 


to 
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revision of the Number Comparison sübtest of 
the Minnesota Clerical Test. The number pairs 
range in length from three to only six digits, the 
errors consist entirely of transpositions and sub- 
stitutions, and all numbers are aligned in col- 
umns at the right. There are 200 items, and the 
time limit is four minutes. Though the criteria 
for revision appear sound, it will be difficult to 
demonstrate statistically that the test is any sub- 
stantial improvement over the Number Compari- 
son subtest of the Minnesota Clerical Test. 

The first form of the Number Series Comple- 
tion Test (Form A), was a reprint of a subtest 
of the Army Alpha Examination: First Ne- 
braska Edition, with 20 items and a three-minute 
time limit. The later forms contain essentially 
similar items, but there are 30 items in each and 
the time limit is four minutes. Factor analysis 
results show that number series completion tests 
are typically found to be among the best tests 
of inductive reasoning ability. 

The Name Finding Test consists of an alpha- 
betical list of full names written in the normal 
‘order (John P. Ahern, e.g.) on one side of a 
sheet, and on the other side a list four times as 
long, including the names of the first list, but 
with each name consisting only of surname and 
initials (Ahern, J. P., eg.). The examinee is to 
check in the second list those names which are 
also in the first list. A special Practice Exercise 
for the Name Finding Test (Form B), has 5 
words in the first list and 20 in the second, with 
no time limit. Form A has 25 words in the first 
list, 100 in the second, and a three-minute time 
limit, Form C has 32 words in the first list, 128 
in the second, and a four-minute time limit. 
From its correlations with other tests (г), it 
would appear that the Name Finding Test meas- 
ures verbal ability and, to a somewhat lesser 
extent, reasoning and perceptual speed. 

The tests of the original battery—Minnesota 
Number Comparison, Number Series Comple- 
tion Form A, and Name Finding Form A— 
were selected from a trial battery of 25 tests ad- 


“ministered to 39 bookkeeping machine opera- 


tors for whom reliable production records were 
obtained as a criterion (т). Since there was a 
high degree of selection from among the trial 
tests, and the N of 39 is not large, the multiple 
correlation of .70 computed from the same sam- 
ple is spuriously high. The battery was adminis- 
tered later to 82 key-punch operators (3), and 
gave a multiple correlation of .38 with a rating 


criterion. Addition to this battery of 
Self-Administering Test of Mental 
minute time limit) in the first study 
W onderlic Personnel Test in a follo 
second study (8), failed in each case to 
its predictive power to any significant d 
The reports on reliability and validit 
manuals, the handbook, and the published 
as well, contain serious statistical errors 
interpretations. Parallel form reliabili 
most useful kind) are reported as tes 
liabilities for the Number Perception Т\ 
even reliabilities are reported for the 
Series Completion Test and the Name: 
Test despite the fact that they are both init 
speed tests. The handbook cites a study showing 
that the same percentages of two groups lie above 
—.6 SD on the Number Perception Test and the 
Minnesota Clerical Test as evidence (along with 
correlations of .83 and .85 between the two! t 
that the four-minute Number Perception Test 
is essentially as good as the 15-minute Mime — 
sota Clerical Test for screening clerical appli- 
cants. It is argued (3) that multiple cutting 
scores are superior to multiple regression scores 
because a higher percentage of a “good” cri- 
terion group is selected by the multiple: 
score technique than by the multiple regression 
technique in the sample used to set the multiple 
cutting scores. үп 
Normative data cover a variety of groups but 
are reported only in terms of medians and quat 
tiles. "Passing scores" are given for the three 
tests; these are apparently derived mainly from 
the multiple cutting score study with 82 key- | 
punch operators. Equivalent Number Реге | 
tion scores and Minnesota Number Comparison _ 
Scores, based on 2,951 and 2,658 cases respeti 
tively (different sets of clerical applicants to А 
same company: 1947-49 and 1944-47), ate 
given for every fifth percentile. j 
In summary, the three tests of this battery ap 
pear to be excellent four-minute tests cover! 
some of the essential aspects of ability to ре 
form routine (as distinct from highly verb Ao | 
highly numerical) clerical operations. E 
companying statistical data do not provide MW 
evidence concerning the reliabilities of two of the 
tests. The data on the validities of these t | 
reported so far are insufficient to support clai p" 
of general validity for clerical selection which arè 
made in the handbook. The normative materi? 
supplied are of little value, 1 
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xMartin Office Aptitude Tests. Clerical employees 
and applicants for clerical positions ; 1947-52; 10 tests ; 
manual, fourth edition ('52) ; $1.50 per set of scoring 
‘stencils; 50€ per scoring stencil for any one test; $1 
per manual ; 10-49 profile charts ('52), 10¢ each; 50- 
00, 9¢ each; 100 or more, 8¢ each; postage extra; 
Howard G. Martin; Martin Publishing Co. * 

a) MARTIN NUMBER CHECKING TEST. 1047-52; Form 
A (747); 10-49 test-answer sheets, rof each; 50-99, 
of each; 100 or more, 8¢ each; 6(10) minutes. 

b) MARTIN NAME CHECKING TEST. 1947-52; Form A 
(47); 10-49 test-answer sheets, 10¢ each; 50-99, 9€ 
each; 100 or more, 8¢ each; 5(10) minutes. Ж 
с) MARTIN NUMBER FACILITY TEST. 1947-52; Form A 
(47) ; 10-49 test-answer sheets, 10€ each; 50-99, 9f 
each; тоо or more, 8¢ each; 8(13) minutes. 

d) MARTIN NUMERICAL OPERATIONS TEST, 1951-52; an 
"alternate" to Martin Number Facility Test. Form A 
(51); 10-49 test-answer sheets, rof each; 50-99, 9¢ 
each; 100 or more, 8¢ each; 8(13) minutes. 

€) MARTIN VOCABULARY TEST. 1947-52; Forms A 
(47), В ('51) ; 10-49 test-answer sheets, 10€ each; 
50-99, 9¢ each; 100 or more, 8€ each; nontimed (10) 
minutes. 

f) MARTIN ARITHMETIC REASONING TEST. 1047—52; 1 
form, '50; 10-49 test-answer sheets, 10¢ each; 50-99, 
of each; 100 or more, 8¢ each; nontimed (20) min- 
utes, 

g) MARTIN ALPHABETIZING TEST, 1951—52; Form A 
(’51) ; 10-49 test-answer sheets, 10¢ each; 50-99, 9¢ 
each; 100 or more, 8¢ each; 6(10) minutes. 

h) MARTIN NUMERICAL ORDER TEST. 1951-52; Form A 
(51); 10-49 test-answer sheets, 10¢ each; 50-99, 9¢ 
each; 100 or more, 8¢ each; 6(10) minutes. 

i) MARTIN STENOGRAPHIC TEST. 1949-52; Forms 60- 
70 А [49], 80-A ['49], 90-A ['49], 100-А ['49] ; $2.50 
per test; 9(15) minutes. 

7) MARTIN TYPING TEST. 1949-52; Form 10-A ['49]; 
à practice exercise must precede administration of test; 
$2.50 per test; 10(20) minutes. 


D. Wetty Lerever, Professor of Education, 
University of Southern California, Los Angeles, 
California. 
‚ DESCRIPTION OF THE TESTS. The basic series 
includes the following measures : Number Check- 
ing, Name Checking, Number Facility, Vocabu- 
lary, and Arithmetic Reasoning. The first two 
named are very similar to the Minnesota Clerical 
Tests. Number Facility is a simple arithmetic 
Computation test including the four basic opera- 
tions for whole numbers only. The subject indi- 
cates whether the answer given is right or wrong. 
he Vocabulary test presents a choice of five 
Words, one of which matches in meaning a group 
9t words, 
ре basic series is supplemented by two pro- 
x tests: Typing, based on the customary 
viue and Steno, requiring the type- 
Ше transcription of four business letters dic- 
ua at 60-70, 80, 9o, and 100 words per min- 
€spectively, 


Three measures of aptitude were added in 
1951 : Alphabetizing, Numerical Order, and Nu- 
merical Operations. The latter presents a series 
of arithmetic operations eating -single digit ` 
numbers. 4 Nec 
STRONG POINTS. (а) The aptitude tests are 
designed in a convenient form for administration 
and scoring. Each test is a single IBM answer 
sheet so that the items appear directly on the 
scoring blank. (b) The general appearance and 
design is clear and attractive. (c) The scoring 
can be effected quickly and easily with overlay 
stencils if mechanical scoring is not available. 
(d) The name and number comparison tests 
should have considerable predictive power for 
selecting clerical employees.. Vocabulary recog- 
nition and arithmetic reasoning have long been 
used as measures of general intelligence. (е) 
The typing work-samples are clearly set up for 
easy copying and quick scoring. (f) The pacing 
of the dictation in the stenographic test appears 
practicable. NOV Mi, 

WEAK POINTS. (a) The battery offers little. 
that is strictly new or original. No definite indi- 
cation is given of fresh research evidence on va- 
lidity. (b) In the number tests (except for 
Arithmetic Reasoning), the possible responses 
are two: the answer given is right or wrong. 
This restricted choice unnecessarily increases the 
hazard of guessing. (c) The directions to the 
test administrator and to the subject occasion- 
ally reveal a careless use of language. Examples 
are: *be sure and make a heavy mark," and "in 
order to determine if." (d) The basis for select- 
ing they ocabulary words is not explained. Since 
the aptitude series was designed for use in busi- 
ness establishments, a considerably greater face 
validity could have been alleged by choosing 
words related to office work. (е) The most 
serious fault, in the judgment of this reviewer, is 
the dearth of information regarding reliability 
and validity. The argument that some test users 
may lack sufficient interest or training to read 
a more complete exposition of the basic research 
evidence does not excuse the author and pub- 
lisher for offering the product to eo por 
chasers in the form of a “pig in a poke 2” (f) The 
manual is extremely brief. Fragments of state- 
ments from the research literature on aptitude 
testing in the clerical field are offered. without 
documentation. Vague and incomplete data on 
validity are suggested. Such norms as "minimum 
aptitude requirements" are arbitrarily defined 
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for several types of office work without in- 
dicatiing the operational basis. 

SUMMARY. These tests appear to possess some 
definite predictive value as judged from their 
resemblance to more completely standardized 
and validated instruments. Potential users, how- 
ever, should demand more complete informa- 
tion before purchasing. 


Ross W. Млттеѕом, Counselor, Michigan State 
College, East Lansing, Michigan. 

This is a series of relatively short tests of those 
abilities required for successful performance of 
various types of office work. The battery was 
designed for use both as an instrument for 
screening job applicants and as a guide in as- 
signing duties to employees. As indicated in the 
manual of instructions, such tests measure only 
the ability, not the “will” to do the work. 

The packaged testing program, all or part of 
which may be given to a particular applicant, 
includes tests of alphabetizing, numerical order, 
number checking, name checking, number fa- 
cility, vocabulary, and arithmetic reasoning. 
There are also tests of stenographic and typing 
skills. Individual record charts, scoring stencils, 
and a manual of instructions are included. 

Of these tests, Alphabetizing is designed to 
measure speed and accuracy in arranging names 
as for filing. Numerical Order presents a similar 
task in noting, for each of 150 pairs of numbers, 
which is the lower number. Number Checking 
and Name Checking involve the familiar task 
of looking at pairs of numbers or of names and 
indicating whether they are the same or differ- 
ent. These skills are considered basic for general 
clerical duties requiring speed and accuracy. 
Number Facility has тоо problems—25 each 
in addition, subtraction, multiplication, and di- 
vision—with answers provided. Instructions are 
to indicate, by marking the appropriate space, 
whether the answers are right or wrong, No 
fractions or decimals are included. Vocabulary 
consists of 50 vocabulary problems in which the 
correct or best word is to be chosen to complete 
each statement. This test should aid in discover- 
ing capacity for assuming more responsible types 
of work. In Arithmetic Reasoning, 12 “story” 
problems are to be worked out as a measure of 
the applicant’s ability to think logically in the 
quantitative area. 

In addition to the above aptitude tests, Steno 
and Typing tests are provided for checking the 
applicant's basic speed:in shorthand and typing. 
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The four letters in Steno are to be read at speeds 
of 60-70, 80, 90, and 100 words per minute re- 
spectively. Although each letter is marked in 1 b 
second intervals, some practice in dictating would 
appear to be necessary for the test administrator, 

Administration of the basic aptitude tests is 
relatively simple. Time limits for the alphabetiz- 
ing, checking, and other speed tests range from 
5 minutes to 8 minutes and must be rigidly ob- 
served. The vocabulary and arithmetic reason- 
ing tests have no time limits. 

Scoring the tests is likewise a relatively simple 
procedure. In the speed tests, wrongs are sub- 
tracted from rights in order to penalize for er- 
rors. The raw scores may readily be converted 
into a profile of comparable percentiles. As an 
aid to effective interpretation of the test scores, 
the manual of instructions gives five standard 
patterns of minimum requirements for typical 
office duties. An applicant’s profile can be com- 
pared with the pattern for duties involving work 
with figures, for general clerical duties, for file 
clerks, for office machine operators, or for ste- 
nographer clerks. 

Test-retest reliability of the Martin O fice Ap- 
titude Tests is claimed to be .90 and above. Satis- 
factory validity is claimed for the tests in con- 
nection with their effectiveness in predicting 
success on the job. Since the manual has been 
written primarily for the office manager or per- 
sonnel director using the tests in a practical situ- 
ation, it includes a minimum of statistical in- 
formation. : 

In addition to the packaged office aptitude 
testing program, a numerical operations test 15 
available. In this test, a series of numerical op- 
erations are to be performed in the order ™ 
which they occur in each problem. Here it would 
seem that the individual who might attempt to 
apply the basic rules presented for such on 
tions in arithmetic and algebra textbooks woul 
be at a complete loss. However, as indicated, the 
current edition of the tests does not include this 
particular instrument in the packaged battery: 

In general, the Martin Office Aptitude Tests 
should prove of practical value in connection 
with the screening of job applicants and r 
assigning of duties to employees. Tests of skills 
as well as tests of aptitudes are provided. As 016 
step in a systematic employment procedure, these 
tests offer the employer a readily available an 
easily administered means of obtaining evidence 
regarding applicants’ qualifications for various 
types of office work. 
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[727] 
Name Comparison Test for Clerical and Indus- 
trial Inspection Operations. Clerical applicants and 
high school; 1945-40; Form A ('45); manual ['46]; 
$2 per 25, postage extra; 50¢ per specimen set, post- 
paid; 7(10) minutes; Jay L. Otis and Louise W. Gar- 
man; Personnel Research Institute, Western Reserve 
University. * 

[ 728] 
xNumber Comparison Test for Clerical and In- 
dustrial Inspection Operations. Clerical applicants 
and high school; 1945-46; Form A (45); manual 
[46]; $2 per 25, postage extra; 50¢ per specimen set, 
postpaid; 9(12) minutes; Jay L. Otis and Louise W. 
Garman; Personnel Research Institute, Western Re- 
serve University. * 


[729] 
*[Personnel Research Institute Clerical Bat- 
tery.] Applicants for clerical positions; 1945-47; 8 
tests (also listed separately) ; & per 25 of any one 
test, postage extra; soc per specimen set, postpaid; 
Personnel Research Institute, Western Reserve Uni- 
versity. * 
d) CLASSIFICATION TEST FOR INDUSTRIAL AND OFFICE 
PERSONNEL, 1043-40; Forms А (’47—a revision of the 
1943 form), B (47); revised manual ['49] ; norms 
(49) ; 15(20) minutes; Jay L. Otis, Evelyn Katz (A), 
Robert W. Henderson (A), Mary Aiken (A), David 
J. Chesler (B), and Gardner E. Lindzey (B). * 
b) NUMBER COMPARISON TEST FOR CLERICAL AND IN- 
DUSTRIAL INSPECTION OPERATIONS. 1945-46; Form А 
(45) ; manual ['46] ; 9(12) minutes; Jay L. Otis and 
Louise W. Garman. 
с) NAME COMPARISON TEST FOR CLERICAL AND INDUS- 
TRIAL INSPECTION OPERATIONS. 1945-46; Form 
(45) ; manual [’46]; 7(10) minutes; Jay L. Otis and 
Louise W. Garman. 
d) TABULATION TEST. 1947; Forms A, В; manual 
[47 ; 10(13) minutes; Jay L. Otis and David J. Ches- 


ё) FILING TEST. 1947; Forms A, B; manual ['47]; 10 
(13) minutes; Jay L. Otis and David J. Chesler. 
) ALPHABETIZING TEST. 1047; Forms A, B; manual 
[47]; 6(9) minutes; David J. Chesler. 
9) ARITHMETIC REASONING TEST. 1948; Forms A, B; 
Tent 48]; 10(13) minutes; Jay L. Otis and.David 
* er. 
y^ SPELLING TEST FOR CLERICAL WORKERS. 1947; Forms 
^ B; manual [’47]; nontimed (15-20) minutes; Jay 
^ Otis, David J. Chesler, and Irene Salmi. 
Louise WrrwR Cureton, 1814 Prospect Place, 
Knoxville, Tennessee. 
These tests are intended to measure most of 
ES important aspects of office and clerical work. 
he Classification Test may have broader usage 
" a brief general intelligence test; and it, com- 
ined with the Name and the Number Compari- 
Son tests, may also be appropriate for certam 
industrial jobs. The tests have been prepared as 
Separate four-page booklets. The type is clear 
ae the format is attractive except in the case 
the Classification Test, whose 100 items have 
en crowded in order to fit on three pages. Ade- 
bae space is provided for figuring in the Arith- 
etic Reasoning and the Tabulation Test book- 


lets, All tests are answered on the booklets and 
are hand scored. Instructions to the examinees 
(on the face sheets of the booklets) are concise 
and clear except for the one ambiguous phrase, 
“There is no penalty for guessing,” and for the 
omission of time limits for four of the eight tests. 

The Classification Test consists of 100 multi- 
ple choice items (40 general information, 20 
vocabulary, 20 arithmetic, and 20 analogy) in 
spiral omnibus form and resembles the familiar 
Otis Self-Administering Tests of Mental Ability 
without the space items. The manual states that 
it is a test of mental ability for the ordinary shop 
or office worker and a “screening test for sifting 
out those very superior individuals who should 
be tested further,” and that an attempt has been 
made to “include items of approximately uni- 
form difficulty....and to make the difficulty level 
fairly low.” However, the vocabulary items 
range in frequency of occurrence * from over 49 
per million running words to less than one, and 
the arithmetic from “How much is 41¢ and 13¢” 
to “Which of the following numbers is the 
smallest? (A) 1% (B) 01% (C) .o1 (D) 
1/100." Also the items are not arranged in order 
of difficulty. This would be a much better test 
of general mental ability if subjected to careful 
item analysis. 

The Spelling Test is intended to measure 
spelling proficiency and is the result of two ex- 
perimental forms and revisions. The examinee 
writes (instructions to print would be prefer- 
able) the correct spelling of 73 commonly mis- 
spelled business words which are each identified 
in the booklet by a simple phonetic spelling and 
a definition. This method avoids oral presenta- 
tion by the examiner, and an examinee of aver- 
age intelligence should have no difficulty in 
recognizing the intended test words. 

The Name and the Number Comparison tests 
each have 200 items closely resembling the two 
parts of the Minnesota Clerical Test except that 
the duplicate items are placed on facing pages 
instead of in adjoining columns. This variation 
is probably more like the typical job task; also, 
it may introduce a slight memory requirement. 
Two scoring measures are recommended, num- 
ber right for speed and number wrong for ac- 
curacy; and reliability, validity, and norms are 
reported for each on small samples. More data 
are needed before this refinement is justified. 


i „ and Lorge, Irving. The Teacher's 

wisi look of POP iri. New уйт Bureau of GE 

tions, Teachers College, Columbia University, 1944. Pp. i 
" 
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The Alphabetizing Test consists of 45 sets of 
5 names to be ordered. The Filing Test consists 
of 150 names to be matched with file folders 
(pictured) labeled Aa-Am, An-Az, etc. 

The Tabulation Test consists of 15 short lists 
of persons or things to be classified and counted 
or multiplied, subtracted, etc., as instructed, for 
example, “Orders received on the first day of 
July were: 27 bbls., 16 boxes, 12 bales, 22 bags. 
Obtain the weekly sales estimate by multiplying 
the above figures by 6. (Ans.) Bbls.—— Boxes 
——— Bales—— Bags——.” This is an interest- 
ing but complicated item type which has face 
validity for certain jobs. Without data it is im- 
possible to judge its actual advantage over sim- 
pler items which can be used in larger quantities 
and hence with greater reliability. 

The Arithmetic Reasoning Test contains 20 
multiple choice verbal-problem items covering 
whole numbers, percentages, and decimals. The 
items vary widely in difficulty and time required 
and are not arranged in order of difficulty. This 
is a serious drawback for a sharply timed test. 

The manuals (one for each test) attempt to 
give the reader all necessary information for 
using the tests. The presentations regarding the 
purposes of the tests, description of items, and di- 
rections for administration and for preparation 
and use of scoring keys are entirely adequate. 
However, several serious inaccuracies and con- 
fusions occur in the data. Reliability coefficients 
range from .89 to .99, but most of these are 
spuriously high because they are obtained by 
the odd-even method on obviously speeded tests. 
The test-retest method was used only for the 
Name and the Number Comparison tests and 
the reported coefficients of .9o and :95 respec- 
tively are higher than usual for this type of test ; 
there is no information as to the amount of time 
intervening between the two administrations. 
For some tests "equivalence of forms" is re- 
ported instead of, or in addition to, reliability. 
These correlations between the two forms of 
the tests (.78 to .96) are also spuriously high 
because the two forms consist of items from 
matched pairs rather than random selections of 
items from the same item universe (Arithmetic 
Reasoning Test, Form A : “At 5 cents each what 
will 2 dozen pencils cost?"; Form B: "At 8 
cents each, what will 3 dozen inkwells cost PY 
Determination of the true reliability of these tests 
would be difficult, but the correlation between 
forms might be taken as the upper limit of re- 
liability. Alternate forms are needed in the prac- 
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tical situation primarily so that a test may be 
readministered in cases where results on the 
first form are invalidated for some reason such 
as illness of the examinee, errors of administra. 
tion, etc., but with such highly similar forms as 
these the scores on the second form might be 
spuriously high unless long time intervals elapse 
between administrations. Visual similarity be- 
tween forms does not guarantee similar levels of 
difficulty, but similar difficulty is implied by the 
fact that the norms tables, showing decile equiva- 
lents for raw scores, apply to both forms for all 
tests having two forms. The differences in mean 
difficulty between forms are reported as .81 and 
:05 raw score points for the Spelling and Arith- 
metic Reasoning tests respectively. “Practice 
effects” are reported as —.04, 2.3, and 1.15 raw 
score points for the Spelling, Alphabetizing, and 
Arithmetic Reasoning tests respectively. Meth- 
ods of determining these two sets of data are not 
stated, and their meaning is not clear. 

Validity coefficients are reported for every 
test but Arithmetic Reasoning. However, the 
criteria are merely high school grades or other 
tests for all but three tests. For the Classifica- 
tion, Tabulation, and Alphabetizing tests, job 
performance criteria are used (r= .21 to 49, 
:25 to .30, .29 to .37, respectively). Many more 
studies must be made before any useful state- 
ment can be made as to the validity of these tests 
in the industrial or business situation. The Clas- 
sification Test correlates substantially with the 
California Test of Mental Maturity, American 
Council on Education Psychological Examina- 
tion, and Otis Self-Administering Test j 
Mental Ability (r's from .62 to .83) for severa 
samples of examinees. 

Norms (decile equivalents for raw scores) are 
provided for every tést. The samples were ks 
on which the validity coefficients were compute 
plus some employed groups for which 7% arè 
not stated. The sample size is adequate only for 
the Classification Test. Geographical location 0 
the samples is seldom mentioned, although this 
is an important variable in studies of other - 
lar tests. There is no separation of norms PY 
sex. While local norms are always best, the user 
of these tests should be provided with many 
more norms from various regions to serve a 
preliminary guides. А 

The published data give no informatio 
the relationships between these eight tests, 4? 
they are nowhere treated as component pr 
a battery. Factor analysis data from other studie 
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would suggest that (a) the Name and the Num- 
ber Comparison tests are highly correlated ; (5) 
the Name Comparison Test, the Spelling Test, 
and the Classification Test are correlated be- 
cause they are all measures in greater or lesser 
degree of the verbal factor; (c) the Arithmetic 
Reasoning Test and the Classification Test both 
contain arithmetic reasoning items and hence 
should show considerable correlation; and (d) 
the Tabulation Test and the Arithmetic Reason- 
ing Test both have loadings on the verbal and 
number factors. The Alphabetizing and Filing 
tests were developed to complement each other 
and may be highly correlated. However, Mosier 
reported on two equally face-valid alphabetizing 
tests which correlated .оо with each other and 
almost zero with supervisors’ ratings? If test 
intercorrelations or factor analysis data were 
available, the test user could select his experi- 
mental batteries much more efficiently. 

These eight tests appear to be intrinsically 
good, but most of them need further editing, item 
analysis, and reliability data before they can be 
compared adequately with the better known tests 
which they closely resemble. Also they require 
many more validity and norms studies before 
they can be used successfully by any but the 
most highly trained personnel psychologist who 
can obtain local validation and normative data. 


Аівевт К. Kurtz, Professor of Psychology, 
The Pennsylvania State College, State College, 
Pennsylvania. 

This battery of eight clerical tests is not very 
useful in its present state. Most of the tests are 
very similar to well known types of tests that 
have been in use for some years; there are few 
muy new ideas incorporated in this battery. A 
ew items in each test would probably be criti- 
bui most persons familiar with the charac- 
E. of good test items. The tests are objec- 
i. no particular difficulties їп test 
[Же р scoring are likely to be encoun- 
ii i eliability coefficients are given for each 
лгы, їп Several cases these coefficients are 
сонын, у high since they are based on odd-even 
add n in which speed is a very important 
ШЫ inally, and most important, validity is 
vali diese ; five of these tests have not been 
titania ina nonacademic situation, and the 

h Xceptions gave low validities. 
€ manuals have the right headings, but the 


? Mosi 
сер et p Charles Т. "A Critical Examination of the Con- 
47. * e Validity." Ed & Psychol Meas 7:191-205 SU 


information supplied is extremely sketchy, 
Nothing at all is said about any item analyses; 
the prospective user may well conclude that 
none were made. 

Most of the tests have alternate forms. By 
looking at them we can deduce that the items 
were probably constructed in pairs. About half 
the time, we are given an indication of the size 
of the difference in difficulty between the two 
forms. In no case are we told anything about 
differences in variability or about differences in 
the shape of the distributions of scores on the 
two forms. 

The directions for administering and scoring 
the tests are satisfactory. Usually the directions 
which the examiner reads match those on the 
test booklet, but there are some discrepancies 
here. In the manuals, one sentence is omitted 
from the directions for the Arithmetic Reason- 
ing Test and two from the directions for the 
Alphabetizing Test. 

Norms are given for all tests, but the group on 
which most of these norms are based is described 
only as “ап employed clerical group." N is usu- 
ally 57 for this group. For some of the tests addi- 
tional norms are given. i 

Three important criticisms may be leveled 
against most of the material in the “reliability” 
sections of the manuals. First, all but one of 
these tests have a time limit which is rather short, 
and hence the speed element in the tests makes 
the usual odd-even reliability coefficient spu- 
riously high. For example, the correlation be- 
tween Forms A and B of the Filing Test is 86 
or .88, but the odd-even reliability is .99. Sec- 
ond, the number of cases on which the reliabili- 
ties are based is sometimes extremely low (35 
for the Number Comparison Test) and is not 
published (why?) for four of the tests. Third, 
these tests probably have satisfactory reliabili- 
ties, but we do not know. If we ignore the spuri- 
ously high odd-even and test-retest figures, we 
find stated reliabilities of .78 to .96 for the four 
tests (е, f, g, and h in the above list) for which 
data are given. We just do not know what the 
reliabilities of the other tests are. ў 

Validity coefficients are conspicuous by their 
absence. One of the test manuals does not even 
mention validity ; for four more of the tests, there 
isa section on validity which does not contain 
on with any industrial or busi- 


a single comparis 
ness criterion; while for the other three tests 


there are extremely brief (never more than three 
lines) statements. These statements simply say 
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that the validities “range between .25 and .30,” 
“ranged from .29 to .37,” and that the test 
“demonstrated low but positive validity.” For 
none of these three tests were the groups de- 
scribed at all; in fact, we are not even told how 
many cases were involved. 

CLASSIFICATION TEST. Items vary widely in 
difficulty, but they are not arranged in order of 
difficulty. Since the time limit is short (то or 
15 minutes for roo items), people who do not 
skip around will not receive credit for many 
items that they could have easily answered. The 
system of classifying items into five subtests is 
most peculiar. For example, “Nippon is another 
name for...Japan" is a “general information" 
item, but “The word Huns is often used to refer 
to the...Germans" is a “vocabulary” item. In 
connection with norms, the following unsub- 
stantiated and highly questionable statement is 
included: "The present sample is representative 
of factory workers found in the State of Ohio." 
Norms for ten different groups are supplied. 

NUMBER COMPARISON AND NAME COMPARI- 
SON TESTS. These tests differ from the well known 
Minnesota Clerical Test in that it is necessary 
to go from one page to another in making the 
comparisons. In a letter to the reviewer, one of 
the authors, Jay L. Otis, said, “We have had 
some additional research on the Number and 
Name Checking Test which indicates that it is 
as good as, if not better than, typical number and 
name checking tests." No satisfactory data on 
either reliability or validity are given in the 
manual, 

TABULATION TEST. Despite its name, several 
of the items in this test require that the appli- 
cant multiply by 4, by 20%, by 14, by 6, etc., or 
divide by such numbers. The manual gives only 
an odd-even reliability and a vague statement 
on validity. Norms are given for one group of 
57 people. 

FILING TEST. This appears to be an almost pure 
speed test as judged by the previously quoted 
spurious odd-even reliability of .99. There are no 
satisfactory validity figures. Norms for two 
groups are given. 

ALPHABETIZING TEST. Data on reliability and 
validity are presented, but there is no descrip- 
tion of the groups upon which the figures are 
based. Norms are based on two very small 
groups (N — 26 and 53). 

ARITHMETIC REASONING TEST. This is a short 
test (20 items) ; most of the items are easy, 
though a few are tricky. They are obviously not 
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in order of difficulty. The only figure given forre 
liability (a correlation of .78 between forms) i 
questioned by the authors as being too low be. 
cause it is based on a highly selected group. 
Norms are given for one group. Validity is not 
even mentioned in the manual. 

SPELLING TEST FOR CLERICAL WORKERS, From 
a purely mechanical standpoint, this test is very 
poorly put together. The printed directions tell 
the applicants to write in the wrong column, and 
the examiner's directions don't match those on 
the test booklet. It is very difficult to do a good 
job of phonetic spelling for a nondictated spelling 
test. In this test a space is inserted between 
syllables of the phonetic spelling and also a brief 
definition is given. Many people would disagree 
with this phonetic spelling. The use of two vow- 
els separated by a consonant within a syllable is 
obviously confusing. Thus sare and nare might 
better be rendered as sair and mair (a method 
which the authors use in some other words). The 
final e in ense, anse, and sorse might well be 
dropped. The use of the letter i for the sound 
of the first e in event is certainly confusing. 
There are many other such deviations from ap- 
proved pronunciations. Finally, teri and eri are 
printed as one but are in reality two syllables (in 
notary and machinery). It must in fairness be 
pointed out that the definitions help the appli- 
cant to understand what word is wanted. The 
reliability is high, but the validity is unknown. 
The test contains many legal and financial terms 

CONCLUSION. The reviewer cannot consclet 
tiously recommend this battery of tests im ү 
present state to an industrial organization m 
is seeking aid in the selection of better clerical 
employees. These tests may be useful for en 
mental purposes, but so are many others. Itt 
test items are analyzed, if the tests are revis 
and if they are found to be valid, the situatio 
will be entirely different. 


[730] al 
*Psychological Corporation General vu 
Test. Grades 9-16 and applicants for clerical Piy 
tions; 1944-50; a revision of General cime speed 
PCI Selection Form го ('42) ; 4 scores: clerical total; 
and accuracy, numerical ability, verbal facility, speci 
т form, '44; manual (750) ; $3.40 рег 25; 350 P (ру. 
men set; also available in two separate booka Booklet 
tial Booklet A—Speed and Number, Partial э, 
B—Verbal) ; $2 per 25 of any one booklet; Pp 
46(50) minutes; Psychological Corporation. 
REFERENCES ) a 
1. Barnette, W. Lesure, Jr. Occupational A нінде Fer 
pa Counseled Veterans. Doctor's thesis, New Y 24:362, tht 
( few York, N.Y.), 1949. Pp. viii, 38s. * (Р. See 
piri Gee WirLrAM JAMES, “A Tested Method for the 
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Psychological Monographs: General and А! 
plied, Vol. 65 No. 5, Whole Мо. 322. Washin on, D.C.: the 
Association, Inc., aper. * (РА 26:2794) 


Epwarp E. Cureton, Professor of Psychology, 
The University of Tennessee, Knoxville, Ten- 
nessee. 

This test consists of nine parts or subtests ar- 
ranged in three groups, Clerical (checking and 
alphabetizing), Numerical (arithmetic compu- 
tation, error location, and arithmetic reasoning), 
and Verbal (spelling, reading comprehension, 
vocabulary, and grammar). It appears to test 
most of the abilities which have been found 
valid for predicting success in clerical occupa- 
tions, with the exception of specific skills such 
as shorthand, typing, and office machine opera- 
tion. The test booklet is well arranged, and the 
instructions for administration are clear and 
reasonably short. Each subtest is timed sepa- 
rately, the time limits ranging from 3 to 8% 
minutes. The examiner must therefore be alert 
throughout the test period of approximately 50 
minutes to be sure that every subtest is timed 
properly. Figures in the margin of the manual 
emphasize the time limits, but an unfortunate 
set of errors confuses the issue slightly ; the sign 
for four minutes, e.g., is printed as 4”, not as 4. 

All parts are scored number right. Part 4 
(error location) is scored with a punched-hole 
mask, the other eight parts with a fan key. No 
instructions are given for folding and using the 
fan key, and persons unfamiliar with this device 
may experience some difficulty in scoring the 
test accurately and rapidly. 

Test-retest reliabilities for the nine parts, the 
three subtotals, and the total score are reported 
in the 1950 manual, based on data from 195 
commercial high school seniors retested after 
one month. Since the test has only one form, and 
Several of its parts are highly speeded, the test- 

. tetest correlations will approximate equivalent 
orm correlations more closely than will split 
thee correlations, They will still be somewhat 
Bud however, than would the equivalent 
i. correlations. The test-retest reliabilities 
ee for the three subtotals and the to- 

Score are .87, .82, .от, and .94 respectively. 
loan standard deviations, interpart correla- 
the Ge шк ethtatal correlations, based on 
cere administration of the test to this group, 

Ported also. 
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The authors state, “Factorial analyses of a 
preliminary form....were made on two similar 
clerical populations of more than one hundred 
cases each. The first showed factors agreeing 
with the three divisions of the test; the second 
substantiated these findings.” Katzell, in his re- 
view of this test in The Third Mental Measure- 
ments Yearbook, states that “С. К. Bennett 
kindly made available to the reviewer a copy of 
the rotated factor loadings. The results of the 
two analyses are not in very close accord, which 
is probably due in part to the relatively small 
number of cases in each sample (108 and 120). 
However, the results generally support the le- 
gitimacy of the three groupings * Parts 1 and 
2 are factorially similar to each other. The same 
can be said of Parts 3, 4, and 5; for Parts 6 and 
9; and for Parts 7 and 8.” It would appear, then, 
that the verbal score might well be broken up 
into two: reading and vocabulary, and spelling 
and grammar. The materials in Parts 3, 4, 5, 7, 
and 8 would correspond quite closely to those 
found in typical group intelligence tests. 

The manual reports a number of correlations 
between the Psychological Corporation General 
Clerical Test and other group intelligence tests 
and clerical aptitude tests ranging from .54 to 
.83. Both of these extreme values represent cor- 
relations with the same group intelligence test in 
different samples. Standard deviations are not 
reported along with these correlations, so that 
it is somewhat difficult to interpret them pre- 
cisely. 

Validity data from three studies are reported. 
The first yielded the following correlations with 
performance ratings: Clerical .45, Numerical 
.22, Verbal .42, and Total .43. Standard devia- 
tions are not given, and no reason is reported 
for the subscale N's ranging from 68 to 71 while 
the N for the total score is 73. A fourfold table 
shows the relation between total scores, cut at 
129-130, and tenure with the firm, cut at 6 
months (N — 116). The manual states, "Here 
again, the test could have been very useful in 
predicting a criterion of job tenure." The four- 
fold-point correlation from this table as com- 
puted by the reviewer, however, 15 only 24. 

The second study shows the relation between 
total scores and supervisors’ ratings of below - 
average, average, and above average. The N’s 
range from 64 to 121, and represent clerical per- 
sonnel in seven branches of the organization. 
Correlations are not given. Phi coefficients com- 
puted by the reviewer from the "outside" ratings 
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(below average and above average) range from 
-16 to .51, and average .32. 

The third study reports the following correla- 
tions with instructors' ratings of 9r clerical 
trainees on completion of a four-week training 
course covering records and accounts: Clerical 
.26, Numerical .54, Verbal .42, and Total .48. 
Standard deviations are not reported. 

The manual emphasizes the specific nature of 
validity by citing the low criterion correlation of 
the Numerical score in the first study (.22) and 
its high correlation in the third (.54), and re- 
commends local validation. 

The manual emphasizes the differences among 
the various sets of norms, stresses the advan- 
tages of local norms, and states that “Until such 
time as he (the test user) has collected a suffi- 
cient number of cases (preferably 100 or more) 
he will find the data in this manual next best for 
his purposes." They will certainly be better than 
the norms supplied with most other published 
clerical tests. 

This test compares very favorably with most 
of the other general clerical tests now available. 
While its use makes somewhat exacting demands 
on the examiner, we must remember that it in- 
cludes its own verbal and numerical intelligence 
test as well as a clerical speed and accuracy test 
and a spelling and grammar test. 


С. A. Satter, Assistant Professor of Psychol- 
оду, University of Michigan, Ann Arbor, Mich- 
igan. 

Four reviewers wrote evaluations of this test 
for The Third Mental Measurements Y carbook. 
They were consistently unenthusiastic. They 
thought the test was too long, that its content 
overlapped with that of intelligence tests, that 
the normative data were inadequate, and that 
some evidence of the predictive value of its scores 
should have been provided. These criticisms 
have, for the most part, been met and dealt with 
in a revision of the test’s manual and in making 
the test available in “partial” forms. 

The Psychological Corporation General Cleri- 
cal Test consists of a battery of nine tests which 
are scored to yield clerical, numerical, and verbal 
scores. These materials are available in a single 
booklet edition which contains all nine tests and 
in a “partial booklet edition” in which the mate- 
tials are divided between two booklets, In the 
latter form, one booklet (A) contains the cleri- 
cal and numerical tests, and another (B), the 


verbal tests. This arrangement permits the or. | 
ganization which already uses an intelligence 
test in its clerical selection procedures to substi- 
tute it for Booklet B, and incidentally it meets 
the criticism that might be raised concerning 
overlap of the verbal test with a test of intelli 
gence. The working time for the tests of Booklet 
A is 2914 minutes, for Dooklet B, 16%, 

The most recent edition of the manual is un- 
usually complete. The prospective user is pro- 
vided with retest reliabilities, normative data 
from quite a variety of sources, and with the re. 
sults of a number of validity studies. There is, 
however, one rather conspicuous omission: the 
authors fail to describe the process by which the 
test materials were originally assembled and the 
process by which they were selected for final 
inclusion. 

Reliability data and intertest correlations are 
reported for a group of 195 seniors in a commer- 
cial high school. The reliability data are in tle 
form of retest coefficients. They range in value 
from .59 to .88 for the subtest scores and from 
82 to .91 for the three part scores; these data 
would suggest that the user of the test prob- 
ably would not want to make decisions on the 
basis of individual subtest scores. Intercorrela- 
tions between scores are repeated for the same 
group; these range from .17 to .54 for the sub- 
test scores and from .42 to .49 for the three part 
scores. 

Enough validity data are provided to enable 
one to say that this collection of tests has the 
potentiality for making a useful contribution to 
a selection battery. These data take the forms 
of correlation coefficients and tables which show 
test performances and job performances. a 
criteria are typically performance ratings, а1 
the correlations are in the 40's and 50's. f 

Norms are presented in the form of percentile 
rank equivalents. They come from a variety 0 
sources—students in business schools, appli- 
cants for clerical positions, and employed cleri- 
cal personnel. A variety of business organiza- 
tions is represented. 1 the 

As it stands today with its revised manual, t $ 
Psychological Corporation General Cler ii 
Test should be able to hold its own among n 
best of the clerical aptitude tests; it most ae 
tainly is worthy of consideration by every di 
ployment office in need of an instrument ў 
selecting or placing clerical personnel. It is Pay 
ably longer than it needs to be, and the separate 
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timed tests may be an annoyance in some em- 
ployment settings. Only the prospective user, 
however, can decide whether these characteris- 
tics are really drawbacks. 


For reviews by Edward N. Hay, Thelma 
Hunt, Raymond А. Katsell, and Е. F. Wonder- 
lic, see 3:630. 

[731] 
*Purdue Clerical Adaptability Test: Purdue 
Personnel Tests. Applicants for clerical positions; 


1949-50; 7 scores: spelling, memory for oral instruc- 
tions, arithmetic computation, checking of names and 


numbers, vocabulary, copying, arithmetical reasoning ; 
Form A (749) ; preliminary manual ('50) ; $5 per 25, 
postage extra; $0@ per specimen set, stpaid; 58(75 

minutes; C. Lawshe, Joseph Tiffin and Herbert 
Moore (test) ; Occupational Research Center, Purdue 
University. * 

Epwarp N. Hav, President, Edward М. Hay 
& Associates, Inc., Philadelphia, Pennsylvania. 

This test has been prepared with some care, 
and its authors are men with considerable ex- 
perience in testing as applied to industry. It is 
surprising, therefore, to find a seven-part test, 
for measuring clerical aptitude, which requires 
nearly an hour of actual testing time. This means 
a total time for administration of close to an 
hour and a half. Public relations will not permit 
the employer today to ask an applicant to take 
so much time. In addition, the cost to the em- 
ployer of administering so long a test is out of 
proportion to the results. It has been established 
that in 15 minutes or less an adequate measure 
of the ability to acquire speed in simple clerical 
tasks can be measured. 

Another fault of this test is that it assumes 
that items such as spelling, word meaning, and 
computation are predictors of clerical success. 
They do not, however, as many experimenters 
have shown, predict low level clerical success as 
well as do tests of speed of perception. 

No validities are furnished in the manual, 
which is very surprising. The manual states “the 
test consists of seven separate sections, each of 
Which measures a skill or ability required on 
Certain clerical jobs.” Instead of submitting va- 
lidity coefficients, the manual states, “а selec- 
tion of the parts to be used can be made from a 
job analysis.” All seven parts are bound together 
ina single booklet, which makes it expensive and 
Inconvenient to use only those parts which may 

ave proven validity in a given situation. 

It is hard to understand why anyone should 
offer a test of this kind for public sale without 


submitting any proof of its ability to do the job 
for which it was designed. In the absence of such 
information, it is impossible to make an adequate 
evaluation. 

In summary, the principal weaknesses of the 
test are that it is prohibitively long, and that no 
experimental evidence is given to show that 
it will predict clerical success. 


Јоѕерн E. Moore, Professor of Psychology and 
Head of the Department, Georgia Institute of 
Technology, Atlanta, Georgia. 

This test includes subtests in seven areas. The 
spelling ability area is tested by the recognition 
method. Memory for oral instruction contains 
such items as telephone exchanges and num- 
bers, time mail goes out, and so forth. The arith- 
metic computation area has 15 problems cover- 
ing simple addition, subtraction, division, and 
percentages. The checking part has material on 
one page to be checked against that on another 
page. The word meaning section contains 35 
words with multiple choice definitions after each. 
The copying portion requires that information 
be copied on particular lines, and that certain 
numbers and symbols be put into the proper 
columns and squares. The last part of the test 
contains typical arithmetic problems. Much of 
the material in this test is similar to that found 
in typical elementary and high school achieve- 
ment tests. 

The directions for administering the test are 
crowded together in the manual in such a way 
that they are both difficult to read and hard to 
keep separated. No information is given about 
the background, training, or experience of the 
norm group except that they were “applicants.” 
A statement that the minimum number of 742 
clerical applicants was used does appear. It 
would be very helpful to have had the group 
classified as to age, sex, and previous work ex- 
perience. 

The only statistical data presented other than 
percentile tables are split half reliability coeffi- 
cients for the seven subtests. These range from 
79 to .97. No data whatsoever are included on 
the validity of the instrument. The statistical - 
data that are presented are incomplete, vague, 
and possibly misleading. This instrument needs 
to have an initial validity study done on it to 
show that it does in fact measure clerical adapta- 
bility. In its present form there is no evidence 
in the manual to show that the instrument can 
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or does differentiate clerical skills or reveal cleri- 
cal adaptability. The authors do not present 
any information to show that their test is as 
good as, or superior, to the numerous clerical 
tests already published and used. 


Avec Корсев, Reader in Psychology, Birkbeck 
College, University of London; and Consulting 
Psychologist to the Ministry of Labour and Na- 
tional Service; London, England. 

Nearly 20 years ago, while on a visit to the 
National Institute of Industrial Psychology, 
Moore became enamoured of the Institute's cler- 
ical test, Group Test 25 (Clerical). Later, he 
produced.a revised version of it for use in the 
United States. Now, in collaboration with 
Lawshe and Tiffin, he has contrived yet another : 
it is on the whole markedly different from the 
original, though it retains a few of the NIIP 
items and instructions. 

Whether it is a better clerical test is difficult 
to judge, for the preliminary manual (which, 
unlike the test booklet, bears the names of 
Lawshe and Tiffin only) offers no validation 
data. Indeed, all the potential user is given is a 
table of percentiles based on the scores of 742 
people who are meagrely designated "clerical 
applicants" and a table of split half reliability 
coefficients. He will derive small comfort from 
these figures. 

Nor will he be reassured by inspection of the 
booklet. The front page contains material that 
is likely to prove puzzling to many "clerical ap- 
plicants" : the names of the compilers, a general 
direction that informs the testee that “this test 
includes a number of sections that measure abili- 
ties that have been found to have value in identi- 
fying successful clerical employees,” and a pro 
forma for raw scores and percentile positions. 
It could certainly not be claimed that the pres- 
ence of such items on the front page is neces- 
sary ; and it might well be argued that it is un- 
desirable. Some of the instructions for the sepa- 
rate sections of the test appear to be similarly 
defective. In two of them, the testee is asked 
to avoid "unnecessary mistakes." What is a 
"necessary" mistake? The type used in the 
arithmetical computation section seems inde- 
fensible. 

In short, although this may in fact be a good 
clerical test, the authors have not yet presented 
any evidence that this is the case. Potential users 
of it will have to depend mainly on their own 
scrutiny of the test booklet, and it is doubtful 
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whether many of them will find it particularly 
attractive. 


[732] 

*SRA Clerical Aptitudes. Grades 9-13 and adults; 
1947-50; 4 scores: office vocabulary, office arithmetic, 
office checking, total; Form AH (47) ; revised manual 
(750); 49¢ per test and answer pad; Separate answer 
pads must be used; $1.90 per 25 answer pads; 55¢ per 
25 profile and norm sheets ('49) ; 75¢ per specimen set; 
cash orders postpaid; 25(35) minutes; prepared by 
Richardson, Bellows, Henry & Co., Inc.; Science Re- 
search Associates, Inc. * 


Epwanp №. Hay, President, Edward М. Hay & 
Associates, Inc., Philadelphia, Pennsylvania, 

This test has been constructed with a great 
deal of care and understanding. It is made up 
of three subtests: office vocabulary, office arith- 
metic, and office checking. These three subtests 
appear to have high face validity, but no infor- 
mation concerning actual validity is presented 
in the manual. In an effort to overcome this lack 
of information, SRA Clerical Aptitudes was ad- 
ministered to 21 employees in a life insurance 
company, along with two test batteries whose 
validities have been repeatedly established in 
various clerical.situations: the Hay Number 
Perception Test and the Hay Name Finding 
Tests, and the L.O.M.A. Clerical Test No. 2. 
Although the sample was small, both batteries 
gave validity coefficients quite similar to those 
obtained in a good many other similar situations. 

АП of the 21 employees were engaged in per- 
forming simple clerical tasks. The measure of 
success as rated by two supervisors was de- 
scribed as “speed of working,” and was intended 
to indicate the “volume of work of acceptable 
accuracy" produced by each individual. The cri- 
terion was the average of the two sets of super- 
visory ratings. The coefficient of reliability of 
the criterion as indicated by the relationship 
between the two sets of ratings was .91. 

The validity coefficients for the Hay Number 
Perception Test and the Hay Name Finding 
Test were .64 and .бо respectively. In contrast 
the arithmetic and vocabulary subtests of Sé 
Clerical Aptitudes had validity coefficients 0 
—.05 and .08 respectively, indicating that these 
subtests apparently do not predict success 1 
simple clerical tasks. The validity coefficient for 
the SRA checking subtest was .55, indicat 
that this 5-minute test is a good predictor 0 
clerical success. Of course, the SRA arithmetic 
and vocabulary tests may have value in aci 
situations, but the authors do not tell us wha 
they are. It is hard to understand how a test ? 
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this kind could have been used for four years 
without information concerning its validity hav- 
ing been assembled. 

The format of the test is good, although some- 
what complex and expensive in style. Scoring is 
about as inconvenient as could be devised. 

The 25 minutes required for administering 
the complete test is longer than is necessary. 
Most of the wasted time occurs in the office 
arithmetic section, which is a series of reasoning 
problems dealing with figures. This and the 
office vocabulary part are really made up of in- 
telligence or mental ability items. Other experi- 
ments have shown repeatedly that low level 
clerical success is predicted more effectively by 
tests involving primarily speed of perception. 

In summary, the serious faults of this test 
аге: (a) two of the parts do not appear to be 
valid as claimed for predicting success in simple 
clerical work; (5) the method of scoring is cum- 
bersome ; and (c) the administration time is too 
long. 


G. A. Satter, Assistant Professor of Psychol- 
ogy, University of Michigan, Ann Arbor, Mich- 
igan. ; 

This battery consists of three short tests which 
are suitable for use with high school students 
and industrial job applicants. The battery con- 
tains an office vocabulary test (48 items cast in 
same-opposites form), an office arithmetic test 
(24 four-alternative multiple choice items), and 
a checking test (144 words, keyed with num- 
bers, which must be paired with one of five num- 
bers). These materials are reproduced in a non- 
expendable spiral binder. Answers are recorded 
on a self-marking answer sheet. 

Inthe authors’ words, materials for these three 
tests were assembled after a “critical survey of 
the Psychological literature on measurement of 
clerical aptitudes.” From the leads provided from 
the survey, experimental forms were constructed 
and administered to high school groups; item 
analyses were made and items were selected in 
terms of their ability to discriminate between 

high” and “low” scoring groups and to yield 
an “even distribution of difficulty." In their 
abled form, the items are scored on a unit 
asis, and the part and total scores may be evalu- 
i against the percentile norms provided in 

* manual. Norms representing the perform- 
ance of students in some 40 high schools are 
Mis. separately by grade and sex ; similar data 

Те provided for industrial clerical applicants. 


The materials in the manual are well or- 
ganized and include data which the prospective 
user of the test may find useful. Subtest means, 
sigmas, and Kuder-Richardson reliabilities are 
provided for groups of boys and girls in grades 
9, 10, 11, and 12. In addition, there are intercor- 
relations between subtest scores (these range 
from .30 to .50) and between subtest scores and 
scores on two intelligence tests and SRA Lan- 
guage Skills. The authors are not so helpful as 
to provide correlations with either job or train- 
ing performance. 

The prospective user will undoubtedly be at- 
tracted by the test layout, its high face validity, 
its short time limit, and the ease with which it 
can be scored. He will be less enthusiastic when 
he fails to find in the manual any statistics con- 
cerning validity; and he may look at other tests 
when he reads that two of this battery of three 
tests correlate from .49 to .75 with measures of 
intelligence. 


[733] 
The Short Employment Tests. Applicants for 
clerical positions; 1951; 3 tests; Forms 1, 2, 3, 4 (dis- 
tribution of Form 1 is restricted to banks which are 
members of the American Bankers Association; Form 
4 is restricted for special uses); preliminar manual 
$1.60 per 25; 35€ per specimen set; postpaid; 15(20) 
minutes; George K. Bennett and Marjorie Gelink; 
Psychological Corporation. * 
a) TEST CA, CLERICAL APTITUDE. 
b) TEST М, ARITHMETIC SKILL. 
c) TEST V, WORD KNOWLEDGE. 


[ 734] 
xTabulation Test. Clerical applicants and high 
school; 1947; Forms A, В; manual ['47]; $2 per 25, 
postage extra; 50¢ per specimen set, postpaid; 10(13 
minutes; Jay L. Otis and David J. Chesler ; Personnel 
Research Institute, Western Reserve University. * 

[ 735] 
*V.G.C. Clerical Indicator. High school; 1950; an 
adaptation of United States Employment Service's 
General Aptitude Test Battery, Part B (see 714); 1 
form; preliminary manual; $1.95 per 25; separate an- 
swer sleeves must be used; 48¢ per 25 answer sleeves ; 
тоё per manual; 45¢ per specimen set; postage extra ; 
available only to Canadian schools; 10(20) minutes; 
Vocational Guidance Centre. * 


Grorce A. Fercuson, Professor of Psychology, 
McGill University, M ontreal, Canada. 

This test is an adaptation of Part B of the 
United States Employment Service’s General 
Aptitude Test Battery. The test material is of 
the name checking type. Pairs of names are pre- 
sented to the subject who is required to indicate 
whether the names in each pair are the same or 
different. Speed and accuracy are important. 
The test material is identical in type with that 
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contained in Test 2 of the Minnesota Clerical 
Test. The test contains 150 items, and the testee 
is allowed 10 minutes working time. 

The distinctive features of this test are its 
format and method of scoring. The test forms are 
printed on heavy cardboard. The testee records 
his answers on a separate cardboard answer 
sheet or sleeve which is fitted over the test form. 
The scoring sheet is subsequently inserted in a 
plastic sleeve for scoring purposes. The format 
and method of scoring are very ingenious. Econ- 
omy may be effected by using the test form 
many times. 

Very little information on the test is pre- 
sented in the manual of instructions. Rather in- 
adequate norms are available based on a sample 
of 480 pupils in grades 9 and то in a commercial 
school. Scores for the grade то pupils are slightly 
higher than those for grade 9. No information 
on the reliability of the predictive capacity of 
the test in various situations is given. We may 
presume, however, that reliability and validity 
data relating to the Minnesota Clerical Test may 
have some degree of relevance to the V.G.C. 
Clerical Indicator since the test material is simi- 
lar in both. The reliability and validity of this 
I50-item test may be expected to be somewhat 
lower than for the Minnesota Clerical which 
contains 400 items in all. 


INTERESTS 


[736] 

*ABC Occupational Inventory. Grades 9-16 and 
adults; 1935-42; a checklist of over 500 occupations 
(classified as to work function) in 10 areas: artistic, 
humanistic, literary, scientific commercial, technical, 
mechanical, industrial, constructional, transporta- 
tional; т form, '42; no data on reliability and validity ; 
no norms; IOÉ per inventory; 256 per specimen set; 
postpaid; N. A. Lufburrow; the Author, 3112 Milford 
Ave., Baltimore 7, Md. * 
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*Brainard Occupational Preference Inventory. 
Grades 9-12 and adults; 1945-49; a revision of Spe- 
cific Interest. Inventory (see 40:1675) by Paul P. 
Brainard and Frances G. Stewart; 4 occupational sec- 
tions in each of 7 fields: commercial, personal service, 
agriculture, mechanical, professional esthetic, scien- 
tific; Form A (’45) ; manual (749) ; $5.50 per 25; sep- 
arate answer sheets must be used; $1.60 per 25 answer 
sheets; 60€ per specimen set; postpaid ; nontimed (30) 
minutes; Paul P. Brainard and Ralph T. Brainard ; 
Psychological Corporation. * 


REFERENCES 
1-4. See 40:1675. 
ES Lo EE ii qa т 
t Я н. omparison о Vi t e 
yentories With Respect to Word Usage” J Col Res 4а 17 8 
"48. * (PA 23:2915 


Ermer D. Hincktey, Professor of Psycholog 
and Head of the Department, U. niversity of Flor. 
ida, Gainesville, Florida. 

This inventory consists of 140 items in which 
the subject rates on a 5-point scale different 
activities in terms of his disliking, liking, or 
neutrality toward these activities. The test js 
easy to administer and score and gives a profile 
for seven occupational fields which are broken 
down into 28 sections, four to each field, 

This inventory is fairly new and has not been 
used as widely as the more popular Strong and 
Kuder. While the reliability coefficient of 81 
quoted by the authors is acceptable for a test 
of this sort, no evidence of validity is offered 
other than logical or face validity. The test asa 
whole may be reliable, but the reliability of the 
field and section scores may be open to question. 
Having only 140 items, the test is too brief to 
yield the amount of information that the authors 
attempt to squeeze from it. 

The norms are inadequate, but the authors 
state that more complete norms will be forth- 
coming. The norms given are for adult men, 
adult women, high school boys, and high school 
girls. Separate field and section norms are given 
for each of these groups with the numerical 
scores expressed in quintile divisions that аге 
designated very low, low, average, high, and 
very high. More cases are needed in establishing 
norms (adult men, N = 95). 


For reviews by Edwin W. Davis and Her- 
schel T. Manuel, see 3:634; for reviews by Jack 
W. Dunlap and M. R. Trabue of the original 
edition, see 40:1675; for a review by Everett В. 
Sackett of the original edition, see 38:1176. 


[738] 
*Edmiston Inventory of Interest. Grades 11-16 
and adults; 1946-48; Forms А (4б), B (40), A 
(748), D (48) ; manual [48]; 736 per 25; 35 
specimen set; postpaid; nontimed (25) minutes; * «v 
ea the Author, Miami University, Oxfor 
io. MD 
а) FORMS A AND B, 8 scores: business, aesthetics, S% 
ence, social, journalism, mechanics, education, 801 
ment. ics, sci- 
b) FORMS С AND р. 9 scores: business, aesthetics, ln 
ence, social, journalism, mechanics, biological scien 
numbers, clerical. \ 
REFERENCES АС 
1, Epmiston, R, W., AND VORDENBERG, WESLEY. The ces 
tionship Between Interests and School Marks of College 
men." Sch & Soc 64:15374 Ag 31 46. * (PA 21:890) 1 
Edw 


ARTHUR E. TRAXLER, Executive Director, 
cational Records Bureau, New York, 
York. n" 
Although several well known inventories 
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interests have been available for a good many 
years, new instruments of this kind continue to 
be published. Perhaps in no other field do meas- 
urement instruments vary so widely in care of 
construction and in extent of research on the 
results. Years of research have been devoted to 
the Strong blanks. There is now a large body of 
data on the Kuder Preference Record, and.sev- 
eral studies of the Cleeton and other interest in- 
ventories have been made available. On the other 
hand, certain inventories have been issued with 
little or no evidence that anything other than the 
judgment of the test author was taken into ac- 
count in either the construction or tryout of 
the instrument. 

The Inventory of Interest is one of the more 
recently published inventories on which there 
is not, at present, much objective information. 
It exists in six forms, a larger number than is 
available for any other instrument designed for 
the measurement of interests. Forms A, B, C, 
and D are intended to measure present interests ; 
the other two forms, designed to measure what 
the test author has called "maintained interests," 
cover vocabularies and the names of personages 
connected with certain occupational fields. 

Тһе test situation presented by the forms de- 
signed to measure present interests is very sim- 
ple. Each form is printed on one side of a letter- 
size sheet. At the top are eight lines of direc- 
tions, below which are three columns, headed 
"Column I, WHICH MAGAZINE; Column Il, 
WHICH coursE; Column III, wHICH тов.” Col- 
umn I contains the names of 40 magazines, either 
actual or fictitious, arranged in 5 groups of 8 
names each. Column II contains the names of 
40 courses, and Column III presents the names 
of 40 jobs; both lists grouped in similar fashion 
to Column I. The procedure in filling out the 
inventory is to place an X before the preferred 
item in each group of eight and to check two 
other items as the next two choices. 

) Forms А and B are planned to provide scores 
in eight fields—business, education, aesthetics, 
Bovernment, science, social, journalism, and me- 
chanics. Forms C and D yield scores in nine 
fields—business, aesthetics, biological sciences, 
Science, social, journalism, mechanics, clerical, 
and numbers. In scoring, the first choice in each 
of the 15 groups is given a weight of 2 and each 
Second choice is counted as 1 point. Thus, a total 
of 60 points is distributed among 8 or 9 different 
fields, The highest possible score in any one 
field is 30 and the lowest is о. The interpreta- 
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tion of the scores is as simple and uncompli- 
cated as the test itself. The individual’s raw 
scores in the eight or nine fields are simply 
ranked in order, and this ranking presumably 
indicates the relative order of one's interests. 
Norms are not used in the interpretation of the 
results ; in fact, no norms appear in the manual. 

At first glance, a test specialist is likely to 
think that this inventory is hardly worth serious 
consideration, especially when it is compared 
with certain other instruments in the field of 
interests which represent a considerable degree 
of statistical sophistication. But, notwithstand- 
ing the extreme simplicity of the scores yielded 
by this blank and of the procedure in interpreta- 
tion, the test author presents in the manual some 
statistical information on reliability and validity 
which does not compare unfavorably with simi- 
lar data on other interest inventories. Seventeen 
correlations between alternate forms of the in- 
ventory of present interests ranged from .66 to 
.92, with a median of .83. Ten reliability coeffi- 
cients for the tests of maintained interests ranged 
from .78 to .91, with a median of .88. 

In order to obtain further data on the relia- 
bility and validity of the inventory, Forms A 
and B were administered to 24 members of the 
staff of the Educational Records Bureau, 17 
women and 7 men. There was an interval of 
10 days between the two forms. About five days 
after the second form was filled out, each person 
was asked to rank the eight fields for which the 
inventory was to be scored in the order of his 
interests as he judged them to be. The inven- 
tories were not scored until these judgments had 
been obtained. 

Rank order correlations were then computed 
for each person between Forms A and B and be- 
tween each form and self-estimates of interests 
with respect to the eight fields. The Spearman 
rank-difference correlations between Forms A 
and B, or the reliabilities, ranged from .05 to 
.98, with a median of .84. Only three of the cor- 
relations were below .70. While it may be only 
fortuitous, it is interesting to note that the me- 
dian is practically the same as the median relia- 
bility reported by the test author. This degree 
of reliability is not unfavorable for a measure- 
ment instrument in this field. The corresponding 
findings for the rank-difference correlations be- 
tween each form and self-estimates of interests 
were as follows : Forms A, range of correlations, 
то to .99, median, .80; Form B, range of cor- 
relations, .43 to .97, median, .74. Similar brief 
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studies of the Kuder Preference Record and 
Cleeton's Vocational Interest Inventory as re- 
ported by this reviewer in earlier Mental Meas- 
urements Y earbooks yielded median correlations 
of .75 for the Kuder and .68 for the Cleeton. 

Obviously, it should not be inferred from these 
data that the Edmiston inventory is necessarily 
as good a measure of interests as the Kuder or 
the Cleeton inventories. It may be that the rela- 
tively simple test situation presented by this in- 
ventory allows a halo effect which carries over 
from one form to the other and from test to 
self-estimate. The mental process required in 
filling out the inventory is quite similar to the 
thinking which takes place when one ranks the 
eight fields in order of interest. Thus, the new in- 
ventory is a less sophisticated measure than an 
instrument such as the Kuder, which is based on 
comparison of activities whose relationship to 
the different fields may not be apparent to the 
subject. Nevertheless, it is noteworthy that some 
positive evidence of reliability and validity was 
obtained in this small and informal study of the 
Edmiston inventory. It should also be noted 
that this inventory was constructed for use at 
the school and college levels and its trial with 
adults may not be an entirely fair test. 

Certain observations may be made about the 
scores and the scoring procedure. The categories 
are broad, and some of them are not precisely 
named. For example, one area is termed “social.” 
It is not clear from this name what kinds of 
social activities are implied. An examination of 
the items contributing to the score in this field 
indicates that the general area of social service 
is sampled. Although the scoring is essentially 
simple, the scoring directions are the most 
nearly unintelligible this reviewer has seen for 
any measurement instrument. These directions 
greatly need revision and improvement. A fan 
key or stencil for scoring could be devised; in 
fact, staff members of the Educational Records 
Bureau formulated a scoring procedure which 
enabled them to score 48 inventories at an aver- 
age rate of 374 minutes an inventory. This is 
very rapid compared with the scoring of most 
other interest inventories. 

Norms are needed for the interpretation of 
the scores on the inventory, and it may be de- 
sirable to investigate the possible need for norms 
differentiated according to sex. 

Everything considered, this reviewer would 
not recommend the use of the Edmiston inven- 
tory in its present form except in an experi- 
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mental way. He does, however, regard it as an 
interesting experiment in setting up a simple, 
easily administered, inexpensive instrument 
which may be valuable after further refinement, 
The fact that the inventory is not at present in 
the hands of a regular test publisher but is pri- 
vately printed suggests that probably the test 
author does not regard it as a final product and 
that further work designed to improve it may 
be contemplated. 


[739] 
*The Guilford-Shneidman-Zimmerman Interest 
Survey. Grades 9-16 and adults; 1048; 18 scores: 
artistic-appreciative, artistic-expressive, linguistic-ap- 
preciative, linguistic-expressive,  scientific-investiga- 
tory, scientific-theoretical, ^ mechanical-manipulative, 
mechanical-designing,  outdoor-natural, outdoor-ath- 
letic, business-political-mercantile, business-political- 
leadership, social activity-persuasive, social activity- 
gregarious, personal  assistance-personal service, 
personal assistance-social welfare, office work-clerical, 
office work-numerical; Form A; no data on validity; 
norms (48); $4 per 25; separate answer sheets must 
be used; 504 per 25 answer sheets; 50€ per 25 profile 
sheets of high school or college norms; 256 per man- 
ual; postage extra; боё per specimen set, postpaid; 
nontimed (50) minutes; J. P. Guilford, Edwin Shneid- 
пад and Wayne S. Zimmerman; Sheridan Supply 


REFERENCES 1 
т, Foster, Кемметн Еосеме. An Intercorrelational Investi 
eu of the Gutiford-Shasidmam Zimmerman Interest Ju 


l; 
aster’s thesis, University of Southern California (Los 
9. 


. P.; SHNEIDMAN, E. S.; AND ZIMMERMAN, 


nS. ford-Shneidman-Zimmerman Interest Survey. 
J Consult Psychol 13:302-6 Ag '49. * (РА 24:875) 


Grorce К. Bennett, President, The Psycho- 
logical Corporation, New Y orb, New York. 
This interest inventory consists of a total of 
360 items to each of which the subject responds 
by encircling on a separate answer sheet “D” as 
indicating disinterested, dislike, or don't know; 
“Н” if the activity has appeal as a hobby; V 
if it would be acceptable as a vocation. The sub 
ject is permitted to encircle both “H” and V 
when the activity is favored in both regards. The 
activities are described in such terms as “Mould 
a statue in clay,” “Write newspaper editorials, 
“Со to dances,” and “Think out the logical de- 
ductions from a scientific discovery.” The items 
fall into nine major categories ; each of these 1$ 
divided into two subcategories which are score 
separately. The answer sheet is so arranged tha 
the individual completes one cycle of the 18 sub- 
categories to a column; this permits horizon 
counting of each row to obtain the “Н” an 
“V” scores for that classification. Thus no key 
is necessary. d 
The items used were selected from a prelim ji 
nary form containing 540 activity descriptions. 
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The criterion for inclusion in the final form was 
internal consistency as determined from the data 
provided by 300 male college sophomores. High 
school normative data have been obtained from 
180 boys and 176 girls entering the eleventh 
grade in a southern California urban school. 
College norms are derived from the scores of 
389 men and 99 women students in a sophomore 
course in a southern California university. The 
manual states: “These norms are regarded as 
provisional and are to be revised as results from 
other parts of the country are obtained.” The 
norms are not presented in tabular form but are 
incorporated into profile sheets, one for each sex 
and educational group. The profile sheet is pre- 
pared by encircling the individual’s raw score, 
these scores being arranged in rows according 
to their positions in a normalized distribution 
resulting from the data of the appropriate norms 
group. C-score, centile, and T-score equivalents 
are shown at the top margin of each profile sheet. 
Data on reliability are provided both in the man- 
ual and in a supplementary issue of “Psycho- 
metric Notes" published in July 1950. The 
reliabilities are of the split half type and fall gen- 
erally in the high .80’s and low .go’s. The supple- 
ment contains a table of tetrachoric intercorrela- 
tion coefficients based upon 300 college men. 
These 778 range from zero to above .8. No data 
are given regarding validity. The authors say: 
“Citations of practical validity data for interest 
scores are typically conspicuous by their absence 
in connection with interest inventories. This 
Survey will be no exception to this respect.” The 
manual (which does not report intercorrela- 
tional data) goes on to say: “Evidences of in- 
ternal or factorial validity are to be found in 
intercorrelation data. * Tt is believed that the 
nine main category scores will intercorrelate 
generally low,” etc. The manual, however, con- 
tains nearly two pages devoted to the vocational 
implications of scores. We find, for example, 
that the person with high outdoor-athletic in- 
terest can be expected to find some of the follow- 
ing vocations attractive: athlete, big game 
hunter, boxer, coach, golfer, gymnast, racing 
driver, wrestler. 

This inventory contains some features which 
are novel and which should increase the ease of 
scoring and profiling. The separation of voca- 
tional and avocational interests would seem to 
have some advantages, and the answer sheet ar- 
rangement diminishes the time and cost of scor- 
ing to a very appreciable extent. It should be 
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noted that each of these virtues involves at- 
tendant detriments. Some of the activities listed 
can hardly be considered appropriate for both 
hobby and vocation. This reviewer found diffi- 
culty in imagining a career based on “Going to 
dances” (Item 68), “Helping friends move their 
belongings to a new home" (Item 69), or “Help- 
ing distribute food at a picnic” (Item 105). Per- 
haps it is feasible to make a hobby of “Ѕирегуіѕ- 


‚ ing a group of professional people" (Item 84) 


or “Being responsible for establishing policies 
and procedures in an industrial organization" 
(Item 102). The systematic arrangement of the 
answer sheet tends to become apparent to the 
examinee and the authors caution that “it is im- 
portant to prepare the examinee to take the 
proper attitude toward the test at the start.” 
Probably neither of these features need be a 
cause of serious concern. 

The most serious defect in this instrument 
would seem to lie in the implication that a test 
is useful with norms obtained from small sam- 
ples drawn from a single community or uni- 
versity. The characteristics of climate, academic 
opportunity, and vocational specialties in the Los 
Angeles area may produce interests representa- 
tive of the country as a whole, but it is more 
likely that these are specific to an area including 
only a small fraction of the nation’s population. 
It may be proper to point out that other interest 
inventories are lacking in validation, although 
this is but partially true of the Strong VB: 
but it does not strike the reviewer as equally 
justifiable to suggest that so tentative an instru- 
ment can provide useful data on the basis of 
which to advise an individual as to his subse- 
quent career. The GSZ Interest Survey may well 
prove at some later date to have significance for 
educational and vocational guidance. The data 
so far submitted do not appear to justify its 
present use except as an experimental instru- 


ment. 


Wirsun L. Layton, Assistant Professor of 
Psychology, and Assistant Director, S tudent 
Counseling Bureau, University of Minnesota, 
Minneapolis, Minnesota. ! a 
This survey purports to measure interests in 
nine general areas, each of which is then broken 
down into a dual classification so that 18 interest 
scores result. For example, in the artistic area, 
appreciative and expressive scores are given, 
distinguishing between those who like art as 
consumers but not as producers. In the mechani- 
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cal area, manipulative and design scores are 
given so that a distinction between the tool op- 
erator and the tool designer or engineer is pos- 
sible. The same sorts of distinctions are made in 
each of the other areas. 

Another unique feature of the survey is the 
dual score for each of the 18 interest fields. On 
the answer sheet there are three possible re- 
sponses for each item: H and V stand for hobby 
and vocation respectively, and D represents all 
categories not specifically denoted by the H and 
V responses—dislike, disinterested, don't know, 
etc. The survey asks for a direct response to the 
type of activities listed as items, and both H and 
V responses may be made to a single item. 

The items were selected from a preliminary 
form of 540 items, which was administered to 
700 subjects—400 high school students, evenly 
divided as to sex at the beginning of the ті 
grade, and 300 male college students in a sopho- 
more course. An internal consistency item analy- 
sis was made on the data from the college group. 
On the basis of this analysis, the 20 most in- 
ternally consistent items in each interest field 
were selected to comprise Form A of the survey. 

In regard to validity the authors say, “Cita- 
tions of practical validity data for interest scores 
are typically conspicuous by their absence in 
connection with interest inventories, This Sur- 
vey will be no exception in this respect. Appro- 
priate external criteria for this purpose have 
never been sufficiently developed." If this form 
of rationalization becomes widespread, we will 
never have appropriate criteria against which 
to validate interest tests. Perhaps we should 
spend more time developing criteria and less 
constructing insufficiently validated tests. 

The authors present evidence for internal or 
factorial validity in a study of intercorrelation 
data. These data give some indication of how 
much uniqueness each score possesses. If a score 
shows high reliability and low intercorrelations 
with the other scores, it tends to measure some- 
thing that none of the other scores measures. 
However, these data do not give one a clue as 
to exactly what the score does measure so 
uniquely, and hence are only a vague indication 
of validity as commonly defined. 

Reliability coefficients computed on first-half 
versus second-half scores are given in the man- 
ual. They are based on the scores of 100 boys 
and roo girls in the high school sample. The 
authors state that the distributions were so seri- 
ously skewed that Pearsonian correlations and 
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standard errors of measurement would have 
been meaningless. Consequently, tetrachoric cor- 
relations were computed. Since the tetrachoric 
correlation coefficient is an estimate of the prod. 
uct-moment correlation coefficient in a bivariate 
normal population and not in a nonnormal popu- 
lation, the obtained coefficients are also mean- 
ingless by the authors' criterion. Nevertheless, 
these correlations were substituted in the Spear- 
man-Brown formula. The resulting coefficients 
range from .72 for the social activity-gregarious 
vocational score for females to .96 for the me- 
chanical-designing vocational score for males, 

The high school norms given on the profile 
sheets were based on 11th grade high school stu- 
dents in a southern California urban school, 180 
boys and 176 girls. The college norm group was 
composed of 389 men and 99 women students 
in a sophomore course in a southern California 
university. The manual states that these norms 
are provisional and are to be revised as results 
from other parts of the country are obtained. 
The test manual was copyrighted in 1948; no 
other normative datà are available. The norms 
given in the manual are inadequate for general 
use. 

As have most interest test authors recently, 
the authors of this test list under each scale à 
set of occupations which might be appropriate 
for people scoring high on the scale. No evidence 
whatsover is presented for the validity of these 
lists. 

In summary, the survey represents а well 
thought out approach. The 36 scores should be 
very useful in counseling. However, until more 
adequate norms and reliability and validity data 
are available, the survey should be used only 
experimentally. 


y [740] di 
*Henderson Analysis of Interest [Second Edi- 
tion]. Grades 9-16 and adults; 1950; occupatione 
preferences in 14 areas: business service, clerical, а 
counting and statistics, persuasive, managerial, Len 
science, physical science, biological science, feng 
ing, art and music, teaching, writing, mechanical, [еу 
ual; 1 form; $9 per 100 sets of test and profile Me 
30 day trial order: $5 per 50 sets of test and Ме 
sheet; 40€ per specimen set; postpaid; nontime 40 
35) minutes; Robert W. Henderson; the Author, 9 
Eighth St., N.E., Massillon, Ohio. * 


Waırgur L. Layton, Assistant Professor of PY 
chology, and Assistant Director, Student Сани 
seling Bureau, University о} Minnesota, Min 
neapolis, Minnesota. Ж 

The 14 scales on this test, which coincide 
somewhat with curricular and occupation? 
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areas, are grouped into business, professional, 
and active areas. Under the business area are 
listed the business service, clerical, accounting 
and statistics, persuasive, and managerial scales. 
Under the professional area are grouped the so- 
cial science, physical science, biological science, 
engineering, art and music, teaching, and writ- 
ing scales. Under the active area are classified the 
mechanical and manual scales. The label “active” 
for this last area seems quite inadequate to de- 
scribe the items scored for the two scales. 

The test items are presented in 40 groups of 
7 items each. The individual is asked to select 
2 items, if possible, from each group of 7, There 
are 20 items for each of the 14 scales, These 
items were taken from the author's College In- 
terest Inventory and his Work Interest Analy- 
sis on the basis of their being "internally signifi- 
cant." From this, although it is not made clear 
in the manual, one assumes that the author con- 
ducted some form of internal-consistency item 
analysis. 

The test author feels that “since a respondent 
is required to select only one or two choices from 
each set of seven possibilities, it is evident that 
his choices will be truer preferences than in such 
tests as the Lee-Thorpe, the new Thurstone, and 
Kuder where the individual is forced to select 
one from a choice of two or three." This, of 
Course, assumes that there are one or two items 
within each group of seven that are attractive 
to the individual, a doubtful assumption ,when 
one looks at the items. 

The items which were finally selected and in- 
cluded in the test are not representative of the 
areas which they are attempting to measure. For 
example, 6 of the 20 items which purport to 
Measure interest in business service are from the 
Occupation of beauty operator or related fields: 
Shampoo customers’ hair, hairdresser, make up 
expert, give facial massages, pluck eyebrows by 
electrolysis, massage expert, and give health 
baths, Five items scored for business service in- 
volve food handling: serve short orders, work 
behind a soda fountain, serve food in a restau- 
Tant, waiter (waitress), and serve soft drinks. 
In each of the other 1 3 areas one finds that the 
tems are not in relative proportion to activities 
°F occupations in the area. 
giae criticism of the items can be made 

the standpoint of duplication of content. 
"i аре, the "waiter (waitress)" item 
with be perceived by most people as identical 
the item "serve food in a restaurant." This 


TESTS & REVIEWS: VOCATIONS—INTERESTS 


lidity and reliability data, this inventory is not 


740 


is also true in the mechanical area where the two 
items “auto mechanic" and “repair automobiles" 


are used. Use of these items gives undue weight 


to that particular activity and tends to determine 
the factors measured by the scale. This is particu- 
larly true when an internal-consistency method 
of item analysis is used. 

A further criticism of the items can be made 
in terms of their readability and understanda- 
bility, particularly when the test is self-adminis- 
tered at the high school level. Our experience at 
the University of Minnesota in giving Strong's 
Vocational Interest Blank to all classes at the 
University High School indicates that the 
younger students do not have sufficient knowl- 
edge of various occupations to make intelligent 
choices without assistance. For such students 
one wonders about the suitability of many spe- 
cialized items: develop aniline dyes, construct 
stamping dies, describe physical events with 
mathematics, investigate ionization problems, 
pluck eyebrows by electrolysis, prepare evolu- 
tion charts, investigate problems of heat transfer, 
do research in immunology, and investigate em- 
bryo development. 

Henderson gives test-retest correlation co- 
efficients based on an N of 40 people as evidence 
of reliability. These coefficents range from .80 
for Business Service to .96 for Physical Science. 
There is no description of the group of 40 people 
in the manual to aid in evaluating these fig- 
ures. 

As partial evidence of validity Henderson re- 
ports correlations between the Кийе" Prefer- 
ence Record and his test on a group of 124 high 
school students. The correlations range from .30 
between the Henderson accounting and statis- 
tics scale and the Kuder computational scale to 
.77 between the two mechanical scales. This 
represents the doubtful procedure of validating 
a newly constructed inventory against an exist- 
ing inventory, the validity of which is still being 
experimentally determined. ter 

Henderson lists a group of representative jobs 
for each scale. These seem to have been classified 
on a speculative basis. He presents no evidence 
that high scores on a particular scale are related 
to any other measure of interest in, or satisfac- 
tion with, a particular profession. 

This inventory appears to be an attempt to 
improve upon the Kuder Preference Record. If 
such was the intent, the author has failed. Be- 
cause of poor items and the lack of adequate va- 


to 
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recommended for use until further research justi- 
fies it. 


Donatp E. Super, Professor of Education, 
Teachers College, Columbia University, New 
York, New York. 

This test consists of 40 seven-item groups, 
some consisting of occupational titles and some 
of occupational activities or duties. Thus, there 
are 20 items per scale. As in the case of the 
Thurstone Interest Schedule this means that 
one change of response can make a sizable change 
of score; in the case of the Business Service 
Scale, for example, a choice of no items puts a 
man at the roth percentile (why not the Ist is 
not explained), a choice of one item puts him 
at the 3oth, and a choice of three items puts him 
at the 7oth. Despite this fact, the one-week re- 
test reliabilities range from .80 to .96, for a 
group of subjects the size and heterogeneity of 
which is not specified. 

APPLICABILITY. The test is said to be applica- 
ble to ninth grade students as well as to college 
students and adults, but no evidence to this effect 
is cited; the fact that the norms are based on 
high school juniors and seniors and college stu- 
dents suggests that there is no such evidence. 
Some of the vocabulary would be difficult for 
the average ninth grader, who probably has no 
idea what “ionization problems" and “geophysi- 
cists" are. Evidence as to the vocabulary level 
would be desirable, with the range as well as 
the average of the grade levels made clear. 

norms. The manual states that "it is believed 
that these norms are fairly representative of 
young people in senior high school and in the 
first year of college," but no evidence is cited to 
show that these 90 high school juniors and sen- 
iors and 150 college students are typical. It is 
not stated whether they all were in one high 
school and one college and from one locality or 
whether the group is cosmopolitan. The group 
being as small as it is, the students are probably 
from one institution. It hardly seems likely that 
students from an academic high school in a priv- 
ileged community are typical of students in a 
trade high school in an underprivileged neigh- 
borhood, or vice versa. Perhaps this is why the 
Mechanical and Manual Scales yield average 
raw scores of two. Obviously, a more adequate 
description of the norm group, and probably 
more adequate norms, are needed. The fact that 
the author cites “clinical experience” with the 
test as evidence of its validity is further evidence 
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of fuzzy thinking and, in fact, of lack of evidence 
concerning the appropriateness of the norms and 
the validity of the scales. 

The norms are described as Permitting the 
comparison of the strength of a person’s interest 
in a field with that of other persons, Assuming 
that the number and representativeness of the 
norm group were sufficient for this, the meaning 
of such data would still be an open question, For 
they do not tell how interested a person should 
be, in order to have a reasonable chance of being 
satisfied, successful, or stable in a given occupa- 
tion. Should an auto mechanic be more inter- 
ested in mechanics than 25, 50, 75, or 9o per 
cent of high school and college students? Occu- 
pational norms are clearly needed. 

A list of representative jobs for each scale is 
included in the manual, with no indication as to 
the validity of the classification of jobs. Exami- 
nation of the grouping shows that it differs in 
certain important respects from that of the Dic- 
tionary of Occupational Titles) Strong's re- 
search, and other empirical classifications. For 
example, policemen are classified as a social 
service occupational group and nurses as a bio- 
logical science occupational group. Even "com. 
mon sense" seems contradicted at certain points 
without supporting data: governesses are à 
teaching group, while nursemaids are a service 
group; drill press operators and weavers belong 
to the machine trades, while automatic machine 
operators are in the manual group. à 

VALIDITY. Аз in the case of many insufficiently 
validated tests, the author cites the nature “ 
the items as evidence of the inventory's validity. 
Correlations with the Kuder Preference Record: 
Vocational, are also cited, ranging from .30 for 
the accounting and computational scales to 7 
for the writing and literary scales, with 5 ° 
the тт r’s below .50. These low correlations at 
interpreted in the manual as evidence that thè 
use of “both tests will yield a better total ate i 
tion of interests than will either alone,” but Jus 
what the Henderson is measuring that the uder 
is not measuring is not explained. ‘sof 

USE. At this stage, the Henderson Analysis | 
Interest can at best be described as suitable a 
for experimental use, in situations in which al? 
can be gathered for the establishment of m 
and for the validation of scales and in which i 
use is to be made of the scores for counseling o 
ux ener Comms quem pee е 


ation, 
р о! ‘itles: V ational к 
ational T tles: Part » S AC tiona 
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Washington, D. nment rinting 
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selection. When data are needed for immediate 
practical purposes, other inventories which are 
supported by far more research, and the meaning 
of which is at least partly understood, are avail- 
able. The Henderson inventory provides further 
evidence that it is far easier to write seemingly 
good test items than it is to develop a usable test. 


[741] 
XInterest Check List. Adults; 1946; an interview- 
ing aid; 8 work areas: artistic, musical, literary, enter- 
tainment, clerical and sales, service, agriculture-marine- 
forestry, mechanical; individual; no data on reliability 
and validity; $1.75 per 100; 5€ per sample copy ; remit- 
tance must accompany order; nontimed (10-30) min- 
utes; prepared by Testing Branch, United States Em- 
porma Service; United States Government Printing 
ce. 


Mitton L. Brum, Associate Professor of Psy- 
chology, The City College, New York, New 
York. 

The Interest Check List is ostensibly an inter- 
viewer's aid. Since it is not scorable, it is not 
intended as an objective instrument. Accord- 
ingly, raising such questions as validity, reli- 
ability, and norms is inappropriate. A critical 
evaluation of the check list must, therefore, be 
equally as subjective as the interpretations of 
the counselors who use the form. 

Items intended to cover eight broad work areas 
are included in the check list. The proportion 
of items to areas is an unexplained variable. For 
example, four of the areas have only five items 
each, but the other four areas have from 16 to 
49 items each. 

There are 25 groupings according to the code 
used in Part IV of the Dictionary of Occupa- 
tional Titles These groupings also vary in the 
number of items; the smallest has one item and 
the largest has ten. Each grouping has the code 
reference printed alongside. One can only guess 
the effect that such hieroglyphics may have on 
the respondent. It may well serve to encourage 
а concentration of check marks and not be a 
reflection of the diversified interests of the sub- 
у Who is trying to make а good (integrated) 
ation, A better format would be to mix the 
{ ei present them in ungrouped fashion and 
ШЕ: үсүе the counselor with a templet that 

u d mechanically reveal the occupational code 
classification, 
өе» are tasks rather than jobs. Some 

ples are: “Going to some trouble to make 
tion, Var Manpower Commisson, Bureau of Manpower Utiliza- 
ird Т: Pre yore, рза Chasen, Re 


Office, sition, Washington. Government Printing 
э 1944. Pp. xiv, 242. Paper. 


foods look attractive" (this is in cooking in the 
area service work) ; "Mixing dough for bak- 
ing" (this is in the group known as processing 
in the mechanical work area) ; and "Driving an 
automobile" (this is in the mechanical work 
area). It does appear as if many of the items may 
be suggestive of other work areas, but no men- 
ki of such possible interpretations is made at^ 
all. 

A Self-Training Handbook, in addition to a 
one-sheet, printed manual of instruction, accom- 
panies the Interest Check List. The first para- 
graph of the handbook is “This is a Self-Train- 
ing Handbook on the Use of the Interest Check 
List. You are your own instructor. The material 
is prepared so that you may go straight through 
the Handbook. For your own sake, don’t look 
on the last page to see ‘how the story ends.’ " 

If the authors are correct in assuming the level 
of the counselor, then it would appear that such 
a check list is inappropriate and that a simple 
and objectively scored check list would be safer. 
If, on the other hand, the authors prefer a “clini- 
cal" rather than an objective inventory and their 
counselors are capable of insightful clinical diag- 
nosis, then the handbook has been written in a 
too indulgent and paternal manner. j 

This check list is not as good as the more 
widely known and previously published ones. Its 
value is limited by its format and by the ex- 
tremely unknown variable of the interpretations 
the counselors are capable of making. 


Howanp R. Taytor, Professor of Psychology 
and Head of the Department, University of 
Oregon, Eugene, Oregon. 

This questionnaire is designed to help orient 
people seeking employment. It is organized 
around the job classifications given in the Dic- 
tionary of Occupational Titles, Subjects check 
the different types of activities they “would be 
interested in doing even if they have never done 
them.” Each of the 135 kinds of work is de- 
scribed in terms of the “action verbs” upon 
which the classification into job families is pri- 
marily based. As a means of securing rapport 
in an interview, counselors report that it is quite 
effective. | 

No attempt is made to summarize the infor- 
mation given in terms of a score nor to distin- 
guish well defined interests based on experience 
from, vague notions of what a person thinks he 
might like to do. The avowed purpose and chief 
value of the form is, therefore, to shorten the 
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time required for interviewing and to secure a 
comprehensive survey of any conscious prefer- 
ences an individual may have with reference to 
eight major fields of work which can ordinarily 
be entered without previous experience. There 
are, of course, no norms. Nor is any objective 
determination of reliability or validity in locat- 
‘ing work interests possible. Since there are many 
research techniques which might materially en- 
‘hance the usefulness of such a tool, or at least 
afford means of checking its adequacy, it is un- 
fortunate that no thoroughgoing standardization 
seems to have been attempted. 

The manual clearly explains how to present 
the list to subjects and to use the information 
obtained. There is also a handbook of typical 
cases which can be used as self training exercises. 

As a psychological measure, however, it has 
all the defects of similar forms of self-report 
without any of the virtues which can be antici- 
pated from multiple sampling of opinions and 
attitudes and their scaling to insure unity or gen- 
eral significance with reference to a criterion. 
No doubt in the hands of a skillful interviewer, 
the check list will provide leads for further ex- 
ploration. It may also afford a client some ink- 
ling of the specific nature of job requirements. 
But if a person has not previously engaged in 
such activities, what does it mean to ask him 
if he likes “creating and composing musical com- 
positions” ; “doing literary research for histori- 
cal publications” ; “developing advertising cam- 
paigns”; “sorting, indexing, and assembling 
papers and other written records” ; “selling in- 
surance by pointing out advantages and disad- 
vantages of various policies” ; “studying social 
and economic conditions in order to help indi- 
viduals or groups solve problems of general wel- 
fare"; or "mixing foods to obtain new flavor" ? 
Verbs, like other words, derive their meaning 
from activities associated with them in the read- 
er's experience. 
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*Kuder Preference Record— Vocational. Grades 
9-16 and adults; 1934—51; IBM; 3 editions; separate 
answer pads or answer sheets must be used; cash or- 
ders postpaid; G. Frederic Kuder; Science Research 
Associates, Inc. * 
а) FORM в (NINE scALE). Grades 9-16 and adults; 
1934-46; a revision of Form А; 9 scores: mechanical, 
computational, scientific, persuasive, artistic, literary, 
musical, social service, clerical; 2 editions; 1 form, 
42; revised manual (746) ; 55¢ per 25 profile sheets for 
either men and women (746) or boys and girls (744); 
75€ per specimen set of any one edition ; nontimed (40- 
50) minutes. 

1) Form BB (Hand Scoring Edition). 49é per test 

and answer pad; $2.15 per 25 answer pads. 


2) Form BM (Machine Scoring Edition), зд 
test; $2.90 per 100 IBM answer Sheets ; $7.50 Fs 
of scoring keys. 
b) FORM BI (SHORT FORM). College and adults; туб. 
48; an adaptation of Nine Scale Form for use in in. 
dustrial situations; 9 scores: same as for Form р 
(Nine Scale) ; IBM; Form Br (748) ; revised manual 
('46—same as for Nine Scale Form); 396 per test: 
$2.90 per 100 IBM answer sheets ; $3 per set of scoring 
stencils ; 49€ per test and answer pad; $2.15 per 25 ap, 
swer pads; 55¢ per 25 profile sheets Саб)? уф pr 
specimen set; nontimed (30) minutes, 
с) FORM C (ELEVEN SCALE). Grades 9-16 and adults; 
1948-51; II scores: same as for Form B (Nine Scale) 
plus outdoor, verification; 2 editions; 1 form, '48; sec- 
ond revised manual ('51) ; ss per 25 revised profile 
sheets for either men and women ('sr) or boys and 
girls ('50) ; 75€ per specimen set of any one edition, 
I) Form CH (Hand Scoring Edition). 49é per test 
and answer pad; $2.15 per 25 answer pads. 
2) Form CM (Machine Scoring Edition). 39¢ pet 
test; $2.00 per 100 IBM answer sheets; $7.50 per 
set of scoring keys. 
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Epwanp S. Bonprx, Associate Professor of Psy- 
chology, University of Michigan, Ann Arbor, 
Michigan. 

For most purposes in testing vocational in- 
terests this inventory and Strong’s Vocational 
Interest Blank (VIB) are the most frequently 
considered choices. Since, of the most recently 
developed instruments, this one comes closest 
to meeting the standards set by Strong, it seems 
appropriate to evaluate this test through com- 
Parison with the VIB. 

Where the Strong approaches the measure- 
ment problem by developing scales representing 
the constellation of preferences which distin- 
guish one occupational group from another, the 

uder purports to isolate the important inde- 
Pendent dimensions of vocational interests. This 


is considered a gradual process: for example, 
the most recent edition had added a new dimen- 
sion, outdoor interests. It is assumed that given 
these dimensions, one can then identify and de- 
rive the constellations which distinguish diverse 
occupational groups. This is the theoretical ori- 
entation of the factor analysis approach in psy- 
chological measurement. Though they approach 
the problem from opposite directions, it would 
appear that the two tests arrive at exactly the 
same point. That they are in fact transmutable 
has not yet been adequately demonstrated. This 
test identifies broad areas of vocational interest 
and proceeds to translate them into specific oc- 
cupations (through mean profiles for occupa- 
tional groups and regression equations). The 
Strong identifies interests of specific occupa- 
tional groups and through factor analysis trans- 
lates them into broad areas. 

The Kuder is an instrument that went through 
a carefully planned and thoroughly executed de- 
velopmental process. It falls short of the Strong 
in the thoroughness of its validation and other 
supporting data. One major set of validation 
data is the presentation in the manual of mean 
profiles for occupational groups largely derived 
from data contributed by test consumers. The 
author’s device of soliciting validation data from 
test users, while a convenient way of solving a 
costly, time-consuming problem, is less than ade- 
quate as a rigorous demonstration that these 
interest scales are indeed relevant to distinguish- 
ing occupational groups in terms of interests. 
First, it is difficult to interpret the significance 
of mean test profiles for a given group without 
information as to the conditions under which 
the test was administered. Was this a study in 
which the subjects had no particular stake? Were 
they taking the test as applicants for a job? Was 
it for counseling? A second, and even more criti- 
cal issue, is the unlikelihood that validation data 
obtained in this manner will represent an un- 
biased estimate of the validity of the test. It is 
hard to imagine anyone ordinarily being moti- 
vated to send to a test author data which demon- 
strates that it does not do what the author hopes 
and believes it does! A further drawback in the 
validation evidence is the very small number of 
cases on which many of the mean profiles are 
based. One important type of supporting data 
about this test that is not now available is de- 
finitive evidence about the stability of test re- 
sults. Nothing approaching the long term follow- 
up studies on the VIB has been reported. In fact, 
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this reviewer is not aware of any follow-up study 
covering a period greater than one year. 

Both authors meet high standards in making 
available important and useful information to 
test users, Two important criticisms of the man- 
ual should be made. First, while data on 
reliability from the point of view of internal con- 
sistency are presented, there is no real considera- 
tion of the allied but different question of sta- 
bility. Users need to know how rapidly, how 
frequently, and under what conditions changes 
in scores can be expected to occur. Second, the 
manual for Form C abandons the practice fol- 
lowed by preceding manuals of identifying which 
of the mean scores for occupational groups repre- 
sent significant deviations from the means of 
the general population. This is the removal of 
a necessary safeguard against overinterpretation 
of the results from this test. 

In summary, the Kuder can be considered an 
acceptable tool for the measurement of interests 
which has advantages over the VIB in terms of 
cost and time absorbed in scoring. However, it 
still cannot be considered as a fully developed 
instrument until further validation studies and 
evidence on stability are available. 


HaRorp D. Carter, Professor of Education, 
University of California, Berkeley, California. 

This well known inventory is now available 
in more than one form. Form C, the new and 
improved product, is designed to replace the 
older Form B. The new inventory yields rr 
scores, including the old 9, plus a new score for 
outdoor interests, and a verification score. The 
verification score is designed to help identify 
those who have not followed directions and those 
who have not answered carefully. 

Another new feature is the glossary of words 
and phrases that may prove difficult for some 
students. The latest manual lists suggested oc- 
cupations for each area of interests, along with 
appropriate references as sources of information 
about each occupation. New sets of norms for 
adult men, extended validity data, and job pro- 
files for both men and women are also included. 
The Kuder Book List, offered by the same pub- 
lishers, is a promising new development. 

The claims made by the publishers are modest 
and reasonable. They suggest that the Kuder 
Preference Record is useful for introducing stu- 
dents to organized study of occupations and 
career selection, and for guiding them into edu- 
cational and vocational activities they will find 


satisfying and enjoyable. In the reviewer's judg. 
ment, the Kuder Preference Record is the most 
useful available instrument for this purpose, It 
is especially well fitted for use with the mass of 
high school students, and the publishers have 
provided stimulating and useful auxiliary ma- 
terials for teachers and counselors. 

Published research has indicated that the vari- 
ous scores provided by the Kuder Preference 
Record are satisfactory in reliability. А digest 
of such studies leads to the judgment that the 
test is one of the best from the standpoint of 
validity. In the opinion of the reviewer, the 
Strong Vocational Interest Blank is often to be 
preferred for investigation of the vocational in- 
terests of college students. However, for use with 
high school students and average adults, es- 
pecially those not preparing for the professions, 
the Kuder Preference Record is a practical and 
valuable instrument. Superior students at the 
seventh grade level can take this inventory with 
the expectation of getting valid and educationally 
significant results. 

The provision of a glossary may be regarded 
as a useful practical service. The test specialist 
is aware, of course, that research must settle 
various questions as to whether the inventory 
is made more effective when such explanations 
are provided. Some studies suggest that scores 
are more reliable and valid when terms are not 
explained. Interests tend to develop along with 
other aspects of mentality and personality; the 
person who is ignorant of a thing can hardly have 
а well developed interest in it. Nevertheless, al- 
though glossaries and other types of explanatory 
material may not be very useful in the measure- 
ment of interests, they may have an important 
function in connection with instruction concern- 
ing interests. э 

In summary, ће Kuder Preference Record 15 
a carefully constructed and well planned instru- 
ment. While full exploration of its values an¢ 
limitations remains as a field for psychologica 
research, it is clearly one of the best available 
instruments of its type, especially for use WP 
high school students. Its practical values just! 
extensive use by high school teachers and Qum 
selors, and by those engaged in counseling repre 
sentative groups of adults. 


H. M. Fowrzn, Associate Professor of PANE 
tional Research, University of Toronto, Тотош 
Ontario, Canada. 
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I. DEVELOPMENT AND PRESENT STATUS 
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THE KUDER PREFERENCE RECORD. Of the various 
interest measures now available, the Kuder 
Preference Record—V ocational is one of the two 
most widely used. It is estimated that in Ontario 
it is used, at least for high school students, more 
than all other interest blanks combined. 

The popularity of this measure as a clinical 
instrument is matched by its appeal to research 
workers who are interested in determining its 
scientific value. The latest manual (March 
1951) has a list of 138 bibliographic references, 
of which 35 are additional to those contained in 
the first revision of the manual in February 1950. 
The reported research includes studies of the 
reliability and validity of the record, compara- 
tive studies designed to assess the particular 
strengths and weaknesses of this instrument in 
competition with other interest inventories, and 
a wide variety of investigations to determine the 
value of the preference scores in special situa- 
tions, for example, in veterans’ advisory bureaus. 
A number of workers have supplemented the 
work being done by the author and his publishers 
by developing local norms for high school sen- 
iors, for college freshmen, for university business 
school seniors, and others. 

One of the reasons that the Kuder Preference 
Record appeals to test users is that the author 
and his publishers are untiring in their efforts 
to improve it, to keep it up to date, and to pro- 
vide necessary interpretative data. The first re- 
search was conducted during the academic year 
1934-35 when the author asked 500 students at 
the Ohio State University to rank in order of 
preference five activities in each of 40 groups. 
Subsequently, much research has been done. 
Even since the publication of The Third Mental 
Measurements Yearbook in 1949, the measure 
has been improved and refined in a number of 
ways. A new scale, the tenth, in an interest area 
related to agricultural, naturalistic, and outdoor 
activities, is now available for use with the C 
forms, It has been called “Outdoor” by the au- 
thor, and is numbered “О” in references to it 
in the tables of occupations. Another develop- 
E has been the provision of a verification 
Ma ui was devised at the request of those 
ҮШ ee it necessary to identify students 
unde filled in the blank carelessly or without 
uh 1 anding. Form Вт, the short form of the 
КУШ appeared in 1948 and was developed for 
tae situations where it is not prac- 

h o use the longer Forms BB and BM. 

© author recommends that, for the present, 


the use of Br be restricted to adults ; later a short 
form may be developed for use with eighth grade 
or high school students. 

П. COMMENTS ON THE Корек PREFERENCE 
RECORD. A. THE KUDER AS A TEST. Is the Kuder 
really a test? According to the manual its spe- 
cific uses are (a) to point out vocations with 
which the student may not be familiar but which 
involve activities of the type for which he has 
expressed preference, and (b) to check on 
whether a person's choice of an occupation is 
consistent with the type of thing he ordinarily 
prefers to do. The name, Kuder Preference 
Record, is itself suggestive. This reviewer looks 
upon the blank as an instrument for testing only 
if one considers that the student is being tested 
with respect to his ability to give a true and com- 
plete picture of his interests through the re- 
sponses that he makes to the items. 

In the administration of this blank it is ex- 
tremely important to obtain the proper rapport 
between student and counselor. Fortunately, stu- 
dents are intrigued by the unusual form and ap- 
pearance of the booklets and enjoy making their 
responses by means of the special pin-prick de- 
vice. These features assist the administrator in 
motivating the students to be honest and thor- 
ough in choosing the responses which provide 
the most satisfactory self-analysis of their in- 
terests or preferences. 

в. FORMAT. The format of the various forms 
of the Kuder Preference Record leaves little to 
be desired. The blanks are printed in clear black 
type. The method of marking the responses, 
both for hand scoring and machine scoring edi- 
tions, is easily followed by most students. Some 
do complain about the pin recording feature, 
saying that the pin should have a wooden rest 
or that it should-have a larger head. The blank 
can be given either individually or to groups and 
the administrator need give very little assistance. 
The directions, which are clear and not too 
wordy, are set up in brief paragraphs which ap- 
pear on the booklets themselves. The special an- 
swer sheet system works well in general, but 
those who sell large quantities complain that in 
some runs of the booklets the pages are not lined 
up properly. ih 

c. ITEMS. According to some authorities, one 
big disadvantage of the Kuder is the forced 
choice. These critics claim that to ask a student 
to say that he prefers one activity to another 
when in truth he may like the two equally well, 
serves only to irritate the conscientious student 
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who would like to give accurate responses and 
to encourage the careless student to become even 
more careless. This disadvantage of the prefer- 
ence item may be more apparent than real. In 
the experience of this reviewer most students 
are able to make choices for most of the items 
without too much difficulty. A check of the blank 
suggests that most of the subitems are “normally 
competitive,” to use Truman L. Kelley's ex- 
pression. For example, “work mathematical 
puzzles" appears with “play checkers,” and 
"study physics" appears with "study public 
speaking." For the few cases in which students 
are not able to give meaningful responses, it may 
be assumed that the scores for the ten scales will 
not be greatly affected since they depend upon 
the responses to large numbers of items. 

Of more concern to some users is the inter- 
pretation placed by students upon some of the 
words and phrases used in the items. Christensen 
(37) completed an experiment designed to test 
the effect of giving systematic instruction in the 
meaning of the items found in the blank ; he con- 
cluded that the results appear to indicate that 
"such instruction probably played a role in caus- 
ing the subjects to change their preferences." 
Clearly, students' responses may vary according 
to what they think the items mean. The latest 
editions of the manual and test booklet include 
a glossary of terms that is intended to minimize 
the variation of interpretation from individual 
to individual. It is questionable, however, 
whether this has much value. It might be better 
to include explanatory material in the items 
themselves, a technique which has been used 
quite effectively by Lee and Thorpe in their 
Occupational Interest Inventory. 

The preference item has definite advantages 
as well as disadvantages. This reviewer believes 
that to ask a student whether he prefers one 
activity over another provides a much more 
meaningful and concrete situation than merely 
to ask him whether he likes or dislikes certain 
activities. Also it makes possible choices between 
two activities, both of which are liked or both 
disliked, thus enabling the counselor to get a 
more complete picture of the student. 

The author is to be commended, also, for 
avoiding in the blank the use of what might be 
called the "million dollar" item. Presentation of 
such unusual or unlikely situations is not apt 
to elicit the type of response that will be helpful 
to the counselor in the eminently matter-of-fact 
task of suggesting suitable vocations. 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK E 


D. RELIABILITY. The Kuder Preference Ro. 
ord was constructed to yield homogeneous 
groups of items, and from the first, the author 
checked the reliability of its scoring categories, 
In view of this, it is rather surprising to find the 
following statement in the report of one study: 
“It is possible to obtain by chance Scores on the 
Kuder Preference Record of a character which, 
if made by high school youngsters, might be re. 
garded as significant." How reliable are the 
Kuder scores? 

The most recent revision of the manual gives 
separate reliability estimates for each of the ten 
scoring categories for each of four different 
groups, labeled as follows: 1,000 men, 10 
women, 100 boys, 100 girls. On the average, the 
reported reliabilities are approximately .90; the 
lowest reliability reported is .84 and the high- 
est is .93. Some studies, such as those by Triggs, 
agree substantially in their estimates of relia- 
bility with those reported in the manual. Others 
such as the study by Tyler (62) have given esti 
mates which are consistently lower than those in 
the manual. On the whole, the evidence suggests 
that the Kuder scales yield scores which are at 
least as reliable as the average personality meas 
ure. However, it is questionable whether scores 
obtained from a single administration of the 
blank should be accepted completely at their face 
value. The reported reliabilities are none too 
high for individual differentiation, and counsel- 
ing must be done on an individual basis. . 

The author might well provide other estimates 
of test reliability. Only estimates of the cor 
sistency of the scales, computed by using the 
Kuder-Richardson Case Four method, are given: 
Test-retest estimates are not given in the lates 
revision of the manual, although they were given 
in earlier editions. The test-retest reliability est" 
mates ranged, generally, somewhat higher "s 
the consistency estimates. Also, the аш Ч 5 
should provide more information about y 
groups used and the situations in which the |, 
liability estimates were obtained. Otherwise H 
unsophisticated reader may think of these ё 
mates as the reliability of the blank. hol 

E. INTERPRETATION OF SCORES. The меа Ё 
information given in the test manual d à 
unusually complete norms, as well as a desc if 
tion of the procedure to be followed in pro ss 
the scores and relating them to suggested 4 on 
pations. The high school norms are base ай 
scores obtained on Form С by 3,418 boys vet 
4,466 girls in high schools well distributed 
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the country. Two sets of norms for male adults 
have been developed, and what should be con- 
sidered tentative norms for female adults are 
provided. Two separate profile sheets, one for 
adult men and women and the other for boys 
and girls, in which raw scores are converted to 
percentiles, serve to center attention upon cer- 
tain broad areas of interest. 

A table given in the manual classifies occupa- 
tions according to the profiled interést areas. 
However, as pointed out by Kuder, the classifi- 
cations must be regarded as tentative. A number 
of counselors have complained about the place- 
ment of occupations. Some of the divisions and 
subdivisions set up to fit the ten interest areas 
appear to be rather fanciful. One suspects that 
they were developed through armchair reason- 
ing rather than by actual study of occupational 
differences. However, if these listings are treated 
as suggestive only, as the manual directs, no 
serious error should result. References are pro- 
vided in the tables to other sources of informa- 
tion about the nature of the various jobs. 

In the past, one of the greatest weaknesses of 
the Kuder Preference Record has been the lack 
of adequate norms developed on sufficiently large 
groups. To some extent this has been corrected. 
The author has developed what might be termed 
national norms based on the scores of 1,296 men 
representative of the major occupations in the 
general population. Another set of norms for 
adult men has been obtained by inviting a group 
of telephone subscribers in a stratified sample 
of 138 cities and towns to fill out the blank. Al- 
though these two sets of, norms agree fairly 
closely for the most part, they differ significantly 
for two of the scoring categories. These differ- 
ences lead one to question the stability of the 
new norms. Because it is difficult to get a truly 
representative sample, genuine national norms 
are very difficult to obtain, but for many purposes 
they are not so valuable as the local norms that 
are now being developed by a number of workers. 

More specific than the national norms are the 
occupational norms which are given in Tables 
ane 3 of the latest edition of the manual. These 
ki id Show the percentile ranks of average scores 
m ie. on the ten scales by more than 15,000 
ili viduals in various occupations. The value of 
Nw Coca for the occupational groups is lim- 
Lo ecause for most of the occupations they 

* based on very small numbers, and only the 
Percentile ranks of average scores are given. 

n summary, the norm groups, although 


greatly increased in number over those reported 
earlier, are still quite small (for adult women 
only roo on the outdoor scale) ; in an effort to 
get a representative sample the author has spread 
his norm group very thinly over a large assort- 
ment of small groups; and the norms cannot 
be employed with confidence by some users, Ca- 
nadians for example, of the test. 

F. VALIDITY, The occupational profiles pre- 
sented in Tables 2 and 3 of the manual offer 
some evidence to support the hypothesis that 
the scores are valid. For example, accountants 
score very high on computational and clerical, 
low on persuasive and mechanical. This type of 
evidence, however, fails to be completely con- 
vincing. One is suspicious of any method of 
demonstrating validity through агтсНаіг rea- 
soning and consequent-antecedent logic. What 
is needed are some genuine follow-up studies in 
which scores obtained by people before they en- 
ter occupations are compared with various meas- 
ures of their vocational success. In the opinion 
of this reviewer, the validity of the Kuder Pref- 
erence Record scores is still very much open to 
question. 

Although no direct demonstration of the va- 
lidity of the blank is forthcoming as yet, some 
information is given in the test manual which 
should help the user to determine how useful 
the scores may be in actual practice. Significant 
differences are reported for students in different 
curricula preparing for various occupations. The 
record scores have, in general, positive but low 
correlations with measures of achievement such 
as school grades and objective test scores, and 
they show only a low relationship with measures 
of ability, which means that counselors must 
consider both abilities and preferences in sug- 
gesting vocational choices. Some evidence re- 
lating to validity is available in outside studies 
not reported in the test manual. For example, 
Brown, who had a number of veterans evaluate 
their Kuder ratings, reports that, on the whole, 
the confidence in the ratings was of high statis- 
tical significance (166). А y 

Of considerable concern to this reviewer is 
the possibility that the responder may have 
marked the blank insincerely or dishonestly, thus 
rendering the scores completely invalid. Cross 
has conducted a study of this possibility, and has 
concluded that “a subject suitably motivated 
may successfully fake the Kuder Preference 
Record" (168). Although most people will give 
honest responses, testers should guard against 
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faking or dishonesty in situations, such as job 
applications or selection testing, in which the 
person being tested has something to gain by 
being dishonest. Under proper conditions of 
administration, faking becomes somewhat diffi- 
cult for any but the most sophisticated, because 
those who are taking the blank for the first time 
are not familiar with the scales that are being 
used. 

The author and his publishers present some 
scanty and inconclusive evidence to suggest that 
it may be possible to differentiate succeessful 
from nonsuccessful people within an occupation. 
The research done by this reviewer along simi- 
lar lines has been discouraging. 

G. MISCELLANEOUS. In line with the publish- 
ers’ efforts to keep the test up to date and to 
provide interpretative data whenever possible, 
is the recent publication of the Kuder Book List 
and the accompanying instructor's guide. This 
list contains over 500 titles classified according 
to the ten interest areas of Form C of the record. 
Each book has been rated on reading difficulty. 
After the student has taken the preference 
record, his profile of scores will help him find 
the types of books that should interest him most. 
We do not yet know whether teachers will find 
this book list of assistance in improving reading, 
but it appears to be based upon the sound prin- 
ciple of appealing to interest to motivate the 
slow reader. 

One reputed advantage of the Kuder Pref- 
erence Record is that it is inexpensive and easily 
scored, Not all will agree. Canadian users find 
the test fairly expensive because they have im- 
port duties to pay over and above the ordinary 
costs. Guidance counselors also complain that 
it takes considerable time to score. Machine 
scoring editions are available, but these can be 
used only by those who have access to test scor- 
ing machines. Moreover, the test is often given 
to individuals or to small groups, and in such 
cases much of the advantage of machine scor- 
ing is lost. There would appear to be a real 
need to produce a shorter, less expensive 
form that can be scored more quickly and eas- 
ily. 

All editions of the manuals suggest the use 
of the 75th percentile in deciding which scores 
are significant. Although one might quarrel with 
the statistics presented by the author in earlier 
editions of the manual in defense of its use, this 
particular critical point appears to be as good 
as any other. As the manual points out, no sig- 
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nificance point would be completely satisfactory 

III. Summary AND GENERAL IMPRESSIONS, 
In summary, it may be said that the Kuder Pre. 
erence Record is one of the best available meas- 
ures of interests, particularly for high scho 
students. Its preference-type items, alth 
originally selected subjectively, have been tested 
by research, and it may be assumed that the 
scores obtained for certain wide areas of interes 
are reasonably objective. The reliability: of the 
scale scores is satisfactory. The author deserves 
commendation for the energetic manner in which 
he has carried out continuous and painstaking 
research on the blank since its first development, 
The C forms, for machine and hand scoring, 
show refinements which make them superior to 
the earlier B forms: a new interest category has 
been added, better norms have been provided, 
and more occupational data are available. In | 
many respects, the Kuder is as good an interest 
measure as Strong’s Vocational Interest Blank, 
its chief rival; in some respects it is a better 
instrument. . j 

The Kuder „Preference Record—V ocationl 
is recommended for use, but in the opinion o 
this reviewer, certain points of caution are worth 
stressing. In the first place, personality meas- 
urement has not yet reached the point of pro- 
ducing completely objective and trustworthy 
scores despite the very considerable progres 
that has been made in the past 30 years. Test 
score interpretation is always hazardous; the 
reader is advised to be particularly careful wi 
the interpretation of interest or preference blan 
scores. Secondly, although the Kuder presents 
a worthwhile effort tó describe subjects in terms 
of general fields of interest (in contrast m 
Strong's test which determines whether st; 
jects mark the test the way successful people j 
various occupations mark it), it should be зе 
ceded that to a large extent the validity of b 
scores has yet to be demonstrated. This 18 a 
surprising in view of the way the test was ha 
structed. The acid test of the blank, how 
whether it is clinically useful. Volume of sae 
although not always a sound criterion, Bes 2 
that it is. The more conservative counselors р 
pear to think that the administration 0 a 
Kuder blank is a good beginning point E 
interview. They look upon it as а usefu pH i 
of suggestive information that may be Us w 
conjunction with data concerning ability i 
other personality factors in exploring the P 
bilities of vocational choice. 
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For reviews by Ralph F. Berdie, E. G. Cham- 
bers and Donald E. Super and an excerpt from 
areview of Forms BB and BM, see 3:640; for 
reviews by A. B. Crawford and Arthur E. Trax- 
ler of an earlier edition, see 40:1671. 
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Occupational Interest Inventory. Grades 7-12 and 
adults, 9-16 and adults; 1943-46; 10 scores grouped in 
3 categories : fields of interests (personal-social, natu- 
ral, mechanical, business, the arts, the sciences), types 
oí interests (verbal, manipulative, computational), 
level of interests; IBM; 2 levels; Form A eei 
lower level manual ('44), upper level manual ('43) ; 
$3 per 25 of any one level; 35€ per specimen set of any 
one level, postpaid; separate answer sheets may be 
used; 26 per IBM answer sheet; боё per set of sten- 
cils for machine scoring of answer sheets; 60€ per set 
of stencils for hand scoring of answer sheets; postage 
extra; nontimed (30-40) minutes; Edwin A. Lee and 
Louis P. Thorpe; California Test Bureau. * 
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AnrHUR Н. BnavrikLp, Professor of Psychol- 
ogy and Head of the Department, Kansas State 
College, Manhattan, Kansas. 

According to the authors, the major purpose 
of this inventory is "to aid in discovering the 
basic occupational interests possessed by an in- 
dividual in order that he may become or remain 
an interested, well-adjusted, and effective per- 
son as well as a profitable employee." They stress 
that it is not a test of occupational abilities or 
skills. This seems a proper orientation to interest 
measurement. 

The inventory purports to identify six basic 
fields of interest: Personal-Social, Natural, Me- 
chanical, Business, Arts, and Sciences; to de- 
termine whether an individual is most interested 
in verbal, manipulative, or computational ac- 
tivities; and finally to identify “the levels on 
which an individual’s basic interests exist.” 

The format of the inventory is good. Each 
section is accompanied by a manual which covers 
in general terms the topics usually considered 
essential in a test manual. Reading difficulty of 
the items has been checked. 

To the uncritical and untrained individual the 
inventory should appear to be a “good buy.” And 
it is just that—nicely packaged pieces of mer- 
chandise unaccompanied by a caveat emptor. 

The authors furnish no information of an em- 
pirical nature which would indicate that (а) 
the interest fields identified by this instrument 
are basic; (b) the items comprising the indi- 
vidual scales were selected by any type of rigor- 
ous internal or external item analysis; or, (c) 
that the inventory meets any of the usual tests 
of validity established for interest measures by 
such workers as Strong, Kuder, Super, Darley, 
and others. : 

Meager data bearing on the validity of the in- 
ventory (Advanced, Form A) is furnished ina 
study by Lindgren (3). He correlated its raw 
scores with raw scores on the "appropriate" 
scales of the Kuder Preference Record for 5o 
male veterans. He found some relationships 
which would be significant. The highest r's were 
Sciences vs. Scientific, .80; Business vs. Cleri- 
cal, .74; Mechanical vs. Mechanical, .72 i and 
Personal-Social vs. Social Service, .60, Business 
also correlated .52 with Persuasive. 


L 
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Given the example of Strong and Kuder in 
devoting time and money to the development of 
their interest inventories with the objective of 
providing an adequate research basis for evalua- 
tion, it seems unfortunate that the authors of 
this inventory were content to release instru- 
ments that are primarily "arm chair" produc- 
tions. 

Lacking further empirical evidence appropri- 
ate to the issues enumerated above, this reviewer 
advises against the use of the Occupational In- 
terest Inventory in any practical situation with 
one exception: The reviewer has used it for 
some years in his measurement courses to dem- 
onstrate that a well packaged, widely adver- 
tised, smartly marketed psychological test or 
inventory may not, on justifiable technical 
grounds, warrant the consumer support which 
it apparently receives. 


For reviews by Edward S. Bordin and Stan- 
ley G. Dulsky, see 3:643. 
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*Self-Administering Vocational Interest Loca- 
tor with Work Interest Picture. Grades 10-16 and 
adults; 1933-49; interest preferences in each of 10 
aspects (creative, working with people, executive, lin- 
gual, mathematical, manual, medical-religious, admin- 
istrative, inter-correlated, general) of то areas (artis- 
tic, commercial, constructional, humanistic, industrial, 
literary, mechanical, scientific, technical, transporta- 
tional); Form 1A ('49); no data on reliability and 
validity; manual ('49); 25€ per set of materials, post- 
paid; N. A. Lufburrow; the Author, 3112 Milford 
Ave., Baltimore 7, Md. * 

Dowarp E. Super, Professor of Education, 
Teachers College, Columbia University, New 
York, New York. 

The Vocational Interest Locator is a collec- 
tion of тоо brief descriptions of presumed types 
of work. These are classified in the summary 
Work Interest Picture (which takes the place 
of a scoring blank) into the areas and aspects of 
work named above. According to the manual, it 
is “not a test,” it “does not present a score,” and 
it is “not the sort of device” that needs to be 
accompanied by evidence of reliability or valid- 
ity. Instead, the manual informs us, it is a “de- 
vice for aiding in locating, analyzing and under- 
standing the individual’s work interests” (is 
the implication that tests do not do this ?), and 
it “leads to a RESULT” (surely an innova- 
tion in the testing field) that “is as valid and re- 
liable as you make it” (but are we told how 
valid and reliable anyone has made it?). The au- 


-thor states that “it is, in truth, a very effective 


interviewing aid,” “certainly it can be helpful,” 


“it is a fascinating device,” there is “no doubt 
to accuracy," and "I have used them (the Class. 
fication of areas of work) constantly in guid- 
ance and placement work, without ever findi 
reason to alter the terminology” (presumably 
because no research has been done оп the re 
liability or validity of the classification), 

The above quotations should make it clar 
that this instrument was developed witho 
benefit of psychological knowledge or ted. 
niques, has not been objectively evaluated, and 
has nothing to contribute to the armamentarium 
of psychologists. To describe and criticize it in 
more detail would be a misuse of space in a psy- 
chological publication and a waste of time for 
users of psychological instruments. 


[745] 

Thurstone Interest Schedule. Grades 9-16 ani 
adults; 1947; replaces Vocational Interest Schedule 
(see 3:653); 10 scores: physical science, biological 
science, computational, business, executive, persuasive, 
e humanitarian, кшш musical fri | 
norms; $1.50 per 25; 356 per specimen set; postpaid; 
(15) minutes; L. L. "Thurstone; Psychological Cor- 
poration, * 


; REFERENCES А 

т. ZWILLING, Vircinta Т. The Prediction of Grades i 
Freshman English From a Battery of Tests of Mental Ability, 
Interests and Aptitudes Administered to Students Entering а 
Liberal Arts College. Doctor’s thesis, Fordham University 
(New York, N.Y.), 1949. (Dissertations..., 1949, рр. 62% 
Norman FREDERIKSEN, Associate Professor | 
Psychology, Princeton University; and Head, 
Research Department, Educational Testing 
Service; Princeton, New Jersey. 

Replacing the older Vocational Interes 
Schedule, the Thurstone Interest Schedule т. 
published іп 1947. The new schedule consists 0 
100 pairs of occupational titles arranged in? 
10 X 10 table. Each cell of the table contains tw? 
occupational titles; all of the upper left entries 
in any one column are related to the same inte 
est area, and all of the lower right entries in 40) 
one row are related to the same interest att 
The task of the subject is to indicate his D^ Я 
erence for one of the two occupational titles 1 
each cell. Each interest area is thus ш 
with every other interest area an equal ntum 
of times, each interest area being represent Um 
job titles which vary from cell to cell in the H 
or column. As is stated clearly in the тал, 
the schedule is appropriate only in аш 
where honest and straightforward response А 
be expected, since the purpose of ће instrum 
is not disguised. ap 

The selection of the ten interest areas bey 
parently not based primarily on results 0 
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tor studies; an attempt was made to include 
Spranger's life interests and other interest cate- 
gories in common use. That the items are in 
general properly allocated in each field is indi- 
cated by the magnitude of the average item va- 
lidities. The lowest average item validity is .64 
(executive) and the highest is .85 (musical) ; 
the item validities are thus on the average con- 
siderably higher than one typically finds for most 
types of tests. 

The intercorrelations among the ten scores 
(for a sample of 200 male high school gradu- 
ates) ranged from .68 to —.37. The negative 
correlations are due at least in part to the method 
of measurement employed; expressing a pref- 
erence for one interest area ordinarily involves 
withholding an expression of preference for an- 
other. The split half reliabilities ranged from .90 
to .96 for a sample of 200 schedules obtained, 
presumably, from the same group of male high 
school graduates; such high reliabilities would 
justify use of the scores in individual guidance. 

It is asserted in the manual that the schedule 
yields comparable scores, and a number of sug- 
gestions regarding interpretation of scores are 
made which assume some kind of comparability. 
It is stated, for example, that “the profile shows 
at a glance whether a man is more interested in 
certain fields than in other fields" and "the pro- 
file may show that he is equally interested in sev- 
eral fields, and not at all in certain others." One 
can think of several ways in which scales can be 
comparable ; they can be comparable in the sense 
that corresponding scores cut off equal propor- 
tions of a population, for example, or in the 
sense that corresponding scores yield the same 
predicted performance on some criterion. The 
Scores on the Thurstone Interest Schedule are 
certainly not intended to be comparable in either 
of these senses. It is difficult to specify in what 
Sense scores must be comparable in order to 
Justify interpretations such as those illustrated 
above, According to the manual, the profile 
Scores are comparable because “each of the oc- 
Cupational fields is represented the same number 
of times in this schedule, and since every occu- 
Pational field is compared with every other field.” 
odd various job titles are used to repre- 
d €ach occupational field, and it is not made 

“ar in the manual how these titles were chosen. 
ое different occupational titles, such 
ei Tummer" for “organist,” would be likely 
Wiki the mean score for a particular area, 

1 Suggests that the particular titles chosen 
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would not necessarily lead to comparability. No 
evidence is presented to show that the interest 
areas are represented by job titles of equal pref- 
erence value or that the job titles chosen are 
representative of the occupations making up the 
interest area, . 

Тһе manual states that а table of norms is 
not necessary because the scores are compara- 
ble. In this reviewer's opinion, even if this 
were so, it would still be desirable to have some 
sort of normative material to aid in interpreta- 
tion of scores. No such materials have yet ap- 
peared in the periodical literature. 

The criticisms noted above will not detract 
seriously from the usefulness of the schedule. 
The categories included in the profile cover the 
important interest areas, at least for clients 
likely to go into higher-level jobs. The schedule 
is unusually easy to administer and score; it 
may be administered in ten or fifteen minutes, 
and it can be scored accurately in two or three 
minutes. The Thurstone Interest Schedule 
should be a valuable addition to the kit of tools 
employed by the vocational counselor. 


Dowarp E. Surer, Professor of Education, 
Teachers College, Columbia University, New 
York, New York. 

This schedule is based on the same assumption 
as its predecessor was, namely, that the coun- 
selee has a good enough understanding of what 
occupational titles mean for the patterning of 
responses to, or choices of, titles to be a mean- 
ingful measure. Thus, in the new schedule the 
examinee indicates which of two occupational 
titles he prefers, or he may mark them so as to 
show equal liking or disliking. In responding 
to the choice “Judge—Tax Specialist,” a delin- 
quent adolescent boy may decide in terms of 
first-hand knowledge of one limited phase of 
the former occupation and a complete lack of 
knowledge of the latter occupation. Ora better 
informed person may feel quite lost in respond- 
ing to the title “Red Cross,” which he knows 
might mean a medical, social work, or adminis- 
trative type of activity, but have a very clear idea 
of what is meant by the other member of the 

air, “Portrait Painter.” Just what effect these 
differences in the meanings of job titles between 
individuals and within individuals have on the 
meaning of scores on the schedule, and indeed 
the mere existence of such differences, has ap- 
parently not been considered by Thurstone. à 
Perhaps they are not as important as the 
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above argument suggests. Some evidence to this 
effect is provided by the analysis of the internal 
consistency of the scales, which Thurstone mis- 
leadingly calls item validity (misleading because 
theterm implies validity of the item against some 
external criterion, such as job success, whereas 
the term here denotes item-test agreement, that 
is, agreement of the item with the total score of 
the scale to which it is assigned). The fact that 
the average correlation between musical items 
and the musical scale is .85 suggests that most 
people in the standardization group understood 
each musical occupational title as an occupation 
involving music more than anything else. But 
the average item-test agreement of the executive 
scale is .64, the range of the correlations being 
from .32 to .86. In this, and in some similar in- 
stances, one might ask whether a significant 
number of respondents did not perceive what 
Thurstone considered to be executive occupa- 
tions as more truly humanitarian or business 
occupations (e.g., hospital and school superin- 
tendent, hotel manager). The low agreement be- 
tween some items and the total score on the scale 
to which they are assigned at least suggests 
that this may be so in some occupational fields. 
And if some types of occupations are better un- 
derstood in some subcultures than in others, the 
scores of persons in those subcultures may be 
consistently distorted by their inadequate per- 
ception of some occupations. Interest inventories 
consisting in whole or in part of items describing 
more familiar everyday activities, such as the 
Strong and the Kuder, are freer from this possi- 
ble defect. 

Further evidence to support Thurstone’s pro- 
cedure lies in the split-half reliabilities of the 20- 
item scales; despite the brevity of the scales, 
which results in one change of response substan- 
tially changing a score, the reliability coefficients 
range from .90 to .96. 

The recommended interpretation of the sched- 
ule emphasizes the importance of intra-individual 
differences in scores and provides no means of 
comparing a person's interests with those of 
others. Thurstone argues with some apparent 
justification that it is "the ups and downs, the 
spread, and the shape of the individual profile 
that count in advising a person" ; it is this, after 
all, that tells what a person is interested in. But 
satisfaction with this logic leads Thurstone to 

neglect altogether the question of external valid- 
ity, an unwarranted neglect which seems to re- 
sult from viewing differential occupational in- 


е 
t3 
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terest patterns as being nothing more than 
norms, whereas they may be evidence of validi 
And it is the lack of external evidence of validity 
which is the most serious defect of the Thur. 
stone Interest Schedule. Do people who, inhi 
school or college, make high physical science 
scores tend to enter, succeed in, and be Satisfied 
with employment in occupations such as engi- 
neering? Do men who make high humanitarian 
scores tend to enter occupations such as social 
work and teaching, do they remain in them, and 
do they like their work? It is questions of this 
type that a counselor must ask about his tests, 
and to which he must have answers, if they are 
to help him in his work. For it is only the an- 
swers to these questions that tell the counselor 
the practical meaning of his instruments, Thur- 
stone has not even sought to provide the answer, 
as someone must before the Thurstone Interest 
Schedule can be more than a research instrument 
or a means of locating fields which might be ex- 
plored by a student. 


[746] 
* Vocational Interest Analyses: А Six-Fold Ana 
lytical Extension of the Occupational Interest 
Inventory. Grades 9-16 and adults; 1951; 6 FUR 
each of 6 areas listed below ; IBM; 6 tests; tests shou! 
be administered only in those areas in which an ex 
aminee obtains high scores on the Occupational ded 
est Inventory (see 743) ; 1 form; no data on гей 1? 
ity; no norms; $2 per 25 of any one test; 756 per spet 
men set, postpaid ; separate answer sheets may be m 
8€ per set of IBM answer sheets for use with any й 
tests; 60€ per set of stencils for machine bero 
answer sheets; боё per set of stencils for hand scort 
of answer sheets; postage extra; nontimed ш 
minutes per test; Edward C. Roeber and Geral 
Prideaux in collaboration with Edwin 
Louis P. Thorpe; California Test Bureau. МУ 
а) PERSONAL-SOCIAL ANALYSIS. б scores: dom 
Service, personal service, social service, t Ed 
related activities, law and law enforcement, h 
medical service. d ero 
b) NATURAL ANALYSIS. 6 scores: general n A 
farming, animal raising and care, garden and g il 
house care, fish-game-domestic fowl, lumbering 
forestry, marine work. = nce and 
с) MECHANICAL ANALYSIS. 6 scores: maintenant н M 
repairing, machine operation and tending, Cons 2. 
designing, bench work and bench crafts, proce ning 
d) BUSINESS ANALYSIS. 6 scores: clerical, Ыл uying 
and distribution, bookkeeping and accounting, э nd 
and selling, training and supervision, managem 
control. ainting 
е) THE ARTS ANALYSIS. 6 scores: art ай р and 
am drawing, decorating and landscaping, dT? | 
radio, literary activities, music. ork, 
) "HE SCIENCES ANALYSIS, 6 scores: laboratory у b 
mineral-petroleum products, applied chemistry, ering 
cal research, biological research, scientific eng 


0 
JULIAN С. Stantey, Associate ЕКИ, D 
Educational Psychology, George Peabody 
lege for Teachers, Nashville, Tennessee 
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In the Third Mental Measurements Year- 
book several major criticisms of the Occupa- 
tional Interest Inventory appeared, and Stan- 
ley G. Dulsky suggested that the “total score 
might mask an important occupational prefer- 
ence that could be detected only by a careful 
analysis of the individual sections making up 
the field.” In line with Dulsky's suggestion, the 
California Test Bureau has devised six Voca- 
tional Interest Analyses. Despite some indica- 
tions of careful construction by Roeber and 
Prideaux, the Analyses seem to the reviewer 
severely limited in usefulness because they lack 
norms, internal consistency statistics, reliability 
coefficients, and evidence of empirical validity. 
In the reviewer's opinion they were released 
prematurely. This becomes especially obvious 
when the two generally unfavorable reviews of 
the parent instrument are considered in con- 
junction with the Analyses, which seem to have 
even more of the same flaws. 

The format of each Vocational Interest Апайу- 
sis is much like that of the Occupational Interest 
Inventory. The Analyses are less cluttered, how- 
ever, and more effectively conceal from the testee 
the names of the categories employed. Each of 
‘the six interest areas of the Inventory is explored 
further with an Analysis that yields six scores 
by means of 120 paired-comparison couplets. 
Thus, for the Personal-Social Analysis 40 short 
statements pertain to domestic service, 40 to 
personal service, 40 to social service, 40 to 
teaching and related activities, 40 to law and law 
enforcement, and 40 to health and medical 
Service, Eight of the domestic service statements 
are paired with eight personal service ones, eight 
more domestic service statements with eight 
Social service ones, etc. Statements pertaining 
to simple activities are paired with other simple 
activity statements, and this balancing is also 
carried out for medium and superior level ac- 
tivities, 

The analyses have a combination armchair 
and Dictionary of Occupational Titles origin; 
each statement is followed by one or more DOT 
code numbers, Though somewhat different from 
other interest inventories, they are sadly defi- 
rs in empirical support. Such statements as 
b awing in the manual are hardly satisfying 
idt 0 the measurement novice, much less to 

rained tester ; 


eaa Чїшїгагу studies indicate that the reliabilities of 
for th the 36 categories....approximate or exceed those 
€ Occupational Interest Inventory]. * Studies 
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are in progress to provide additional evidences of va- 
lidity. * [Not a shred of evidence for any sort of empiri- 
cal validity is mentioned.) The highest interest elds 
for an individual are determined by the percentile ranks 
of the composite norms [both sexes combined] pro- 
vided in the Occupational Interest Inventory manual. * 
A further justification for the use of frequency of 
choice in establishing a differentiation of interests [for 
each Analysis] is the factor of sex differences in the 
vocational interest fields, For example, a girl may ob- 
tain a high pecans rank (or standard score) in the 
Mechanical field when sex norms are used, even though 
she makes relatively few choices in that field, Never- 
theless, the advisability of preparing percentile norms 
for the six Vocational Interest Analyses is being fur- 
ther investigated. : 


Perhaps the reviewer is missing something 
either subtle or obvious in this renunciation of 
sex norms for the Inventory and any norms at 
all for the Analyses, but he can find no evidence 
in the manual that a sheer frequency compari- 
son is meaningful, even though, because of its 
counterbalanced internal structure, each Analy- 
sis must yield a mean of 20 for its six categories 
(120/6). 

In conclusion, because of their extreme lack 
of an empirical foundation the reviewer cannot 
recommend the Vocational Interest Analyses at 
this time. He sincerely hopes that the California 
Test Bureau will take vigorous action to stand- 
ardize these inventories before advertising them 
further as being reliable and valid. 


[747] 

*Vocational Interest Blank for Men, Revised. 
Ages 17 and over; 1927-51; 50 scorable categories: 
masculinity-femininity (38), maturity of interest 
(41), occupational level (зо), 1 occupations, 6 occu- 
pational groups (I, П, V, УШ, IX, and X) ; Group I: 
group scale (738), artist (38), psychologist (49), 
architect (38), physician (38), dentist (738), osteo- 
path (47), veterinarian (4) 3 Group II: group scale 
(^39), mathematician ('38), engineer C3 } chemist 
ER Group III: production manager (3) j; Group 
IV: farmer (38), carpenter (738), aviator (до), 
printer. (738), mathematics-physical science teacher 
(738), policeman (738), forest service man (38); 
Group V: group scale (736), Y.M.C.A. paysia direc- 
tor (°38), personnel manager C38), ри ic administra- 
tor (44), Ү.М.С.А. secretary ), social science 
teacher (738), school superintendent C38), minister 
(38) ; Group VI: musician (38) ; Group VII: MI 
fied public accountant (^38) ; Group VIII: group scale 
('38), senior certified public accountant (49), ac- 
countant ('38), office man (^38), purchasing agent 
(38), banker $359 pharmacist 
(^46) ; Group IX: group scale (738), sales mana er 
(738), realtor (738), life insurance salesman ( i 
Group X: group scale, (738), advertising man GB 
lawyer (38), author-journalist (38) ; Group s Я 
resident of a manufacturing concern (38) ; I ne) 
ditions ; 1 form, '38; manual (751) ; $2 per 25; at per 
report blank [38]; 2€ per global interest chart [43] ; 
$1 per single коно scale j 2-9 stat hg! ae or 

les, 70€ each; 15¢ per spec ; or- 
eaa end ; nontimed (40) minutes; Edward K. 


Strong, Jr.; Stanford University Press. * 


(749), mortician 
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а) FORM M (HAND SCORING EDITION). 

b) FORM MM (MACHINE SCORING EDITION). Separate 
answer sheets must be used; 756 per 25 IBM answer 
sheets; 75€ per 25 Hankes answer sheets for use with 
Engineers Northwest scoring service (see 466). 
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EpwanD S. Bornin, Associate Professor of Psy- 
chology, University of Michigan, Ann Arbor, 
Michigan. 

A reviewer of this interest inventory suffers 
from conflicting feelings. On the one hand, he 
feels that this test stands out among instruments 
of its type in terms of the thoroughness with 
which it was developed. From this point of view, 
criticism seems like perfectionism or unrealistic 
impractical application of standards. On the 
other hand, the reviewer feels that its very emi- 
nence in the field makes it more imperative that 
its defects be pinpointed as a vehicle for the gen- 
eral raising of standards in test development. 

The latest edition of the test manual (1951) 
continues to present the basic information neces- 
sary for use of the test. In place of the results 
of factor studies of the occupational scales, the 
present edition substitutes a table of normative 
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data showing the means and sigmas for the cri- 
terion groups and for men in general. Data on 
the standard score equivalent to a zero raw 
Score and the means and sigmas obtained by 
dice throwing are also presented in this table. 
It would seem that this new table will be more 
useful to the average test user than the previous 
one. However, this reviewer feels it would have 
been best to include both tables, since the factor 
patterns involved are a very important basis 
for test interpretation. One obscurity in the pres- 
ent table arises from the fact that the standard 
score equivalent to a raw score of zero is pre- 
sented in a column ambiguously headed "zero 
score." It took considerable cross checking to 
decipher its meaning. Other readers might not 
be so persistent. 

As in the past, the manual fulfills all of the im- 
portant functions of such a publication, sum- 
marizing information on the instrument's va- 
lidity, reliability, stability of its scores, norms, 
directions for administration, and aid in inter- 
preting scores. Considering the ease with which 
test users fall into the error of interpreting in- 
terest measures as predictions of probable suc- 
cess in training or in an occupation, this re- 
viewer finds the manual quite lacking in safe- 
guards against such interpretations. 

Throughout the manual there are loopholes 
left for interpreting interest scores as predic- 
tors of level of achievement in training or on the 
job. Examples are as follows: (a) The last sen- 
tence on the title page reads, “Seemingly, also, 
he should be more effective there [in an occu- 
pation in which he has a high interest score] 
than somewhere else because he would be en- 
gaged, in the main, in work he liked” ; and (b) 
The results of a study demonstrating a relation- 
ship between productivity and interest ratings 
for life insurance agents is presented as though 
this were a finding typical for other occupations, 
despite the fact that the results have been almost 
uniformly negative for other occupations and 
for prediction of achievement in training. 

The VIB, despite the fact that it is one of the 
most time consuming and costly inventories to 
score and despite the shortcomings just de- 
scribed, remains as the interest test whose use- 
fulness has been most carefully and thoroughly 
demonstrated. 


Егмев D. HINCKLEY, Professor of Psychology 
and Head of the Department, University of Flor- 
ida, Gainesville, Florida: 
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The Strong blank was the forerunner of the 
present-day crop of interest tests and inven. 
tories. While it has lost some of its pre-eminence 
to some of the newcomers, none has a more se- 
cure foundation or offers more possibilities than 
does the Strong. The practice of supplying new 
scales and revising old ones keeps the Strong up 
with the times. 

The impressive array of statistical evidence 
and the methods used in constructing the vari- 
ous scales are too well known to need elabora- 
tion here. It is sufficient to note that the test 
has stood the test of time and the competition of 
new tests. 

The major drawback of the Strong is the 
effort involved in a complete scoring. There are 
41 vocational scales available, plus the interest 
maturity, MF, occupational level, and studious- 
ness scales. The convenience of the Kuder and 
the thoroughness of the Strong would be a most 
desirable combination. 

Another weakness is the ability of the testee 
to fake an interest score if he thinks it is de- 
sirable to do so. This is unimportant if the per- 
son is earnestly in search of vocational counsel- 
ing, but where the Strong may be used as а 
screening device, this factor may be important. 
A “lie” scale would be a useful addition if such 
a thing were possible. This shortcoming of fake- 
ability is shared by tests other than the Strong. 

Interests should not be looked upon as a sepa- 
rate thing from the personality, but should ij 
considered as a part of the total personality an 
judged accordingly. The Strong offers E 
exciting possibilities of research in the study 0 
personality ; more should be done in this m 

The Strong is a valuable instrument in the 
vocational counseling situation as an adjunct | 
other sources of information about the ™ 1 
vidual. The tedious scoring is not always 4 e 
ous problem, for much valuable information @ 
be gained from incomplete scoring. 

Gf all the уеп, this has been the Sf 
demanding in terms of empirical foundation? 
background. It is a real job to use it as t oi А 
be used, but the returns for the effort invo y 
are good and make the use of this test W 
worth the while. 


For excerpts from related reviews se? т 
3:648, 3:650, and 3:652; for reviews by На Ий 
D. Carter, John С. Darley, and N. W. Ун o 
see 40:1680; for a review by John G. Darley 
an earlier edition, see 38:1178. 
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[748] 
x[Re Vocational Interest Blank for Men, Re- 
vised.] ScHWEBEL, Mitton. The Interests of Phar- 
macists. Foreword by Donald E. Super. New York: 
Kings Crown Press, 1951. Pp. xii, 84. Paper, Litho- 
typed. $1.75. * (London: Oxford University Press. 
115, 6d.) (PA 26: 1161) 


J Consult Psychol 15:515 D 51. Laurance F. 
Shaffer. In addition to developing a pharmacist 
scale for the Strong Vocational Interest Blank, 
this study explored the relationship of job satis- 
faction to interest measurement. Its techniques 
and findings are applicable to the measurement 
of interest and satisfaction in other occupations. 

Occupations 30:78 О ' 51. Н. Alan Robinson. 
* Although much of the book may have its great- 
est appeal to persons who are interested in test 
construction and evaluation, an host of practical 
research and counseling procedures. may be 
gleaned from the information presented. Dr. 
Schwebel is especially sensitive to the human 
factor and carries this sensitivity into his con- 
struction and evaluation of interest scales. He 
follows the Strong procedure but excludes all the 
dissatisfied from the weight and norm groups. 
He constructed three scales so he could isolate 
à number of pertinent factors and weight them 
against one another. In this manner he was able 
to add an important scale to the Strong inven- 
tories and also obtain some surprising informa- 
tion which should be of special concern to coun- 
selors. He found that the interests of pharmacists 
resemble those of office and sales workers more 
than they resemble the interests of physicians 
and dentists. Dr. Schwebel was able to make a 
definite distinction between apothecary pharma- 
cists (those devoting 26 per cent or more of their 
timeto prescription work) and business pharma- 
ee (those devoting less than 26 per cent of 
шеше to prescription work). When the in- 
AM a of the two groups are contrasted, there 
Fas arp enough difference in scientific interest 
a is medical students successfully. There- 
EU a р ough he has shown that an occupational 
dislike i à recognizable pattern of likes and 
likes h € has also differentiated between the 
КАТ dislikes within the specialties of the 
КШ, id Information of this kind is impor- 
the bet Rronnaelor who must often counsel on 

Paie А ап average interest pattern when he 
Nd аца! may not always fit that 
unl ds е author has gone a step ahead of 

: à istical measurements; he has refined 

Possibilities. The same type of refinement 
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and emphasis on individual factors could be ap- 
plied profitably to other fields with many special- 
ties, such as education, farming, journalism, law, 
and even personnel work. * If the book has a 
weakness it can be likened to the new college 
student who complains about all his questions 
not being answered. He, as the reader of the 
book, is presented with information to stimulate 
thought. Practical applications of the material 
presented can certainly be made in the areas of 
test construction and evaluation, counseling and 
occupational advisement. Much of what is done 
with pharmacists, as described in this volume, 
can be applied to other occupations by the 
thoughtful practitioner. As Donald E. Super 
states in the foreword of the book, *Dr. Schwebel 
has added significantly to our knowledge of... 
the relationship between inventoried interest 
and work satisfaction...and....the differentia- 
tions of specialties within an occupation." 


MANUAL DEXTERITY 


[749] 
xMartin Peg Board (Finger Dexterity Test). 
High school and adults ; 1947-51; 3 scores; right hand, 
left hand, both hands; mimeographed manual [{'51]; 
no data on validity; $18 per sii outfit, Men ex- 
tra; worklimit (5-10) minutes; Howard G. Martin; 
Martin Publishing Co. * 


[750] 

*Moore Eye-Hand Coordination and Color- 
Matching Test. Ages 2 and over; 1949; 2 scores: 
speed of eye-hand coordination, color-matching; indi- 
vidual; discontinued; no norms for color-matching 
scores of pre-school children; (10-15) minutes; Jo- 
seph E. Moore; Joseph E. Moore & Associates. * (For- 
merly published by California Test Bureau.) 

REFERENCES { 

. Mi озЕРНн E. “А Test of Eye-Hand Coordination.” 
J Ling Papen ar 8672 D él * (РА 1814919) 

2. Moore, Joserg E. “A Comparison of White and Negro 
Children on a Simple Eye-Hand ordination Test.” Abstract. 
Paho Aaris * D Gots 1397, title only) and White 
cider ш Jom Reaction 54 ап Eye-Hand Coordination 


“ ic Psychol 59:225-8 S,'41. * (PA 16:1089 
T MEN Tosen E. aR заранд of Negro and Wi 
Preschool Children on a Vocabulary Test and Eye-Hand 
ordination Test." Child Develop 13:247-52 D '42. * (PA 17: 


2079) 

м Y pu E. “A Comparison of the Moore Eye-Hand 
Се pene den Color Matching Test With Other xterity 
Tests.” Abstract. Am Psychol 2:297-8 Ag '47. (PA 21:4611, 


itp ne ]озЕРн E. "The Standardization of the Moore 


piri edm cerner fe А 
Norman FREDERIKSEN, Associate Professor 
of Psychology, Princeton University; and Head, 
Research Department, Educational Testing 
Service; Princeton, New Jersey. 

The apparatus consists of a flat rectangular 
box; at the left end are four receptacles contain- 


hite 
Co- 
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‘ing colored marbles, and extending to the right 
from each receptacle is a row of eight holes each 
of which will hold one marble. For the Eye- 
Hand Coordination Test, the task is to pick up 
marbles one at a time and place them in the 
holes. The test is repeated three times, and the 
score is the time required to place the 96 mar- 
bles. In the Color-Matching Test, a similar pro- 
cedure is used with the same equipment, but 
each marble must be placed in a hole of similar 
color. The score is the time required for plac- 
ing the 96 marbles plus one second for each er- 
ror in matching colors. The preschool form of 
the test makes use of the same equipment but is 
half as long and does not include the color- 
matching portion. 

The only specific purpose of the test mentioned 
in the manual is selection and placement of em- 
ployees, although mention is made of use of the 
test in clinics and guidance centers. What value 
the preschool form might have is not specified ; 
the supplementary manual for the preschool 
form merely quotes from the regular manual 
that "one of its major purposes is to save valu- 
able time as well as expense in the selection and 
placement of employees." 

With regard to reliability, the publisher's an- 
nouncement states that “The test-retest method 
based on 187 subjects (elementary, high school, 
and college students, and adults) after an inter- 
val of one week, gave a coefficient of .95." Es- 
sentially the same statement is made in the 
manual where the coefficient is attributed to the 
Eye-Hand Coordination Test only, rather than 
to the test as a whole. High reliability certainly 
should be expected from a group with such an 
extreme range of ability. It is much more to the 
point to report reliabilities for groups of greater 
homogeneity, which is done for the Color-Match- 
ing Test in the regular manual (where a retest 
reliability of .82 is reported for 83 men in a 
college psychology class) and in the preschool 
manual (where a retest reliability of .95 is re- 
ported for 81 preschool children). In a later 
article the author again refers to the reliability 
of .82 and says that when corrected for restric- 
tion of range it becomes .955. The application of 
a formula for correction for restriction of range 
seems questionable in this situation, especially 
in view of the failure to state what amount of 
variability is assumed in the correction form- 
ula. 

One line of evidence as to the validity of the 
Eye-Hand Coordination Test is that it differen- 
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tiates between various age groups from 2 to 16 
years. A limited amount of evidence, based o 
very small numbers of cases, suggests that the 
test is predictive of success in industrial jobs; 
studies are reported for ice cream sandwich 
maker (N — 10) and loom operator (N =z), 
For 62 typing students, a correlation of 81 was 
found between the test and speed and accuracy 
scores. As is true for most industrial selection 
tests, the validity of this test should be estab- 
lished for any new situation in which it is to be 
employed before using the scores in selection, 
According to the publisher’s announcement, 
norms are based on over 5,000 cases. This isa 
true statement, but it is somewhat misleading. 
For the preschool form, for example, percentil 
norms for children from 24 to 77 montlis of age 
are presented by six-month age ranges; tht 
norms for the 24-29 month group are based on 
only тї cases, while for the other eight groups 
the N's range from 27 to 119. Girls are said to 
be faster than boys in speed of eye-hand coordi- 
nation, but no sex-differentiated norms are 
given. For adults the N's are in general much 
more adequate ; norms for adult white men (u 
classified) are based on 2,707 cases. Separate 
percentile norms based on white males are givet 
for clerical and sales jobs, textile jobs, skilled 
mechanics, truck drivers, adult men (unclassi- 
fied), and college men. Similarly, norms are prt 
sented for various groups of white women and 
negro men and women. i 
Of even more importance than the number 0 
cases is the adequacy with which the aie 
group represents the population for which ji 
norms are presented. This is of course а pem 
problem for test publishers and is very difficut 
to solve adequately. However, it seems Кү 
larly dangerous to assume that men who A 
to the Guidance Center of the Georgia ШЫ 
of Technology “probably represent а good je 
section of the general population between К 
ages of sixteen and sixty,” as is stated in 
manual. ^. bM 
The Moore Eye-Hand Coordination ie] 
Color-Matching Test is better than its pr 
The test can be administered in 10 or 15 " the 
utes, the instructions seem to be adem 
5 
provision is made for easy return of the mat 4 
to their starting position. If one 
caution of performing a validity study t d how 
mine the predictive value of the test 2" plem 
best to employ the scores in a selection рї0 
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the test would seem to be well worth trying out 
in a situation which seems to require speed of 
eye-hand coordination. 


Jay L. Oris, Professor of Psychology, and Di- 
vector, Research and Service Center for Busi- 
ness and Industry, Western Reserve University, 
Cleveland, Ohio. 

This test was designed to measure the speed 
and accuracy with which an individual can co- 
ordinate small muscle movements involving eye- 
hand activity and the speed with which he can 
select and match four colors. The test requires 
the coordination of the thumb, index finger, and 
the eyes in a constantly changing spatial pattern. 
It is suggested that the test would be useful in 
selecting employees for work requiring speed of 
eye-hand coordination. 

Directions for administration are clear and 
complete. The only criticism is that the examiner 
is not told how he should score the eye-hand 
portion of the test if the subject drops a marble 
on the floor and looks for it before completing 
the test. 

Normative data are available for several 
groups including preschool children, college stu- 
dents, and occupational groups. Several thou- 
sand subjects are included and most of the groups 
are represented by an adequate number of cases. 
Most of the subjects used are from southern 
states. Measures of central tendency and vari- 
ability from two northern colleges are similar to 
those from southern colleges. However, checks 
were not made for other groups. Nevertheless, 
this test should rate excellent from the stand- 
point of normative material. 

Validity studies discussed in the manual show 
that the eye-hand coordination part yielded a 
correlation of .86 (tetrachoric) with hourly 
earnings of 23 textile workers, .81 with speed and 
accuracy of 62 typing students, and .52 with 
production of то ice cream sandwich makers. 
No evidence of the validity of the Color-Match- 
ing Test is given. Correlations with other dex- 
terity tests range from .29 to .67 for the Eye- 
Hand Coordination Test and from .35 to .54 for 
Color-Matching. The validity data are encour- 
aging. However, to one who has used dexterity 
tests in industry, it is surprising that all re- 
Ported validity coefficients are so high. It would 

useful to the industrial psychologist to know 
those situations, if any, in which the test has 
een unsuccessful, as well as those in which it 
as succeeded, 


The Moore Eye-Hand Coordination and 
Color-Matching Test is a new test, and final 
judgment will be made only after more users 
have had an opportunity to report on its useful- 
ness, The amount of work done on the stand- 
ardization of this test is impressive. It is to be 
hoped that similar efforts will be made by the 
author and others to determine the applications 
in which it will have greatest validity and the 
possibilities for its use in individual prediction 
in vocational guidance work. This test merits 
consideration on the part of examiners who 
need a measure of speed of eye-hand coordina- 
tion. 
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*Purdue Pegboard. Grades 9-16 and adults; 1041- 
48; 5 scores: right hand, left hand, both hands, right 
plus left plus both hands, assembly ; 1 form, '41; man- 
ual (48); $16.05 per testing apparatus; $1.30 per 25 
profile sheets; cash orders postpaid ; 215(10) minutes ; 
prepared by Purdue Research Foundation; Science 
Research Associates, Inc. * 


REFERENCES 
1-3. See 3:666. 


4. Вілетт, C. G. “Pictures of the Mind.” J Rehabil 13:4-10 
Је ’47. * (PA 2123630) 

5. STRANGE, РА > AND SARTAIN, А. О. "Veterans" Scores 
on the Purdue Pegboard Test." J Appl Psychol 32:35-40 Е '48. 


озЕРН, AND ASHER, E. J. "The Purdue Feghoards 
'sychol 

32:234-47 Je '48 
7. ERMAN, ] 
Three-Trial Purdue Pegboard Norms." Occupations 27:251-2 


*49. * (РА 23: 
Due iW. Drsur, Jr. Occupational doi Patterns 
Солена eterans. Docior's thesis, New York University 
(New York, N.Y.), 1949. Pp. viii, 385. * (PA 24:362, title 
only) 
Dexterity Tests.” Ed & Psychol Meas 9:765-72 W ?49. 
26:2950. UM ) 
x Henry D. "The Prediction of Veterans’ Suc- 
ast Scores at the University of Oklahoma,” Part 
„ 59-72. In The Sixth Yearbook о] C 
in Education, 1948-1949. Fairmont, 


Gerst, Hanorp. “Тһе Performance of Amputees on Мор 


ў 12130) 
W. ^ A dministration of the Purdue Peg- 


12. Curtis, JAM RU edi’ 


zs W. 
board Test to Blind Individuals.” Ed & Psychol 
р, 


logical Association, Psychological Monographs: General and 
ded, Vol. 6s, No. 5, Whole No. 322. Wa: о, D. 
the Association, Inc., 1951. Fp. 49, Paper. E ү ло 
вр М. . 

a Біо Purdue Pegboard.” J Appl Psychol 35:312-3 о 
"вт? А бон N., AND STACEY, CHALMERS L, “Ма: 
m CANTOR Онча Mental Defectives.”” Am J Mental Def 
56:401-10 O’st. * (PA 26:2252) 


For reviews by Edwin E. Ghiselli, Thomas 
W. Harrell, and. Albert Gibson Packard, see 


3:666. 
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ts Dexterity Test. High. school and 
Mun Piin esae 2 vont a, eie 
; : '46; [revi u ; 
laee а teeing materials, postpaid ; до - 
limit (10-25) minutes; John E. Crawford and Doro- 
thea М. Crawford; Psychological Corporation. 
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Raymonp A. KATzELL, Director and Staff Con- 
sultant, Richardson, Bellows, Henry and Com- 
pany, New York, New York. 

The test consists of a metal plate in which two 
sets of 36 holes (6 rows and 6 columns) each are 
drilled. There is an additional practice row of 
6 holes in each set. The holes of one set are 
threaded, while the holes of the other are smooth. 
The latter set is used for Part I of the test, 
which consists of placing, with the use of for- 
ceps, a pin in each hole and then a collar over 
the pin. Part II involves manually starting a 
small screw in each of the threaded holes and 
then screwing it down with a small screwdriver. 

The score on each part is the time required to 
complete all 36 holes. The time required for 
Part I seems to range from about 3 to about 
9 minutes, while that for Part II ranges from 
about 4 to about 16 minutes. 

The test is designed as a measure of a com- 
plex of perception-dexterity coordinations such 
as may be involved in tasks like the assembly 
and adjustment of small machinery (e.g., sewing 
machines), or the wiring of intricate electrical 
apparatus. 

The authors hypothesize that a composite 
measure, such as this test seems to furnish, is a 
more valid measure of the aptitudes required for 
jobs of the sort mentioned than tests which use 
the fingers or a single tool only. This conjecture 
gains credibility in the light of the evidence that 
a complex motor pattern is not closely related to 
the simpler movements into which it may be log- 
ically segmented. Yet this specific hypothesis re- 
mains to be empirically verified. 

Thus far, evidence regarding the validity of 
the test is scanty. The manual cites two studies 
of the test’s usefulness, one performed on hand 
engravers and etchers and the other on burlers 
and menders in a textile mill, But, as the authors 
recognize, more evidence will be required in a 
wider variety of situations before we can begin 
to generalize about the test’s validity, 

The reliability of the test promises to be ade- 
quate for most purposes. On four samples rang- 
ing in size from 56 to 118, the split half relia- 
bility coefficients (corrected by the Spearman- 
Brown formula) ranged from .8o to :94 for Part 
I and from .90 to .98 for Part II. 

The two parts measure somewhat different 
functions, correlations between the two ranging 
from .35 to .50 in seven samples. Part I corre- 
lates about .38 with the several parts of the Min- 
nesota Rate of Manipulation Test, whereas Part 
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II correlates about .27. One wonders what the 
correlations would be with tests measuring fine 
coordinations, such as O’Connor’s Таеп 
Dexterity Test or the Purdue Pegboard, They 
tests are likely to function more nearly in the 
same areas as the test under review, 

The manual is well written. It Covers ade. 
quately the administration and scoring of the 
test, describes the norm groups in some detail 
and pertinently discusses the available validity 
and reliability data. It also quite correctly em- 
phasizes the desirability of local validation ani 
norms. Percentile norms are furnished for four 
groups of adult males, three groups of high schod 
boys, and two groups of women. 

The physical construction of the test is ex 
cellent. It is convenient to carry and use, ор 
erates smoothly, and seems durable. In the latter 
respects, it rates somewhat higher than what is 
probably one of its major competitors, the Pw: 
due Pegboard, but the latter is considerably les 
expensive and has the benefit of having seen 
more varied trial. The fact that the Purdue Pep 
board isa time limit test and is somewhat shorter, 
is also to its advantage. 

The Small Parts Dexterity Test has the eat- 
marks of a promising instrument in its field, bt 
little more can be said about it until additional 
evaluative data are available. Those who are in 
a position to experiment might do well to look 
into it further, 


For a review by Joseph E. Moore, see 3:61 


[753] e 
*Spatial Test I. Ages iren cuoi e pa 
рег 100; 155. per 12; 15. 40, per single copy; 35. 2 
manual; postage extra; 41(60) minutes; 1. e 
Smith; National Foundation for Educational 
in England and Wales. * 


REFERENCES 


p Saar, Т, Macrartane. “Measuring Spatial 
Бера УЧА" Occupational Psychol 22:150-9 J 
23:1181 


E. A. PEEL, Professor of Education and im 
of the Department, University of Birmmngi 
Birmingham, England. Pee 
This test has been designed for use 1 iL 
junction with other tests to discover шо sth: 
dren who show potential ability for bes dra 
jects as practical mathematics, technica po 
ing, engineering, and building. It is made Test 
six timed subtests: Test 1, fitting shapes ition; 
2, form recognition; Test 3, pattern mm ой; 
Test 4, shape recognition; Test 5, comp! 
and Test 6, form reflection. al space 
The items of all tests are nonverb 


ility i 
| A 4 
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problems in two dimensions. Test 5 is composed 
of the familiar analogy item in space material. 
Presumably the object of the designer of this 
test was to abstract six recognisable aspects of 
tasks requiring spatial ability. If so, it is curious 
that no three-dimensional material, represented 
by perspective drawings or other conventions, 
is included. This omission may have to be reme- 
died by applying the test with other material 
providing “solid” problems, particularly where 
interest is in the prediction of practical subjects. 
Albeit, the test provides suitable problems in- 
volving analytical and synthetic processes in 
space relations. Answers to Tests т, 3, and 6 
require very simple drawing of lines by the 
testee. This makes for alternative responses and 
requires a decision on acceptable and unaccept- 
able answers. Provision in the key for alterna- 
tive answers appears to be adequate, but their 
existence slows up marking to some extent. 

Adequate practice items are provided, the 
only criticism of them being concerned with their 
thoroughness. Since some discussion of the 
items is still necessary in administering the test, 
this may limit the number of children who may 
be tested simultaneously. The manual of instruc- 
tions is competent. 

Norms are provided on the basis of scores 
standardised to a mean of roo and standard de- 
viation of 15 and so are roughly equivalent to 
IQ scores. The test is very discriminative with 
respect to age, as inspection of the table of norms 
will show. At the.mean, for example, there is а 
rise of some 20 points of raw score between the 
ages of 11 and 13.9. Although the test shows 
significant sex differences in favour of boys, 
some five to six points of raw score on mean 
scores of about 50 points, the author has based 
his norms on a mixed population. This incon- 
sistency is perhaps to be regretted, whether the 


test is intended for testing boys or a mixed popu- » 


lation. 

A serious criticism of the present edition of 
this test is that no figures for the validity of the 
test are provided. Possible users of the test may 
Tightly want to know what is meant by “high 
correlation with assessments in technical draw- 
ing” before they commit themselves to purchase 
of this material. Similarly, no figures are given 
With regard to the correlations of the test with 
other tests of spatial ability and “practical” ap- 
titude, and the reader of the test booklet must be 
Satisfied with a brief reference to the original 
Source of inspiration for the material in the work 


` being blue, 


of Koussy and Stephenson. Some results of fac- 
tor analysis should be available to those inter- 
ested, particularly as the author of the test now 
has available promising factorial data which 
show that the test compares favourably with 
other tests, group and individual, which are 
claimed to measure the gk components of intelli- 
gence. 

The reliability of the test by the test-retest 
method is .93, and a figure of .96 has been ob- 
tained by the Kuder-Richardson method. These 
are id satisfactory reliabilities for spatial ma- 
terial. 


[754] 
*Spatial Test II (Three Dimensional). Ages 10- 
11; 1950-51; I form ['51]; manual ['51]; provisional 
norms ['51]; 115$. per 100; ISs. per 12; Is. 4d. per 
single copy; 3s. 6d. per manual; postage extra; 26% 
(45) minutes ; А. F. Watts with the assistance of D. A. 
Pidgeon and M. K. B. Richards; National Foundation 
{ог Educational Research in England and Wales. * 

[755] 
*Stromberg Dexterity Test. Adults; 1945-51; also 
called SDT ; individual; 1 form, '47; preliminary man- 
ual ('51); $30 per set of testing materials needed b 
examiner; 356 per manual; postpaid ; nontimed (8-15) 
minutes ; Eleroy L. Stromberg ; sychological Corpora- 
tion. * 
REFERENCES 

т. STROMBERG, ELrROY L. “Visual Discrimination, мде, 
and Non-Verbal Intelligence Measured by a Single, Short 8 
Minute) Test." Abstract. Am Psychol 4:392 S '49. * 24: 
1675, title only) 

Jurian C. STANLEY, Associate Professor of 
Educational Psychology, George Peabody Col- 
lege for Teachers, Nashville, Tennessee. 

This well built one-hand manipulative test 
gives promise of being reasonably valid for many 
skilled and semiskilled positions, despite several 
shortcomings of the preliminary (1951) manual. 
The rectangular wooden kit (21% by 17% by 
124 inches) opens up to reveal on one side a 
tricolored formboard with 6 rows and 9 col- 
umns of holes 134 inches in diameter into which 
fit 54 flat, cylindrical wooden disks % inches 
thick. Thirty disks are red on one side or the 
other, 6 of these being red on both sides. Like- 
wise, 30 are yellow on one side or the other, and 
30 are blue. Twenty-four are red on neither side, 
and similarly for yellow and blue. At the be- 
ginning of Trial 1, the testee sees in the tray on 
his left 6 rows of 9 blocks each, the top row 
the next yellow, the next red, the 
etc, He must transfer these as quickly 
as possible in a predesignated order to the form- 
board on his right. On Trial 2, the 9 columns 
in the left-hand tray are alternatingly red, yel- 
low, and blue, and the directions for transferring 


next blue, 
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the disks to the formboard holes are somewhat 
different from those for Trial r. Trials 1 and 
2 are for practice only; Trials 3 and 4 are their 
respective counterparts, the total number of 
seconds needed to complete both being the final 
Score. Such measures range from about 2 to 
4% minutes. 

The author and publisher are to be congratu- 
lated for having repeatedly cautioned against 
uncritical use of the test in their preliminary 
manual. They emphasize that “а test may have 
high validity for workers in one situation; but 
in another case, for workers with the same job 
title the validity of the test may be low." Yet 
their own validity data are not thoroughly satis- 
factory. For a test under trial since 1945, the 
evidence offered is quite meager. Their validity 
discussion is based upon a grand total of 69 
foundry moulders in three different plants, the 
criterion being monthly wages dichotomized at 
the median separately for each plant. Thus three 
pairs of score distributions are shown. For each 
of these the “poor” (lowerpaid) workers were 
in general less speedy on the Stromberg Dex- 
terity Test. No biserial r’s were computed or t- 
tests run. The reviewer finds for the plant with 
the greatest difference between the mean scores 
of “good and poor” workers (14 each) a biserial 
r of .56, which is significant between the 1 and 
5 per cent levels. Why the author chose to di- 
chotomize his criterion rather than use an ordi- 
nary product-moment r between wage and score 
is not stated. Certainly he was not trying merely 
to simplify matters for the statistically untutored, 
because nine r’s appear elsewhere in the manual. 
No mention is made of follow-up studies of be- 
ginning employees, even though percentiles for 
77 “female assembler and welder applicants” 
(italics mine) appear in the norms. 

The correlation between scores on Trials 3 
and 4, stepped up with the Spearman-Brown 
formula, was .84 for 70 female assembler and 
welder job applicants, variability unspecified, 
For 80 male trade school students the reliability 
coefficient of the total SDT scores, determined 
in a similar fashion, was 87. Conspicuously 
missing are standard errors of measurement and 
test-retest r’s to reflect stability over an inter- 


vening period of time. If individual scores are . 


to be used for selection, Trials 3 and 4 might be 
given twice in succession (i.e., 3, 4, 3, 4) in 
order to enhance reliability. With this pro- 
cedure a reliability coefficient of about .92 for 
industrial workers similar to those reported in 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK E 


the manual should be obtained with less thay 
15 minutes devoted to each testee, 

One reliability coefficient (.90) and two thirds 
of the r’s with other tests are based Upon the 
inevitable college students, scores for both Sexes 
being combined. It is difficult to conceive of 
practical uses for the SDT with such groups, nor 
does the author suggest any. 

The test manual does not report experimental 
evidence on such points as proper use with left- 
handed persons, relative scores with each hand, 
sex differences, improvement with practice, and 
factorial content. Concerning left-handedness it 
states that studies are needed but advocates the 
same procedure as for right-handers. 

Directions for giving the test are unusually 
clear and adequate, except that the necessity for 
speed in Trials 3 and 4 is not emphasized for 
the subject, who may have forgotten the brie 
mention of speed at the start of untimed Trial 1 
Some testees will be reminded of this if the stop- 
watch is seen and recognized, but more explicit 
instructions seem needed. 

We are not told what to do if the subject 
accidentally disarranges the blocks, turns one 
over, forgets how to proceed, or otherwise be 
comes confused. Presumably, the only allov- 
able plan is to wait until he places a block ir 
correctly on the formboard and then say “No, 
that's wrong,” followed by “No, that's not the 
next one” if he continues to err. This might cause 
either an infinite score or unstandardiz 
prompting. Starting over seems the only ү 
tion, but of course an undetermined amount 0 
practice effect has already resulted. The 5 
aminer must watch very closely to insure tha 
his subject does not become hopelessly enmeshe! 
in an incorrect response pattern before а d 
ing is given. Thus, he can hardly test more th? 
one person at a time. ant 

Researchers employing the SDT may YA 
to investigate the effect of skewed time ШЕ 
upon 7’s and perhaps seek a transformation 
them if curvilinear regression is found. "m 

The noticeable chipping of paint on the "s 
may introduce slight progressive changes ™ be 
testing situation, though this should not be 
portant unless large areas flake off. ` teas 

The Stromberg Dexterity Test seems at € 
as complex as Stromberg (т) judged it kd 
In addition to requiring visual: discriminan 
dexterity, and nonverbal intelligence, it may ittle 
have a spatial component. The manual gives ver 
usable information on these points, howe 


751 ] 


since the 4 r's with related tests (Minnesota Rate 
of Manipulation, .50 for placing and .34 for 
turning; Purdue Pegboard, .30 for placing and 
ai for assembly) are based upon “college men 
and women." The other two 7°, .45 for 484 in- 
dustrial workers and .39 for 9o additional in- 
dustrial workers with the Adaptability Test (a 
35-item, 15-minute measure of mental ability) 
are moderately high if attenuation is taken into 
account. Despite the statement in the manual 
that “the SDT might very possibly add to the 
value of a selection battery,” the reviewer be- 
lieves that it will perhaps be most useful'as a 
single predictor. This is an empirical matter, 
though, and must be decided in terms of multi- 
ple 7’s rather than armchair speculation. 

In summary, the Stromberg Dexterity Test 
is an attractive, sturdily constructed formboard 
type instrument that may perhaps be found to 
be a valid predictor of success in a rather wide 
range of skilled and semiskilled jobs. It is only 
partially standardized, so that the user will need 
to accumulate his own norms and to make ex- 
tensive validity studies in his own situation, This 
must be done to some degree for almost any test, 
but the dearth of data in the preliminary (1951) 
SDT manual or in the published literature makes 
caution imperative. 


MECHANICAL ABILITY 


[756] 
*A.C.E.R. Mechanical Comprehension Test. 
Ages 13.5 and over; 1942; І form; no data on reliabil- 
ity and validity; 3s. rod. per 10; separate answer sheets 
must be used; 1s. per 10 answer sheets; 3d. per scori 
stencil; 6d. per manual; rs. 6d. per specimen set; cas 
Orders postpaid within Australia; 30(40) minutes; 
Australian Council for Educational Research. * 


REFERENCES 
qd OLADE, М. “An Experiment іп the Use of Ps 
Los in the Selection of Women Trainee Telephone 
Bi" Psychol & Personnel Prac 2:26-32 Mr "46. 


chological 
echanics.”” 
* (PA 20: 


D. W. McEtwatn, Senior Lecturer in Psy- 
chology, University of Melbourne, Victoria, 
Australia, 

This test consists of 45 items. Each item is 
based on a drawing of some mechanical device, 
and usually asks what will happen to a given 
Part when some other part is moved in a pre- 
Scribed way. The diagrams are usually not of 

nown or familiar mechanical objects but are 
made-up affairs which require an analysis of 
the particular set-up. 

The test was constructed by the Australian 
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Council for Educational Research during the 
recent war for use as a selective device for choos- 
ing persons suitable for the mechanical trades 
(a term used in a narrower sense in Australia 
than in the United States to apply only to those 
highly skilled metal trades which work with ma- 
chines, e.g., toolmaker, brass fitter, metal ma- 
chinist, but not engine driver, plumber, drafts- 
man, tinsmith). The test was used in all the 
Australian armed services. Probably nearly one 
million persons (mainly men) were given this 
test in the period 1942-47. 

The test developed from the original work of 
Stenquist, as did the Bennett series in the United 
States and the Test N series of the Australian 
Institute of Industrial Psychology in Australia. 
The test differs, however, from the Bennett tests 
in that (a) four-fold selective answers are em- 
ployed; (b) the items are based on drawings of 
unfamiliar mechanical objects; and (c) the test 
has a sharp difficulty gradient. 

The test has a number of very good features, 
among them excellent photolithographic repro- 
duction on good paper, an extended and care- 
fully planned practice, and some useful ingenui- 
ties in matching the lettering of the diagrams to 
the lettering of the answers. 

Several major surveys of the usefulness of 
the test have been made and many suggestions 
have been put forward for its improvement. The 
test has undergone a major revision by the Army 
Psychology Service, which has made a new test 
with these revisions: (a) 3-dimensional repre- 
sentations are used in the drawings instead of 
flat elevations; (b) direct validation is based on 
ratings on the job in the mechanical trades ; and 
(c) items below medium difficulty for the Army 
have been eliminated. 

While the new test is not publicly available, 
it is mentioned here because the revisions point 
up the main criticisms that can be made of the 
original test. The principal criticism is reflected 
in the change in validation criteria, The original 
test was validated against a combined score of 
the Cox Mechanical Models Test, and the Cox 
Mechanical Diagrams Test and its own item trial 
pool score. 1 т 

Тһе test has in practice given correlations of 
the order .3 to .7 for various populations, occu- 
pations, and vocational training facilities. It has 
a high g saturation under almost all conditions. 
It is easy to administer and is superior to the 
usual mechanical assembly test in predicting 
learning rate for mechanical trades. 
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The A.C.E.R. are presently preparing a new 
form of the test more suitable for adolescents in 
high schools and technical schools. It is planned 
to supplement the new form with a test on tool 
information. ^ 

The A.C.E.R. Mechanical Comprehension 
Test is a very good one of its kind, but like the 
others, has essential difficulties. The test is a 
composite and is certainly multidimensional ; it 
is neither a pure factor test on the one hand, nor 
a job analogue on the other, but lies somewhere 
in between. It has, however, proved very useful. 
For particular selection jobs with known offer- 

' ing populations and within a context of a given 
psychometric battery, such tests are almost in- 
dispensable. 
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Form Relations Group Test. Ages 14 and over; 
1926-46; 1 form ['26]; no data on reliability and valid- 
ity and no description of normative population in man- 
ual; data available on request ; mimeographed manual 
[46]; 6s. od. per 12; 15. 6d. per single copy; 1з. per 
manual; postage extra; 19/6 (25) minutes; National 
Institute of Industrial Psychology. * 


REFERENCES 

т. Laycock, S. R., anp Hutcuzon, №. B. “А Prelimina: 
ahha СЕ into the Problem of мез Engineering Apti- 
tude." J Ed Psychol 30:280-8 Ap '39. * (PA 13:5899) 

2. HOLLIDAY, FRANK. “An Investigation Into the Selection 
of Apprentices for the Engineering Industry." Occupational 
Psychol 14:69-81 Ap '4o. * (PA 14:3710) 

3. SLATER, Patrick. “Some Group Tests of Spatial Judg- 
ment, or practical piss, Occupational Psychol 14:40-55 Ja 
40. 14:2644 

f HOLLIDAY, FRANK. “A Further Investigation Into the 
Selection of Apprentices for the Engineering ndustry.” Occu- 
pational Psychol 15173 94 О. "41. * 

5. HOLLIDAY, FRANK. “А Survey of an Investigation Into 
the Selection of Apprentices for the Engineering Industry." 
Occupational Psychol 16:1-19 Ja '42. * 

6. Kerr, GEORGE. “©. AEN Testing for Secondary Courses: 
An Essay in Control Under War-Time Difficulties.” Occupa- 
tional Psychol 16:73-8 Ap '42. * 

7, SHUTTLEWORTH, CiiFrorp W., “Tests of Technical Apti- 
tude.” Occupational Psychol 16:175-82 О '42. * 

8. Harpinc, D. W. “Prognostic Tests for Students of Archi- 
VH Occupational Psychol 17:126-31 Jl '43. * (PA 18: 
290 

9. Но лрлү, FRANK. “The Relation Between Ps chological 
Test Scores and Subsequent Proficiency of A; Pentia in the 
Engineering КАМИ: Occupational Psychol 17:168-85 О 
"43. 18:183 

то. Moorz, В. С. R., AND Pret, E. A. “Predicting Aptitude 
ich рунии Occupational Psychol 25:192-9 Jl ’51. * (PA 
26:3675 


А. T. Wetrorp, University Lecturer in Experi- 
mental Psychology, University of Cambridge, 
Cambridge, England. 

This is a paper and pencil test with eight sec- 
tions of progressively increasing difficulty. In 
the easiest section the subject is required to 
identify one shape from among 15 to complete 
each of 5 incomplete squares. In the most diffi- 
cult he is required to identify 2 shapes from 
among 12, turning them around or over if neces- 
sary, to complete each of 5 incomplete cubes 
drawn in perspective. There is a time limit for 
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each section; the whole test takes about 25 
minutes to administer. 

The manual supplied with the test gives the 
precise wording of instructions, a scoring key 
and norms. Statistical data on reliability, ete, 
are not included but are contained in articles in 
Occupational Psychology (7, 9), and are sup- 
plied on request by the National Institute of In- 
dustrial Psychology, which strictly controls the 
issue of the test material. 

The norms are in the form of five grades: 
A = best 10 per cent, B = next 20 per cent, C= 
middle 40 per cent, D = next 20 per cent, and 
E = worst 10 per cent. They are given for boys 
and men and for girls and women of secondary 
grammar school (high school) education in three 
age ranges—16 and over, 15 to 15.11 and 14 to 
14.11. Separate norms are given for a relatively 
unselected group of boys and men of the same 
age ranges and also of the range 13 to 13.11. The 
norms are based on the scores of 5,999 subjects, 
some from schools in various parts of England, 
and some applicants tested in the course of in- 
dustrial selection work. The scatter of scores 
shown by the norms is satisfactorily wide and 
indicates that the test can discriminate at both 
the top and bottom ends of the scale. 

The norms for the grammar school and the 
relatively unselected groups are surprisingly 
similar, indicating that performance is not closely 
bound up with the capacities upon which selec- 
tion of pupils for grammar schools is made or 
upon the effects of training in these schools. It 
must, however, be remembered that the test was 
produced at a time when the selection of pups 
for grammar schools on qualities likely to be 
measured by, mental tests of this kind was less 
prevalent than it is today. ^ 

Scores have been found to correlate 44 with 
teachers' estimates of ability based on sam | 
records for a group of 109 technical school pup! 
and from .22 to .54 (weighted average 3 
with instructors’ estimates of proficiency for nine 
groups of apprentices ranging in size from 91 ү 
16. If it is borne in mind that these are selecte 
populations, and that the criteria must have 1" 
cluded qualities other than those involved à 
performance at the test, the correlations 50090 
that the test is likely to form a useful compone? 
of a battery. The reliability is given as 87. "m 

The scores, like those for almost all spee a 
tests, fail to distinguish between those who Жо 
slowly but accurately and those who work mo 
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quickly but with more mistakes. Research would 
seem to be well worth while to determine whether 
the value of speeded tests as a means of selection 
could be enhanced by taking account not only 
of the overall measure of achievement as repre- 
sented by the usual type of score but also of the 
way in which this achievement is attained. 

The “mental manipulation” of shapes in a 
form in which they are not tangible objects that 
can be picked up and turned around would seem 
to be an important feature of many practical 
skilled tasks. This is especially so for three-di- 
mensional shapes as in the last two sections of 
the test. It is an aspect of performance upon 
which experimental research is desirable. 


[ 758] 
xGroup Test 81 pace Perception Test]: Na- 
tional Institute of ndustrial Psychology. Ages 
13-15; 1949; I form ['49]; no data on reliability and 
validity and no description of normative population in 
manual; data available on request; mimeographed 
manual ['49]; 4s. 5d. per 12; 15. per single copy; 25. 
per set of manual, key, and norms; ostage extra; I5 
(25) minutes ; National Institute 0 Industrial Psy- 
chology. * 

REFERENCES 
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i HOLLIDAY, FRANK. “А Further Inyestigation Into the 
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the Selection of Apprentices for {һе Engineering Industry." 
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s. HOLLIDAY, FRANK. “The Relation Between Psychologi- 
cal Test Scores and Subsequent Proficiency of AP rentices in 
the, Engineering Industry.’ Occupational Psychol 17:168-85 
О "43. * (PA 18:1835) 


E. G. CHAMBERS, Assistant Director of. Re- 
search in Industrial Psychology, University of 
Cambridge, Cambridge, England. 

This spatial perception test involves two dif- 
ferent principles. In the first part, а small pat- 
tern formed of crosses has to be recognized and 
marked in a larger pattern of crosses. Fifty prob- 
lems are set with a time limit of seven minutes. 
Tn the second part, five small figures, numbered 
1to 5, are given together with some larger shapes 
which are made up of two, three, or four of the 
numbered figures. The subject is required to 
write down the numbers of the figures compos- 
ing each shape. There are 20 two-piece, 12 three- 
piece, and 8 four-piece problems. Ап overall time 
of eight minutes is allowed for this part. Five- 
grade norms (poor, fair, average: good, and 
very good) based on 3,061 cases from secondary 
modern schools are provided for ages 13.0-13-11 
and 14.0-14.11. 
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The test covers some of the principles com- 
mon to all tests which purport to measure the 
ability to appreciate relationships of space and 
form. The problems are entirely two-dimen- 
sional and the test is moderately easy, some of 
the more difficult elements in spatial percep- 
tion not being included. It is unfortunate that 
an error has been made in printing the num- 
bered figures in the second part of this test. The 
long sides of figures 1 and 2 do not match, so 
that figure 1 does not fit into either the second 
or the fourth of the practice examples. Whether 
Group Test 81 has special excellences or defi- 
ciencies compared with other tests of a similar 
nature is impossible to say without validation 
data. It should, however, be a useful tool in the 
hands of the vocational adviser. 
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MacQuarrie Test for Mechanical Ability. Ages 
7 and over; 1925-43; 8 scores: tracing, tapping, dot- 
ting, copying, location, blocks, purstit, total; 1 form, 
'25; manual ('43) ; $2.50 рег 25, postage extra; 35 

per specimen set, postpaid ; Spanish edition available 
at same prices; 12(25) minutes ; T. W. MacQuarrie ; 
California Test Bureau. 
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For an excerpt from a related review, see 760; 
for reviews by John R. Kinzer, C. H. Lawshe, 
Jr., and Alec Rodger, see 3:661. 


[ 760] 

*[Re MacQuarrie Test for Mechanical Ability.] 
Bancock, Harrier. The MacQuarrie Test as a 
Clinical Instrument: Validity of Vocabulary- 
MacQuarrie Deviations as a Measure of Abnor- 
mal Mental Functioning When Scored in Psy- 
çualogical Units. Lancaster, Pa.: Science Press, 
1950. Pp. vii, 72. $2.00. * (РА 25: 5336) 

J Consult Psychol 15:263 Je’51. Laurance Е. 
Shaffer. This monograph presents evidence for 
a new measure of efficiency of mental function- 
ing, obtained by comparing a vocabulary age 
with corresponding age scores obtained from the 
MacQuarrie Test. The latter test is certainly as 
suitable as any for measuring a deteriorating 
mental function because of its dependence on 
speed, skill, and spatial perception. Although the 
monograph does not establish the validity of the 
Vocabulary-MacQuarrie difference beyond all 
doubt, there is considerable confirming evidence 
of a clinical nature. Norms are based on 1000 
cases. No correction for age is provided, and 
the author argues against its use. 


[761] 
Prognostic Test of Mechanical Abilities. Grades 
7-12 and adults; 1946-47; 6 scores: arithmetic com- 
putation, reading drawings and blueprints, identifica- 
tion and use of tools, spatial relationships, checking 
measurements, total; IBM; Form A ('46); manual 
(47) ; $2 рег 25; 35¢ per specimen set, postpaid; sep- 
arate answer sheets may be used; 2¢ per IBM answer 
sheet; 20¢ per stencil for machine scoring of answer 
sheets; 20¢ per stencil for hand scoring of answer 
sheets; postage extra; 38(45) minutes; J. Wayne 
азаи and Charles E. O’Toole; California Test 
ureau. 


REFERENCES 
1. Міснлкі, WiLLIAM B.; ZIMMERMAN, WAYN mn 
Сотғовр, J. P. “Ап Investigation of the Nature. of g Spa. 


Wirramp A. Kerr, Associate Professor of Psy- 
chology, Illinois Institute of Technology, Chi- 
cago, Illinois. 

This test in its five sectional subscores is a 
shrewd and interesting compromise between the 
demands of factorial purity and the traditional 
labels of mechanical trades know-how. Its care- 
fully prepared manual reveals sectional inter- 
correlations ranging from .30 to :70 and sectional 
correlations with total score ranging from .52 
to .77. Each section seems to contribute some 
unique value to the test, and certainly the five 
sectional scores as labeled have quick and easy 
meaning in both educational and industrial train- 
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ing programs as well as in student or Worker 
counseling. From a pure factorial standpoint, 
the intercorrelations reported among the five 
sections suggest that most of the variance in total 
score is accounted for by three factors: (a) 
number factor (arithmetic computation, read- 
ing drawings and blueprints, checking measure- 
ments) ; (b) functional interests factor (identi- 
fication and use of tools) ; and (c) spatial factor 
(spatial relationships). Independent research 
has shown the basic importance of all three 
factors in predicting success in mechanical 
trades. 

Because it recognizes the demand that a good 
prediction instrument should isolate to some ex- 
tent the principal talents which it measures, this 
test represents a distinct technological advance 
over such competing tests as the Bennett-Fry 
Test of Mechanical Comprehension which meas- 
ures but does not separate the functional interests 
factor and the spatial factor. Future research 
probably can further show its superior validity 
because it permits differential predictive weight- 
ing by sections (or factors) and, in addition, it 
measures the number factor. Evidence in the 
manual of its validity is unfortunately limited to 
sectional contingency coefficients (ranging from 
бо to .71) and a total score contingency coeffi- 
cient (.78) with instructors' ratings ina training 
course for the aviation trades. The authors write 
that "since this is a general type of mechanics 
course, it is assumed that similar coefficients 
would be found for other mechanical occupations 
or trades." The difficulty in evaluating this as- 
sumption is increased by the failure of the man- 
ual to mention the number of cases on which the 
study was based. 

Reliability coefficients ( Kuder-Richardson) 
range from .89 to .94 for grade levels 7-12. EX 
tensive norms (5,268 male students in 7 states) 
are provided for the same grade levels. Adult 
norms are absent, but the authors suggest that 
“the norms for grade 12 be used.” | 

Despite present modest validity data, p 
test seems fairly certain to yield gratifying ЎН 
lidity coefficients in occupations such as automo 
tive and aviation mechanics, building trades, 
electric installation and practice, boat building 
and maritime trades, art metal work, foundry 
work, and related metal trades not primarily m 
phasizing repetitive dexterity. In the latter ™ 
stance, the MacQuarrie Test for Mechanica 
Ability is probably a better one. 
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Doucras G. Scnurrz, Research Associate, Ed- 
ucational Testing Service, Princeton, New Jer- 
sey. 
Presumably this test is predictive of success 
in various mechanical trades and in training for 
those trades. The content is based on an analysis 
of training courses in various mechanical occu- 
pations. (The manual does not give the details 
of how that analysis was carried out.) Yet the 
only predictive validity data presented in the 
manual are a set of contingency coefficients re- 
lating the subtest scores and total score to “in- 
structors’ ratings which were gathered very 
systematically and carefully in a training course 
for the aviation trades.” No mention is made of 
the number of students on whom the figures were 
calculated, the number of instructors who did 
the rating, the bases for the ratings, the level and 
specific characteristics of the course, etc. Such 
information is entirely inconclusive. This re- 
viewer has no reason to agree (or to disagree) 
with the manual’s contention that since the men- 
tioned course "is a general type of mechanics 
course, it is assumed that similar coefficients 
would be found for other mechanical occupations 
or trades." 

The Kuder-Richardson reliabilities of the total 
score for each of the individual grades and for 
the total group in grades 7-12 seem entirely sat- 
isfactory. The publisher's catalog (not the man- 
ual) mentions that the reliabilities are based on 
over 4,000 cases in grades 7-12. Even though 
use of subtest scores is stressed at several places 
in the manual, no reliability coefficients are given 
for them. The intercorrelations among the parts 
are moderately high, but here again the number 
and type of subjects used are not given. 

Subtest and total score norms for individual 
grades and for the total group in grades 7-12 are 
available in the manual. Some 5,000 male stu- 
dents were used to establish these norms, but 
We are not told how many came from each grade. 
Generally the discrimination seems good, par- 
ticularly in view of the small numbers of items in 
the individual subtests. In grade 12, the dis- 
crimination is somewhat poorer at the upper 
end of the test. This would suggest caution in 
the administration of the test to adults. No norms 
are included for adults ; this reviewer would not 
recommend use of the grade 12 norms as sug- 
gested in the manual. The test appears to be 
directed toward the lower educational levels, 
and, while it is listed as appropriate for adults, 
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no material concerning its effectiveness at that 
level is presented. 

The test parts represent the rather heteroge- 
neous content usually found in a mechanical ap- 
titude test. The second part has only simple 
drawings, and no blueprints, as the title might 
imply, to be read. The spatial relationships items 
seem to require an undue amount of fine size 
discrimination (up to differences И 6”). 

The test itself could stand a thorough editing. 
For example, Item 30 does not state how the 
circles are to be compared, and Items 39-45 on 
tool usage should be put in terms of “should be 
used” or “is used" instead of “can be used.” The 
print is of a good size, and the arrangement is, 
on the whole, satisfactory. The pictures in Sub- 
test 3 might be confusing to younger students. 
Option designations are not always clearly 
printed ; “с” and “e” are likely to be misread in 
a number of instances. More practice items 
would be advisable. 

The reviewer’s overall impression is that this 
test, properly used, is one which may be of value 
in many situations where some kind of mechani- 
cal aptitude test is sought. The user should make 
his own validity studies. But the test certainly 
would not seem to constitute a major addition 
to the list of available tests. 


For an excerpt from a review, see 3:674. 
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*Purdue Mechanical Adaptability Test: Purdue 
Personnel Tests. Males ages 15 and over; 1945-50; 
Form A—Men (46); Form T—Women discontinued ; 
preliminary manual ('50) ; $2.50 per 25, postage extra; 
25¢ per specimen set, postpaid; 15 (20) minutes ; bi B. 
Lawshe, Jr. and Joseph Tiffin; ccupational Research 
Center, Purdue University. * 


REFERENCES 


. LawsHe, C. H., JR; SEMANEK, IRENE А.; AND TIFFIN 
Josten. "The РИЧ КП Adaptability Test.” J Appl 
В » 3 21: 
DEAE uu PH. Re, AND TIED Joseru, "The Purdue 
Mechanical Adaptability Tee Abstract Am Psychol 2:428 
47. * E tit! $ 2 
QA Donas E. Appraising Vocational Fitness by 
Means of Psycholo ical m pp. LY PEN д Нагрег 
P3 27. 4: 
ы DNE бур Evano, RN, "Selection of Students 
for a Trade and Industrial Education Curriculum,” pp. 9-14. 
In Motives and Aptitudes in Education: Four Studies. Edited 
by Н. Н. Remmers. Purdue University, Division of Educational 
Reference, Studies in Higher Education, No. 74. Datarea 
Ind.: the Division, December 1950. Pp. iii, 63. Paper. * € 
26:3010) " 
F. “Normative and Validity Data for 
the БИНТКЕ Adaptability Test.” Personnel Psychol 
mx 5 $ 
n ees LG. КЕЕ Norms for the Purdue Mechani- 
cal "Adaptability Test. Master's thesis, Purdue University (La- 
fayette, Ind.), 1950. 


For reviews by Jay L. Otis and Dewey B. 
Stuit, see 3:676. 
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*Revised Minnesota Paper Form Board Test. 
Grades 7-12 and adults; 1930-48 ;, IBM; 2 editions; 
revised manual (’48) ; 35¢ per specimen set of any one 
edition; postpaid; 20(25) minutes; original test by 
Donald G. Paterson, Richard M. Elliott, L. Dewey 
Anderson, Herbert А. Toops, and Edna Heidbreder; 
revision by Rensis Likert and William Н. Quasha; 
Psychological Corporation. * 
а) HAND SCORING EDITION. 1930-48; Forms AA (’41), 
BB (41); $1.75 per 25. 
b) MACHINE SCORING EDITION. 1041-48; Forms MA 
(41), MB (41); $275 per 25; $1.85 per 50 IBM an- 
swer sheets. 


REFERENCES 
‚ 1-9. See 40:1673. 


19 506 See 3:677 A 
58. Murruy, Laura Wirrz. "The Relation Between Me- 
chanical Ability Tests and Verbal and Non-Verbal Intelligence 
Tests." J Psychol 2:353-66 °36. * (РА 11:3928) T 
59. Morris, CHARLES M. “An Experimental Analysis of 
Certain Performance Tests." Abstract. Psychol B 34:716-7 N 
'37. * (PA 12:1661, title only) б ә 
бо. Mercer, MARGARET, An Analysis of the Factors of Sci- 
entific Aptitude as Indicated by Success in Engineering Cur- 
ғісиіа. Doctor's thesis, Pennsylvania State College (State Col- 
lege, Pa.), 1938. (Abstracts of Doctoral Dissertations..., 1938, 


. 97-102.) 
shi Hackman, Ray Carter. The Differential Prediction of 
Success in Two Contrasting Vocational Areas. Doctor's thesis, 
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ГАА SHEPARD, 
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Eucene L, Measurements of Certain Nonm- 
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IU University (Washington, НС; 1944. СБ" 
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66. AcHARD, F. H., AND CLARKE, FLORENCE E "Yon Can 
Measure the Probability of Success as a Supervisor." Per- 
sonnel 2135733 My 45. * 

67. JANKE, Leora LONG, AND HAVIGHURST, iE “ве. 
lations Between Ability and Social Status in a Mid-Western 
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sychol 36:499—509 N "4s, 20:19 
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E RR Am Psychol 1:243 Jl *46. * (PA 20:3772, title 
only 
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STROTHER, Grorce В. “Improving the Selection of Linotype 
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Сілғғокр E, JuncENsEN, Personnel Director, 
Minneapolis Gas Company, Minneapolis, Min- 
nesota. d 
The Revised Minnesota Paper Form ied 
Test, first published in 1934, can be considere 
a classic in measurement of mechanical n 
tude. Tt has had a long history, has been use 
widely, and still warrants high respect and fre 
quent use. dei 
The test is too well known to warrant а ‹ s 
tailed description. Suffice to say that it consi 
of 8 practice items and 64 test items пала 
with 2 dimensional space perception. Each ris 
item consists of the disarranged parts of а 8€ 2 
metrical figure, and 5 assembled geometric £ 
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ures from among which the examinee is required 
to pick out the one figure which is made up of 
the correct combination of parts. 

Numerous validity studies have been made, 
and it is the exceptional case where the test is 
not sufficiently valid to warrant its use for se- 
lecting employees in mechanical type jobs such 
as drafting, inspection, linotype operation, ma- 
chine operation, packing, and the like. Also, the 
test has frequently been found to be predictive 
of academic success in courses such as art, avia- 
tion mechanics, dentistry, drafting, engineering, 
and geometry. In employee selection for main- 
tenance positions such as machinist, pipefitter, 
welder, and the like, this reviewer has had more 
favorable experience with the Test of Mechani- 
cal Comprehension than with the Revised Min- 
nesota Paper Form Board Test, However, if 
time permits, he has found the Minnesota test 
an excellent additional test to be used. 

The 16-page manual is excellent. It opens with 
a history of the development of the test, a de- 
scription of the test materials, and a needed ex- 
hortation to follow the specified time limit. There 
follow simple, clear, and sufficiently detailed in- 
structions on administering and scoring the test. 
Validity data are discussed on three pages, and 
cover many diverse studies. 

Reliability is briefly discussed. For Series AA 
and BB reliability was found to be .85 for one 
form and .92 when both forms were administered 
to 290 high school seniors applying for admis- 
sion to New York University. No reliability 
studies have been published for Series MA and 
MB. The manual states that “there is no reason 
to believe that it is any lower than for Series AA 
and BB,” The reviewer concurs with this view, 
and also agrees with the manual that experi- 
mental evidence would be desirable. 

The manual gives a large number of intercor- 
relations with other tests. In general, the studies 
show that the test is correlated only moderately 
with intelligence tests and with other mechanical 
aptitude tests, Several studies show that sex dif- 
ferences are small (and probably unimportant) 
but that males consistently excel females in 
score, 

Extensive tables of norms are given for well- 
defined and large samples, Four tables are in- 
cluded, separate norms being given for hand 
and machine scored editions and for educational 
and industrial groups. A bibliography of 54 items 
(all but four being published data) is appended. 

The excellence of the manual should do much 


to increase further the use of this test in both vo- 
cational guidance and vocational selection. 


Raymonp A. KarzELL, Director and Staff Con- 
sultant, Richardson, Bellows, Henry and Com- 
pany, New York, New York. 

This is a familiar stand-by among tests of 
spatial relations ability. The current revision, 
published in 1934, has been competently re- 
viewed in earlier volumes of this Yearbook. The 
present review will therefore be confined mainly 
to the contents of the new manual. This manual, 
issued in 1948, replaces a 1941 edition used for 
the hand scored forms of the test and a later 
temporary one for the machine scored series. 
The present edition is designed to clarify some 
confusions in procedures for administering and 
scoring the test, and to present improved and 
extended data on norms and validity. 

DIRECTIONS. Instructions for administering 
both the hand and machine scored series are 
clearly and explicitly given in a format which 
can be easily read by the examiner in the testing 
situation. 

Detailed scoring directions are also given sepa- 
rately for the two series. In the directions for 
the hand scored series, where the scoring for- 
mula is R — W/5, the distinction between omis- 
sions and wrong answers is not defined. There 
is the danger that some inexperienced scorers 
may follow the common school practice of re- 
garding omissions as equivalent to wrong an- 
swers, thus arriving at erroneous scores. (Omis- 
sions occur occasionally, even though discour- 
aged by instructions to examinees.) 

VALIDITY AND RELIABILITY DATA. The exten- 
sive use of this test over a number of years has 
produced information on its validity in a variety 
of educational and industrial situations. Twenty- 
five studies which include validity data on the 
test are summarized in the manual, These stud- 
ies cover the following types of personnel: en- 
gineering and technical students, art students, 
dental students, mechanics, sewing machine op- 
erators, packers, linotype operators, inspectors, 
factory supervisors, and several other types of 
factory jobs. The test has shown appreciable 
validity in most of these areas. 

The manual also contains some rather scanty 
data on reliability as determined on groups of 
students, as well as a useful list of correlations 
with about 20 other tests of various kinds. 

The information in these sections of the man- 
ual should prove most helpful in decisions on 
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whether to use the test and its interpretations. 
A desirable addition would be the indication of 
significance levels of the validity coefficients, 
especially since the N is frequently not men- 
tioned. 

NORMS. When compared with most tests, this 
one is replete with normative data. Percentile 
tables are presented for 13 educational and r3 in- 
dustrial groups tested with the hand scored ser- 
ies, and for rr educational and 5 industrial 
groups tested with the machine scored series. 
Means and sigmas are also furnished for most 
of the groups. The samples are usually amply 
large, and represent a wide diversity of popula- 
tions. A particularly commendable feature is 
the inclusion of a relatively detailed description 
of each sample. 

An up-to-date bibliography of 54 entries adds 
to the worth of the manual, as does the fact that 
it is quite readable. Its otherwise good style is, 
however, slightly marred by the appearance, al- 
beit rare, of such expressions as “so much... 
data" (p. 3) and "unusual amount of studies" 
(p. 6) 

Some imperfections in this revised manual 
have already been noted. One might also like to 
see more information on the rationale and con- 
struction of the test, along with more discussion 
of its composition as revealed by factorial studies. 
But, the negative comments that may be made 
about it are trifling in comparison with the many 
cogent virtues which have been mentioned. 

We have here a test of demonstrated validity 
for various kinds of mechanical and technical 
work; its value is enhanced by the comprehen- 
siveness of its new manual. 


For a review by Dewey B. Stuit, see 3:677; 
for a review by Alec Rodger, see 40:1673. 
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*SRA Mechanical Aptitudes. Grades 9-12 and 
adults; 1947-50; 4 scores: mechanical knowledge, 
space relations, shop arithmetic, total; Form AH 
(747) ; second revised manual ('50) ; 498 per test and 
answer pad; separate answer pads must be used; $1.90 
per 25 answer pads; 55€ per 25 profile and norm sheets 
(749) ; 75€ per specimen set; cash orders postpaid; 35 
(45) minutes; prepared by Richardson, Bellows, 
Henry & Co., Inc.; Science Research Associates, Inc. * 


Атес Корсев, Reader in Psychology, Birkbeck 
College, University of London; and C onsulting 
Psychologist to the Ministry of Labour and Na- 
tional Service; London, England. 

This businesslike production derives chiefly 
from wartime experience. Among those who 
shared the authors’ problems during that period 
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there are not likely to be many dissentients from 
their conclusions about the most desirable com- 
ponents of a battery of mechanical aptitude tests, 
In fact, their decision to include tests of me- 
chanical knowledge, space relations, and shop 
arithmetic would probably be echoed by almost 
everyone who has kept abreast of developments, 
in the last 12 years or so, in the selection of per- 
sonnel for engineering occupations. 

But they offer no validation data at all, It js 
true that in the manual they say “only those types 
of items which produced highest validities under 
Army research conditions were included”; and 
it is true that there is no reason whatever to dis- 
pute their claim. However, they should have 
acknowledged explicitly the fact that types of 
items which have high validity in certain circum- 
stances sometimes have surprisingly low validity 
in apparently similar circumstances ; and they 
should have given some indication of the steps 
they are taking to collect appropriate informa- 
tion. Until they have repaired the gap, we shall 
have to be cautious and take the view that this 
is probably a very good test for a variety of se- 
lection and guidance purposes, but that judge- 
ment on it cannot yet be passed firmly. 


Dovaras G. Scnurrz, Research Associate, Edu- 
cational. Testing Service, Princeton, New Jer- 
sey. 

The purpose of this test is to evaluate three 
types of abilities usually measured in mechanical 
aptitude tests. The choice of the content was 
based on a survey of published literature, as wel 
as on unpublished studies conducted by the 
armed services. Although the test is listed as ap- 
propriate for both sexes, the authors express 2 
belief that it is not as suitable for female groups 
as for male groups. d 

The manner in which the test was constructe 
is well described in the manual and appears d 
have been very thorough. Pretesting and M 
analysis preceded the final selection of items, pn 
the answer options in the shop arithmetic te 
grew out of free answer tallies. The genera 
quality of the items reflects the care exercise 
in the test construction process. 7 dot 

Part-score and total-score Kuder-Richar 3 К 
reliability coefficients are given for each 0! oh 
grades 9-12 and for a group of described pei y 
(young men). The total score figures are S " 
erally acceptable except for the ninth grade E Cl * 
Only the mechanical knowledge test is M 
enough for individual counseling. The man! 
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suggests the possibility of using extreme scores 
on the other two parts ; it might also have men- 
tioned that group comparisons could legitimately 
be made on these part scores. 

Norms are available in the manual for the 
group of trainees mentioned above. Means and 
standard deviations, but not complete norm ta- 
bles, are also presented for grades 9-12. The 
expected increases in scores with advancement 
in grade level are found. This reviewer's im- 
pression was that the space relations and shop 
arithmetic tests were a bit difficult; the data con- 
firmed this impression. For both the trainee and 
high school groups the mean scores on these 
parts are somewhat lower than desirable. Also, 
for girls the mechanical knowledge scores are 
down toward the chance range. These facts may 
indicate some speededness. But, on the whole, 
the level and spread of scores are good. 

The manual does not include any material on 
validity. The statement that the user will obtain 
maximum validity from weights developed for 
his own situation is a very reasonable one. But 
it is regrettable that results from administration 
of the test in some typical situations are not 
presented. The authors say they-are developing 
more complete norms; let us hope they will de- 
vote some attention to validity evidence as well. 

The test is well printed and bound in an at- 
tractive cover; the type size is good ; and the 
pictures and diagrams are, with one or two minor 
exceptions, clear and well drawn. The SRA an- 
swer pad is very convenient but has the disad- 
vantage (for certain purposes) that it is quite 
difficult to determine which specific items have 
been. missed and which response was chosen to 
any item. 

‚ Sound construction and excellent presenta- 
tion of materials characterize this test. It is hoped 
that the adequate beginning in accumulating 
normative data will be continued and that va- 
lidity information will be added. While the test 
does not represent a pioneering effort in the 
field, it appears to be an excellent instrument 
to have available when a mechanical aptitude 
test is required. 
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andi AM of Space Relations Ability. Grades 9-16 
and adults; 1044-49; IBM; Forms A (44), В (47); 
paid: s (49); $2 per 25; 356 per specimen set, n 
answe separate answer sheets may be used; 2¢ per IBM 
айу Sheet; 20€ per stencil for machine scoring of 
КОООЙ 40€ per stencil for hand scoring of an- 
eni eets; Spanish edition available at same prices; 
Flo: it extra; 15(20) minutes; Harry W. Case and 

Yd Ruch; California Test Bureau. * 
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*Test of Mechanical Comprehension. Grades 9 and 
over; 1940-51; IBM; 4 editions; $4 per 25 of one 
edition; separate answer sheets must be used; $185 per 
50 IBM answer sheets of any one edition; 35¢ per spec- 
imen set of any one edition; postpaid; nontimed (30) 
minutes; George K. Bennett, Dinah E. Fry (W-1), 
= William A. Owens (CC) ; Psychological Corpora- 

оп. 
а) FORM АА. Men in grades 9 and over; 1940-47; 1 
form, '40; manual (4. 
b) Form вв, Men in grades 13 and over; 1941-51; ‘1 
form, "41, revised manual (’51). 
с) Form cc. Men in engineering schools; 1949; 1 form. 
d) ForM w-r. Women in grades 9 and over; 1942-47 ; 
т form [47] ; manual tek 
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Different Criteria of Leadership." J Psychol 27:377-88 Ap '49. 


30. GORDON, Tuomas. “The Airline rud Job." J Appl Psy- 
:122-; Н зи 
des ата eom рл rhe Prediction of Veterans’ Suc- 
pre T: Pie Sixth Yearbook of the National Council on 
Measurements Used in Education, 1948-1949. 
W.Va.: the Council, Fairmont State College 1949. Pp. v, 140 
(балону numer mp € praising Vocational Fitness by 
Means of Psychological Ten Lo n rip BO Harper 
5 RP TLLA SHE The "Structure of Practical Abilities.” 
Occupational Psychol 23:81-96 Ap '49. * (PA 23:5 13) w. 
WHITLOCK, JOHN, B., JR, AND CRANNELL, CLARKE W. 
“An Analysis о Ertain Factors in Serious Accidents js 4 
Large Steel Plant" J Appl Psychol 33:494-8 О '49. 


prs “ Factor Exist in 
„ Warter_R. “Does a Perceptual Factor Ex 
Ане Abilit DI Ed Res 4440 51 5 Tou a^ (PA agragat) Д 
36. Воко, WALTER R. “Some ‘actors, vere AT School 
SIME Јн Камо. The Ву 


DO БЕУ Appl Psychol 34:9075 Ap '50. * (PA 
25:2076 


jv. Ковевт W., AND FLETCHER, Frank M., Jr. 
«те Relationship of Owens-Bennett Test Scores to First-Year 


lection of Patrolmen. 
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Achievement in an Engineering College." Abstract. Am Psychol 
5:353 Jl 'so. * (PA 25:1278, title only) 

. 39. arr a Кече 9 Male Hy wr H ys Pa- 
tients." епі ci 96:72 "50, ) 25:259! 

40. Narr, RALPH ENNETH Predictive Value Stand“ 
ardized Tests and Inventories in Industrial Arts Teacher Edu- 
cation, Doctor's thesis, University of Missouri (Columbia, Mo.) 
1950. Abstract: Microfilm Abstracts 10:77-8 no 3 '50. * (РА 
25:4862, title only) í 

41. OWENS, ILLIAM A. Jm. “А Difficult New Test of 
Mechanical Comprehension.” J ‘Appl Psychol 34:77-81 Ap 'so. * 


“© (PA 25:1384) 


42. Travers, Rosert M. W., лхо WarLAce, WIMBURN L. 
“Inconsistency in the Predictive Value of a Battery of Tests.” 
J Appl Psychol 34:237-9 Ag 'so. * (PA 25:6480) 

43. Barnette, W. Lestiz, Jm. Occupational Aptitude Pat- 
terns of Selected Groups ‚А Counseled Veterans. American 

sychological 


Psychological | Association, Monographs: Gen- 
eral and Applied, Vol. 6s, No. 5, Whole No. 322. Washington, 
D.C.: the Association, Inc., 1951. Pp. v, 49. Paper. * (РА 


26:2794) 


Me 44. Gitpert, Harry В. An Evaluation of Certain Procedures 
in the 


Selection of Соир Counselors Based оп Objective Test 
Data as Predictive of Practical Performance. Doctor's thesis, 
New York University (New York, N.Y.), 1951. Abstract: 
Microfilm Abstracts 11:983-4 no 4 '$1. 

45. HALLIDAY, ROBERT Wi, FLETCHER, FRANK M., Jr.; AND 
Conen, Rita M. “Validity of the Owens-Bennett Mechanical 
CORREA Denn Test." J Appl Psychol 35:321-4 О '$1. * (PA 

141 
=! HX A ER Автнов R., Jr. "Validity of Employment Tests 
for Gas-Appliance Service Personnel" Personnel Psychol 
4:199-208 su 'sr. * (PA 26:1735) 

47. Worrr, W. M., AND NonTH, A. J. "Selection of Municipal 
Firemen.” J Appl Psychol 35:25-9 Е "s1. * (РА 25:7124) 

N. W. Morton, Director, Operational Research 
Group, Defense Research Board, Ottawa, Can- 
ada. 

Three forms of this test appeared about a 
decade ago. A fourth form was added in 1949 
extending upward the range of difficulty for 
male subjects. 

The test is intended to measure the ability of 
an individual to understand various kinds of 
everyday physical and mechanical relationships. 
Each form consists of 60 items. Each item in- 
cludes a picture exhibiting one or more objects, 
physical situations, or mechanical relationships 
about which a question permitting a categorical 
answer is asked. The principles underlying these 
questions include leverage (perhaps the most 
frequently represented), force and motion, light, 


heat and sound, etc. Obviously, the examples 


used are ones arising out of most people's com- 


mon experience of physical phenomena rather 
than coming necessarily from technical training. 
It is presumably for this reason that previous ' 
exposure to formal instruction in elementary 
physics appears to confer only a slight advantage 
(about 3 or 4 points). 

Of the four forms, three represent different 
levels of difficulty for men, and one is for women 
(who score much lower than men even on the 
easiest form for men). АП forms are unspeeded. 
Reliability (internal consistency) is of the order 
of .8. For purposes of individual assessment, this 
is not high but commonly considered as accept- 
able (reduction of the error by ага would 


presumably require an additional 20 or 25 min- _ 


utes of testing time). 


any special deman: 


The illustrations are clear, well drawn, did | 
well printed. For all but illiterate individuals the 
easier forms of the test would not appear to make. 
on the subject by way of - 
understanding the kind of performance expected 
of him. 

Evidence of validity is given mainly by cor- 
relation with job, academic, or training course 
performance. These range from .3 to .6. Corre- 
lations with other tests are of the same order, 
For the women's form no validity data except 
correlations with other tests are given. А lim- 
ited amount of evidence is available suggesting 


that this test ranks relatively high in correlating 


with criteria in comparison with other tests 
used. While its value will depend in specific situ- 
ations upon the criterion predicted and the de- 
gree of correlation with other tests used, ex- 
perience with the Test of Mechanical Compre- 
hension would indicate generally that it is a 
useful addition to the stock of measuring devices . 
available for vocational guidance and selection. 


For reviews by Charles M. Harsh, Lloyd G. 
Humphreys, and George А. Satter, see 3:663. 


[767] ; 

*V.G.C. Object Visualization Indicator. High 
school; 1950; an adaptation of United States Employ- 
ment Service's General Aptitude Test Battery, Part Ё 
(see 714) ; 1 form; preliminary manual; $1.95 per 25; 
separate answer sleeves must be used; 48€ per 25 an- 
swer sleeves; 10€ per manual; 456 per specimen set; 
postage extra; available only to Canadian schools; 10 
(20) minutes; Vocational Guidance Centre. * 


[768] А 
*V.G.C. Space Relations Ability Indicator. и 
school; 1950; an adaptation of United States Employ 
ment Service's General Aptitude Test Battery, P i 
(see 714) ; 1 form; preliminary manual; $1.95 per a 
separate answer sleeves must be used; 486 per 25 d 
swer sleeves; 10€ per manual; 45¢ per spect d 
postage extra; available only to Canadian schools; 
(20) minutes; Vocational Guidance Centre. 


[769] d 
*The Vincent Mechanical Models Test A al 
dustrial), Ages 14 and over; 1936-46; 1 {ош БЫУ 
same as test copyrighted in 1036]; no data on re iod 
and validity and no description of normative popu he 
in manual; data available on request; тйпео&Г v 
manual ['46]; norms ['46]; 185. per 12; I$. qa 
single сору ; 120s. per set of 8 models; separate Ley tf. 
sheets must be used; 15. 60. per 12 answer "e tional 
per manual; postage extra; 22(30) minutes; Na 
Institute of Industrial Psychology. * 


REFERENCES election 
1. НолрАү, FRANK, “An Investigation Into pe cupational 
of Apprentices for the Engineering Industry. n 
Psychol 14:69-81 Ap '4o. * (РА 14:3710) Spatial Jw 
2. SLATER, PATRICK. "Some Group Tests of Sp 140-55 J^ 
UBRO P ees Ability." Occupational Psychol 14:49 ii 
"40. 14:2 i pu 
Я Aviron, Сыттокр W. a Tents of Technical i 
."" Occupatii 2175-82 42, 
4. НЫ PANE ФА Further Investigation Into 
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Selection of Apprentices for the 
cupational Psychol 15:173-84 ата 

5. Ношлрлу, Frank. “A Survey of an Investigation Into 
the Selection of Apprentices for the Engineering Industry.” 
Gacupational Psychol 16:1719 Ја ‘42. * Y 

6. HorLrpAY, FRANK. “The Relation Between Psychological 
Test Scores and Subsequent MS V of Apprentices in the. 
Engineering Industry." Occupation з: 
* (РА 18:1835) j 

7. LAMBERT, CONSTANCE M. “Symposium on Selection of 
Pupils for Different Types of Secondary Schools: VII, A Sur- 
vey of Ability and Interest at the Stage of Transfer." Brit J 
Ed Psychol 19:67-81 Je '49. * 


A. T. Wetrorp, University Lecturer in Experi- 
mental Psychology, University of Cambridge, 
Cambridge, England. 


This test is intended to assess mechanical ap- - 


titude. Eight “models” are used, each consist- 
ing of an arrangement of levers, pulleys and. 
cords, or pinion wheels. The mechanism of each 
model is hidden behind a board except for two 
or more projecting arms or cords with a bob 
attached. The administrator shows the models 
one at a time to the subjects, works one of the 
projecting portions, and asks the candidates to 
observe the movements produced in the other 
projecting portions. The candidate has a book- 
let in which, for each model, there is a sheet 
showing eight diagrams of mechanisms some of 
which would produce movements similar to that 
of the model, and some of which would not. A 
“Yes” or “No” answer for each diagram has to 
be recorded on an answer sheet. A commendable 
refinement is that the candidates are told the 
correct answers to the first two diagrams (one 
“Yes” and one *No") of the first model which 
thus act as examples. A time limit of 214 or 3 
minutes is allowed for the answers about each 
model. Subjects are warned against guessing and 
told that marks will be deducted for wrong an- 
swers, 

The working of the models by hand makes the 
test somewhat laborious to administer, but it 
would seem possible to arrange for the models to 
be worked by means of an electric motor. 

The manual supplied with the test gives the 
Precise wording of instructions, a scoring key, 
and norms, Statistical data on reliability, etc., 
are not included but are contained in articles in 
Occupational Psychology, and are supplied on 
request by the National Institute of Industrial 
Psychology, which strictly controls the issue of 
the test material. file 

The norms are in the form of five grades: A= 
best 10 per cent, В = next 20 per cent, C = mid- 
dle 40 per cent, D — next 20 per cent and E — 
Worst Io per cent. They are given for а rela- 


ti ; v 
Vely unselected group of boys in three P8". additional information from this and other tests 


Tanges: 16 and over, 15, and 14. Separate norms, 


А 
е 
Psychol 17:168-85 О ү, 
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Engi: Б I n Y Р . un ` 
O'S, Engineering Industry.” Oc which are substantially higher, are given for 


boys of secondary grammar school (high school) 
education. The norms are based on the scores 
of 3,373 subjects, some school pupils tested in. 
the course of vocational guidance work, and 
some apprenticeship candidates tested in the 
course of industrial selection work. The scatter 
of scores indicated by the norms is satisfactorily 
wide, and indicates that the test can discrimi- 
nate at both the top and bottom ends of the scale, 

Correlations with instructors’ estimates of 
proficiency have been found as follows : 91 trade 
apprentices (engineering), .52; 41 trade aj 
prentices (engineering), .33; 45 apprentices 
(toolmakers), .бо ; 34 apprentices (toolmakers), 
.39; 71 engineering apprentices, .27; 30 engi- 
neering apprentices, .42. Bearing in mind that 
these are selected populations and that the cri- 
teria must have included qualities other than 
those involved in performance at the test, the 
correlations suggest that the test is likely to 
form a useful component of a battery. The split 
half reliability is given as .96. 

Although the test is given with a time limit 
for each section, the time allowed has been 
found sufficient for all subjects to record an 
answer for each diagram if they are able to do 
so. It thus appears to avoid the difficulty with 
speeded tests that no distinction is made between 
the people who work quickly but ine.” 
and those who work more slowly but more ac- 
curately. Further research is, however, needed 
to determine in just what ways performance at 
a task is affected by a time limit. It seems very 
possible that time limits exert important effects 
even when they appear fully adequate. The point 
is not advanced as a criticism of the test, but _ 
should be borne in mind when considering the 
validity of any test carried out with a time limit.” 
Real life situations appear to impose speed pres- 
sures which are frequently less and occasionally 
more severe than those of speeded tests. 

The test is almost if not quite unique, and of 
considerable interest in that it requires appre- 
ciation by subjects of relationships between 
movements of mechanical parts which the sub- 
jects may study directly only in an abstract, 
diagrammatic form. While clearly more than 
this is involved in mechanical aptitude defined 
in any realistic sense, it is a type of appreciation 
which is undoubte _ important, and would 
seem worthy: of following up both as a matter 
of research and of practical testing. Valuable 


> 
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of similar type which may be designed in the 
future could perhaps be gained by considering 
not only overall scores, but also item analyses to 
see whether, and in what ways, one type or com- 
plexity of mechanism is more difficult to under- 
stand than another. 


MISCELLANEOUS 


[770] 
*Employee Evaluation Series, Adults; 1948; 6 
tests; 1 form; $2.50 per 25 of any one test; $3 per 
specimen set; $1 per specimen set of any one test; post- 


paid; Joseph E. King; Industrial Psychology, Inc. * 
а) PERFORMANCE: CLERICAL. 

b) PERFORMANCE: MECHANICAL. 

С) PERFORMANCE: SALES. 

d) PERFORMANCE: TECHNICAL. 

€) PERFORMANCE: ADMINISTRATIVE, 

Í) PERFORMANCE: FOREMAN. 


REFERENCES 
1. Kine, Јоѕерн E. “Multiple-Item ANA to Merit Rat- 


ing.” Abstract. Am Psychologist 4:278 Jl '49. * (PA 23:6482, 
title only) 

Brent Baxter, Director, Agencies Research 
Division, The Prudential Insurance С ompany 
of America, Newark, New Jersey. 

This series provides an overall package for 
evaluating most employees in many companies. 
It has many practical aspects and attempts to 
present a simple formula for supervisors in both 
reviewing and interpreting the performance of 
their employees. 

The structure of the series is a list of state- 
ments for each of the six work areas for which a 
form is provided. There are 50 items for each 
type of work (60 in forms are to be published). 
The value of the forms to any given company is 
limited by the degree to which these general 
items apply to the job of the person being rated. 
The company might well prefer to add or sub- 


tract certain items ог put a different emphasis 


on the job components than is included in the 
printed list. The instructions and language also 
will not be appropriate for all companies. Thus, 
a company which likes this multiple item ap- 
proach to rating may choose between the labors 
of tailor-making its own form or the adoption 
of a carefully made form which may not fit its 
situation too well. 

The author emphasizes that this rating ap- 
proach minimizes the halo effect and lays stress 
on evaluating performance. But there is still 
ample opportunity for halo effect. Many items 
concern attitude rather than performance be- 
havior, e.g., “Completely sold that this is the 
‘best place in town’ to work." Responses to 


this kind of item are subject to the halo effect, 

At present, there is no manual to go with the 
series to explain adequately its reliability and 
other evaluating data. The author reports one 
is currently being prepared. He submitted а 
manuscript of an APA (1949) paper indicating 
a corrected split half reliability for the clerical 
series of .92. Results from two raters correlated 
81 (т). It correlated .73 with results from 
a man-to-standard rating scale. From the APA 
paper one may conclude that the clerical series 
has had extensive statistical analysis, Efforts 
have been made to weed out ambiguous items 
and items which correlate highly with the total 
score, 

Considerable effort has been made to design 
a scale which will result in a normal distribution 
of scores. While this achieves a desirable spread 
of scores, there has been some overconcern with 
this aspect. Each rater is “expected” to achieve 
this normal distribution which may not fit his 
group at all. 

The evaluation “system” is tied in with both 
percentiles and stanines. To have both of these 
scales may be confusing to many and is unneces- 
sary. The stanine ranges are not calculated as 
is usually done (1.е., in equal class intervals). 

The present series provides for the rater to 
check a statement if it is true about the employee 
and to leave it blank otherwise. Omissions thus 
may be counted “against” the employee. The 
author reports that a revised series will provide 
for a “not true at present” marking. Neither 
form allows the rater to mark the statement 
“not relevant” or “don’t know.” This may force 
unjust ratings to be made. 

The author claims that by adding the favorable 
replies on the statements one achieves а total 
score "in which the whole is actually greater 
than the sum of its parts.” This statement may 
mislead many readers into thinking that some 
thing special is added in some mysterious Way: 
Apparently what is meant is that the items really 
represent a sample of the total number of "s 
ments that might be made about the employe 
and that conclusions may now be drawn abou 
the total. In view of how the sample was draw? 
it might be much better to limit interpretation 
to the specific statements. ‘ole 

SUMMARY. The series utilizes the Ed 
item rating approach to provide industry EA 
a ready-made rating program. Although ү 
neatly arranged and has many practical ene 
it doesn't live up to some of its marketing claim 
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such as being a basis for "getting away from 
favoritism and influence." It is not a cure-all for 
personnel problems. It will not fit all compariies ; 
a tailor-made instrument is to be preferred. 


[771] 
*Employee Merit Report. Employees; 1949; also 
called Comprehensive Plan for Rating Employees; 13 
ratings: quálity of work, quantity of work, knowledge 
of work, dependability, attitude toward work, adapta- 
bility, cooperation, initiative, personality, judgment, 
supervision, safety, overall merit; no data on reliability 
and validity; no norms for part scores; no description 
of normative population; $6.25 per 100; 256 per speci- 
men set; postpaid; Keith Van Allyn; Van Allyn Insti- 
tute, 

[772] 
*[Employee Selection Forms.] Job applicants; 
1947; 4 parts; specimen set not available; cash orders 
postpaid; developed by Robert N. McMurry & Co.; 
Science Research Associates, Inc. * 
а) APPLICATION FOR EMPLOYMENT, I form in 2 colors 
(pink for women and green for men) ; $1.10 per 25 of 
any one color. 
b) APPLICATION FOR SALES POSITION. $1.70 per 25. 
C) TELEPHONE CHECK WITH PREVIOUS EMPLOYERS. $4.25 
per 100. 
d) STANDARDIZED 
FORM). $1.60 per 25. 


SELECTION INTERVIEW (SHORT 


[773] 
*[Executive, Industrial, and Sales Personnel 
Forms.] Applicants for executive, office, industrial, or 
sales employment; 1949-50; a series of interviewing 
aids to facilitate selection of personnel; postage extra; 
developed by Robert N. McMurry & Co.; Dartnell 
Corporation. * 
а) [EXECUTIVE PERSONNEL FORMS.] 
1) Application for Executive Position. Form EA- 
301 (740); 1-99 copies, 10€ each; 100-249, 8¢ each. 
2) Patterned Interview Form—Executive Position. 
Form EP-302 ('49); 1-99 copies, 30¢ each; 100- 
249, 25¢ each. 
3) Telephone Check on Executive Applicant. Form 
ЕТсзоз (50); 1-99 copies, 8¢ each; 100-249, 7¢ 
4) Selection and Evaluation Summary. Form No. 
им (750); 1-99 copies, бё each; 100-249, 5¢ 


b) [INDUSTRIAL PERSONNEL FORMS.] 
1) Application for Position. Form No. OA-201 
50) ; 1-99 copies, 7¢ each; 100-249, 6¢ each. 
2) Application for Employment. Form No, OC-200 
50) ; 1-99 copies, 66 each; 100-249, 5¢ each. 
3) Patterned Interview (Short Form). Form No. 
GE ad (49) ; 1-99 copies, 10€ each; 100-249, 8¢ 


4) Telephone Check. Form No. OT-203 ('49) ; 1-99 

Copies, 6¢ each; 100-240, 5¢ each. 

5) Telephone Check With Schools. Form No. OS- 

294 (^49) ; 1-99 copies, 66 each; 100-249, 5€ each. 
2 Selection and Evaluation Summary. Same as for 
xecutive Personnel Forms. 

[SALES PERSONNEL FORMS.] 

" Application for Sales Position. Form No, SA- 

3 Á 50) ; 1-99 copies, тоё each; 100-249, 8¢ each 

N atterned Interview Form—Sales Position. Form 

3s а (750); 1-99 copies, 15¢ each; 100-249, 

Ü Telephone Check on Sales Applicant, Form No; 

-103 (40); 1-99 copies, 7¢ each; 100-249, 6¢ 


c 


each, 
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Select 3 

С ча ttd ri, наат 
FLovp L. Rucu, Professor of Psychology, Uni- 
versity of Southern California, Los Angeles, 
California. di Er 

„The use of the Patterned Interview as a pre- 
dictive device is based upon the assumption that 
the best way to predict what a person will do 
in the future is to review carefully what he has 
done in the past. It is designed to cover syste- 
matically six major areas of information about 
the candidate : work history, education and train- 
ing, family background, financial situation, do- 
mestic and social situations, and health record. 
It begins with the work history of the candidate, 
thus getting him “warmed up” so that the “in- 
terviewer can probe successfully into more per- 
sonal matters.” 

The standard form on which the interviewer 
records the answers to the questions includes 
not only the questions to be asked of the candi- 
date, but additional questions (printed in red) 
which are designed to help the interviewer inter- 
pret the answers to the questions. Other stand- 
ard forms available for use in conjunction with 
the Patterned Interview are the application for 
position, the form for telephone check of former 
employers, the form for the telephone check of 
schools, and the selection and evaluation sum- 
mary. All these forms, except the last, are avail- 
able in special editions for the selection of execu- 
tive personnel, industrial personnel, and sales 
personnel. The telephone checks are especially 
highly recommended by the authors, and tech- 
niques for obtaining accurate information from 
these are outlined in the manual, The Step-by- 
Step Selection Program. 

After obtaining “complete information" by 
means of the Patterned Interview, the telephone 
check, and the application blank, the information 
which has been gathered must be interpreted. 
The manual lists eight character traits which are 
usually considered to be essential for any job. 
These are stability, industry, perseverance, loy- 
alty, motivation, and maturity. In addition, two 
more traits, leadership and competitiveness, are 
listed as being important for some jobs. Each of 
these traits is explained in the manual as an 
aid to the interviewer in making the interpreta- 

n. 
“The overall rating is made on the basis of a 
4-point scale. “This rating is not a mathematical 
summing up of points but, rather, a comparison 
of the applicant’s qualifications with those re- 


774 


quired by the particular job." For successful use, 
the Patterned Interview makes a thorough 
knowledge of the requirements of each job man- 
datory on the part of the interviewer. The state 
of the labor market should not influence the rat- 
ing. It is recommended that only the “‘1’s” and 
“2's” be hired; but with a tight labor market, 
“3's” may be included. 

According to thé manual, the Patterned In- 
terview is superior to ordinary interviewing pro- 
cedures in that it provides (a) a systematic plan 
for the interviewer to follow; (b) a technique 
for getting facts; (c) a set of principles to use 
in interpreting the facts; and (d) a method of 
minimizing personal biases and prejudices. 

The largest validation study made of the Pat- 
terned Interview had two criteria: length of 
service (from less than one week to over one 
year), and foreman's ratings. With an N of 
587, the Pearson coefficient was .43 for length 
of service versus interview ratings. With an N 
of 407, the Pearson coefficient against the cri- 
terion of foreman's ratings was .68. These rat- 
ings were made without knowledge of interview 
results. In a study involving 108 truck drivers, 
where the criterion was length of service (on 
the job at the end of 11 weeks), a biserial co- 
efficient of .61 was obtained. In still another 
study, in which the interview ratings were cor- 
related against supervisor's ratings on 84 sub- 
jects, the Pearson coefficient was .бт. 

It should be pointed out that in all the studies 
above, the interviewers were well trained in psy- 
chology and/or interviewing. While these stud- 
ies were certainly better controlled than the 
usual interview study, and while the Patterned 
Interview technique is no doubt an improvement 
over ordinary interviewing methods, the ques- 
tion still remains unanswered as to whether the 
magnitude of these validity coefficients results 
mainly from the use of the standard forms or 
Írom the ability of the interviewers. In the final 
analysis, the value of the procedure lies in the 
interviewer's ability to make a valid subjective 
rating of a comprehensive set of objective facts 
which the Patterned Interview has led him to 
accumulate. 

i [774] 
*How Supervise? Candidates for supervisory train- 
ing; 1943-48; Forms А (43), В (43), M (48); Form 
M consists of items from Forms and B “which 
proved most discriminating at higher management lev- 
els” ; revised manual ('48) ; $2.25 per 25; 356 per speci- 
men set; postpaid; nontimed (15-40) minutes; edited 


by H. H. Remmers; Quentin W. File; Psychological 
Corporat. Ж 
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Mitton M. Махов, Chief, Administrative 
and Management Testing, U.S. Civil Service 
Commission, Washington, D.C. 

Ever since 1945 there has been an awakened 
awareness of the need for better supervisory and 
executive selection programs. Every large or- 
ganization in private industry and government 
pays at least lip service to the need for improved 
supervisory and executive selection methods, 
and in many cases fine programs in this area 
have actually been installed and are operating 
effectively. An increasing number of articles оп 
this subject have appeared in the psychological 
journals during the past few years. 

The test, How Supervise?, consists of two 
forms of 70 questions each and a third form, 
Form M, containing тоо questions. All of the 
items in Form M come from the first two forms. 
Form M is recommended for use in selecting 
higher level supervisors. For each statement the 
candidate is asked to indicate whether he agrees 
or disagrees or whether he considers the prat- 
tice desirable or undesirable. The candidate ca? 
mark his answer as uncertain if he wishes (0 C? 
so. In other words, the test is basically a tue 
false test. 

The validity data available for this te 
cate that it can make a moderate contribu 4 
better supervisory selection. This contributio 
probably comes from both the verbal aspects M 
this test and the knowledge of supervisory jd 
tices involved in the questions. The test iif it 
competitors in the sense of available tests 
may be purchased from test publishers, althoug 
Richardson, Bellows, Henry and Company 
market a test which measures the same inform 


st indi- 
tion (0 
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tion as this one and is an improvement over it. 
This test is a supervisory judgment test with 
multiple choice questions that cover the general 
field of supervisory practices. In addition, Pro- 
fessors John R. Kinzer and John E. Horrocks 
have published in preliminary form a test for 
supervisory selection called "Joe Greene: A 
Supervisor's Opinionnaire." This test, also in 
multiple choice form, seems, on the basis of in- 
ternal evidence, to be a significant contribution 
to the supervisory selection field, although no 
validation data appear to be available in the liter- 
ature for it. 

It is probably inadvisable that How Super- 
vise? should be the only part of a supervisory 
testing program. It is quite possible, indeed, 
that validation studies may indicate that this 
test does not contribute anything unique in a 
full battery of tests for supervisory selection. 

As a relatively crude basis for screening first- 
level supervisors of relatively low educational 
level, this test might make a contribution, al- 
though before using it, one should first explore 
the greater validity that may be obtained from 
the use of supervisory judgment types of tests 
in multiple choice form and the use of aptitude 
tests in the field in which the supervision will 
take place. Pioneer efforts generally suffer by 
comparison with later improved versions and 
this seems to be the case here. 


For reviews by D. Welty Lefever, Charles I. 
Mosier, and C. H. Ruedisili, see 3:687. 
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*Interview Aids: Purdue Vocational Series. Ap- 
plicants for industrial employment; 1943; 4 interview- 
ing aids; 1 form; no data on reliability and validity ; 
no key; no norms; $1.70 per 25 of any one aid; 50¢ 
рег specimen set; cash orders postpaid; (5-10) min- 
utes per aid; edited by Joseph Tiffin; C. Н. Lawshe, 
С. А. Satter (1), and Lawrence С. Lindahl (2) ; Sci- 
ence Research Associates, Inc. * 
а) NUMBER I: INTERVIEWER'S RATING SCALE. 
) NUMBER 2: CAN YOU READ A WORKING DRAWING? 
C) NUMBER 3: CAN YOU READ A MICROMETER? 

NUMBER 4: CAN YOU READ A SCALE? 


Witam W. Warre, Associate Professor of 
Industrial Engineering, Columbia University, 
New York, New York. 

INTERVIEWER'S RATING SCALE. This scale is 
designed to aid an employment interviewer in 
evaluating the qualifications of applicants for 
work. It provides a positive record instead of 
the rather vague notes of opinion frequently 
found in industry. The subject is rated on six 
Personal characteristics: (a) appearance, (5) 
ability in self-expression, (c) friendliness and 


775 


sociability, (d) work record, (e) initiative, and 
(f) desire to work for the organization to which 
he has applied. Provision is also made for evalu- 
ating the subject's potentialities for each of two 
specific job titles, on the basis of his experience 
and training, as well as his information on the 
general occupational area involved. АП the char- 
acteristics and the degree of qualification are 
graded in five-step scales. The degrees of each 
characteristic are defined clearly in terms that 
are easily intelligible, rather than in the often 
used “Good-Fair-Poor’” scale or by means of 
single adjectives. The instructions for the use 
of the scale are satisfactory. 

Altogether, this scale seems an excellent 
medium for improving interview records in em- 
ployment offices by forcing interviewers to con- 
sider certain definite items with respect to each 
applicant and to evaluate these characteristics 
on a standard scale. 

CAN YOU READ A WORKING DRAWING? This 
test consists of a single sheet with directions on 
one side and three views of a machine part on 
the other. In the blueprint, the part is rather 
elaborately dimensioned, and there are 23 places 
in which the interviewee must fill in missing 
figures by selecting the proper point on another 
view of the part and reading the dimension. 
While the authors do not provide any data on 
validity, this reviewer believes that a person 
with an elementary knowledge of mechanical 
drafting would find the test easy, but that any- 
one without such knowledge would find it quite 
difficult. The test should, therefore, prove to be 
fairly satisfactory. | s 

One point which may cause difficulty if this 
test is used with people who are qualified ma- 
chinists should be noted. The dimensions are so 
placed in several cases that subdimensions add 
up to an overall figure, which is also shown. 
There is a note on the drawing to the effect that 
tolerances are +%4 of an inch, This means, for 
example, that, while subdimensions of 1% 
inches, 134g of an inch, and 21916 inches add up 
to a nominal total of 5% inches, if the machinist 
making the part uses the allowed tolerance of 
—%4 of an inch on each of the subdimensions, 
the actual total will be only 5964 inches, or less 
than the required 5764 inches which is allowed. 
In practice, the dimensioning of a part is car- 
ried out in such a way that tolerated deviations 
will not conflict, even if they are cumulative. A 
person who had had experience in machine 
work might notice this discrepancy. 
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CAN YOU READ A MICROMETER ? This test con- 
sists of a single sheet with directions on one side 
and sketches of ro micrometer readings on the 


other. Various standard types of micrometers. 


for measuring outside and inside dimensions, 
thread diameters, depths, and the like are repre- 
sented. The examples are well chosen and the 
figures are clear and easy to read. While the au- 
thor does not state what is considered to be a 
satisfactory score, it is obvious that the indi- 
vidual who has no familiarity with the mi- 
crometer will have considerable difficulty in 
making a high score. The test should be of value 
to interviewers in identifying people who have 
the ability to use a micrometer. 

CAN YOU READ A SCALE? This test is designed 
to indicate whether the interviewee is capable of 
reading dimensions from a standard machinist's 
scale or rule. It is printed on a single sheet, with 
simple directions on one side and 12 sketches 
on the other. The sketches show nuts, bolts, 
washers, screws, springs, and other machine 
parts as well as a caliper reading. The dimen- 
sions involved range down to. 64ths of an inch, 
but the sketches are clear enough so that the 
dimensions can be read easily if the subject is 
familiar with this type of work. 

The authors do not supply any information 
on the validity of the test or on what constitutes 
a satisfactory score. It is this reviewer's opin- 
ion, however, that any applicant who fails to 
read at least то of the dimensions accurately 
should be considered inadequately qualified in 
this particular field. 
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X*Interview Rating Scale for Prospective Em- 
ployees. Rating of applicants following an interview; 
1948; 11 ratings: mental alertness, personal neatness, 
courtesy, emotional stability, self-evaluation, maturity 
of judgment, temperament, frankness, impressiveness, 
ability to persuade others, personal dominance; $r.50 
per 25, cash orders postpaid; sample scale free to quali- 
fied test users; William J. Morgan and Antonia Mor- 
gan; Aptitude Associates, * 


Jay L. Oris, Professor of Psychology, and Di- 
rector, Research and Service Center for Busi- 
ness and Industry, Western Reserve U. niversity, 
Cleveland, Ohio. 

This single-page rating scale, with instructions 
on the reverse side, has been designed as a means 
of recording impressions of an applicant, either 
male or female. The rater is asked to enter iden- 
tifying information at the top of the form, select 
from each of 11 groups of three trait names the 
one most descriptive of the applicant, answer 


questions concerning the applicant's liabilities 
and assets, and finally give his recommei ations, 
The authors recommend the scale especially for 
applicants for contact occupations, 

The trait names describing each characteristic 
to be rated are arranged from most to feast de- 
sirable. However, there is no evidence that an 
effort has been made to select for the middle 
phrases terms which are equidistant from those 
at the extremes. In fact, the question could be 
raised as to whether some of the groups of three 
are on the same continuum. Examples of these 
are: "modest, complacent, boastful" ; “mature, 
sensible, naive”; and “persuasive, responsive, 
taciturn.” 

This rating scale fails to meet the definition 
for standardization in several important re- 
spects. It is realized that there are many prac- 
tical problems involved in determining the va- 
lidity and reliability of such an instrument, but 
these concepts are not mentioned in the instruc- 
tions. Normative data are not furnished, but the 
user is told that he can easily develop standards 
in his own situation. Trait names are not defined 
or discussed ; nor is assistance given in determin- 
ing the desirable characteristics for the various 
types of work. Rating each characteristic on a 
three-point rating scale gives only a gross de- 
scription. For example, the interviewer is asked 
to describe the emotional stability of the appli- 
cant with one of three words: “imperturbable, 
steady, excitable.” 

Perhaps a greater inadequacy than the lack 
of standardization is an absence of adequate in- 
structions accompanying this rating form. The 
interviewer is not aided in developing better 
interviewing techniques. He is not given any 
examples of the types of behavior which a 
described by these trait names ; nor is he taught 
to discriminate between the important and un- 
important interview information. s te 

The most important use of this rating SC? 
would be in a screening situation in which Т 
interviewer must see large numbers of peop 
and make some sort of rough systematic rating 
of each person. However, even here it is di js 
cult to justify its use. This rating scale, 1 ү 
present stage of development, makes little co 
tribution to industrial psychology. 


S: Rarns WALLACE, JR., Director of Research 

Life Insurance Agency Management А550 

tion, Hartford, Connecticut. dioc 
This one page interview summary is “15181 
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to serve as a systematic record of your impres- 
sions of a prospective employee, either male or 
female.” It purports to cover ІІ characteristics 
including “mental alertness,” “courtesy,” “self- 
evaluation,” “impressiveness,” “ability to per- 
suade others," etc. This "coverage" is obtained 
by the interviewer’s selection of one of three 
descriptive phases for each characteristic. For 
example, “mental alertness” is covered by in- 
dicating whether the applicant is “wide-awake, 
attentive, or apathetic.” 

The instructions state that numerical scoring 
is not necessary since "the ratings on the Scale 
speak for themselves.” The reviewer, unable to 
hear the magical voices, wrote the publisher for 
information on the reliability and validity of this 
unscaled scale. The reply stated, “Since we re- 
gard the Scale as being of value primarily as a 
clinical instrument for the training of personnel 
interviewers, we have not made reliability and 
validity studies.” 

Also included in the instructions is the state- 
ment that “Standards vary greatly in different 
occupations and in different companies. Norms 
are, accordingly, not supplied, but your own 
standards can easily be developed.” This re- 
viewer can see no reason for anyone to bother. 

This is an unusually blatant attempt to em- 
ploy an undefined and high-sounding vocabu- 
lary in the exploitation of business men. 
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XKahn Career Orientation Questionnaire: А Pre- 
liminary to Vocational or Educational Counsel- 
ing, Student Form. Grades 9-16; 1948-51; 11 scores: 
emotional stability, unrealistic approach, lack of voca- 
tional information, lack of vocational connections, lack 
of job experience, unsatisfactory family adjustment, 
deratisfactory school adjustment, physical limitations, 
clensiveness-inflexibility, intensification of defensive- 
ness-inflexibility, anxiety; 1 form, '48; revised manual 
GD; $2.50 per 25 sets of test, answer sheet, profile, 
nd evaluation sheet; 25¢ per specimen set; postpaid ; 
nontimed (20-30) minutes; Theodore C. Kahn; Guid- 
ance Tools, * 


ARTHUR E. Traxter, Executive Director, Edu- 
cational Records Bureau, New York, New York. 
This questionnaire, designed for use in schools 
ànd guidance centers where professional coun- 
seling services are available, is intended to pro- 
vide an introductory device to assist young рео- 
eed gaining insight into their vocational prepa- 
dh plans. The questionnaire is not in- 
e ed for use in clinical counseling. It is not 
uitable for employed adults. 
questionnaire makes up the first three 
Pages of a four-page folder with the last page 
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devoted to directions to permit examinees to 
score and interpret their own scores. The ques- 
tionnaire is divided into two parts. Part I con- 
tains 50 questions concerning the personal quali- 
fications of the subject and his vocational prepa- 
ration and plans to which he responds by check- 
ing yes, no, or uncertain. Part II calls for the 
listing of the occupations in which the subject 
can earn a living, the occupations in which he 
will be able to start after he has completed his 
training, and the occupations in which he is 
interested. 

The questionnaire is accompanied by three 
supplementary sheets. The “answer sheet” is 
actually a work sheet to be used by the student 
in scoring his paper and in analyzing his re- 
sponses in terms of the 11 categories which are 
not identified for the student. The names of the 
categories appear on a profile and analysis sheet 
intended for use by the teacher or counselor. The 
author warns that an individual’s profile is to 
be used in an exploratory way only and not as 
a basis for clinical counseling. The third sheet 
is used for recording miscellaneous evaluations 
made by the counselor. 

A rather poorly edited, typewritten, five-page 
manual presents some information concerning 
the construction and use of the questionnaire. 
The questionnaire was designed at the Veterans 
Administration Guidance Center of the Uni- 
versity of California Extension and was “stand- 
ardized” on two groups of junior college stu- 
dents in California. Little statistical informa- 
tion is given. The reliability coefficients of the 
scores of two groups of junior college students, 
with periods of one month and three months 
between administrations, were found to be .93 
and .83, respectively, figures as high as those 
usually found for this type of instrument. In one 
study of correlation of scores with counselor rat- 
ings, an r of .59 was obtained ; in a second study 
on another group, 7 was .39. Although these cor- 
relations are not high, they are statistically sig- 
nificant and they fall within the range frequently 
found when interest questionnaire scores are 
compared with ratings. The manual also reports 
positive correlations between four subscores on 
the questionnaire and certain categories of the 
Minnesota Multiphasic Personality Inventory. 

The scoring procedure, although somewhat 
involved and lacking in readily apparent reasons 
for some of its steps, should be interesting to stu- 
dents. 


In the selection of the questionnaire which is 
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intended to help the student interpret his score, 
some useful advice concerning the obtaining of 
professional vocational counseling is given. 
However, in view of the author's careful warn- 
ing that the questionnaire should not be used in 
clinical counseling, it is rather surprising to find 
the following printed statement to students 
whose scores fall within the 80-90 range: “You 
have an above average chance in securing and 
keeping a position in which you may be happy." 
A statement of this kind should probably be 
worded more cautiously. 

It is difficult to form an overall judgment of 
the worth of an instrument for which so few re- 
search data are available as there are for this one. 
The present reviewer is inclined to question the 
value of the scores, and particularly of the sub- 
Scores, which, as the author states, should be 
used with great caution. He feels that the ques- 
tions themselves are, on the whole, well con- 
ceived. Without reference to the scores, the re- 
sponses to the questions should be useful as a 
starting point in initial counseling interviews, 
although probably not more useful than those 
to the questions in several well known interest 
inventories. 


[778] 
*Lawshe-Kephart Personnel Comparison Sys- 
tem. For rating any aspect of employee performance 
by the paired comparison technique; 1946-48; Form 
A. ('46) ; no data on reliability and validity in manual 
(for data presented elsewhere by the authors see 1); 
manual (748); $2.50 per 50, postage extra; C. H. 
Lawshe, Jr., and N. C. Kephart; J. S. Mayer & Co. * 


REFERENCES 

JU ETUR Me EO 
(PA 1355040) iployees. 4 зус! 33:69-77 49. 
RzreN Н. BrrrNER, Assistant Director of Per- 
sonnel Research, The Prudential Insurance 
Company of America, Newark, New Jersey. 

This system is an attempt to make the adminis- 
tration of paired comparisons feasible for in- 
dustry. The paired comparison technique of rat- 
ing requires that each employee be compared 
with every other employee in his work group 
with respect to some defined characteristic of 
job performance, that the pairs be presented in 
a random order, that the preferences be tallied, 
and that an index be calculated which indicates 
each employee's relative position in the rating 
group. Although it has generally been found that 
this technique yields quite reliable ratings, it has 
not been used extensively in industry because of 
the time required in preparing the materials, in 
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the rating process, and in summarizing the те. 
sults. This has been particularly true when the 
number of employees to be rated is fairly large, 

The Lawshe-Kephart Personnel Comparison 
System provides a means of systematizing the 
preparation of materials and the summarization 
of results. Perforated sheets are provided oy 
which the pairs of names can be typed and then 
separated into slips on which a single pair is 
typed. Tables are given which enable the pairs 
to be arranged in random order with ease. Tally 
sheets for summarizing results and a table for 
converting preference frequencies to standard 
scores are provided. Thus, the work of preparing 
materials and summarizing results has bem 
made a simple clerical task. The manual not only 
presents the process of preparing materials but 
also gives step-by-step instructions for obtain- 
ing the ratings. А 

One study (т) reported by the authors indi- 
cates that ratings obtained by the technique were 
highly reliable with respect to agreement among 
raters and agreement between successive ratings 
made on different days by the same raters. It 
should be noted that the reliability of any ratings 
is due as much or more to the raters as to the 
rating technique itself. Because of their recog: 
nition of this, the authors have given emphasis 
in their manual to the preparation of raters. No 
data are given on the relation of the ratings to 
other measures of performance as evidences of 
the validity of the ratings. ) 

The authors conclude that the five to six hours 
required in their study to complete the entire 
rating process for 24 employees by one rater 
(including preparation of materials) is not id 
cessive. The rater himself spent only ops 
minutes judging the 276 pairs involved. Whet Я 
the time required is excessive or not € 
on the purpose and use of the ratings, the ү 
available for obtaining ratings, and, more ДЕ 
portantly, on whether ratings equally valid n 
reliable could be obtained by a less bay | 
ing technique. In the reviewer's judgment М * 
a definite possibility that the last considers) " 
is true. Nevertheless, the Lawshe-Kephart. aly 
sonnel Comparison System is definitely a 18 
efficient method of paired comparison rating: 

[779] jr an 


*Mathematical and Technical Test. Agr ly pub; 


over; 1948; also called M-T Test; orem te 
lished 1947 in Dutch (4 forms available in titude fof 
I form in Ghasa Indonesia) ; a measure o m or 
mathematical, technical, and mechanical od- 


Seah H А ing m 
training; 11 scores: completing pictures, COPY’ ing 
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els, calculations, completing the series, continuing pat- 
terns, filling up gaps, technical insight, figure series, 
geometrical figures, remembering drawings, total; 
Forms A, В; tentative norms; 25s. per 25; 3s. 6d. per 
set of figure cards for any one form; 3s. 6d. per set of 
keys for any опе form; 3s. 6d. per booklet containing 
instructions, scoring key, and interpretation of the test 
results; 3s. 6d. per manual; Is. 3d. per specimen set 
(does not include manual) ; purchase tax (British pur- 
chasers only) and postage extra; (75) minutes; J. 
Luning Prak; George G. Harrap & Co. Ltd. 


Cuartes R. LANGMUIR, Associate Professor 
of Education, Syracuse University, Syracuse, 
New York. 

This test is of interest in the United States 
principally as an example revealing the state of 
the art of test construction in Europe. The Eng- 
lish Forms A and B are translated from the 
Dutch. 

The то parts are short, speeded sections us- 
ing both familiar and novel item types. By com- 
parison with American tests constructed for sim- 
ilar purposes, three contrasts are striking: the 
M-T test contains no vocabulary section, the 
items seem much more difficult, and the scoring 
is excessively time-consuming, visually difficult 
and in five of the subtests is partly or wholly sub- 
jective. Even though the key, which is a test 
booklet overprinted in red, supplies scaled ex- 
amples for two of these five tests, American con- 
sumers would find the difficulty and ambiguity 
of scoring unacceptable. On the other hand, 
these two subtests attempt the measurement of 
performance abilities usually neglected in Amer- 
ican testing, i.e., spatial accuracy and draftsman- 
ship in freehand reproduction of such geometri- 
cal designs as a repeating border or an insignia. 
К Completing Pictures consists of the familiar 

put the missing leg on the chair" type of in- 
telligence test item. A number of the 20 pictures 
would not be applicable in the diverse culture 
of a large region, and a number: have multiple 
answers or are too obscure to survive careful 
editing. Copying Models is a graded series of 
nine difficult exercises similar to the copying 
items in the MacQuarrie Test jor Mechanical 
Ability. Calculations consist of 25 arithmetic 
Problems including five on English money. The 
25 number series items are conventional. Both 
number tests are highly speeded, six minutes for 
25 items, The continuing patterns items require 
шүл reproduction of a repeating geometric 
*sign. The five items are each scored one to 
sts comparison with keyed examples. The 
ection Filling Up Gaps is a novel, multiple 
choice, paper formboard item, 17 independent 
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item groups yielding a maximum score of 32. 
Technical Insight consists of one 7-choice item, 
three 2-choice items, three matching items, and 
seven brief answer what or why questions, as- 
sociated with 12 line drawings of mechanical 
situations. The 15 figure series items require 
freehand drawing of small geometric figures ex- 
tending a systematically changing series. The 20 
geometrical figures items combine space, num- 
ber, and verbal abilities. Sub-areas defined by 
overlapping squares, circles, and triangles must 
be perceived, identified, isolated, or counted in 
accordance with verbal directions. In the remem- 
bering drawings section a card showing a geo- 
metric line drawing is exposed for five seconds. 
The candidate’s reproduction from memory is 
scored one to five by comparison with a key. 
There are five items. The “time allowed for each 
drawing operation: reasonable enough to per- 
mit the majority to finish.” 

Forms A and B are structurally similar and 
appear to be equivalent. A reading of the 32- 
page manual suggests that issues of compara- 
bility of populations or of scores from alternate 
forms are not as well recognized in Holland as 
in the U.S. The statistics reported were derived 
from testing 3,400 Dutch students aged 11 to 
15 with an “older edition,” 700 pupils from large 
industrial and secondary schools, and 100 ap- 
prentices. No frequency distributions are re- 
ported. Evaluation of individual scores is com- 
plicated by increments and decrements for age 
and for errors. Various reliability coefficients are 
reported in the neighborhood of .8 computed 
from small groups otherwise unidentified. Cor- 
relation coefficients with a criterion identified as 
«combined results in trade schools" based on 20 
to 30 cases range from .32 to 64. In general, the 
data describing the experimental results would 
be regarded as inadequate by American con- 


sumers. 


F. W. WARBURTON, Lecturer in Educational 
Psychology, University of Manchester, Man- 
chester, England. | ' 
A publisher's note in the manual describes this 
as a nonverbal technical test. The term “non- 
verbal” is used in an unusual way, since the test 
contains arithmetical problems, number series, 
and some questions which require a sentence to 
be written by the subjects. It is a series of in- 
telligence, spatial, mechanical comprehension, 
memory, and arithmetic tests, and is said to 
measure technical aptitude on a practical com- 
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mon sense basis. Few, if any, of the factorists 
engaged on the analysis of ability have succeeded 
in isolating an aptitude peculiar to technical edu- 
cation or have ventured to describe a test as 
technical. Its claim to be called mathematical is 
almost as shaky. The mathematical knowledge 
required (except in the calculations subtest) is 
incidental, and does not extend beyond simple 
problems in percentages and the principles of 
leverage. 

These are matters of nomenclature. This test 
has proved useful in industry in Holland for 
selecting apprentices, skilled workmen, design- 
ers, and foremen. It has also been used when 
children leave the elementary school and enter 
industrial, extension, and secondary schools. 

There are 10 subtests and some of them must 
be difficult to score. For Continuing Patterns by 
drawing them freehand across the page, the 
marker is provided with five graded answers to 
each item. The Figure Series (patterns) and 
the Remembering Drawings tests also have to 
be drawn freehand and the marking cannot be 
completely objective. Technical Insight, a test 
of mechanical comprehension, is unorthodox in 
requiring written answers to some questions, 
such as “What do you think of this bow and ar- 
row?” Four of the questions in the Geometrical 
Figures subtest (Form A) are mistranslated 
and would lead English speaking subjects to 
give the wrong answer; “largest figure” actu- 
ally means “highest number.” All in all, one 
would like to have figures for the correlation be- 
tween different markers and for the reliability 
of individual markers. 

Practice items are given before each subtest. 
The instructions are, on the whole, adequate. 
One agrees that a stop watch would be useful. 

Raw scores are converted to a 12-point (nom- 
inally 10-point) scale, 3, 4, 5, 6—, 6, 6+, 7—, 7, 
7%, 8, 9, 10, in order to accord with the practice 
of class teachers in Holland. This scale is offered 
tentatively and is based on the examination of 
3,400 pupils. 

No details are provided of the sample used for 
standardisation. The only validity quoted for a 
group of more than 30 pupils is satisfactory, .43 
(90 pupils). Reliabilities seem satisfactory con- 
sidering the selectivity of the samples and the 
length of the time interval. They are .74 and .79 
after about two years. The two parallel forms of 
the test correlate .79 after 5 months and .84 
after one year (100 cases). 

It is puzzling to be told in the handbook that 
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the range of application is more than that of the 
Binet series, particularly as the age groups given 
in the statistical tables range only from 11 to 
15. 

Тһе author suggests that the test results be 
combined with those obtained from verbal in- 
telligence tests, and provides a table for weight. 
ing scores in these two types of test according 
to different types of school. Whether thes 
weights are based on multiple regression coef- 
cients or are inspirational is not stated, 

There is undoubtedly a real need for tests 
which will discriminate between children best 
suited for technical and for more academic form 
of education, and which can be used within tech- 
nical schools and in industry. A rather ill-defined 
factor of “practical ability” distinct from gen 
eral intelligence and mechanical ability appears 
to be measured rather inefficiently by visuo- 
spatial tests. In Britain these tests have bem 
found useful in selection for technical education 
and for trade apprenticeship amongst adoles- 
cents with little mechanical experience. On the 
whole, they seem to be superior to scholastic 
examination and perhaps to intelligence tests 
in picking boys for “practical” careers, although 
how well they work as early as 11 years of age 
is more doubtful. Arithmetic tests, too, have 0 
ten given good predictions of efficiency. 

The author has clearly put in a great deal of 
work in trying out various types of tests, and 
the battery he has eventually chosen bears ont 
experience in a similar field in Britain. ^0 
doubt the test has proved useful, particularly 
conjunction with intelligence tests and mechant- 
cal dexterity tests. sah 

The M-T test has the advantage of bane 
under a single cover а wide range of tests SU 
able for selection for technical education and i 
dustry in Holland, where over 12,000 Pup! Я 
have already been tested. Elsewhere, the pm 
ground would probably be more efficient! 
though somewhat more laboriously, coveret y 
using well established tests such as (а) RES 
Minnesota Paper Formboard, (b) Bennet 
Test of Mechanical Comprehension, (¢ )8 és d 
verbal intelligence test, and (d) a short 16 
arithmetical reasoning. 


[780] Ж 
*Miles Career Evaluation Inventory. Jer dire 
cants; 1947; an interviewing aid for person?" «oy 
tors, employers, and counselors; 5 scores: осо 
educational, personal, social, total; 1 form; sli only; 
reliability and validity ; available on royalty ? x 
$25 per year, $100 for unlimited use; nont! 


111) 
inutes; Lester F. Miles; Mrs. Lester F. Miles 
East 23rd St., Brooklyn 26, N.Y. * ‚ 423 
REFERENCES 


1, Mires, Lester F. Brass Hat or Executive, pp. 98-126. 
New York: Wilfred Funk, Inc., 1949. Pp. xvii, 2 з? ч 


[ 781] 
*Patterned Merit Review Form. For rating indus- 
trial and office personnel or sales personnel; 1950; а 
form to be used by personnel or home office representa- 
tives for summarizing supervisors’ evaluations of em- 
ployee performance; 2 forms; 1-99 copies of any one 
form, 15¢ each; 100-249 of any one form, 13€ each; $2 
per manual; postage extra; (5-30) minutes; developed 
by Robert N. McMurry & Co.; Dartnell Corporation. * 
a) PLANT AND OFFICE. Form No. MR-405. 
b) sates. Form No. MR-406. 


Harry W. Karn, Professor of Psychology, 
Carnegie Institute of Technology, Pittsburgh, 
Pennsylvania. 

The Patterned Merit Review Plan is essen- 
tially an information-gathering tool for the use 
of management. Its primary purpose is to stimu- 
late the supervisor's thinking about his subordi- 
nates so that he will arrive at sound evaluations 
on the basis of which to make decisions regard- 
ing promotions, transfers, layoffs, etc. In order 
to obtain evaluation data, an interview is con- 
ducted between supervisors and the representa- 
tive of the company's personnel department. The 
representative who conducts the interview does 
not evaluate the employee; his primary function 
is to provide the supervisor with a patterned 
frame of reference so that the supervisor himself 
can do the evaluating. 

The frame of reference used by the interviewer 


‘is the Patterned Merit Review Form on which 


the interviewer records notes as he talks with 
the supervisor about each of his employees. The 
form contains separate sections for job descrip- 
tion, strong and weak points in terms of job 
performance, promotability, etc. 

A complete patterned merit review on each 
employee consists of the following steps: (а) The 
Supervisor’s ranking of his employee in com- 
Tinea with other employees on the same job. 
a т determination by the supervisor of the 
viis | job duties of the employee. (c) The super- 
кез: 8 evaluation of the employee in terms of 
ine and weak points. (4) The supervisor's 
hel ep regarding the extent to which he has 
i ү the employee develop and to what extent 
Mul i. action toward future develop- 
the id the employee. (e) A determination by 
ipfe sae of his employee's promotability to 
of th © jobs. (f) An overall supervisory rating 

1e employee as “outstanding,” “above aver- 


арене h 
$6" "below average," or “unsatisfactory.” 
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These six steps are completed in sequence for 
one employee before discussing the next em- 
ployee. After all employees up for merit rating 
are discussed, the supervisor is asked to repeat 
the first step, i.e., rerank each employee in com- 
parison with others on the same job. The merit 
review procedure is next repeated on the same 
employees with a higher ranking supervisor. 
If the final ratings of the two supervisors differ 
considerably, the interviewer attempts to guide 
the supervisors into resolving their discrepan- 
cies. 

Complete and detailed instructions for accom- 
plishing the various steps in the merit review are 
contained in the Manual on How to Use the Pat- 
terned Merit Review Form. In this reviewer's 
opinion, thorough familiarity and adherence to 
the material contained in the manual should lead 
to a sound merit appraisal. The plan is simple 
and thorough. Although the interview is of the 
“controlled” type, the pattern or frame of refer- 
ence provides for sufficient flexibility to handle 
most appraisal situations. The ‘suggestive prob- 
ing questions to interviewers, which are printed 
on the interview form, are carefully couched in 
concrete terms. Questions are avoided which re- 
quire the supervisor to abstract out his em- 
ployee’s "ambitiousness" or “cooperativeness”” 
or "initiative" rather than consider the person 
as a whole. The application of the principle of 
multiple rating in the merit review plan is ob- 
viously sound. The greatest danger in the whole 
procedure appears to be the probable tendency 
for a lower level supervisor to agree to adapt his 
evaluations and rankings to those of the higher 
level supervisor in those cases where the two 
supervisors are called together to resolve dis- 
crepancies. As an overall appraisal, the present 
reviewer feels that the Patterned Merit Review 
Plan is one of the most promising tools available 
to anyone concerned with the installation of a 
formalized merit rating program. Reliability and 
validity data are presumably not available since 
none are included in the manual, Any appraisal 
of the plan must therefore remain in the realm 


of opinion. 


Frovp L. Косн, Professor of Psychology, Uni- 
versity of Southern California, Los Angeles, 
California. ‘ 

The Patterned Merit Review Plan is a com- 
bination of the forced distribution (ranking) 
and field review methods of merit rating. It is 
designed to utilize both quantitative and quali- 
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tative ratings. The interviewer fills in the form 
as the interview progresses. In addition to the 
general questions to be asked, the form contains 
questions (printed in red) which are designed 
to help the interviewer guide the interview so 
that specific incidents of behavior are brought 
out to substantiate the rating, inconsistent state- 
ments are resolved, etc. 

There seem to be several advantages to this 
plan. It is easily adapted to the evaluation of em- 
ployees on a particular job, and it is comprehen- 
sive. It circumvents the semantic problem exist- 
ent in most rating forms by permitting the 
supervisor to talk in his own language. It aids in 
holding the “Һа1о effect" to a minimum and helps 
the supervisor to face his human relations prob- 
lems objectively and self-critically. A reported 
corrected reliability of .97 would indicate a great 
advantage for this plan over other evaluation 
methods. 

The great disadvantage of the plan lies in the 
amount of time required for the operation and 
the resulting expense. 


[ 782 ] 
*Purdue Blueprint Reading Test: Purdue Voca- 
tional Test, No. 4. Grades 9-12 and adults; 1942; 1 
form; mimeographed manual; $2.60 per 25; 50€ per 
specimen set; cash orders postpaid; 30(35) minutes; 
manual by Joseph Tiffin; Н. F. Owen and J. N. Ar- 
nold; Science Research Associates, Inc. * 
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X*The Tear Ballot for Industry: General Opin- 
. ions, Industrial employees; 1944-51; a job satisfaction 
questionnaire answered by tearing in order to conceal 
identity of employee; 1 form, '44; manual (48) ; sup- 
plementary manual ('51); initial orders, $5 per 25; 
additional copies, $1 per 25 ; $3 per 100; postage extra; 
nontimed (3-5) minutes; Willard A. Kerr; Psycho- 
metric Affiliates, * 
REFERENCES 


1. GRIFFITH, Јонх W.; Kerr, WILLARD А.; Mayo, THOMAS 
B., JR., AND Торг, oun К. “Changes іп Subjective Fatigue 


and Readiness for Work During the Eight-Hour Shift.’ J 


Appl Psychol 34:163-6 Je x * (PA 25:4014) 
2. Kerr, WILLARD А.; KOPPELMEIER, GEORGE J.: AND SuL- 
LIVAN, JAMES J. “Absenteeism, Turnover and orale in a 


Metals Fabrication Factory." Occupational Psychol 25:50-5 Ja 
'$1. * (РА 25:7727) j 

з. VAN 7кїзт, RAYMOND, “Worker Popularity and Job Satis- 
faction." Personnel Psychol Аоста w "51. * (РА 26:6586) 

4. ZiNTz, Е. RAYMOND, AND Kerr, VERRE A. "Hearing 
Loss and Worker Morale." J Appl Psychol 35:92-3 Ap т. 
(PA 25:8305) 

Brent Baxter, Director, Agencies Research 
Division, The Prudential Insurance Company 
of America, Newark, New J ersey. 

The Tear Ballot consists of 10 general items 
and one “special problem" item. esponses are 
made by tearing (ripping) the edge of the bal- 
lot at one of five arrowheads that form a con- 


tinuum for each item. Similar tears are used to 


classify sex, age, office vs. nonoffice Work, super- 
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visory or nonsupervisory work, and shift Work, 

The manual states that the ballot is a practi- 
cal measure of (a) the job satisfaction of regu- 
lar workers, (b) the general causes of discon- 
tent of terminating workers, and (c) the reh. 
tionships between morale and personal charac. 
teristics, wage systems, and new company poli- 
cies. 

It is difficult to obtain adequate validation “ 
this job satisfaction scale because individual 
scores are not ordinarily obtained and criteria of 
job satisfaction are either lacking or are quite 
indirect. The test author cites the use of cri 
terion measures such as turnover (r = 25), ab- 
senteeism (.44), percentage of hearing los 
(.42), popularity measures (.82), satisfaction 
with the union (.74), empathy (.44), and num- 
ber of "serious worker-manager conversation: 
(.76).” 

You may deny the use of some or all of these 
criteria as adequate measures of job satisfaction. 
The scale author does not present his assump- 
tions regarding the nature of job satisfaction, 
leaving us with only an empirical approach, that 
shows how the scale scores correlate with other 
variables. Or, taking an operational considera- 
tion of the scale, the total score is the “sum” 0 
the answers to the то items making up the ballot. 
These deal with the kind of questions frequently 
found on such scales (i.e., worker feeling of job 
security, company interest in worker welfare 
feeling toward supervision, working condi 
pay, opportunity to gripe, good intentions 0 ' 
management, good sense of management, an 
worker personal happiness). Several of these 
questions are very general, perhaps being de- 
liberately vague in order to permit maximum 
amount of projection on the part of the «ш 
While the questions do tap the subjects genera 
considered to be related to.job satisfaction, It А 
not likely that workers ordinarily think abou 
their job in these broad terms. Bal- 

Job satisfaction as measured by the Tear " 
lot apparently is not the same in all D 
Factor analyses revealed matrices which al pi 
fer in significant ways according to the wor ith 
studied and the psychological climates s 
which they work.” “АП but one of these ana 1 
yielded three significant factors and the ех И 
tion yielded two. In no two analyses were A 
patterns of loadings obtained similar оп ™ 
than one factor.” PS 

The atten cover а агеаз of job p 
tion in ro questions raises a question 0 


M 
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bility. If one assumes that job satisfaction is 
made up of a great many different feelings, can 
it be evaluated by 10 questions? If it is assumed 
that job satisfaction has 10 major components, 
is one question enough for each component? It 
cannot be assumed that it is one general factor 
since a factor analysis of just these 10 items re- 
yealed three factors. Split half reliability corre- 
lations range from .65 to .82 with a median of 
75. No test retest data are available. Other job 
satisfaction scales having many more items have 
internal 7's as high as .92. 

Frequency distributions of raw scores and per- 
centiles are given for many different classifica- 
tions, e.g., age, sex, type of work, level of work, 
geographical location. These are called norms 
and suggested as “interesting comparative data 
which may help form a useful frame of reference 
in interpreting the results of future similar sur- 
veys.” Unless much larger samples of workers 
are obtained, these “comparative data” are rela- 
tively useless. The fact that the “norm” group 
were carpenters may have been the least signifi- 
cant aspect of their employment as it affected job 
satisfaction. 

The most novel part of the ballot is its method 
of completion. In the reviewer’s opinion it is 
doubtful whether the method is superior to other 
more usual approaches such as checking points 
on a continuum with a pencil. While the author 
emphasizes the guarantee of anonymity which 
the ballot affords, it is difficult to see how a tear 
is more anonymous than a pencil check-mark. 
The use of pencils on questionnaires is not as 
'foreign" to many groups of workers as often 
assumed. Tearing the edges of papers is cer- 
tainly a more unusual situation. In any case, it 
would seem that anonymity is more a function 
of the set of respondents, reflecting their past 
experience and the psychological freedom that 
their employers have established. 

, SUMMARY. The chief value of the tear ballot 
15 its brevity, which makes administration easy. 
The brevity, however, is one of its major draw- 
backs since it is thus able to tap only in a very 
general way the areas of job satisfaction. It thus 
is unable to be particularly effective as a diag- 
nostic instrument in determining strong and 
Weak areas of satisfaction. It might be useful to 
measure group differences but not individual dif- 
ferences, More research data will be necessary 
before it can be recommended conclusively for 
anything beyond a very rough general measure 
of the answers to the 10 questions comprising the 
$ [Л 
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ballot. In providing a scale for use in a wide 
variety of companies, it is less effective than a 
tailor-made scale. | 


[ 784] 

"Test of Practical Judgment. Grades 9-16 and 
adults; 1942-50; Form AH ('so—same as test copy- 
righted in 1942) ; preliminary manual ('42); 49€ per 
test and answer pad; separate answer pads must be 
used; $1.80 per 25 answer pads; 75 per specimen set; 
cash orders postpaid ; 35(40) minutes; Alfred J. Car- 
dall; Science Research Associates, Inc, * 


REFERENCES 
т. CARDALL, ALFRED J., Jn. A Test for Primary Business In- 


terests Based on a Fun 
tor’s thesis, Harvard University (Cambridge, Mass.), 1941. 

2. Нослроме, EDWINA, AND SMITH, Leo №. “Some Evidence 
on the Validity of the Cardall Test of Practical Judgment." 
J Appl Psychol 31:54-6 Е '4z. * (PA 21:2492) 

з. Carrincton, Domormv Н, “A Study of the Validity of 
Cardall's Test of Practical Judgment." Abstract. 4m Psychol 
3:364 Ag '48. * (PA 23:746, title only) 

4 Слкатхстох, DomorHY Н. "Note on the Cardall Practical 
Ju WI Test.” J Appl Psychol 33:29-30 F '49. * ( 23: 
3745 : 

s. Dursky, SrANLEY G., AND Krout, MAURICE H. “Predict- 
ing Promotion Potential on the Basis of Psychological Tests.” 
Personnel Psychol 3:345-51 au '50. * (83 25 3452) 

6. Ни, А. М. An Evaluation of the Cardall Test of Practical 
Judgment in Industrial Supervisory Selection. Master's thesis, 
University of Toronto (Toronto, Canada), 1950. Pp. 25. 


For reviews by Glen U. Cleeton and Howard 
R. Taylor, see 3:694. 
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*Probst Rating System. Adults; 1928-47; 8 rating 
forms; $125 per scoring and rating device and any 
combination of 1,000 service report forms; 20-99 re- 
port forms, 10€ each; 100-499, 2is¢ each; $1 per sam- 
ple set of forms; postage extra; John B, Probst; 
Probst Rating System. * 4 
a) PROBST SERVICE REPORT: FIRE DEPARTMENT FORM, 
Fire fighting personnel; 1932-47 ; Form F ('47). 
b) PROBST SERVICE REPORT: GENERAL том. Clerical 
workers, supervisors, salesmen, salesclerks, and per- 
sonnel not covered by the other forms; 1928-45; Form 
G (45). 
C) PROBST SERVICE REPORT: POLICE FORM. Police person- 
nel, guards, and watchmen ; 1932-46; Form PO $4); 
d) PROBST SERVICE REPORT: PROFESSIONAL токм, Social 
service, library, medical, D WU and other pro- 
fessions; 1932-46; [Form PR] (46). 
e) PROBST SERVICE REPORT: SKILLED [LABOR FORM. 
Skilled trades, crafts, and special semiskilled workers ; 
1032-45; Form S (745). noe 
f) PROBST SERVICE REPORT: EDUCATIONAL FORM, Teach- 
ers, principals, and supervisors ; 1931; [Form E]. 1 
g) PROBST SERVICE REPORT: LABOR FORM. Unskilled an 
semiskilled workers; 1932; Form 
Л) PROBST PERSONAL FITNESS REPORT: PERSONAL FIT- 
NESS FORM. Employees and applicants ; 1936-45; а рег- 
sonality rating scale primarily used by interviewers, ; 
formerly called Probst Personality Report: Personal- 
ity Form; [Form PF] (45). 
REFERENCES 
B. Service Ratings, A joint ИЧЕ of the 
[^ 


0 the CY бева 
d States and Canada, Chicago, Il: Bu 
Assembly, of the Оаа Administration, 1931. ХЕ Mo Vilu. 

2, Prous B. Measuring and Ка ing д, {РА M А) 


New Pio: Ronald Press Co., 1947. PP. xi, 166. 
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Mitton M. Manpett, Chief, Administrative 
and Management, Testing, U.S. Civil Service 
Commission, Washington, D.C. 

The Probst rating system provides for a check- 
ing by one or more raters of those statements 
from a long list which describe the employee 
being rated. The statements can be illustrated 
by the following two examples: “Must generally 
be told what to do,” and “Usually pleasant and 
cheerful.” Statements checked are weighted on 
а 9-point scale from plus 2 to minus 3 with some 
statements being given zero weight. The overall 
rating is presented on a 10-point scale. Special 
rating forms have been prepared for some posi- 
tions with the statements included in these spe- 
cial forms presumably being those most closely 
related to those groups; in addition, there is a 
general form for all other types of positions not 
covered by the special forms. The rater checks 
those statements which apply to the individual's 
work performance and which he has had a chance 
to observe. The summing up of the ratings is 
done generally by someone other than the rater. 

It almost seems as if the inherent characteris- 
tics of personnel specialists require that at some 
time or other they become interested in perform- 
ance ratings. The need for performance ratings 
seems so logical that anyone would seem to be 
an anarchist if he were to assert that repeated 
experience with formal rating systems seems to 
justify the generalization that their cost and the 
harm created by them outweigh the advantages 
that are obtained. The system under review here 
does not have any greater or lesser virtues than 
the systems included in the generalization. 

The major trouble with most performance rat- 
ing systems is that they start from the point of 
view of what system should be used rather than 
from the point of view of the administrative con- 
ditions in the organization where the rating sys- 
tem is to be installed. A second major weakness 
in the installation of most rating systems is that 
the installers are too ambitious in terms of the 
objectives desired. In trying to justify the in- 
stallation of such a system, they claim many ad- 
vantages which logically flow from the use of 
a performance rating system, but they forget 
that the inclusion of the additional objectives may 
interfere with the attainment of the primary ob- 
jective. What this primary objective is will ob- 
viously vary from place to place, but it seems 
likely that the most general goal should be that 
of improving work performance and supervisor- 
employee relationships. To the extent that other 
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goals are added, such as the use of the Tatings 
for promotion and layoff purposes, it js highly 
likely that the achievement of the primary goal 
will be impeded and that the whole system wil 
tend to flounder. 

There are several prerequisites to the success 
of a rating system. There has to be a sincere in- 
terest on the part of management in having rat- 
ings before ratings can be successful. Second, 
management must be able and aggressive if it 
is going to give honest ratings. Third, the raters 
must be given training if their ratings are going 
to be successful. Fourth, the rating system must 
make sense in terms of simplicity and relevance 
to the jobs being rated. Finally, there should be 
no mystery as to how the final ratings are arrived 
at, because a mysterious system will be a sus- 
pect system that will inevitably affect the hon- 
esty of the ratings given. 

The major advantage of the Probst system is 
that the task of the rater is relatively simple. 
The rater has a choice as to which statements 
to check for any particular employee, and the 
statements are relatively clear. The major dis- 
advantages of the Probst system are that though 
it is quite obvious whether a given statement is 
favorable or unfavorable, the rater does not 
know what the final rating will be that results 
from his work ; and the statements that he checks 
are generalizations rather than specific to any 
task that is performed. Technically, furthermore, 
it would seem unlikely that a 9-point scale for 
weighting each item could be justified in terms 
of validity or reliability and that a ro-point fi 
overall scale could be justified in terms of tie 
reliability of ratings. As a generalization, И 
improbable that in any organization or occupa 
tional group more than a 5-point scale is justifi А 
or administratively necessary. Actually, it 5 
highly improbable that, in order to achieve ui 
major objective of improvement of work b 
formance, any overall evaluation is needed. It { 
probably psychologically desirable not to prov! 
for an overall evaluation. 7 field 

The two most recent developments in ШЕП 
of performance ratings are the Army's wor ho 
the development of the forced-choice met м 
originally devised for the rating of officers 277 
the critical incidents technique. The major Үй 
vantage of the forced-choice method is tha ђе 
one sense the raters are forced to be poms di 
cause the choices are presumably phrased in oe 
a manner that the clues for determining P 
tively weighted items are at a minimum ап 
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raters do not know the scoring key. To that ex- 
tent, if this is an important long-range advan- 
tage, the forced-choice method is better than the 
relatively unsubtle method of the Probst rating 
system. However, the mysterious quality of the 
forced-choice method constitutes a disadvantage 
as well, and it has been necessary in some cases 
to add a more customary graphic rating. The 
critical incidents technique has the major ad- 
vantage, as compared with the Probst system, 
of having the statements which are to be checked 
based on concrete work tasks. To that extent this 
method is better than the general statements in- 
cluded in the Probst rating system. How- 
ever, neither method can overcome such usual 
difficulties of rating systems as the lack of com- 
petence of the raters, the differences in stand- 
ards used by different raters, the confusion of 
objectives, and the lack of desire on the part of 
some raters to rate honestly. 

Psychometrists have learned that the obtain- 
ing of criteria for validation studies is at least 
as important as the preparation of predictors. It 
is time that devisors of rating systems learn that 
the administrative situation connected with the 
use of a rating system is as important as, if not 
much more important than, the system itself. In 
а brilliant research study, Stockford and Bissell 
have demonstrated the validity of this point of 
view. Reign Bittner, in his outstanding sum- 
mary of the problems of merit rating, has given 
in step-by-step form the essential parts of a 
merit-rating program.? 

Many examples can be given of the impedi- 
ment to valid ratings that results from a mixture 
of objectives. One of these, the inclusion of state- 
ments of personality characteristics as well as 
statements relating to job performance, is pres- 
ent in the Probst system. Approximately 20 per 
cent of the statements are descriptions of per- 
sonality characteristics. If the primary objective 
15 work improvement, and if one recognizes that 
the same personality characteristics may lead 
to varying levels of work performance, what is 
the value of the inclusion of statements on per- 


(ашу, characteristics unrelated to specific job 
asks? : 


If rating systems are to be continued, and it 
з Dot certain that they should be іп а formal 
ense, then efforts will have to be made to create 


1 Stockf, t i 
Estap ockford, Lee, and Bissell, Н. W. “Factors Involved in 
DER Merit-Rating Scale." Personnel 26:94-116 S '49. 
2 Bittn У У 
P er, Reign. ‘“Developi: Employee Merit-Rating 

оше? Personnel dE 78 49. (PA 24:334) This 

e also describes the forced-choice method. 
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more propitious administrative conditions for 
them, to train raters, and to get their participa- 
tion in the preparation of rating systems rather 
than emphasizing the statistical virtues or the 
objectivity of any particular rating method. 


DALE Yopzn, Director of the Industrial Rela- 
tions Center, University of Minnesota, Minne- 
apolis, Minnesota. 

These rating scales have been extensively 
used in the public service, and several forms are 
also used by private industrial and commercial 
organizations. Wider application in public serv- 
ice personnel administration is explained in part 
by the specialized design of several forms (po- 
lice, fire department, educational, and profes- 
sional) and in part by the fact that Probst has 
long been a public official (municipal civil service 
director). Many of the experiments designed to 
appraise and improve various forms have in- 
volved public employees in federal, state, and 
local agencies. 

Probst's scales (except the Personal Fitness 
Form) consist of approximately 110 items plus 
provision for supplementary comments and a 
report on absenteeism. In the General Form, 38 
of these items refer to special knowledge and 
skill, 22 items to accuracy, reliability, and de- 
pendability, 25 items to moods, appearance, emo- 
tional traits, and physical characteristics, and 
25 items to industry and output. Probst (2) has 
reported that individual raters check from 20 to 
25 items on an average. A distinctive feature of 
the Probst scales is their provision for from one 
to three raters to indicate their choices of items 
on the same rating form. The first rating is gen- 
erally made by the employee's immediate super- 
visor. Using the same checked rating form, one 
or two additional supervisors may then rate the 
employee. i 

Scales may be scored by means of special 
stencils which facilitate tabulation of checked 
items. Scores are calculated as the sum of plus 
and minus values attached to the items. Balanc- 
ing of positive and negative values is aided by an 
ingenious arrangement that. designates plus 
values in black and minus values in red and 
records their cumulative totals on an attached 
“clide-rule.” Calculations provide numerical 
scores which are usually translated into letter 
grades according to the distribution indicated 
on the “slide-rule.” у ите? 

Items were selected by a series of appraisals in 
which “experts” chose those expressions they 
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regarded as most effectively describing the traits 
generally included in rating scales. No standard 
definitions of qualities to be rated are included. 
Rather, items are expected—by their judicious 
use of everyday language—to indicate what qual- 
ities and standards are implied. The formula for 
scoring and for weighting items was developed 
experimentally. The final score measures the 
spread between total numbers of favorable items 
and the net score. Equivalent letter grades were 
established in early experiments. Experience in- 
dicates that distributions of scores are approxi- 
mately normal, although no formal normalizing 
process has been applied. 

Probst (2) has reported numerous studies in- 
dicating the reliability of ratings thus provided. 
Few independent studies of the system are re- 
ported. Tests of validity face the usual difficulty 
of establishing a satisfactory criterion. Probst 
dismisses output or production records for this 
purpose with the observation that these indi- 
cators are not sufficiently inclusive. Hence de- 
pendence is placed on comparisons of Probst 
ratings with informal designations by depart- 
ment heads or immediate supervisors. Highly 
significant correlations in such comparisons are 
reported by Probst. 


For excerpts from related reviews, see 786. 
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[Re Probst Rating System. Рвовѕт, Joun B. 
Measuring and Rating Employee Value. New 
York: Ronald Press Co., 1947. Pp. xi, 166. $5.00. * 
(PA 21:2434) 


Personnel Psychol 1:111-7 sp '48. Dorothy 
C. Adkins. This book is really about “measuring 
and rating employée value” by a particular sys- 
tem of checking employee characteristics on one 
of several available check-list forms and sum- 
marizing these evaluations by means of a scoring 
formula. The reader who expects any orderly 
and thorough treatment of the basic problems 
of evaluating job performance will be disap- 
pointed. In fact, this reviewer emerged from two 
rather labored readings with only an unsatisfac- 
tory understanding of how the Probst system 
was developed and of the basis for some of the 
claims made for it. There are three parts to the 
book, covering twelve rather brief chapters and 
nine appendices. The first part, occupying 27 
pages, is on “Employee Rating Plans in Gen- 
eral.” This includes a brief statement on the 
purposes served by service ratings and presents 
a somewhat over-simplified account of the dis- 
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advantages of various well-known methods, The 

reader is brought up short early in the book by 

the discussion of the unreliability of personal 

judgment ratings, under the general heading “A 

Critical Analysis of Rating Plans." There he 

finds cited a “case study” of 78 public health 

nurses who were rated by two supervisors, It iş 

then stated that two nurses ranked sixth anf 

forty-second by one supervisor were rated sec. 

ond and forty-fourth, respectively, by the other 

One may wonder in vain how the other 76 rank- 

ings compared. Further reading brings a cita- 

tion of three raters, all of whom apparently 

agreed that five of a group of 48 clerks, appar- 

ently not engaged in supervisory work, were of 
supervisory caliber. The author states that the 

rating officers had considerable difficulty in 

agreeing on the relative merits of these five. 
Why should this be amazing or a cause for cor- 
cern? In fact, it seems surprising enough that 
there was agreement that all five were of super- 
visory caliber. This reviewer was plagued by the 
reading difficulty of the book which abounds in 
rather loose statements. For example: “They 
[over-all judgment ratings] also permit giving 
greater emphasis to the strictly personal qualifi 
cations bearing on the employee's total worth to 
the employer." Greater than any other rating 
plan? Can not the weighting of the components 
in other plans be adjusted to provide joris 
great weight? Is it not true, rather, that bone 

judgments involve unknown and variable 
weighting of particular qualifications, ш 
personal? At the end of the first part of the pis 
are given several criteria against which sk 
rating systems may be tested. Many would que ^ 
tion the inclusion of substantial adherence ien 
normal distribution if, as seems probable, hs 
is a selection factor in at least some аш Б 
groups. Another requirement that seems (0 ay 
reviewer highly questionable is that the 12 r 
officer need not measure an employee as t0 p 
relative excellence. Are we not deluding a 
selves if we believe a rating officer 15 Шу he 
sidering how other persons stand whe joye 
checks, for example, that a particular R 
is "exceptionally polite and courteous ! ӨЕ 
is the standard for exceptional weit be 
other requirement is “that the rating 0 show 
virtually forced to report honestly, or be "ings 
by ‘internal evidence’ in his own report mat tally 
that he has not done so.” Elsewhere, шоген this 
it is implied that the Probst system тео p 
requirement, and yet again it is specifica 
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ognized that 1o rating system can force raters 
to be honest, although the majority of them are. 
Another criterion is that the rating officer be 
required to mark only those traits and qualities 
which he knows from his own knowledge will 
properly describe the employee. The resultant 
“flexibility in reporting" is later argued as an 
advantage of the Probst system. It is stated that, 
since each reporting officer checks only the items 
he knows fit the employee, no two reporting 
officers are likely to check the same items for the 
same person but that “yet their reports generally 
produce identically the same rating." А case is 
cited where five rating officers checked, respec- 
tively, 10, 12, 16, 21, and 27 items, yet each set 
of items produced precisely the same letter rat- 
ing! If this finding were typical, it would forc- 
ibly raise the questions of why more than one 
rating officer is needed and whether or not the 
form could be shortened considerably. If sub-sets 
of items are highly correlated, their number can 
be reduced, although some rewording doubtless 
would be required. If they are not, the practice 
of basing different employees' ratings on differ- 
ent sets of items is open to serious question. It 
may also be noted that one of the gains to be had 
from requiring a supervisor to mark all items 
pertinent to a job is that he is thus forced to ob- 
serve characteristics he otherwise might have ig- 
nored. The title of Part II, “An Improved Rat- 
ing System,” refers to the Probst system. A sam- 
ple of one of the report forms is reproduced, and 
a so-called “scoring-rating device” is pictured. 
Under the heading “Selection of the Items for 
the Report Form,” one expects to find the meth- 
ods used to select the items, but he is confronted 
with only such statements as that “every item 
Was carefully selected, studied, analyzed, and 
then experimented with in various ways." One 
then may remember that there is an appendix on 
Item Selection." Turning to it, he finds a de- 
scription of several “experiments” that had spe- 
cial reference to why the item “has initiative” 
was not included in the report forms. The sup- 
Position seems to be that everyone should re- 
spond with a pretty exact dictionary definition 
in defining “initiative.” In one “experiment,” it 
was found “that only ro of the 17 had a good 
understanding of the true definition of initiative. 
a other seven were wrong." It seems to this 
F ons that social sciences now have available, 
"x е form of psychophysical scaling methods, 
€ convincing tools to demonstrate whether a 
Particular trait should be included and what 
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words should be used to describe variant de- 
grees of it. Persons trained in the measurement 
field may also be dissatisfied with the description 
of the "scoring-rating device.” This reviewer was 
puzzled to find that "the values and weights as- 
signed to the individual items are relatively un- 
important as compared with the scoring-rating 
process that must bring to a focus all the checked 
items on an employee's service report." But then 
a footnote was seen, indicating that "the formula 
for this evaluating process is explained in detail 
[in the Appendix] for the benefit of those who 
desire to learn more about the technical side of 
this work." After two or three hours with a 
six-page appendix on "Origin and Development 
of the Scoring Stencils and Formula Scales in 
the Probst Rating System," the reviewer was 
more confused than ever. At first it seemed that 
the author meant that, with a fairly large num- 
ber of items, differential weighting had turned 
out to be unimportant. Actually, however, 
weights ranging over 6 points, from —3 to 2; 
with additional intermediate values of —2%, 
—114, and —¥, were assigned. Moreover, “ће 
idea of a formula” and “experimenting with the 
spread” only added to the nonplus.* Nor did this 
reviewer find convincing the author's justifica- 
tion of the apparently higgeldy-piggeldy ar- 
rangement of items on the check-sheets. Turning 
to the statistical analysis of ratings, we find a 
large number of frequency distributions, which 
effectively demonstrate that the central tendency 
of ratings assigned in accordance with the scor- 
ing system is the one designated as “С.” The 
writer seems to find in this fact corroboration of 
the value of the system, and comes to attach 
absolute rather than relative significance to par- 
ticular letter designations for intervals of scores. 
If even a truly rectilinear distribution is normal- 
ized, however, the middle category can be called 
“C,” and the mean score will be “С.” If orders 
of rank used as a criterion are “normalized” by 
assigning letter designations of A, B, C, D, and 
E in a fashion such that the frequencies for each 
category approximate those of the normal curve, 
then any scoring scheme for ratings that is ad- 
justed empirically to make the ratings cor- 

1 After trial p various multiplying factors, the final scoring 


times the algebraic sum of the ts 
fona Satay all PON Shecked less 3 times the number 0! fa- 


when 
isors. This chan 
sheets are y only one or two supervi сы 
i re the relative weight of t 

im the formula, "The Te UL mor? ееп unable to rationalize 
this change in the scoring formula. 
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respond as closely as possible to the criterion 
categories will tend to yield a normal distribu- 
tion of ratings. If the ratings and the criterion 
correlated fairly highly, no other result would 
be possible. Incidentally, however, although 
curves that look like normal curves are super- 
imposed on a succession of bar graphs depicting 
frequency distributions, these are not the “‘best- 
fitting" normal curves. And in no case is any 
statistical check of normality of distribution pre- 
sented. In the one case the reviewer tested, the 
fit of a normal curve to the distribution was 
clearly unacceptable. In discussing the disadvan- 
tages of various types of ratings, the writer dis- 
parages over-all judgments as highly unreliable 
and seriously lacking in validity. But in seeking 
a criterion against which to evaluate his own 
system, he accepts the average or consensus of 
such judgments made by those who best know 
the individuals rated. An average of two over-all 
judgments was used as a criterion in some in- 
stances, and the judgments often took the form 
of simple rankings from best to poorest. Now 
since the Probst system calls for separate re- 
cording by three supervisors (who presumably 
"best know" the individuals rated) on a form 
containing about 100 items and then a separate 
scoring process, one may question why the more 
easily obtainable criterion ratings should not 
serve as the measure of employee value. Under 
“Testing a Rating System,” the description of 
the Cleveland Civil Service Commission’s ex- 
periment as providing evidence of validity raises 
question regarding the objectivity of the meth- 
ods for establishing this characteristic of the 
ratings. Those yielded by the Probst system 
were re-submitted to a conference of the super- 
visors, who reviewed the ratings and then as- 
signed second ratings. The correlation of 92 
(presumably a Pearson product-moment co- 
efficient) between the Probst ratings and those 
later assigned by the conference of supervisors 
clearly does not show the relation between the 
instrument whose validity was in question and 
an independent criterion. The foregoing citation 
does not represent an isolated instance. It is 
typical of the type of finding set forth as evidence 
of validity. Admittedly a crucial experiment in 
this area is exceedingly difficult to attain. This 
fact does not, however, obviate need for caution 
in interpreting results of experiments that by 
their very nature can not be decisive, Part III, 
“The System in Practice," covers four chapters, 
on "Special Experimental Tests of Rating Meth- 
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ods,” “Filling Supervisory and Key Positions,” 
“Measuring Personal Fitness,” and “Unfinished 
Business.” The first of these cites a study com. 
paring ranks based on production records and 
Probst ranks. With a rank-difference correlation 
coefficient in the neighborhood of ‚50, Probst 
conclusion is that “one’s faith in production te. 
ords alone as an effective measure of service 
value is severely shaken.” Granted that there are 
factors other than production that affect the 
value of an employee, however, perhaps we 
should not dismiss so cavalierly the possibility 
of measuring objectively those factors that can 
be so appraised. Should we not reserve more 
subjective judgments for those factors for which 
no objective measures are available? The chap- 
ters on “Filling Supervisory and Key Positions’ 
and on “Measuring Personal Fitness” presett 
a justification for the need of a special form to 
rate personal characteristics related to success- 
ful dealing with other persons, and several stud- 
ies of the relation between rankings based on 
the Probst Personal Fitness Report and ot 
rankings made without the form. As before, the 
question of independence of the “test” and the 
criterion, and the question of why the simpler 
criterion procedure does not suffice, at once 
arise. The last chapter, which presents fragmen 
tary reports on studies from which the author 
does not feel justified in drawing definite co 
clusions, is followed by several appendices, some 
of which have been referred to above. This те 
view should not be closed without directing 
attention to the fact that it should not bar 
garded as a critique of the Probst system. ‘a 
application of the check-list idea to pula! 
of employee value, as well as other dedit 
the plan, doubtless represented definite prog" 
at the time of the inception of the c 
can scarcely be doubted, however, that fu e 
improvements would result from applica i 
of advancements in the field of messen 
and, in particular, of the psychophysical sca 
techniques developed over the last 20 de | 
Pub Personnel R 8:171—2 J1 "47. Char dics 
Meyer. Persons interested in the Probst ge 
rating system and the scoring technique 11У itten 
will find this text interesting. Avowedly gus 
"to meet the needs of those who are ad i meth 
of thoroughly organized and time-teste pil 
ods for correctly rating the principale i en 
and work performances of their indivi oo av 
ployees,” most of the book is devoted ш н 
thor's own service report forms. * Mos 
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criteria listed as goals of development of a 
"good" system are those commonly accepted. 
Some would take issue, however, with the theory 
that service rating scores for a select group 
should adhere to a normal distribution. Perhaps 
more disagreement would be manifested with the 
recommendation that the rating officer be per- 
mitted to select the traits to be checked. On this 
the author argues that the officer might not be 
sufficiently familiar with the service value of 
an employee to check all traits listed. Others 
have argued that the requirement that all traits 
be checked compels the officer to acquire the 
necessary familiarity with the employee's serv- 
ice. * [The] chapter headed "Development of 
a More Effective Rating System” . . . and sub- 
sequent chapters do not, as one might infer, deal 
with ways and means, but narrows down to a 
discussion of the Probst system. A chapter on 
statistical analyses of ratings, for example, con- 
sists largely of distributions of scores attained in 
rating various occupational groups, although 
some of the elementary statistical measures are 
defined and illustrated. * Although both text 
and appendices emphasize that words carry dif- 
ferent meanings to different reporting officers, 
no direct explanation substantiating the use of 
single words or phrases is developed, except 
the pragmatic one that the report form as con- 
structed works. In fact, the word “lazy” (the 
first item on the report) is indicated as a shock 
word which, as such, apparently conveys the 
same meaning to all reporting supervisors and 
accordingly leads them to consider carefully all 
subsequent items. Basic problems of semantics 
and control in terms of trait definitions and train- 
ing of reporting supervisors are barely covered. 
In one place the statement is made that “overall 
judgment, unaided, is highly unreliable and seri- 
ously lacking in validity.” Later on, the descrip- 
tion of the procedures of validation includes 
overall ratings as the principal criterion. The 
author concludes that successive ratings of the 
same group of employees will reveal a tendency 
toward higher ratings because supervisors be- 
come accustomed to a higher standard of per- 
formance and thus visualize a higher type of 
worker as their “average” employee. More cyni- 
cal observers of rating systems could attribute 
the phenomenon to a less ideal cause, such as 
lack of training of the supervisors, or perhaps 
lack of courage. Scattered throughout the book 
are conclusions borne of experience that are of 


general application and worth noting, but they 
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must be sought by the reader among the plethora 
of statements on the Probst system. Enough 
credit may never be given the author for his con- 
tribution in the initiation and development of 
the check-list type of rating system. Observa- 
tion of the reviewer has led him to the conclusion 
that the check-list approach offers more than 
any of the other types. However, in a text en- 
titled as this one is, more precise statistical sub- 
stantiation and more thorough documentation 
with fewer personal inferences would reflect 
more credit to the rating system and add ma- 
terially to the stature of this book. 
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[787] 
American Institute of" Accountants Testing 
Programs. Grades 13-16 and employees and applicants 
for employment in the field of accounting ; 1946-51; а 
project to facilitate the placement of seniors and col- 
lege graduates and to aid in the guidance and appraisal 
of undergraduates majoring in accounting ; 3 tests ad- 
ministered in 2 programs ; IBM; prepared by the Com- 
mittee on Selection of Personnel, erican Institute 
of Accountants; Ben D. Wood, Director ; Arthur E. 
Traxler, Assistant Director; programs administered 
by Educational Records Bureau. * 
а) COLLEGE ACCOUNTING TESTING PROGRAM. Grades 
16; 1946-51; tests available three times annually (fal 1, 
midyear, and spring) and are rented to participating 
institutions; rental fee includes answer sheets, electro- 
graph pencils, practice tests, and о and reporting 
services; individual score reports available to seniors 
at $2.50 per any number up to 12. : 
1) Orientation Test. Grades 13-16; 1046-51; an in- 
telligence test weighted toward the field of business ; 
3 scores: verbal, quantitative, total; Forms А ('46), 
В (46), С (749); 35¢ per student; 50(70) minutes. 
2) Achtevement Test: el I. Grades 13-15; 1946- 
51; dor A (аә, В (47), С (51) ; 35€ per stu- 
dent; 120(140) minutes. 
3) Achievement Test: Level II. Grade 16; 1946-51 ; 
Forms А (747), В (49), С $4; D (750) Ё 35€ per 
student; Forms A, B: 240(280) minutes; Forms C, 
D: 120(140) minutes. ! 

) GNE Interest Blank for Men, Revised. 
Grades 13-16; 1927-51; same edition as published by 
Stanford University Press; blank scored for 27 
scales and plotted on an accountants profile standard- 
ized specifically {ог use in this program; 1-24 Stu- 
dents, $1.70 each; 25-49, $1.50 each; 50 or more, 
$1.25 each ; tests may be purchased at 10¢ per copy 
and scored locally ; nontimed (40) minutes; Edward 
K. Strong, Jr. 

b) PROFESSIONAL, 
ployees and applicants for emplo 
accounting ; 1947 


esting centers established by the State Societies, either 
e des stated by regional center or by s a 
rangement ; details may be obtained from Education: 


Records Bureau; $12 per examin 


d score report. н 
T Orientation Test. Same as ax above; $2.50 per 
examinee if kel vd scored locally $5 per examinee 
if test is scored by project осе. Ў 
К, “Achievement Test: Level II. Same as ag above; 
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$2.50 per examinee if test is scored locally; $5 per 
examinee if test is scored by project office. ї 

3) Vocational Interest Blank for Men, Revised. 
Same as a4 above; $2 per examinee if test is scored 
by project office. 
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[788] 
*Dental Aptitude Testing Program. Dental school 
applicants; 1946-51; tests administered three times an- 
nually (May, October, March) at centers established 
by American Dental Association; application form and 
bulletin of information may be obtained from pub- 
lisher ; examination fee, $10; fee includes scoring serv- 
ice and reporting of scores to any 5 designated dental 
schools; scores not reported to examinees; $1 per ad- 
ditional report; 308(400) minutes in 3 sessions; pro- 
gram administered by Council on Dental Education, 
American Dental Association. 
а) CARVING DEXTERITY TEST. 90(105) minutes; pre- 
pared by the Committee on Aptitude Testing (Bert L. 
Hooper, Chairman). 
b) AMERICAN COUNCIL ON EDUCATION PSYCHOLOGICAL 
EXAMINATION FOR COLLEGE FRESHMEN. See 277; 3 
scores: quantitative, linguistic, total; 38(65) minutes. 
C) SURVEY OF THE NATURAL SCIENCES. 6 scores: biol- 
ogy, chemistry, physics, factual, application, total; 75 
(80) minutes. 
d) SURVEY OF OBJECT VISUALIZATION. See 3:681; 25 
(35) minutes. 
€) INTERPRETATION OF READING MATERIALS IN THE 


countin 


NATURAL SCIENCES: TESTS OF GENERAL EDUCATIONAL 
DEVELOPMENT: COLLEGE LEVEL, TEST 3. See 3:526; $ 
(85) minutes. ) 


REFERENCES 


т. PETERSON, SHArLER: "The Dental Aptitud i 
gen Will Become Nation-Wide for the dort КЕЕ КАШУ 

ы 26:112-4 О ’50, *. Tu Dakin 

2. PETERSON, SHAILER. “The Dental itude Testi 
pra Will Become Nation-Wide for the Da arant 

d 35:119-21 Mr 's1. * 


DRIVERS 


[789] 
*[Driver Selection and Training Forms.] Truck 
drivers; 1943-50; a part of the White Motor Со% 
Continuing Control System of Truck Management; 
individual in part; 2 parts; manual out of print; post- 
"age extra; developed for White Motor Co.; Dartnell 
Corporation. * 
`а) [DRIVER SELECTION.] 1946-50; developed by Robert 
N. McMurry & Co. (except for Identical Forms). 
1) Employment Application. 1946; Forms No. ш 
(city delivery drivers), No. 211 (over-the-road driv- 
ers), No. 311 (long distance drivers); 1-99 copies 
of any one form, 7¢ each; 100-249, 6¢ each. 
2) Telephone Check. 1946-50; Form No. OT-203 
(^50) ; 1-99 copies, 6¢ each; 100-249, 5¢ each. 
3) Patterned Interview (Short Form). 1946-49; 
Form No. OP-202 ('49); 1-99 copies, rof each; 
100-249, 8¢ each. : 
4) Serviceman Placement Interview. A supplemen- 
tary interview for servicemen; 1946; discontinued: 
Form No. 14. Р 
5) Identical Forms. 1947 ; use is optional ; also called 
Visual Speed and Accuracy Test; Form No. 15 
[’47] ; no data on reliability and validity ; no manta'; 
no norms; $1.35 per 25, cash orders postpaid ; sped- 
men set free; 4% (10) minutes; L. L. Ташен, 
must be purchased directly from Science Researc 
Associates, Inc., the publisher. r 6: 
6) Physical Examination Record (Drivers). 194; 
Form No. 19; 1-99 copies, 5¢ each; 100-249, 4f each. 
b) [DRIVER TRAINING.] 1943-46. No 
1) Traffic and Driver Knowledge, 1946; Form i 
16; no data on reliability and validity; no ded 
no description of normative population; 1-99 paata 
6¢ each; 100-249, sé each; 6¢ per key—Form Am 
17; Amos E. Neyhart and Helen L. Меуһаг 
distributed by the Institute of Public Safety, 
sylvania State College. " Train- 
2) Road Test—In Trafic: For Selecting and driv- 
ing Truck Drivers. 1943-46; 3 scores: speci ОШ! 
ing skills, general driving habits and attitudes, E x 
Form No. 18 (46); no norms for part scd (4 ); 
description of normative population; manta hu 
1-99 copies, 5¢ each; 100-249, 4¢ each; tum ШР 
cylinder essential for administration; (3 Ameri- 
utes; Amos E. Neyhart; also distributed by + Publi 
can Automobile Association and Institute 0 
Safety, Pennsylvania State College. 


S. Клімѕ WALLACE, Jr., Director of Research 
Life Insurance Agency Management Assoc 
tion, Hartford, Connecticut. ten 
For use in their “Continuing Control 2d 
of Truck Management," the White Motor ep 
pany asked Robert N. McMurry to Шү oe 
selection battery and procedure for truck drivin 
and Amos E. Neyhart to prepare tests of p i 
knowledge and skill. The result is а гета" 
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and generally sound, program for selection and 
diagnosis in training. | 

The first screen in the selection process is a 
personal history blank called the Selective Ap- 
plication Form. The form is differently weighted 
for city delivery drivers, over-the-road drivers, 
and long distance drivers. 

The form for long distance drivers includes 
ten significant items and is scored for three age 
groups in order to equate for the relation of age 
to the criterion—survival in the job. Validity 
data for 1,016 of these drivers are presented in 
tabular form, showing the proportion of men 


at eight score groups who survived for less than: 


six months, for six months to two years, and for 
two years and over. No correlation coefficient is 
given, nor can one be obtained from the data. 
This will disturb some readers. It does not dis- 
turb the reviewer who believes that success ta- 
bles which may be readily understood by man- 
agement are usually superior to correlation 
coefficients, especially when the latter are com- 
puted in violation of the assumptions underly- 
ing them. In brief, 49 per cent of the applicants 
inthe highest score category survived for two 
years or more compared to 6 per cent of those in 
the lowest. There is a regular decline in survival 
às the score decreases. This is very acceptable 
validity. 

Itis unfortunate that similar validity data are 
not presented for the other two driver types. 
Since the interpretation given for the score for 
long distance drivers in terms of norms and rec- 
ommended cutoffs is much fuller than for the 
other two, one is led to suspect either that work 
has not progressed as far with them or that the 
Validities attained are considerably lower. The 
Suspicion is fortified (but not completely justi- 
fied) by the fact that the forms for neither city 
delivery drivers nor over-the-road drivers are 
corrected for age or accompanied by norms. 
“ne second step is a one-page blank (Tele- 
ан Check) to be used during a telephone in- 
"i ki With a previous employer. This is of 
к Standard type and merits neither favorable 
or unfavorable comment. 
id Кыр step is a two-page form called the 
bin : Interview, supplemented by a special 
Же or the serviceman. Like all such docu- 
Pirical 1 is defended on logical rather than em- 
is bas donde It has the usual defect that it 
a od. a Some common-sense stereotypes 
and ү. at is good background for a position 

What is bad. Since the interview is always 
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with us, we seem forced to prepare such docu- 
ments on the assumption that an organized job 
of getting information and weighing it must be 
superior to a disorganized one. One wishes that 
the authors had not felt it necessary to go beyond 
this and state : “Interviewers quickly learn to rec- 
ognize those patterns of personality adjustment 
traits which are most likely to lead to success in 
driving a truck. They learn quickly, also, to spot 
those patterns which mean 'danger.'" This is 
pious hope and nothing more and the authors 
know it. It is beneath the dignity of their basic 
approach. 

However, all credit is their due for making a 
valiant, if ineffectual, attempt to validate their 
interview. They present data on 108 men, each 
one interviewed after the company had decided 
to hire him. Of the eight men who were scored 
“outstanding,” 75 per cent were still on the job 
after an indeterminate time (minimum of eleven 
weeks), while 38 per cent of the 39 men rated 
“good,” 26 per cent of the 46 men rated “ques- 
tionable,” and 13 per cent of the 15 men rated 
“poor” satisfied the criterion. Unfortunately, the 
failure to specify who did the interviewing makes 
the study meaningless. This problem of vali- 
dating an intrinsically artistic device used by an 
unspecified but heterogeneous group is one 
which nobody has solved and few have dared to 
attack. The Planned Interview “looks” adequate 
to the reviewer. This, of course, means nothing. 

These three steps constitute the basic selec- 
tion program. For those companies which can 
devote additional time to processing driver appli- 
cants, four supplementary aids are offered; a 
test of perceptual speed and accuracy, a traffic 
and driver knowledge test, a road test, and a 
physical examination form. The first of these, 
Thurstone’s Identical Forms Test, has shown - 
some promise of adding substantially to the va- 
lidity of the selection procedure. Further study 
of its contribution is planned. In the interim, the 
use of a very low cutoff score is recommended. 

The Traffic and Driving Knowledge Test is 
composed of 57 multiple choice items mainly 
directed at traffic laws, efficient and safe driving 
practices, and engine maintenance. Examina- 
tion of the test suggests the inclusion of con- 
siderable verbal component. Furthermore, the 
rationale underlying some of the items is quite 
obscure. It is easy to understand why an item 
on the best method for starting an empty truck 
on ice or for conserving gas while driving is rele- 
vant to the test purpose. But an item like “Which 
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of these things should you think about to keep 
your attention on driving?" or “Which do you 
consider the best way to decrease accidents 
among commercial drivers ?", suggests that the 
authors have attempted to get at attitudes as well 
as knowledge, The effectiveness of this “port- 
manteau" approach cannot be evaluated since 
no information is given concerning reliability, 
item consistency, or validity either for driving 
knowledge per se or for the survival criterion. 
Cutoff scores for five ratings are suggested, and 
the opinion is offered that a score below 45 (av- 
erage) shows that the applicant needs instruc- 
tion to improve his knowledge of traffic and 
driving and if hired with no instruction will be 
a poor risk as a driver. 

The Road Test in Traffic represents an at- 
tempt to evaluate skill in driving with the phase- 
check technique. Standardization of the tasks is 
attempted by specifying a 5-mile route including 
right and left turns, stop signs, traffic lights, 
grades, curves, and a railroad crossing. This 
seems inadequate, as does the instruction to ex- 
aminers to "try the test out on three drivers 
whose driving ability is already known....an ex- 
cellent driver, an average driver, and a poor or 
inexperienced driver * score the tests and as- 
sign Letter Grades * The excellent driver should 
have a Letter Grade of A or B. The average 
driver should have a Letter Grade of C or pos- 
sibly as high as B or as low as D. The poor driver 
should have a Letter Grade of D or E. If your 
results agree with this closely, you should be 
ready to start using the Road Test in Traffic 
for driver selection." This procedure is sug- 
gested as a check on the examiner and the stand- 
ard route. It seems probable that it will, instead, 
increase the subjectivity of the examiner's judg- 
ments. 

It is particularly unfortunate that no data on 
objectivity or reliability are given for this test 
whose validity is mainly a matter of fiat. The 
statement is made that applicants who score A, 
B, or (possibly) C have been found to be good 
risks as truck drivers, while those scoring E 
have been found to be very poor risks and should 
be disqualified. However, no supporting data 
are given. Norms are also absent. 

The Physical Examination Form needs no 
comment. 

The authors have recognized as few others 
have done the importance of integrating a selec- 
tion system with the training which is to follow. 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK ж 


They present their tests of knowledge and фа 
as both selection instruments and " 


aids for determining the quantity and 

of training appropriate for individuals. Unies. 
tunately, one receives the impression that the 
integration broke down somewhere, The ab 
sence of selection validity data for the two dul. 
purpose tests is symptomatic of such а bresk- 
down, Equally disturbing is the fact that no dats 
are presented on the interrelations of the varios 
instruments in the battery. The selection 

is based upon a series of multiple cutoffs. This 
may be the most effective method but one would 
like to see the data which led to its preferment 
over the multiple correlation technique, Simi- 
larly, there is no evidence that the knowledge 
and skill tests have been thoroughly investigated. 
If the Road Test in Traffic does provide a good 
measure of driving skill, why should it not ale 
be used as a criterion for the selection tests, or 
at least as a method of refining the survival erb- 
terion? If driving knowledge is important, why 
is it not also important to know more about its 
relation to driving skill and to survival? The 
failure to investigate these questions leaves the 
reviewer unhappy and dubious of the valid- 
ties of the tests for diagnostic and training por 


poses. 

All things considered, however, this systems 
compares favorably with the great majority of 
those found in industry. With proper followup 
procedures and more careful analyses of the im 
struments used, it could serve as a model of 
the proper use of psychological techniques 8 
selection and training. 
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*Road Tei. Telle, Drivers; M, iJ 
scores: speci. s, general 
r1 editions; no norms 


and attitudes, total 
scores; no descri normative qe 
коче pas оов, тон таеп se im 


essential H E 
to ified test users; (30-60) minutes; Ao 
Neyharts American Automobile Association and І 
Sate College, Pa pa 
a) та SELECTING AND qum TRUCK pavers. 190 
A "45; manual (745). 

D] ка TESTING, RATING, ane TRAINING PASSENGER OM 
DRIVERS, 1947; 1 form. 

qose Truck. drivet: 
*T raffic and Drivi ^ 
1946; 1 form; mo data on reli. Ы and validity ж 
manual ; no description of normative popula Ра 
prepared = key, qat Neyhart and rey 
or the White Motor Co.; Institute of Public 66 
Pennsylvania State College, State College, 
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Aptitude Test for Elementary School Teach- 
enin-Training. Prospective school 
eio: 1045-11; also called Ec sva - 
prt; $1.75 per 10; "i per manual D id per 
specimen set; tpaid; Henry Bowers; J. M. & 
Sons (o Lx Ы У тч et, distributor ; Psycho- 
exttic iates. 
4) PARTS 1-V. 1946; 5 scores: opinions, books, occupa- 
tions, interests, aspect of judgment; 1 form; nontimed 
( ) minutes. 
0) PART VI, PERFORMANCE TEST, REVISED EDITION, I 

; a controlled interview in which the examinee de- 
Dres yo 5 speeches of 3 to 34 minutes duration on 
ferent subjects; examinees also required to serve as 
audience for other examinees; 1 form, '50; (120-240) 
minutes per examinee in 4 sessions, 
4) PART ҮП, THE HIGH SCHOOL PERCENTILE. 1946; a 
— кої record form intended for use only in Ón- 


d) PART VII, PSYCHOMETRIC EVALUATION AND PERSONAL 

BATA, 1951; а high school record form intended for use 

outside of Ontario; available only from Psychometric 
es, 


E REFERENCES 
h 
"Seh 36 618- 8. * 
% iomas, йу, New Dai i Validity of e Anik 
м de n-Training.' " Сов 


Ronert М. W, Travers, Human Resources Re- 
search Center, Air Training Command, Lack- 
lend Air Force Base, San Antonio, Texas. 

It seems fair to describe this test as an attempt 
to do what cannot be done at the present time. 
The purpose of the project of which the test is 
à major outcome was to develop an aptitude test 
which would predict “the ability to teach lessons 
in classrooms where the problem of control is 
simplified by the presence of one or two experts 

fo evaluate that ability," This statement 
must be carefully examined because it is based 
"pon assumptions that do not seem tenable at 
the present time, 

First, it implies that a so-called expert can 

*rve a teacher and judge the extent to which 
desirable changes are being produced in chil- 
ren. This does not seem to be the case at all, 

the studies by A. $. Barr and his associates 
lect consistently low correlations between ef- 
" Yeness of teaching as measured by super- 

‘rs’ ratings and by objective measures of 
P krowth. Those who achieve high ratings 

“ipervisors are likely to produce just about 
much and no more pupil growth than those 
№ achieve low ratings. This fact is discour- 
чен Tom an administrator's point of view but 
fet to be faced. It is also necessary to face the 

that measured pupil growth is a much more 
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defensible basis for the assessment of teacher 


effectiveness than are supervisory 
the latter cannot be accepted at the 
аз a usable criterion, 


kar qus de эрү implies that effective- 
ness of teaching is a concept derived from the 
observation of teachers; that is to say, that ef- 
fective is apparent when it is observed. 
Nothing could be further from the case. Teach- 
iar a © ven йа ма be собно оба. 

ve if it provi in 1з which are 
considered recor n 5 ‚ In terms of 
the philosophy of another individual, these same 
pupil changes may not seem desirable and the 
eer ipei Ee be considered effec- 
tive, effectiveness of teaching depends not 
only on the amount of pupil change which it 
produces but also on whether these changes are 
considered desirable. 

ie wih tds t a гонаг to prit i not 
able which test is to not 
a meaningful one at the present time, Meaning- 
ful and valid measures of teaching effectiveness 
cannot be obtained. However, even if acceptable 
and valid measures of teacher effectiveness were 
available, it is doubtful whether the instrument 
under eap would be particularly useful 
for оп purposes. 

The iet eode of seven parts as follows: 
(a) Opinion (attitudes towards various social 
activities); (b) Books (attitudes towards the 
titles of six imaginary works) ; (с) Occupations 
(preference for certain 4 (4) In- 
terests (expressed interests in activi- 
ties) ; (e) An aspect of judgment (judgments 
of the degree of truth or of several propo- 
sitions) ; (f) Performance (ratings of various 
traits in a standardized situation); and (g) 
High school (a measure of high school 

. Thus the measures 


achievement. 


hatever form it may occur, the i 
there is опе, which underlies the items included 
i test is not given. implies a theory 
sist аеро ро А 
can be best examined in relation to Part VI of 
the test, in which the student is rated for various 
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characteristics while he delivers three, four, or 
five brief addresses. While the student is deliv- 
ering these speeches he is “rated” for the follow- 
ing characteristics : 


FAVORABLE DESCRIPTIONS UNFAVORABLE DESCRIPTIONS 


Businesslike Bashful 

Cheerful Colorless 

Colorful : Hesitant 

Enthusiastic Listless 

Pleasing voice Nervous : 
'oised Unsatisfactory voice 

Self-confident Vague 


The rater indicates those descriptions that are 
“highly applicable," and those that are “clearly, 
but not highly, applicable." In addition, ratings 
are made for alertness, logical procedure, and 
general impression. ; 

Now the assumption which underlies this en- 
tire procedure is that individuals are character- 
ized by stable traits of the kind listed, and that 
these traits will manifest themselves in a variety 
of situations which include the test situation and 
the teaching situation. This is largely contrary 
to what is known concerning the nature of per- 
sonality. Characteristics of social behavior are 
not manifested consistently from one situation 
to another—it hardly seems necessary to refer 
to classic studies in which it was shown that most 
persons were not consistently honest or dishon- 
est but varied in their behavior from one situa- 
tion to another. Thus the assumptions underlying 
those parts of the test based on traditional trait 
theory are largely unacceptable in terms of a 
modern theory of behavior. 

The author of the test has commendable cour- 
age in attempting to provide a solution for a 
major educational problem, but he has selected 
one of such complexity that the techniques 
needed for solving it are not available and are 
not likely to be available for many years to come. 


Epwin Wannt, Statistician and Research As- 
sociate, Teacher Characteristics Study of the 
American Council on Education, University of 
California, Los Angeles, California. 

The 47-page manual provides a wealth of in- 
formation regarding the validity and reliability 
of the various tests. Fourteen pages are de- 
voted to discussions of validity. The test and 
manual obviously have resulted from a consider- 
able amount of careful research. 

Several facts appear to limit the usefulness of 
this test in the prediction of success in practice 
teaching, particularly in the United States. (a) 
The population used in the development of the 
test was exclusively Canadian. Use of the scor- 
ing keys on the paper and pencil sections would 
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be extremely risky without experimental уеп. 
cation of these keys with local student teacher 
samples. (b) Although five pages of the manual 
are devoted to detailed instructions for the per- 
formance test, no information is given regard- 
ing the scales entitled “alertness” and “logical 
procedure.” Since these two scales contribute 
so heavily to the final score on the Performance 
Test, more detailed instructions should have 
been furnished. (с) The most important test, 
Test VI (Performance), depends on the avail- 
ability of at least three competent raters, each 
of whom should have appraised the efforts of at 
least 500 students. There is no question that the 
validity coefficients for the performance test are 
high ; however, it seems quite possible that these 
high correlations are due to the extensive ex- 
perience of the raters involved in the validation 
studies rather than to any particular character- 
istic of the test itself. 

For the above reasons it is recommended that 
the ATEST be used for research purposes only. 
Persons interested in the problems of pretrain- 
ing selection of teachers will find the manual 
very interesting and instructive reading. 


J Ed (London) 82:460-2 Ag’50.C.M. Flem- 
ing. * while specialized for Canadian conditions, 
is in line with recent work on interests and opin- 
ions in their relevance to teaching skill * 
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XBarr-Harris Teacher's Performance Record 
For teacher rating and in-service training; 19433 
scores: understanding and use of objectives, philosop | 
of teaching, professional competency, personal fitness, 
pupil growth and achievement, general merit; 1 ipm; 
no norms; 15¢ per copy; 10@ per manual; poses 
tra; A, S, Barr and А. E. Harris; Dembar Publi 
tions, Inc. * 


[ 794] Ў 
*The Case of Mickey Murphy: А Case-Study Tr 
strument for Evaluating Teachers Unde д 
ing of Child Growth and Development í e =“ 
Revised Edition). Teachers in training and € О 
enced teachers; 1042-48; 3 scores: drawing 20 pro- 
sions and making decisions, soundness of ana МЕА fe 
cedures for adjustment; 1 form, '43; no Се” вер" 
reliability and validity; manual (748); $5 per in set 
arate answer sheets must be used; 256 per p е ex 
(does not include manual or scoring key) фев aller; 
tra; nontimed (50-80) minutes; Warren ^ 
University of Nebraska Press. * 


FRANK S. Freeman, Professor of P. qun 
and Education, Cornell University, Ithaca, 
York. RUM en 
This instrument was devised primarily & i 
aid to instruction rather than as a test to 62 d. 
ate or measure the students’ mastery of P 
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ples and procedures in dealing with problems of 
human development and behavior. It can be 
used, however, for the latter purpose in a pre- 
liminary way ; but such use should be only the 
first step for intensive study and discussion of 
the problems presented in the case. Since this 
instrument is primarily a teaching device, the 
author has, apparently, not felt it necessary thus 
far to go through the usual statistical procedures 
to establish its reliability and validity. It does 
not seem that there is great need to be con- 
cerned with its validity as a teaching aid; but, 
if it is to be used as a measure of students’ psy- 
chological competence, the value of the instru- 
ment will be enhanced if, as evidence of pro- 
fessional agreement, an analysis were made of 
responses given by various known groups of ex- 
perts. Also, evidence of the instrument's relia- 
bility, as answered by expert persons, would be 
highly desirable. One study reports split half re- 
liability coefficients (corrected with the Spear- 
man-Brown formula) ranging from .56 to .64. 
But these low coefficients were based on scores 
of freshmen and seniors in a teachers college. 
A much more significant measure of reliability 
would be the coefficients obtained from scores of 
competent professional persons. With respect, 
by implication, to validity and reliability, the au- 
thor states in the manual that the values as- 
signed to various responses are based upon com- 
parisons of responses by groups of persons with 
different degrees of training and experience. 
Items receiving greatest weight are those that 
differentiate most clearly between persons of 
recognized professional competence and those of 
little or no professional training, the latter rang- 
ing from college sophomores to graduate stu- 
dents who have had teaching experience. The 
scores of the undergraduates and of the experi- 
enced teachers are presented in separate percen- 
tile rank tables. 

The materials of the case study have been ar- 
ranged into separate and successive parts, each 
bs adding more and often complicating data, 
ü еасһ being accompanied by its own ques- 

Ons, hypotheses, or interpretations—some re- 
nM as correct or warranted, while others are 
eee to test the student’s comprehen- 
Mae ae elements and forces operative in hu- 
ЖЫ Еујог, The data of the case deal with the 
in actors affecting behavior of all children 
кс Society : relationships to parents, siblings, 
E Ts, teachers, and other adults; achieve- 

7t in school studies, physical development, 
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mental development, interests, and ambitions. 

The reviewer has used Baller’s case study in 
several classes in clinical psychology at the 
graduate level. The instrument proved to be 
a very effective teaching device, in which the 
students took a keen interest. The case provides 
a wide range of materials; it deals with types of 
problems that are frequent and important in 
schools and among school children ; the arrange- 
ment of materials into progressive stages is ef- 
fective ; and at times both students and instructor 
will disagree with answers, hypotheses, and in- 
terpretations offered by the author of the case, 
thereby providing additional material for analy- 
sis and discussion. 


[795] 
*Diagnostic Teacher-Rating Scale. A scale for 
rating teachers by children in grades 4-8; 1938; each 
scale has space for rating four teachers; 8 scores: lik- 
ing for teacher, ability to explain, kindness-friendli- 
ness-understanding, fairness in grading, discipline, 
amount of work required, liking for lessons, total; 
Forms A, B; mimeographed ; no data on reliability and 
validity in manual (for data presented elsewhere by the 
author, see 1-2); 3€ рег scale; 156 per specimen set; 
postpaid; nontimed (30) minutes; edited by H. 
Remmers; M. Amatora Tschechtelin; Division of 
Educational Reference, Purdue University. * 
REFERENCES М 

IRA; IPSKI " HN 
P ioter Dites AT Measuring the. m of 
Elementary-School Саа UTE Tue Teachers.” J E 
PONI S5 M. “A Diagnostic 14757) cr Rating Scale.” 
J Psychol 30:395-9 О '50. * (РА 25:2740) 
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XExceptional Teacher Service Record. A form 
for the summarization and evaluation of evidence of 
superior teaching; 1947; I form; no data on reliability 
and validity; no norms; manual (747) ; $3 per 25, post- 
age extra; Dwight E. Beecher; the Author, Congdon 
Campus School, State Teachers College, Potsdam, 
* 


N.Y. 


Leo J. BRUECKNER, Professor of Education, 
University of Minnesota, Minneapolis, Minne- 
sota. m 

"This record form is an 8-page folder listing 
four groups of items that may be regarded as 
outstanding services rendered by a teacher. The 
form was developed to assist administrators to 
apply the system of merit rating established by 
the New York Legislature. The titles of each 
group and the number of areas included in each 
are as follows: (a) direct service to pupils, such 
as providing rich school experiences—I5 areas; 
(b) community services, such as direct leader- 
ship of youth activities, as Boy Scouts—2 areas; 
(c) nonschool activities, such as work in public 
relations, research undertaken, and leadership of 
community organizations—II areas; and (d) 
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professional training, including advanced study, 
demonstration teaching, and travel—9 areas. 
The experimental edition of the form contained 
many of the items included in the Report of the 
Advisory Committee on Teachers’ Salaries ap- 
pointed by the Commissioner of Education of 
New York State in 1947. Consideration was 
also given to “research and expert judgment” 
in compiling the final list. 

The record form assumes the maintenance of 
a cumulative record form for each teacher in 
which supervisory reports, anecdotal records, 
documentary evidence, and records of excep- 
tional pupil growth are recorded. On the service 
record form each area is defined by concrete il- 
lustrations of typical examples. A space is pro- 
vided for recording the dates on which evidence 
concerning each point was submitted and filed 
in the cumulative record, and the “number of 
the evidence.” To arrive at a rating based on 
this evidence two credit points are assigned “for 
each area in which evidence is frequent and con- 
sistent, indicating regular practice; otherwise 
one credit is allowed for evidence in each area." 
When no evidence is recorded, no credit is 
given. The total number of points for each of the 
four parts of the form is then recorded on a sum- 
mary form. This number is then converted into 
a "ranking score" by means of tables that are 
provided. Finally the results for the four areas 
are totalled. On the basis of this figure the indi- 
vidual then receives a ranking within the fac- 
ulty group involved. As far as the reviewer has 
been able to discover, no report has as yet been 
made of the validity of the blank, the statistical 
basis of the points assigned, or of the results 
where the form is in use. The point rating for 
items appears to be quite subjective since no 
standards are given for assigning points. Changes 
in the weighting of the various items may also 
seem desirable to many. 

'There can be no doubt, however, as to the 
great value of the development of a systematic 
plan for recording outstanding services of teach- 
ers to be used as one basis for establishing merit 
ratings. The blank itself should serve as an ex- 
cellent device for stimulating teachers in service 
to improve their contributions to the enrichment 
of the educational program of a community and 
to broaden their activities in relation to the life 
of the school. The service record in its present 
form should undoubtedly be regarded as tenta- 
tive. Communities might well undertake the re- 
vision or extension of the items included in the 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK [ 79 


analysis in the light of existing conditions, Un- 
doubtedly the extension of the list will lead to 
an even richer list of suggestions of the types of 
exceptional services teachers can render. 


Epwin Wawpr, Statistician and Research As- 
sociate, Teacher Characteristics Study of the 
American Council on Education, University of 
California, Los Angeles, California. 

The express purpose of this form is to pro- 
vide “a quantitative method for the summari- 
zation of evidence of superior teaching.” 

The Record consists of four sections: (а) di- 
rect service to pupils, (b) community service, 
(c) nonschool activities, and (d) professional 
training. Each of these sections consists of a 
number of different areas in which the teacher 
can earn credit points. Sample types of accept- 
able evidence are given for each of these areas. 
One credit point is allowed for evidence of su- 
perior teaching in each area, and two credit 
points are allowed for each area in which the 
evidence is frequent or consistent. Additional 
credits (number unspecified) are allowed in 
cases where the evidence indicates unusually ex- 
tensive benefits from the service rendered. The 
system assumes the maintenance of a cumulative 
record folder for each teacher in which evidence 
of exceptional service will be kept. я 

Scores are calculated by totaling the entries 
in each section separately to give a total credit 
on a "frequency" basis; with this, a "variety 
factor may be included by adding the number of 
different services credited within the section. 

The total of these two sums is then converted, 
in accordance with tables which are provided, 
into a “ranking score" for each section. Direc, 
tions for scoring are confusing. The "variety 
factor may be added but apparently need not be. 
If added, the tables for converting credit points 
to ranking scores are not adequate. For examp o 
in the first section the maximum ranking а 
of 100 corresponds to 30 credit points. I : 
teacher earned 2 points in each of the 15 D 
of this section, he would receive 45 credit e bt 
(30 points for “frequency” plus 15 points in 
"variety") ; yet, there is no provision ma m 
the ranking score for credit points of more 
30. 
A single sheet serves as the manual. N 
dence of reliability or validity is given. | "m 

An instrument of this type may possibly 5 inr 
a useful purpose in ranking teachers accor " 
to activities engaged in and services perform 


o evi- 
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however, much work is needed before it may be 
safely used for the judgment of superior teach- 
ing. There is а danger that school systems which 
use this form will be misled into thinking that 
their rankings of teachers are reliable and valid. 

In view of the lack of evidence as to the re- 
liability or validity of this instrument, it is sug- 
gested that it be used for research purposes 
only. 


[ 797] 
*Graduate Record Examinations Advanced Edu- 
cation Test. Senior year college through graduate 
school and candidates for graduate school; 1946-51; 
available only in Graduate Record Examinations pro- 
grams (see 527) ; 180(220) minutes; prepared by the 
Advanced Education Test Committee appointed by 
Educational Testing Service: G. Lester Anderson 
(Chairman), R. Freeman Butts, Walter W. Cook, 
Harl Douglass, and Donald Durrell; Educational 
Testing Service. * 
REFERENCES 


1, Saum, JAMES A, “The Graduate Record Examination and 
Its MUN ED in the Stanford School of Education." Abstract. 
Calif J Ed Res ipm du Ы 

2. Treacy, Jonn P. “Interpretation of Scores on Advanced 
Test in Education." Cath Ed R 49:171-4 Mr '51. * 
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*How I Counsel. Counselors and prospective coun- 
Evi 1950; IBM; Forms A, B; preliminary norms; 
ad per copy; 256 per specimen set; separate answer 
I (IBM Form ITS 1100 A 155) may be used; 
| М answer sheets must be purchased directly from 
the International Business Machines Corporation ; 
Hec nontimed (30) minutes; Stanley C. Benz 
ши Н. Н. Remmers; Personnel Evaluation Service 
and Research, Division of Educational Reference, Pur- 
due University. * 
REFERENCES 
«Ди» Ѕтлміку C. “Some Measured Dimensions of Coun- 
Guiden PP. 56-62. In Proceedings of the Thirteenth Annual 
aan Conference Held at Purdue University, November 
Balere 22, 1947. Purdue University, Division о Educational 
Ты гелсе, Studies in Higher Education [No.] 63., Lafayette, 
2 hee Division, November 1947. Pp. 77. Paper. * 
та, Benz, SrANLEY C. An Investigation of the Attributes and 
morb aad of High-School Counselors. Edited by H. H. Rem- 
Studie "urdue University, Division of Educational Reference, 
Divisa in SES Education [No.] 64. Lafayette, Ind.: the 
d Осо r 1948. Pp. 41. Paper. * 
"An o ELLAND, WILLIAM A., AND SINAIKO, Н. WALLACE, 
Abstract! d igation of a Counselor Attitude Questionnaire." 
«Ut Ат ye 3:363 Ag '48. * (РА 23: 38, title only) 
Univer ieee . J. AND Remmers, Н. Н. fie Validity of 
Mr? sity Counselor Self-Ratings.” J Ed Psychol 40:168-73 
49. * (PA 24:1485) 


ine P. Евоєнілсн, Associate Professor 
ducation, University of California, Berke- 
Фу, California. t 
К, А to the manual, this test “is designed 
hs i how well a person is acquainted with 
phi уе counseling techniques, to reflect his 
па of counseling, and to indicate his 
toes of pome of the basic principles of 
test F ehavior, This is a big order for a single 
page a m the information presented in the 4- 
judge en manual, it is impossible to 
nf whether or not the test can do so much. 

ortunately, the manual does not mention that 
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summary data concerning the construction of 
this test were published in 1948 (2). In that year 
the test contained 100 items. In its 1950 format 
the same items are divided “on the basis of 
subject matter, difficulty, and the discriminat- 
ing power of the items” to make up the two 
equated Forms A and B. Even though this 
means that some of the data reported in 1948 
are not directly applicable to the present form 
of the test, the data reported in the 1948 mono- 
graph provide a better basis for understanding 
the test than does the manual. 

Each of the two forms of this test contains 50 
statements for which the testee indicates “agree,” 
“not sure,” or “disagree.” The authors have 
classified the items under 11 “basic principles of 
counseling" which repeat most of the generally 
accepted clichés about counseling, such as, “соп- 
sider the student as a whole person," or “сеп- 
ter the interview around the problem expressed 
by the student." That the test does not adequately 
measure each of these 11 principles may be in- 
ferred from the facts that in Form A only one 
item is devoted to each of two of the basic prin- 
ciples, and that two additional principles are 
measured by two items each. Although it is 
theoretically possible to have a perfect one-item 
test, this reviewer does not believe that the basic 
principle that the counselor should "strive to be 
personally well adjusted" is adequately meas- 
ured by the item, *a counselor should be a stable, 
well adjusted person." This condition should 
not be a serious handicap if the test user clearly 
recognizes the uneven distribution of items to 
the various principles. 

The reliability coefficient is reported in the 
manual as .834 for the "combined scores on both 
forms of the test." This coefficient is comparable 
to the previously reported (т). .831 split half 
reliability coefficient obtained for a group of 
360 composed of 306 teachers in a stratified 
sample of schools in Indiana, 40 teachers attend- 
ing a university summer session guidance work- 
shop, and 14 full-time veterans' counselors. 

There are some indications that the test yields 
scores which differentiate groups in the ex- 
pected manner. The mean score of 14 veterans 
counselors, for example, was significantly higher 
than that obtained by various groups of teach- 
ers. Counselors with higher test scores were 
more critical of counseling. services in a uni- 
versity than were persons with lower test scores 
(4). If it is assumed that critical evaluation 1s 
correlated with counselor competence, then the 
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fact that counselors with higher test scores rated 
counseling services in a university lower than 
did persons who did not do as well on the test 
may be taken as an indication of validity. How- 
ever, the validity data available to this reviewer 
do not seem to be substantial enough to justify 
the use of this test for individual selection pur- 
poses as recommended in the manual. 

The test's greatest usefulness at the present 
is as a discussion stimulator in counselor train- 
ing. For this purpose, it makes a fine companion 
to such tests as Tests of Human Growth and 
Development. 'To make the present edition of 
this test more useful, the authors should rewrite 
the inadequate manual. They should include the 
information which they have summarized in 
other publications (т, 2) and omit the overam- 
bitious statements regarding the test's useful- 
ness. The authors have provided an interesting 
instrument. It may be that further research will 
demonstrate its usefulness. Persons concerned 
with the selection and training of counselors may 
find this test a useful research device. 


Mitton E. Haun, Professor of Psychology, 
and Dean of Students, University of California, 
Los Angeles, California. 

This combination of an attitude inventory and 
an information test is an interesting step toward 
the better selection of specialized personnel in 
schools and colleges. After completing Forms 
A and B, the reviewer was of the opinion that 
the answers would be acceptable to the majority 
of competent practitioners in the field. The in- 
strument is somewhat slanted toward advising 
rather than toward clinical or psychological 
counseling. 

Validity of the instrument is based on the cri- 
terion of expert judges. While this is a standard 
step in final validation, it must be accepted as 
tentative until much more rigorous approaches 
are available. Acceptable validity must rest on 
follow-up of candidates selected on the basis of 
this test plus investigation of satisfied versus 
dissatisfied workers in relationship to the test. 

The reliability coefficient for one form (50 
items) is given in the manual as .73. This would 
not warrant great reliance on the instrument for 
use in individual cases, even if satisfactory va- 
lidity had been established. The combined Forms 
A and B (100 items) are listed as having a re- 
liability of .83, not as high as one might wish for 
individual application in selection. 

The tentative norms in the manual are based 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK [ 794 
———— ——n——T— LT 


on a sample of 346 high school teachers. This 
norming appears to the reviewer to be inade. 
quate. A further norm problem with either form 
is that a standard deviation at the extremes is 
covered by a difference of not more than three 
raw score points. A single raw score point on 
both forms makes the difference between the 
8oth and goth percentiles ; two raw score points 
between the 8oth and 95th percentiles. 

If the present forms of the inventories are ac- 
cepted as preliminary ones, the following steps 
are needed before they can be effectively useful 
in selecting candidates or measuring the out- 
comes of training: (a) Formulating a definition 
of counseling as contrasted to advising, psycho- 
logical or clinical counseling, and psychotherapy. 
(b) Lengthening the forms to give more ex- 
tensive coverage to the area described in the 
definition and to gain greater reliability. (c) 
Conducting research to establish reasonable va- 
lidity. (d) Conducting research to provide ade- 
quate norms. 

SUMMARY. This test appears to be the pre- 
liminary release of an instrument which may, 
provided the standard additional steps for sound 
validation are taken, be useful in selecting candi- 
dates for counseling; measuring outcomes of 
training ; evaluating knowledge, beliefs, and tech- 
niques of counselors; and locating areas which 
should be emphasized in counselor training. It 
is not recommended in its present stage of de- 
velopment for other than research purposes. 


[799 ] i 
How I Teach: Analysis of Teaching Practices. 
Teachers; 1942; also called Purdue Teachers [о 
ination; IBM; Forms A, B; $2.25 per 25 IBM i 
answer booklets; 756 per stencil for machine Lon 
of test booklet; $1 per manual, postage extra; e 
per specimen set, postpaid; nontimed (30) т 
Ida B. Kelley and Keith J. Perkins; Education: 
Bureau, Educational Publishers, Inc. * \ 
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May V. ЅЕАСОЕ, Professor of Education, Uni- 
versity of California, Los Angeles, California. 

The scale is designed to measure college stu- 
dents’ and instructors’ working knowledge of 
mental hygiene principles as they relate to teach- 
ing and learning in college. 

The experimental edition of the scale consists 
of 162 statements, refined from an original list 
of 246 items which were taken from the litera- 
ture on mental health and submitted for evalua- 
tion to college instructors and college students. 
Later, the scale was submitted for criticism 
and response to nine experts in psychology and 
psychiatry, and 48 additional items were elimi- 
nated from scoring as a result. Answers are 
recorded in terms of degree of agreement on 
a five-point scale. “Right” answers are based 
on consensus of experts. Scoring is done by 
matching the answer sheet to the margin of 
each page, and the number right constitutes 
the total score. Norms are precentile ranks 
based on the scores of 314 teachers in 12 insti- 
tutions of higher learning in Indiana. Norms are 
available in the reference cited for college stu- 
dents as well. Reliability (odd vs. even) is .91 
onan N of 219. An abbreviated scale of 90 items 
based on item analysis has a reliability of .75 for 
men and .81 for women. 

An analysis of results from 326 college men 
and 1,056 college women, chiefly from teacher 
training institutions, showed that women excel 
men in the insights measured; training in psy- 
chology and education are related to understand- 
ing of mental health problems; and students 
Planning to teach older children seem to have 
somewhat less insight. 

á ps test is an important pioneer contribution 
ust Study of college mental health. It is re- 
the @ 0 pues scales for college instructors on 
jus hand and to college level tests of “ad- 
ted nt" and certain personality measures on 
m Ja though no studies of interrelationships 
in ound in the literature. The original stand- 
m procedure seems sound. 
КН. ае һаз сегїаїп mechanical de- 
doy de ch as hand written answers, relatively 
Т е inclusion of nonscored items, and 
covered ota well organized manual. Populations 
Eas are limited largely to a single state and 

r ii ety teacher training institutions. 
ane usion, this test, though in need of re- 
ФОНЫ, апа improvement of mechanics and 
i к 5 а promising instrument in а relatively 

eloped field. Until research on interrela- 
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tionships is available, its chief uses will probably 
be as a research instrument and as a teaching 
device in college courses in education. 


D. A. Worcester, Professor of Educational 
Psychology and Measurements, University of 
Nebraska, Lincoln, Nebraska. 

The authors state that their problem was “to 
construct and evaluate an instrument to measure 
teachers’ knowledge of the psychological nature 
of the children in their care.” The validity of this 
instrument is indicated by the fact that those 
teachers nominated as “best” by their principals 
or superintendents tended to make higher scores 
than those rated as “worst.” There were, how- 
ever, many exceptions. The items of the inven- 
tory were chosen from the literature and from 
several practical sources such as case histories 
which had been prepared by teachers. The con- 
tent of the test, taken mostly from the field of 
mental hygiene, appears to be good. However, 
many of the items are subject to disagreement 
among experts, and the opinions of 10 judges 
distributed among eight fields are a rather small 
sampling of expert judgment even though some 
of the judges, without doubt, were skilled in 
more than one field. In some instances “best” 
teachers disagreed with the judges, but their 
opinions apparently did not carry weight in the 
scoring. 

The inventory is accompanied by a leaflet of 
directions which is all that is needed for the 
practical use of the instrument. However, the 
complete manual is needed to identify the three 
parts of the test: Teaching Practices, Opinions, 
and Factual Data. The manual also gives de- 
tailed reports of many statistical analyses made 
upon the scores of teachers of different schools, 
different subjects, different trainings, and so on. 
All of these analyses are made from a method of 
scoring which is different from the one recom- 
mended to test users. Because the correlation 
between the two methods is high (.95), the sim- 
pler one—the number of right responses 15 
recommended. However, the authors suggest 
that the more involved scoring method, which 
gives two points for the best answer and one 
point for the next best answer, has advantages 
when individual teachers are being studied. 
Since the authors recommend that the test be 
used for the selection of teachers—certainly in- 
dividual study—norms based upon this scoring 
method are necessary. But only in the complete 
manual, which must be purchased separately, 
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are these norms available. Also, while the re- 
search shows many significant differences (e.g., 
between the scores of men and women high 
school teachers and between teachers with longer 
and shorter experience on the present job), no 
differential norms are presented. 

Forms A and B each have 75 items. In al- 
most all cases a change in three answers changes 
the score by at least 10 percentiles. In Form A, 
for example, the score for the roth percentile is 
15.6 and for the goth percentile is 34.7. A student 
getting half of the possible score stands at the 
9oth percentile or above. This situation prob- 
ably arises from requiring exact answers to 
questions upon which there is so much disagree- 
ment among experts. 

The reliability of the two forms is indicated 
by an intercorrelation of .77. The authors sug- 
gest that one form be used at the beginning of 
а course and the other at the end. The authors 
also state that care was used to check duplica- 
tion as to areas: “If....one form had two items 
about delinquency and the other form had none, 
one item on delinquency was taken from one form 
and put into the other." An examination of the 
list of areas covered by items shows that of the 
94 areas named, only 20 are represented in both 
forms; 65 areas are represented by a single item 
in one form only. The fairly high reliability co- 
efficient may show, then, that a person who 
knows one part of his field probably knows an- 
other part, or, perhaps, the areas were not ac- 
curately identified. Apparently, the two forms 
do not cover the same content. 

The reviewer thinks that the items of the in- 
ventory were carefully chosen and worded. He 
believes the instrument has value as a teaching 
device in some courses in education, particu- 
larly, perhaps, in a course in mental hygiene. He 
questions its use for the employment of teachers. 


For a review by David G. Ryans, see 3:403. 
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edited by H. H. Remmers; Jean Harvey, William R. 
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Dean A. Worcester, Professor of Educational 
Psychology and Measurements, University of 
Nebraska, Lincoln, Nebraska. 

Although this test was developed in 1943 and 
revised in 1947, there is little evidence as to the 
nature or amount of revision. The title of the 
test appears to be misleading; it is preponder- 
antly a measure of one’s understanding of mental 
hygiene, as is stated on the sheet of general in- 
formation. While it perhaps should be inferred 
that a person who agrees with the majority of 
a group of experts in the field of mental hygiene 
is a better teacher than those who do not agree, 
there is no available evidence which compares 
the scores on this test with teaching success. It 
is to be noted, too, that the title is a double one, 
How Teach and Learn in College, and this im- 
plication is carried out by a student’s section 
of a rather elaborate data sheet. No norms of 
any kind are given for students, nor is there any 
evidence that students with high scores are any 
more successful in their college experiences, es- 
pecially those conventionally included under 
learning experiences, than are those with low 
scores. 

One is also a little inclined to question the 
adequacy of the norms given for teachers. Scores 
of 36 and lower are assigned a percentile rank 
of 0; a score of 50, a percentile rank of 5; and 
scores of 104 and over, a percentile rank of 100. 
Since the total possible score is 159, only 68 score 
values lie within the range equivalent to регсеп- 
tile ranks of o to roo, and От score values lie 
outside. This suggests that a large number 0 
disagreements with the criterion group Wl 
found among those scoring in the high places. 

It is noticeable that on many items a perso? 
will agree or strongly agree or, on the other band, 
disagree or strongly disagree in the same ^ ү 
rection” as the key but not in the same amount. 
Since in most of these instances the keyed E 
swer cannot be objectively verified but 1$ M 
matter of personal understanding, it might ; 
wise to make allowances for these responses tha 
are nearly correct. j ntal 

The test may have value for use in а me di 
hygiene course, especially in a college of =й 
cation, but this writer does not see how ер 
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be recommended as a means of forecasting 
teaching efficiency or success in learning at the 
college level. 
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XMinnesota Teacher Attitude Inventory. Ele- 
mentary and secondary school teachers and teachers 
in training; 1951; IBM; Form A; $3 per 25; separate 
answer sheets must be used; $1.85 per 50 IBM answer 
sheets; $15 P 500; 50¢ per set of machine scoring 
stencils; 60€ per specimen set; postpaid; nontimed 
(20-30) minutes; Walter W. Cook, Carroll H. Leeds, 
and Robert Callis; Psychological Corporation. * 
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Dwienr Г. Anworp, Professor of Education, 
and Director of Guidance Testing, Kent State 
University, Kent, Ohio. 

The basic approach to the MTAI seems sound 
and similar to Strong’s very successful Voca- 
tional Interest Blank. The inventory consists of 
à series of items related to attitudes of teachers 
M Pupil-teacher relations. A large number 
vil езе and similar items were given to teachers 
ü Were rated by principals as successful and 

unsuccessful in pupil-teacher relations. From 
d Tesults, the final scale and scoring were 
al [Wo studies of validity were made; 
0 "p yielded coefficients of correlation 
NE ui бо between the scores on the test 
pil's mre criteria : principal’s estimate, pu- 
sistent. ing, and visiting experts’ rating. Con- 
ihe and thorough work has gone into con- 

ea Ае and validation of this inventory. 

"У this represents a serious attempt оп a 
3 important and very difficult problem. 
оа eee of teaching success has been 
authors, Very difficult if not impossible. The 

5 of the MTAT in the first paragraph in 
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the manual indicate “It is designed to measure 
those attitudes which predict how well he (the 
teacher) will get along with pupils.” This is 
certainly a very laudable purpose, although re- 
search which has been done heretofore indicates 
that studies of this kind have not been successful 
in securing any significant prediction. It must be 
said also that none of the data presented by the 
authors in the manual actually involve studies 
of scores made by students in or before teacher 
training in relation to later success in teaching. 
Data are presented to show that persons scoring 
high on the test tend to be better teachers than 
persons scoring low on the test. This is distinctly 
valuable and makes the instrument worthy of 
publication. 

Norms are presented for high school seniors, 
students in training, university freshmen, and 
experienced teachers. These norms are divided 
into the various levels by age of students being 
taught and by amount of education of the teach- 
ers. The samplings involved seem adequate for 
students in training and for experienced teachers 
in Minnesota or the Midwest. The question 
might be raised as to whether norms should not 
have been presented for the successful teacher. 
Although the whole inventory is based on the 
idea of measuring the difference in scores be- 
tween successful and unsuccessful teachers, no 
actual data are presented as to distribution or 
percentiles of these successful and unsuccessful 
teachers. 

No comment or explanation is given in the 
manual of the great differences in the median 
scores of Table I, students, and the median 
scores of Table II, experienced teachers. The 
students in education, as a group, score approxi- 
mately 30 to 45 points higher than do experi- 
enced teachers. As an extreme but important 
example, experienced four-year academic sec- 
ondary teachers make a median score of 23. 
Graduating education seniors from the four-year 
academic secondary program have a median 
score of 68. A student whose score is 35 stands 
at the 9th percentile when compared to students, 
and at the 6oth percentile when compared with 
experienced secondary academic teachers. These 
differences are so great that the inventory will 
be of limited value until the following questions 
are answered. Which attitudes and scores are de- 
sirable, those of students or those of experienced 
teachers? Will these discrepancies be found in 
other groups? If they are confirmed, they mean 
that the inventory is of no value or that teacher 
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education programs are unrealistic or that ex- 
perienced teachers have undesirable attitudes. Tt 
would seem difficult to use this instrument in 
counseling without assuming an answer to these 
questions. It is entirely possible that this inven- 
tory may help force educators to face the rather 
unpleasant implications raised here. The МТАІ 
will be a tool for further research on these same 
questions. Such research is seriously needed. 

Statements in the manual are not clear as to 
how the inventory may be used in counseling or 
advising students. The following items are def- 
initely confusing: “We have no information as 
to how appropriate the Inventory would be for 
use with high school seniors who plan to enter 
teaching”; and “Since high school....counselors 
will be using the Inventory as one of the bases 
for judging vocational fitness in the area of edu- 
cation, it seems desirable to present norms for 
high school seniors and university freshmen.” 
The variation in norms makes counseling with 
this instrument confusing. It is difficult to know 
how to advise a student whose score stands at 
the 9th percentile for students and at the 6oth 
percentile for experienced teachers. As is true 
with so many manuals, no adequate explanation 
is given as to just what may or may not be said 
to students about their scores. Every manual 
should include usable, defensible quotations, ac- 
tual sentences which may be used in explaining 
results to students, instead of the usual general 
statements from which any of a dozen different 
inferences may be drawn. 

From a study of the manual and norms pro- 
vided, a counselor may say to the counselee as 
follows: “Here is an inventory which will in- 
dicate the extent to which your attitudes about 
pupil-teacher relations are similar to, or not simi- 
lar to, those of persons who are actually teach- 
ing or are in preparation for teaching. It will 
also show to some extent whether your attitudes 
are like those of successful teachers.” There is 
no evidence on which to say more. 

The МТАТ presents a new and promising ap- 
proach to a difficult but important problem. It 
deserves considered use in counseling and ex- 
tensive use in research. 


Lee J. Cronsacu, Professor of Education, 
Bureau of Research and Service, University of 
Illinois, Urbana, Illinois. 

Test development of exceptional quality lies 
behind this inventory. Publication was preceded 
by patient and careful research which is well re- 
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ported in the manual. However, the research js 
not at this point adequate to support wide prac- 
tical use of the test, but the research program is 
continuing. 

The inventory is a device designed to predict 
which teachers or potential teachers will estab- 
lish good relations with pupils. There are 150 
opinion statements to be marked "Strongly 
agree," "Agree," etc. From a large trial set, 
items were chosen which were marked differ- 
ently by criterion groups of teachers. The item 
selection and empirical scoring key were checked 
on several samples. 

The central question is whether the inventory 
is valid for the recommended uses of teacher 
selection and guidance. The manual does a seri- 
ous disservice by failing to make clear that the 
validity of the test as a predictive instrument 
has not been investigated. Until such studies are 
reported, the test should be used only on a re- 
search basis by those who are willing to conduct 
follow-up studies of their own. The studies re- 
ported in the manual make one optimistic that 
the validity of the test will prove satisfactory. 
The authors wisely seek to predict a particular 
aspect of the teaching job, success in establish- 
ing rapport with children, rather than a nebu- 
lous global criterion. Ratings of this quality by 
principals, observers, and pupils themselves 
correlate .45 to .49 with scores on the test. When 
the three types of rating are combined into а 
more reliable criterion, correlations with test 
score in three studies are remarkably good : 60 
63, and .46. In design, replication, and care m 
reporting, these studies are distinguished. There 
is a clear correspondence between inventory 
scores and teaching behavior at the time the test 
is given. 

Split half reliability is .93. On retest, scores 
are only moderately stable if teachers are © 
posed to significant experiences. Retest relia- 
bilities during early professional courses, an 
during the first months of teaching cee. 
are near .70. There is evidence of considera à 
change in mean score during the college Д 
It is therefore somewhat doubtful whether t 
test, given at the time of college entrance, jel 
predict the attitudes and practices of а n 
several years later. For short term predic 
or in the hiring of experienced teachers W T 
attitudes have become stabilized, one can exp 
better results. А 

The norms are excellent in many аре, 
stratified random sample of Minnesota teat 
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was used, and norms report separate percentile 
conversions for groups of distinct types; e.g, 
academic secondary teachers with five years 
training. The norms for students, however, are 
based on samples of opportunity and are stop- 
gaps at best. 'The manual should advise users to 
obtain their own local norms. 

The inventory is probably as well designed 
and executed as any actuarial test can Бе, The 
reviewer nonetheless questions whether actu- 
arial tests are the most profitable type to develop 
and use. The strictly empirical test is designed 
to allow scores to include diverse characteristics 
so long as they predict the criterion. A test de- 
signed to measure a psychological quality should 
be more homogeneous, and scores on items 
should have a logical meaning. While the items 
on the inventory are rather homogeneous, and 
scores reflect to some extent a child-centered, 
permissive attitude, the emphasis on predicting 
а criterion has led the authors to use weights 
which are sometimes illogical. 

For some reason, a person who responds 
"Strongly disagree" persistently will tend to 
tama much better score than one who expresses 
milder disagreement. In particular items there 
are scoring weights like these: Item 1, "Most 
children are obedient" ; weights: strongly agree 
(1), agree (—1), undecided or uncertain (0), 
disagree (—1), strongly disagree (о). Item 8, 
A child’s companionships can be too carefully 
supervised” ; weights: strongly agree (0), agree 
(1), undecided or uncertain (0), disagree 
(-1), strongly disagree (—1). An agreement 
or disagreement on Item 1 is penalized; a sus- 
pended judgment or a strong agreement is re- 
warded. In Item 8, an agreement is good, but 
3 emphatic agreement, which is logically identi- 
hu not scored positively. The authors intend 
mo differences "with respect to faking, re- 
RV set, test-taking attitude, and role-playing 
dated in the direction of increasing the [pre- 
an db. There is evidence that scores 
weight Ж raised by faking, and perhaps the 
ing $ which seem illogical do counteract fak- 
hi. Батя! test permits statements to be 
ing, uc d probability of success. For counsel- 
Seit "ug the writer would prefer to use a 
that де describes the person's qualities so 
diction ns gain self-understanding. Since pre- 
ii Я а test almost never allows a final 
Vantage about the individual, there is an ad- 

in obtaining descriptive information 
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which can be supported or denied by supple- 
mentary evidence. Descriptive tests are also 
more useful for evaluating training and for re- 
search. 

| The Minnesota Teacher Attitude Inventory 
is a promising tool for research on teacher suc- 
cess. At its present stage of development, it 
should be employed only by people trained in 
research who can validate it for their purposes. 
It should not be used to select applicants for 
teacher training or beginning teachers until 
further validated. When so validated, it is likely 
to prove highly suitable for such screening. It 
may be used for guidance and for discussion in 
teacher training classes with very cautious inter- 
pretation. 
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National Teacher Examinations. Applicants for ; 


teaching positions and teacher college seniors; 1940- 
51; an examination program for use in selection of 
teachers and the appraisal of teachers-in-training ; 
IBM; 2 parts; tests administered annually in Febru- 
ary at centers established by Educational Testing Serv- 
icé; application form and bulletin of information may 
be obtained from publisher; Common Examinations 
with or without 1 Optional Examination: $6 for full- 
time students, $10 for other candidates; Common Ex- 
aminations with 2 Optional Examinations :.$8 for full- 
time students, $12 for other candidates; 1 or 2 Optional 
Examinations: $5.50 for all candidates; fees include 
scoring service and reporting of scores to examinee and 
any 2 agencies designated at time of application; $1 
per each additional report requested at that time or 
any time thereafter ; prepared under the direction of 
the Staff of Educational Testing Service with the con- 
sultation of the Committee for the National Teacher 
Examinations ; Educational Testing Service, * | 
а) COMMON EXAMINATIONS. 6 scores: professional in- 
formation, history-literature-fine arts, science and 
mathematics, English expression, nonverbal reasoning, 
total; 185(210) minutes. { 

b) OPTIONAL EXAMINATIONS. Candidates may not elect 
more than 2 of 9 tests: Education in the Elementary 
School, Biological Sciences, English Language and 
Literature, Industrial Arts Education, Mathematics, 


Physical Sciences, 
80 (00) ‘minutes per test. 
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Harry N. Riviin, Professor of Education and 
Chairman of the Department; and Director o 
Graduate Studies; Queens College, Flushing, 
New York. 

These examinations are objective written 
tests, administered under standardized condi- 
tions, which are designed to measure the relative 
adequacy of the preparation of candidates for 
teaching positions in elementary and secondary 
schools. New editions are prepared annually. 

The examinations are available for a 30-day 
confidential inspection by officials of teacher ed- 
ucation institutions and school systems whose 
positions give assurance that the reviewers are 
likely to make intelligent use of the information 
gained by reviewing the specific content of any 
one annual form of the examinations. The Edu- 
cational Testing Service furnishes confidential 
review copies to superintendents and to exami- 
ners or personnel directors attached to their 
staffs. In similar fashion, the ETS seldom raises 
any question with regard to furnishing the prest- 
dent or dean of teacher education institutions 
with confidential review copies of the tests. | 

All examinees take some examinations if 
common and, in addition, such optional exami- 
nations as they wish to take to demonstrate their 
qualifications for specific types of teaching pos 
tions. The administration of the examinations 
was modified, beginning with the 1951 edition, 
so that the Common Examinations are admin- 
istered in the morning session, with ше 
made for candidates to take one or two Optio 
Examinations in the afternoon session. 3 

The Common Examination, which takes 
minutes of testing time, includes a test of К 
fessional information (education as a 50012 H 
stitution, child development and ещо ү 
psychology, guidance and шелше шү, d 
ucation, and general principles and met “ы: 
teaching); English expression; history; m 
ture, and fine arts; science and mac 
and nonverbal reasoning. The Optional E 
inations, which take 80 minutes of testing up 
each, are offered in education in the de i 
school, biological sciences, English шо 
literature, industrial arts education, pi 
matics, physical sciences, social sciences, 
and Spanish. 

All scores are converted into scaled $n 
that the results on one part of the d em i 
can be compared with those on another р 


res 50 
ation 
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the results of an examination administered one 
year can be compared with those of another year. 
At the candidate's request, the results are made 
available to colleges and superintendents of his 
choice. 

Inaugurated under the auspices of the Amer- 
ican Council on Education in 1939, these exami- 
nations have been administered annually since 
1940. Few examinations have been the center 
of greater controversy. Within the first 10 years, 
more than 68 articles were published explaining, 
defending, and attacking them. This reviewer's 
analysis is based upon this published material 
and upon two complete sets of examinations 
which were made available to him for the pur- 
pose of this review. 

One of the major advantages of the National 
Teacher Examinations is that it introduces an 
objective basis for comparing the graduates of 
teacher education institutions so that the student 
who comes from an institution that grades se- 
verely is not penalized in competing for a posi- 
tion with graduates of other institutions that 
grade more leniently. 

The examinations are carefully constructed. 
The distribution of emphasis is sound and the 
items are, with few exceptions, unobjectionable. 
There is a definite attempt, particularly in the 
professional information test, to measure not 
only the candidate's knowledge but also his abil- 
ity to apply it to classroom situations. According 
to ETS reports, the examinations are suffi- 
ciently reliable for the use for which they are 
intended. That the examinations are valid, the 
ed submits, is evident from the way in which 
Ww and the keys are reviewed by experts 
1 the field and by the success with which they 
Teneo between good and poor candidates. 
eat w p have been used are evaluated statis- 
Sone and are then used in subsequent examina- 

15 as a partial basis for evaluating other ex- 
amination items. 

р ашуг of previous examination results 
Ris е that there is a perfect positive correla- 
cn ween the average standard scores of each 
the 3 and the amount of educational training 
Dod has had. Thus, candidates with the 
de. di jun a higher average standard score 
MAT ose with a master's degree ; those with 
Mon 2 degree have higher average standard 

nire an do those with only a baccalaureate 
age d rns college graduates have higher aver- 
"EL ndard scores than do those candidates 

"o ave had less than a four year college course. 
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In a recent small scale comparison, teachers in 
training judged by faculty members as "likely 
to succeed" attained significantly higher scores 
on the teacher examinations than did their fel- 
low students judged "likely not to be successful 
teachers" (Biserial correlation coefficient .48). 

The examinations admittedly measure only 
what the candidates know about educational 
theory and about their subjects. Those who con- 
struct the tests are careful to emphasize that the 
ability to teach is not measured. Those who are 
to use the results of these examinations must 
be equally careful not to assume that the tests 
measure what even the authors and editors do 
not claim them to do. 

The subject matter tests could well be used to 
measure the mastery of college students who 
have majored in a given subject area. Though 
there are usually some items on each test that 
are directly related to the content of the second- 
ary schools, the major source of the items is the 
college curriculum and not the secondary school 
curriculum. To be sure, the richer the prospec- 
tive teacher is in his area of specialization, the 
more he has to share with his students. There 
is, however, the question of whether the person 
who has taken an endless number of college and 
university courses is really well prepared to 
teach the subject to junior and senior high school 
students if he is not thoroughly familiar with 
those aspects of the subject matter that play 
the greatest role in the lives of adolescents. It 
must be admitted that there are many items that 
test the scholarship of a graduate student but 
that do not indicate his ability to apply his schol- 
arship to secondary school teaching or his fa- 
miliarity with the content of the secondary 
school. The modern secondary school is more 
than a watered-down version of a college or a 
university. A candidate with a perfect score may 
not be a better equipped prospective teacher 
than is another candidate who misses a half 
dozen items. It may be that these subject matter 
tests would be more useful to superintendents if 
they drew more heavily upon the secondary 
school curriculum. ў 

Everyone who has ever edited a standardized _ 
test knows how easy it is for the editor to modify ~ 
questions prepared by even competent item 
writers. There аге, therefore, bound to be a num- 
ber of items on any published test to which com- 
petent people can take exception. The present 
examinations are no exception. There were a 
number of items on this test for which this re- 
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viewer would accept answers different from 
those given in the official answer key. While it 
is possible that this reviewer is in error in his 
answer to the items, there is also the possibility 
that there are situations in education where the 
experts will disagree in their choice of the best 
solution, This is particularly true where the ex- 
amination, in order to measure the candidate’s 
ability to apply his information to the classroom, 
presents the candidate with a classroom situa- 
tion with the request that he indicate what he 
would do under the circumstances. The descrip- 
tion of the situation has to be brief and, there- 
fore, includes some data that the test authors 
consider to be the determining factor in the situ- 
ation, Other educators, however, thinking of the 
circumstances under which such a situation may 
arise, may attach more significance to what the 
authors consider to be relatively trivial. While 
the official key may be the correct answer for 
most instances of the type of situation described 
in the item, there may be a few cases where the 
accepted solution may be so inadvisable that 
those who know schools are reluctant to offer 
the blanket advice suggested in the answer. 

One of the difficulties in the preparation of 
the National Teacher Examinations is that we 
do not have a national school system. Our ele- 
mentary schools, for example, vary from highly 
departmentalized subject matter schools to ex- 
perience curriculum schools. The background 
which a teacher needs differs so much from one 
extreme of educational practice to the other that 
it is hard to conceive of an examination in ele- 
mentary education that is equally appropriate 
for so wide a variety of schools. Most of the 
items in elementary education, for example, are 
still conceived in terms of a school which teaches 
isolated subjects and may not be most appropri- 
ate for selecting teachers for elementary schools 
which expect their staffs to be able to build their 
curriculum with the children. 

How, then, should these examinations be 
used? Regardless of how useful these tests are 
as a means of measuring what a candidate knows 
about teaching and about the subject he teaches, 
they do not measure many of the most important 
aspects of the candidate's ability to teach. There 
are, for example, no ways in which these tests 
measure the ability to develop rapport with 
children, the effectiveness in communicating to 
children, or the candidate’s own personal ad- 
justment, to mention only some of the factors 
that are important in teaching success. For this 
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reason, those who use the National Teacher Ey. 
aminations should supplement them with other, 
more personal, ones. 

This reviewer has served as one of the outside 
consultants used by school systems to help evalu- 
ate candidates for teaching positions whose score 
on the National Teacher Examinations was 
known to the examining committee. In these 
situations, knowing the score made the inter- 
view much more efficient and effective since it 
was unnecessary to attempt to evaluate orally 
what had already been measured more objec- 
tively and more thoroughly. The examiners, 
therefore, were free to measure the subjective 
elements of the student’s personal qualifications 
for getting along with children and with col- 
leagues and for serving as an effective leader in 
а democratic classroom and school situation. 

For what they are intended to serve, the ex- 
aminations are undoubtedly valuable. They do 
measure the candidate’s background of knowl- 
edge about teàching in a typical situation. For 
Schools that are interested in knowing more 
about a candidate than just what he knows about 
teaching, these examinations should be supple- 
mented by an examination of the candidates 
college record, of his success as a teacher, and 
of his personal qualifications. 

[ 803 ] Д 

X*The Purdue Rating Scale for Instruction. Col- 
lege; 1927-50; ratings on 26 characteristics of Ше 
structor and teaching situation; formerly called The 
Purdue Rating Scale for Instructors; IBM; 1 n 
'50; manual (’50); norms (’50); 3¢ per IBM s |. 
answer sheet; 5€ per profile chart; 15¢ per rd 
25€ per specimen set; postpaid; nontimed (шо) РС 
utes; Н. Н. Remmers, D. N. Elliott (scale), and 1. em 
Baker (manual); Personnel Evaluation Келе 
Service, Division of Educational Reference, Fur 
University. * 
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du ed by Н. Remmers. Purdue University, Division, o; 
ron ee Reference, Studies in Higher Education INo.] 
t i Further Studies in Attitudes, Series 15. Lafayette, Ind.: 
* Division, March тоо. Pp. o6. Paper. * f 
sects, BAKER, P. C. лхо Remmers, Н. H. "Progress in Ке. 
zh on Personnel Evaluation." J Teacher Ed 2:143-6 Je '&t. 
son; 5 Canter, Geran C. “A Factor Analysis of Student Per- 
ПА tv Traits.” J Ed Res 44:381-85 Ja 'st. * (РА 25:8264) 
Stude Drucker, A. J., AND Remmers, H. Н. “Do Alumni and 
Ed Pan Differ in Their Attitudes Toward Instructors?” J 
sychol 42:129-43 Mr 's1. * (PA 25:7698) 


Keynern І, Heaton, Richardson, Bellows, 
Henry and C ompany, Inc., Philadelphia, Penn- 
Sylvania. 
, The 1950 edition of this scale presents the 10 
instructor traits included in the original 1927 
Ner together with 16 characteristics which de- 
hing the teaching situation (“Suitability of the 
5 od or methods by which the subject matter 
n Presented,” “Suitability of the size of the 
Ery Degree to which the objectives of the 
urse were clarified and discussed,” etc.). 
у effort has been made to validate the in- 
abi. as а “ ‘true’ measure of the instructor s 
Шу.” The authors present the instrument as 


"primarily a device for ascertaining student 
judgments concerning the traits in question" 
but have not validated it on this basis. The man- 
ual includes a report of findings from experi- 
mental use of previous forms. These reports in- 
dicate that such instruments have motivated in- 
structors to make needed improvements. The 
manual also includes a discussion of the instru- 
ment and factors to be considered in its use, so 
presented as to be reassuring to the instructor 
who is not used to the technical details involved 
or who is reluctant to encourage an expression 
of student criticism. 

The rating form itself is designed for either 
IBM scoring or for hand scoring. The mechan- 
ics seem unnecessarily complex in that the first 
IO items are to be marked and scored on one 
basis and the last 16 items on another. There is 
no apparent reason for the difference in format. 

The scale can be useful as an encouragement 
to criticism. The need for supplementary ex- 
changes of more analytical and specific sugges- 
tions for improvement, and the need for criti- 
cism from those who are more expert than 
students in judging the educational process, is 
obvious. 

[ 804] 
*Scale for Rating Effective Teacher Behavior. 
Teachers; 1947; 38 behavior items grouped under 6 
categories: fairness, cheerfulness, sympathetic under- 
standing, control, ability to stimulate response, knowl- 
edge; 1 form; no description of normative population ; 
manual ('47) ; norms ('50) ; $2 per 25, postage extra; 


Dwight E. Beecher; the Author, ngdon Campus 
School, State Teachers College, Potsdam, N.Y, * 


REFERENCES 


5 , Dwicut F. The Evaluation of Teaching: Bach- 
улын ко, Concepts. Introduction by Maurice E. Troyer. 
New York: Syracuse University Press, 1949. Pp. xi, 105. Pa- 


per. (PA 24:3901) 
Leo J. BRUECKNER, Professor of Education, 
University of Minnesota, Minneapolis, Minne- 
sota. 
The items in this scale consist of “those quali- 
ties which available evidence indicates are uni- 
versally apparent to pupils” when they are asked 
to appraise the work of teachers. Each item is 
expressed as observable teacher “reaction be- 
haviors” which can be noted by an observer of 
the activities of a teacher in a.classroom situation. 
In stating the items an unfortunate use 15 
made of such “universal” expressions as “no 
pets,” “all pupils,” “every opportunity,” and 
“never disregarded.” The observer is required 
to make an appraisal on an either-or basis. 
Either the item as stated in the list was observed, 
in which case a plus rating is given, or the re- 
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verse type of behavior was observed, in which 
case a minus rating is given. Space is provided 
on the blank for “anecdotal records of behavior.” 
When the item is not observed, as the author 
indicates is often the case, no rating is given. 
There is no differentiation in the weighting given 
to the various items in the scale. The score on 
which a rating is based is the percentage of items 
observed for which the rating was plus. 

The dichotomy used in appraising the items 
in the scale is obviously open to question. For 
example, a rating of Item 12, "friendly in man- 
ner and tone of voice to all pupils," would be an 
expression of a judgment and would be compli- 
cated by the word "all." How would an observer 
rate the item if the manner was friendly but the 
voice did not appear to be friendly? How would 
the item be rated if the teacher for even an in- 
stant betrayed irritation with some single pupil 
in the class who raised a disturbance? Tech- 
nically the rating would have to be minus be- 
cause of the word all, whereas otherwise the 
rating might have been a plus, as judged by the 
general impression given. Universals should not 
be included in a rating device. Nor should rat- 
ings be either plus or minus, since degrees exist 
in the extent to which any observable behavior 
may be present. The use of some sort of profile 
chart might make possible some expression of 
the degree to which any item of behavior was 
present. 


Epwin Wanpt, Statistician and Research As- 
sociate, Teacher Characteristics Study of the 
American Council on Education, University of 
California, Los Angeles, California. 

This rating scale consists of 38 items of ob- 
servable teacher behavior which are reacted to 
most frequently and intensely by pupils. The 
basic assumption of the author is “that pupil- 
teacher relationships are a vital, if not the most 
important, factor in effective teaching.” 

The 38 items are grouped into 6 categories: 
indications of fairness (6 items), indications of 
cheerfulness (5 items), indications of sympa- 
thetic understanding (7 items), business-like 
procedure in controlled situations (8 items), 
ability to get pupil response (6 items), and 
knowledge and technique (6 items). 

The scale is filled out by an observer who sees 
the teacher in the classroom. Each item of be- 
havior which the teacher exhibits is marked 
plus, and each item which is noticeably lacking, 
in the face of opportunity for its occurrence, is 


marked minus. Sufficient observation time js 
allowed for the checking of approximately 25 
items. Two or more classroom visits wil] usually 
be necessary for one “complete observation,” 
The author recommends that two Such “com. 
plete observations" be made at intervals of sey- 
eral weeks or months, and that the results be 
averaged. The score on the scale is the percent- 
age which the plus items are of all items checked, 
For example, a teacher who received 20 plus 
items and 5 minus items would receive a score 
of 80. 

Scores on the scale correspond fairly well with 
estimates of teaching efficiency obtained from ad- 
ministrators. A contingency coefficient of 72 
was obtained between judges' ratings and scores 
on the scale based on a sample of 5o teachers, 
(The coefficient .88 reported in the manual is 
not a contingency coefficient but rather a ques- 
tionable estimate of the product moment corre- 
lation coefficient.) 

The reliability of the total scale of 38 items, 
as reported for two small samples, seems to be 
adequate. A correlation coefficient of .90 was 
obtained between two sets of observations made 
by Beecher (N — 50). A correlation coefficient 
of .79 was obtained between the observations 
made by two independent judges (N — 50). 

The norms consist of the nine deciles. Al 
though the median should equal the fifth decile, 
different values are reported. Because the num- 
ber of teachers, the kinds of teachers, and the 
Schools from which the teachers came are not 
reported, the norms are meaningless. If norms 
are desired, users of the scale are advised to 1 
nore those furnished norms and to develop their 
own local ones. 

The one-page manual furnished with the scale 
is inadequate. Persons using the scale will prob- 
ably wish to read Beecher's monograph (1), 
Which contains an account of the development 
of the scale. (Some of the data reported in this 
review were obtained from this monograph-) 1 

The choice of items (“observable teacher ү 
haviors to which....pupils react most frequent 
and intensely") has produced a scale шаги 
of teacher behaviors which many educators "e 
are associated with effective teaching. The ie 
is easy to use, and, if the method suggeste a 
the author (two or more “complete obse hn 
tions," each consisting of two or more duum д 
visits) is followed, it would seem that 4 Я i 
ple of important teacher behaviors can be 
tained. 


805 ] 


TESTS & REVIEWS: VOCATIONS—SPECIFIC—EDUCATION 


805 


а арР 


[805] 
XTeacher Opinionaire on Democracy. Teachers; 
1949; democratic aspects of teacher philosophy ; Forms 
G, H; manual [49] ; $1 per 25; $1.50 per 25 booklets 
containing both forms; 35€ per specimen set; postpaid ; 
nontimed (20) minutes ; Enola Ledbetter and Theo. Е. 
Lentz (manual) ; Character Research Association. * 


СковсЕ W. HARTMANN, Professor of Psychol- 
ogy and Chairman of the Department, Roosevelt 
College, Chicago, Illinois. 

Two forms, G and H, each containing 65 
attitude statements, attempt to measure the 
degree of "democraticness" manifested by 
teachers in dealing with educational situations 
by means of expressed agreement with such 
items as “Children in a teacher-controlled class- 
room are no better than slaves,” “Loyalty to 
authorized subject-matter is one of the greatest 
enemies of democratic living in the classroom,” 
etc, The instructions declare the purpose of the 
instrument to be a determination of “how teach- 
ers feel about what is and is not wise in our treat- 
ment of children in school.” The context sug- 
gests that that variant of the progressive outlook 
championing maximal pupil freedom of purpos- 
ing in the elementary years is taken as norma- 
tive by the key. That ethically we can be free 
only to do the right, that in the long run, a hu- 
mane, kindly, friendly temper prevents the ac- 
ceptance of every goal on an equal basis, is a 
neglected or rejected position. 

_In principle, there is no question that instruc- 
tional procedures, as well as personalities, vary 
from more to less democratic in spirit ; but some 
Segmented responses can be dangerously mis- 
leading when treated in isolation or mechani- 
cally. Thus, the AAUP would officially affirm, 
in the honored name of hard-won academic free- 
dom, that “Pupils’ wishes should have nothing 
'o do with promotion or dismissal of teachers,” 
yi the key considers that an undemocratic posi- 
RA Certainly it is arguable whether a demo- 
К ee omens means the elimination of all 
Ж pulsory courses, permitting gum-chewing 1п 

"e that a principal is unnecessary, simpli- 
кж ык pear cal ga d d 
"authoritati lv" рез at REM 
eu tively by the six harmonizing judges 

ployed in the validation process ! The Achilles’ 
kel is right here. 
in ably is given as .87, and percentile 
Eee Бие, from 400 teachers appear 1п the 
ЛА al which sketches, in readable essay fash- 

» Some of the philosophic, social, and profes- 


sional complications encountered in the develop- 
ment of this device. 

The chief merits of the test seem to be in its 
pioneer novelty, bold willingness to see what 
can be done to formulate an operative consensus 
of values, adaptability to a variety of in-service 
training conditions, and utility as an agency for 
promoting insight via conceptual analysis into 
the “logic” of one’s instructional practices. But 
as its authors somewhat apologetically recog- 
nize, despite their evangelical motivation, there 
is some peril that an unofficial supreme court 
will try to freeze in impressive test form both 
its sound and unsound decisions as to what is 
or is not truly democratic conduct. It would 
seem more in accord not only with field theory 
but with the very nature of democracy itself that 
there can be no tenable restricted, local, abso- 
lute, authoritative, or “final” determination of 
either its universal essence or specific pedagogi- 
cal manifestations. 


C. Rogert Pace, Chairman, Department of 
Psychology, Syracuse University, Syracuse, 
New York. 

One of the most commendable things about 
this test is the clear and concise manual: it pre- 
sents specifically the basic assumptions under- 
lying the Opinionaire ; it describes the Opinion- 
aire as a tentative and research instrument ; it 
gives extensive item analysis and reliability data ; 
it suggests uses for the test; it is written in a 
spirit of humility and as an invitation for others 
to try the Opinionaire, criticize it, and share in 
its improvement. 

The Opinionaire consists of two forms, each 
containing 65 statements which the subject is 
asked to mark plus or minus, indicating general 
agreement or disagreement. Both forms are 
printed in the same four-page booklet. — , 

Specific criticisms follow : First, and of most 
importance, is the choice of basic assumptions 
underlying the construction of the Opinionaire 
— that is, the definitions of democracy. Lentz 
and collaborators recognize this when they say 
in the manual, “The acceptance of the instru- 
ment and the key will vary from one education- 
ist to another * it seemed impossible to the au- 
thors to get the test entirely outside the realm 
of controversy.” Yet the basic structure of the 
test is so important that one wishes the authors 
had sought a wider range of scholars, especially 
scholars outside the field of education, before ar- 
riving at the design of the instrument. The key 


806 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK [ 806 


was devised by the authors with a check by six 
outside judges. 

Second, the total score is simply the number 
of statements marked which agree with the key ; 
yet the statements themselves include quite a 
variety of topics—from racial equalitarianism 
to the method of making administrative de- 
cisions to the desirability of pupils' asking per- 
mission before going to the toilet. Presumably, 
the statements can be grouped about various 
subtopics, but we do not know what relationship 
exists among these subtopics or whether, statis- 

ically, they should be combined to arrive at a 
total score. Using the score as a criterion for 
item analysis or for other statistical study will 
produce little improvement in the instrument ; 
for, as Davis has so aptly said, “Ву the time the 
tryout forms for a test have been constructed, 
much of the opportunity for selecting items has 
already passed." + 

Third, some of the statements in the test lack 
clarity and simplicity as a consequence of the 
authors' tendency to be epigrammatic. Here are 
two examples : "Teachers don't want to encour- 
age pupils to want what they (pupils) want, be- 
cause they (teachers) are afraid the pupils won't 
want what the teachers want them to want" and 
“ ‘Living cannot take place without learning’ is 
truer than the phrase ‘learning cannot take place 
without living.'" Other statements are difficult 
to respond to because they contain two concepts 
which are not necessarily related (e.g., "In a 
democratic school the principal is unnecessary. 
Decisions concerning the school life should be 
made by pupils and teachers jointly.") or be- 
cause they contain prejudicial words which the 
authors define in a special way (e.g., "When a 
child dares to question the teacher's authority 
or opinion, i.e., when he begins to think for him- 
self, then democracy is working"). 

Beyond these specific criticisms, most of 
which are of the sort one could frequently make 
about test materials still in the rather early stages 
of development, it seems reasonable to ask 
whether authors and publishers should print 
and sell tests for the chief purpose of inviting 
others to experiment with their materials. The 
intent of the Teacher Opinionaire on Democracy 
is most admirable and important. If purchasers 
buy it with the idea of using it to promote dis- 
cussion about democracy, rather than with the 

1 Davis, Frederick B. Chap. 9, “Item Selection Techni ues,” 
р. 266. In Educational Measurement. Edited by E. F, Tina- 


quist. Washington, D.C. erican Council on Education, 
1951. Pp. xix, 819. $6.00. * 


idea of using it as a measuring instrument, it; 
publication will have been worthwhile 


[ 806 ] à 

Teaching Aptitude Test: George i 
University Series. Grades 12-16; bot Washington 
data on reliability and validity; mimeographed man. 
ual; $3 per 25; 30¢ per specimen set 3 нн, 37 (45) 
. Wallace; (4s) 


minutes; Е. A. Moss, J. Hunt, and F, 
ter for Psychological Service, 


University, * George Washington 


REFERENCES 

1-8, See 3:405. 

May V. Seacor, Professor of Education, Uni- 
versity of California, Los Angeles, California, 

An important contribution of the test is its 
attempt to get away from entirely verbal and re- 
mote abstractions and to devise practical situa- 
tions requiring the application of judgment and 
professional information. Correlations with 
grades in student teaching, teacher ratings, and 
supervisor’s ratings are low, in part because of 
the low reliability of such ratings. 

The test has, however, certain limitations. 
The date of.publication (1927) means inevitably 
that some of the terminology is now obsolete (as 
“mischievous pupils are usually vicious pupils"), 
the pictures are noticeably crude, and the ty- 
pography a little difficult. More important, the 
sections on “observation and recall” and “recog 
nition of mental states from facial expressions 
seem based on assumptions current psychologi 
cal thought might question. 

In conclusion, this reviewer considers the test 
useful chiefly as a research instrument. Revision 
might improve its predictive value. 


For a review by А. S. Barr, see 3:405. 


i [ 807 ] f Е 
*Wilson Teacher-Appraisal Scale. Rating ba 
structors by students in grades 9-16; 1948; 19 ral id 
knowledge of subject, sense of humor, prese 
student teacher relationship, self confidence, pen 
toward students, interest in students, answering La 
tions, explanations, attitude toward differences, fairnes 
ments, examinations, examination questions, ła Tome 
of examinations, fairness in grading, course И. 
work, enjoyment, overall; 1 form; no data ОП er $0, 
bility and validity; no manual; no norms; $1.50 P 


postpaid; sample scale free; Howard Wilson; p 
nomic Institute. * 
ENGINEERING 
[808] ага Special 


*College Entrance Examination Board, for 
Aptitude Test in Spatial Relations. Candid? ge 
college entrance; 1939-51; available only Pd Pro- 
Entrano Examination Board Admissions e 7 
втат (ѕее 526); 70) minutes; р! min- 
Staff of Educational Testing Service; program ad 


d 
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istered by Educational Testing Service for the College 
Entrance Examination Board. * 

Rosert L. THORNDIKE, Professor of Education, 
Teachers College, Columbia University, New 
York, New York. [Review of Forms VAC2, 
WAC2, ХАС, YAC, and ZAC.] 

This test, which is taken by a rather small 
fraction of CEEB examinees, is designed pri- 
marily as an aptitude measure for candidates 
for engineering and technical schools. As in all 
CEEB examinations, a new form is assembled 
for each testing based largely on the existing 
pool of items for which an accumulation of item 
analysis data is available. During the academic 
year 1950-51, the several editions of the test 
included two or three of the following six item 
types: (а) block counting, (b) surface develop- 
ment, (с) intersections (of a plane and a 3- 
dimensional form), (d) identical blocks, (е) 
relation of solid forms, and (f) number deco- 
rated cubes. All of the item types deal with static 
3-dimensional forms, and appear to require in 
varying proportions abilities to visualize, to 
manipulate mentally and translate in space, to 
reason deductively, and to be careful and syste- 
matic in analysis of a problem. 

Inspection of the test items indicates that they 
are generally well conceived and adequately 
drawn, though in isolated instances the visual 
cues seem rather meager. From the internal sta- 
tistics reported for the various forms of the test, 
one may conclude that the test (а) has an ap- 
propriate level and spread of difficulty for the 
group tested; (b) is primarily a power test, 
though somewhat speeded; and (c) has. igh 
reliability, represented by odd-even coefficier ts 
Over .90, , 

Reported validity data are limited to one set 
of correlations with freshman grades at Cooper 
Union School of Engineering for a group of 
87 students, The highest correlations were with 
grades in descriptive geometry (.40 and .49) and 
physics (.40 and .34). The correlation with 
TRAD grade-point average was .33, and 
le test appeared to make a small addition 
to the composite prediction resulting from 
other College Board tests and high school 
grades, 

I general this appears to be a workmanlike 
in a somewhat specialized aptitude area. 
on Re! further evidence would be desirable 

the range and level of the test’s validity as а 


HEELS of different phases of college achieve- 
nt, 


[809] 

XCollege Entrance Examination Board Test in 
Pre-Engineering Science Comprehension. Candi- 
dates for college entrance; 1949-51; available only in 
College Entrance Examination Board Admissions 
Testing Program (see 526); 60(70) minutes; pre- 
pared by the Staff of Educational Testing Service; 
program administered by Educational Testing Service 
for the College Entrance Examination Board. * 


[810] 
*Engineering and Physical Science Aptitude 
Test. Grade 12 and technical school entrants; 1943- 
51; 7 scores: mathematics, formulation, physical sci- 
ence comprehension, arithmetic reasoning, verbal со! 
prehension, mechanical comprehension, total; IBM; 
1 form, '43; revised manual ['51] ; $4.25 per 25; sepa- 
rate answer sheets must be used; $2.10 per 50 IBM an- 
swer sheets; 50€ per set of manual and machine scor- 
ing stencils; 50¢ per set of manual and hand scoring 
stencils; 60€ per specimen set; postpaid; 72(80-90) 
minutes; manual by Henry Borow; Bruce V. Moore, 
C. J. Lapp, and Charles Н. Griffin; Psychological Cor- 
poration. * 
REFERENCES 

H., AND Borow, Henry, "An En- 

gineering and Physical Science Aptitude Test." J Appl Psychol 


f the Stanford Scientific and the Engineering and Physical 
Science Aptitude Relentifie ава Psychol Meas 8:683-7 W 38, * 
(ВА atn i W. LesLie, Jm. Occupational Aptitude Pat- 
terns of Counseled Veterans. joctor’s thesis, New York Uni- 
versity (New York, N.Y.), 1949. Pp. viii, 385. 

. TREUMANN, MILDRED JENKINS, AND SULLIVAN, BEN A. 
“е of the Engineering and Physical Science Aptitude Test 
as a Predictor of Academic Achievement of Freshman En- 
gineering Students.” J Ed Res 43:129-33 О '49. * (PA 24: 
BENE W. Leste, Jm. Occupational Aptitude Pat- 
terns of Selected Groups of Counseled Veterans, American Psy- 
chological Association, Psychological Monographs: General and 
Applied, Vol. 65, No. 5, Whole No. 322, ashiny 
‘Association, Inc., 1951, Pp. v, 49. Paper, * (P. 26:2794) m 

6. Grecc, Сокс W. “An GELS M EE NR ШШ 

Validity of the Engineering an ysical Science Apti- 
o уан "у Ed Res 45:299-305 D '51, * (PA 26:6546) 
Joux W. FRENCH, Research Associate, Educa- 
tional Testing Service, Princeton, New Jersey. 

The materials for this test were compiled from 
earlier standardized tests with the purpose of 
measuring aptitude for training 1n work of an 
engineering and physical science nature. To 
point out that a test contains nothing new might 
be taken as an adverse comment. However, in 
the area of physical science and engineering pre- 
diction, we have gotten beyond the notion that 
the best test is the one containing the newest 
and most ingenious mind twisters. The method 
of test construction used here can be depended 
upon to result in a highly satisfactory test bat- 
tery, since selection of materials can be based 
on a history of good performance. À 

Selection of the subtests and some of the items 
was based on correlation with Form 1936B of 
the Cooperative Physics Test for College Stu- 
dents: Mechanics given at the end of a physics 
course. Although this is an excellent selection 


on, D.C.: the ` 
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technique, several different tests should have 
been used as criteria since the EPSAT is sup- 
posed to predict various aspects of physics and 
engineering as well as mechanics. For example, 
the use of such a criterion as end-of-course pro- 
ficiency in engineering drawing would probably 
have led to the inclusion of a test of spatial abil- 
ity, a well known predictor of success in graphic 
work. f 

Although the range of materials may have 
been limited in some degree by the narrow na- 
ture of the criterion used for their selection, the 
quality of the materials is very high, as are the 
tests from which they were selected. The mathe- 
matics part is varied and appropriately inde- 
pendent of specific knowledge; the formulation 
part is a highly ingenious measure of the ability 
to put verbal expression into quantitative form ; 
the part on physical science comprehension has 
a wide coverage and includes information that 
an alert high school student can pick up outside 
of specific courses, although a course in physics 
would certainly help; the arithmetic reasoning 
part is good but is probably too short to be ade- 
quately reliable; the verbal comprehension part 
covers information that, although technical, can 
be learned outside of specific courses; and the 
mechanical comprehension part reflects the high 
quality of Bennett's original instrument. 

The manual as revised in 1951 is considerably 
above most test manuals in both comprehension 
and understandability. There are included a fair 
discussion of purpose, excellent directions for 
administration, satisfactory directions for scor- 
ing, a good explanation of the construction of 
the battery, and detailed discussions of its va- 
lidity and reliability. The discussion of test va- 
lidity, although based on small numbers (53 to 
296), includes descriptions of five different 
studies. For one of these a very unusual table 
gives the validity of the battery for each of the 
eight semesters of work at an engineering col- 
lege. The discussion of reliability recognizes that 
reliability computed by the split half technique is 
spuriously high. The figure of .96 is given as the 
split half reliability for the battery. As a lower 
limit for the true reliability the figure .84 is 
given as the reliability of the battery estimated 
by correlating Parts I, IIT, and IV with Parts 
II, V, and VI. No mention is made of a Spear- 
man-Brown correction. A table of intercorrela- 
tions among the parts of the battery and one with 
other tests are included. Percentile norms are 
given for six groups of students, one group of 
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applicants for engineering training at an airplane 
plant, and two employee groups. The numbers 
in these groups range from 113 to 6,695. Very 
appropriately no national norming is attempted, 
The norms for the nine groups mentioned are 
all presented in one table and give a very clear 
picture of the meaning of the test scores, 

This test is a very good composite of earlier 
test materials designed to measure physics and 
engineering aptitude. With some limitations re- 
sulting from the use of a test of mechanics asa 
criterion for selecting subtests, the choice of 
materials is wise and the resulting test is of wide 
coverage and good validity. The nature of the 
test and the excellence of the manual will result 
in unusual ease of administration and clarity of 
interpretation. 


For reviews by Norman Fredericksen and 
Robert M. W. Travers, see 3:698. 


[811] 

*Graduate Record Examinations Advanced En- 
gineering Test. Senior year college through gradu- 
ate school and candidates for graduate school; 193%- 
51; available only in Graduate Record Examinations 
programs (see 527); 180(220) minutes; prepared by 
the Advanced Engineering Test Committee appointed 
by Educational Testing Service: Ernst Weber ( Chair- 
man), Ovid W. Eshbach, Frederic T. Mavis, William 
E. Reaser, and John Henry Rushton; Educational 
Testing Service. * 


[812] 
*Pre-Engineering Ability Test. Engineering non 
entrants; 1951-52; an abbreviated adaptation of. d 
Engineering Inventory ('43) ; IBM; Form ZPA e 
[revised] manual ('52); $10 per 25; separate m 
sheets must be used; 80¢ per 25 IBM answer $ z 
25¢ per stencil for scoring answer sheets; cash or 3 
postpaid; $1 рег specimen set, postpaid; 80(90) 
utes; Educational Testing Service. * 
REFERENCES f -— 
ME Vivon, EgxNETE, W. "Pre-Engincering Inventory, 
P» ravi, RW. rhe "Measurement and Guidance Proj 
ect in Engineering Education—A Report of Progre 
Plans.” J Eng Ed 36:676-84 Је '46. * (PA 21:297) 
3. Jex, FRANK Bird, The Predictive Value of 
Engineering Inventory, Master’s thesis, University 
(Salt Lake City, Utah), 1947. d Examin® 


, “Entr: 
4. Lone, Louis, AND Pru, Jauss D. Ed & Psychol Meos 
» J Eng 


Pre 
(и Utah 


cord 
. “The Projects of the Graduate Jg, 


ference on Testing Problems, New Yor 
1946, Herschel Т. Manuel, Chairman, Edited by К. d Guidance 
Sponsored by the Committee on Measurement ер Series 1 
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Reports of Committees and Conferences, E i 
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10. LORD, FREDERIC; Cowes, Јони T.; AND CYNAMON, 
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Jerome E. DorrELT, Assistant Director, Test 
Division, The Psychological Corporation, New 
York, New York. 

This test was made by selecting items from 

two tests of the Pre-Engineering Inventory. The 
purpose was to produce an instrument which 
would be cheaper, shorter and easier to adminis- 
ter than the Inventory and, at the same time, 
would yield a score approximating the validity 
of the Inventory composite score. This compos- 
ite score, which is based on three of the seven 
Inventory tests, had been found to be an effec- 
tive predictor of general engineering aptitude. 
The new test is composed of 85 items divided 
into two parts : Comprehension of Scientific Ma- 
terials and General Mathematical Ability. Each 
part is separately timed but only a single overall 
score is obtained. 
‚ The comprehension section requires the read- 
ing of scientific selections, tables and graphs and 
answering questions based on the readings. The 
point is made that no specific factual knowledge 
is needed to answer the questions because the 
passages supply all of the needed information. 
There are, however, several items which defi- 
nitely require knowledge beyond that imparted 
by the passages. There is also one item on the 
number of chemical elements which does not 
have a correct response, if we recognize recent 
developments in chemistry and physics. 

The mathematics part of the test consists of 
problems in arithmetic, algebra, and geometry. 
One group of four items is based on a very long 
Passage from analytic geometry. It might have 
жу more advisable to reduce the amount of 
Ж jx and to substitute additional items. Space 
is etr is provided in the booklet. This 
the 5 an sue to the student but it means that 
x A ets are not reusable. It would certainly 
RUE aper for the test purchaser if provision 

iade for separate scratch paper. 
bili € manual gives .9o as the estimated relia- 

Y Coefficient for a group entering the fresh- 

Шап class of ineeri idi 
Ба ап engineering college. The validity 
Ns Иза by a series of statistical steps 
QS or the two parent tests in the Pre- 
coeffci ving Inventory. The estimated validity 
tei x against a criterion of average first- 
noted а es is reported as .57. It should be 
» however, that the data used in the various 
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steps leading up to the validity coefficient are 
average values (medians) taken from a study 
of the Pre-Engineering Inventory with 14 school 
groups and the coefficient of .57 is a summarizing 
value. The test user will still want to study care- 
fully the validity of the instrument in more spe- 
cific situations. 

The norms consist of a table equating raw 
scores on the Pre-Engineering Ability Test with 
composite scores on the Pre-Engineering In- 
ventory, based ona study at one engineering col- 
lege. This table also includes three columns of 
data for the Inventory (not the ability test) 
which show percentile ranks for various com- 
posite scores. Unfortunately, there is a misprint 
in the number of cases for one of these columns : 
the N should be 2,043, not 7,043. Another table 
shows comparable scores on the Pre-Engineer- 
ing Ability Test and three other tests. However, 
the use of such a table is limited to comparisons 
of students’ scores on the four selected tests. 

In general, this test reflects a competent job 
of extracting the essence of two other tests. It has 
face validity and will probably be found as useful 
as the longer Inventory in the selection of en- 
gineering candidates. Additional norms tables 
and reliability and validity data for specific situ- 
ations would be very desirable.* 


Dewey B. Sturt, Dean, College of Liberal Arts, 
State University of Iowa, Iowa City, Towa. 
This test is an abbreviated form of the Pre- 
Engineering Inventory developed in 1943 by the 
Graduate Record Office of the Carnegie Founda- 
tion as a joint project of the American Society 
for Engineering Education, the Engineers' 
Council for Professional Development, and the 
Foundation. The Long Form of the original In- 
ventory required six hours for administration 
and the Short Form, four hours. The engineer- 
ing colleges felt that a shorter, less expensive test 
would serve their needs better ; hence work was 
started on the Pre-Engineering Ability Test 
which consists of only two parts and requires 
only 80 minutes of working time. д 
The two parts of ће test аге Comprehension 
of Scientific Materials and General Mathemati- 
cal Ability, these two subtests being the ones 
which contributed most to the validity of the 
composite score of the Pre-Engineering Inven- 


i writing of this review (a copy of which was sent 
o einen lichen), the publisher has issued a. slightly revised 
al and test which is not subject to some 0 
icisms noted in this review. Re-use of the test 
ible i А H diio: o ЕП 

i hical error in one of the norms es has 
amines, the type ne f the items has been slightly changed. 
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tory. The 9o best comprehension items and the 
80 best general mathematical ability items were 
first selected and one half of the selected test 
items for each part were chosen for the 80-min- 
ute Pre-Engineering Ability, Test. While only 
a single overall score is obtained, the two parts 
are timed separately, 45 minutes for Comprehen- 
sion of Scientific Materials and 35 for General 
Mathematical Ability. 

Since the Pre-Engineering Ability Test is in 
essence a revision of the Pre-Engineering Inven- 
tory it was possible to use the normative and 
validity data of the latter test (10) in estimating 
similar data for the new test. The composite 
scores of the two tests were first equated and 
norms were then provided for engineering fresh- 
men from public colleges, private colleges, and 
the two groups combined. The estimated validity 
of the test is .57 and the reliability .90. Data are 
also given showing the correlation of the test 
with the American Council on Education Psy- 
chological Examination (.77), the mathematics 
section of the College Entrance Examination 
Board Scholastic Aptitude Test (.71), and the 
College Entrance Examination Board Achieve- 
ment Test in Physics (.69). The test manual 
conforms to the standards now usually expected 
for tests of this type. The purposes of the test 
(selection, placement, guidance) are discussed, 
the construction of the test described, normative 
data provided, and information given on validity 
and reliability. The directions for administra- 
tion and scoring are clear. 

The Pre-Engineering Ability Test promises 
to be an effective instrument for the prediction 
of scholastic success in engineering. It has a good 
background and should appeal to engineering 
colleges as an easily administered test. The fact 
that it is published by an organization with a 
reputation for sound and efficient work will. en- 
hance its prestige. Whether it will be superior 
to subject matter tests in science and mathemat- 
ies or other aptitude tests in the field as a pre- 
dictor of success in engineering remains to be 
seen. One may rest assured, however, that the 
Educational Testing Service will make appro- 
priate studies as the test becomes widely used in 
the field. 


[813] 
Stanford Scientific Aptitude Test. Entrants to 
schools of engineering and science; 1929-30; 1 form, 
"30; manual ('30); examiner's direction booklet and 
scoring key (730); $4 per 25; 256 per specimen set; 
cash orders postpaid; nontimed (60-120) minutes; 
D. L. Zyve; Stanford University Press, * 
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]озЕРн E. Moore, Professor of Psychology and 
Head of the Department, Georgia Institute of 
Technology, Atlanta, Georgia. 

This test purports to measure a conglomerate 
of 11 mental and character traits considered to 
be components of scientific aptitude, These traits 
are as follows: experimental bent; clarity of 
definitions; suspended versus snap judgment; 
reasoning ; inconsistencies ; fallacies; induction, 
deduction, and generalization; caution and 
thoroughness; discrimination of values in se- 
lecting and arranging experimental data; ac- 
curacy of interpretation ; and accuracy of obser- 
vation. A separate exercise is devoted to each 
trait and each exercise is weighted, the assigned 
weights ranging from two to seven points. No 
time limit is set for taking the test but an overall 
working time of two hours is suggested. _ 

According to the author, scientific aptitude 
rather than specific ability is stressed because 
“ ‘capacities’ or ‘aptitude’ are controlled by laws 
inherent in the organism and are very loosely 
dependent upon the individual’s achievement 
whereas the abilities of an individual are chiefly 
determined by his experiences and achievement. 
For this reason the test is constructed so 
it is not dependent upon any information 
yond that usually acquired in the elementary 
school ; and, even so, whatever specific informa 
tion is needed is provided right in the test. ji 

The test itself is an excellently printed ie 
the material is well spaced and is very les! ў 
А combined manual and scoring key fin 
general instructions for giving and scoring : 
test. The scoring is needlessly cumbersome sn 
very time consuming. А separate informa 4 
booklet explains the rationale of the test f 
scribes the different exercises and RTT 
their significance, and summarizes the гёз jul 
conducted by the author 22 years ago t0 € үй 
lish the validity and reliability of the (P^ 
validity of .82 and a reliability of .93 97 

rted. PN 
P The criterion group on which the origina 


gu] 


data were obtained consisted of 50 research stu- 
dents at Stanford University, of whom 21 were 
from the chemistry department, 10 from the 
physics department, and 19 from the electrical 
engineering department. Correlations between 
the estimates of faculty members of the scien- 
tific aptitude of these students and scores ob- 
tained by the students on the test ranged from 
4 for the whole criterion group to .95 for the 
physics group. The reviewer ran a test of sig- 
nificance of difference between the mean scores 
for the different groups making up the criterion 
group, and found that none of the differences 
was statistically significant. 

The author also tested several other groups, 
including 246 unselected freshmen, 79 science 
freshmen, 14 non-science faculty, 21 science 
faculty, and 47 non-science seniors and gradu- 
ates of Stanford University. Only the scores of 
the latter two groups were substantially differ- 
ent, being, as one might expect, much lower than 
those of the other groups. In testing the differ- 
ence between the mean scores of the two fresh- 
man groups, the reviewer obtained a critical ra- 
tio of 2.29. Although the obtained difference fa- 
vored the science group, it was not statistically 
significant, 

The author rests his claim for the validity of 
the test mainly on the results of the research re- 
ported above. The reviewer considers the au- 
thor's data almost entirely inadequate. Face va- 
lidity is not enough ; facts are needed which show 
how well this instrument can predict or forecast 
performance in science subjects and scientific 
achievement. No data of this kind are presented. 

he reviewer suggests a follow-up study of the 
actual performance in science courses of a large 
number of students who take the test during 
their freshman year. 

. In summary : This test contains many unusual 
as of material. It is well printed and attrac- 
tedi y arranged. It is easy to administer but 
+ ous to score. It was developed over 22 years 
den and, unfortunately, ‘has had little research 
© on it since. The validity data are based on 
к Sta ese by student and faculty groups 
КО ү University and a comparison of the 
of ШЕ, tained by individual students in some 
entific е groups and faculty estimates of the sci- 
piss aptitude of individual students in the 
So While the data indicate that the test 
"iui y has high face validity, they do not an- 
ver the fundamental question of whether stu- 
ts who perform best on the Stanford Scien- 
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tific Aptitude Test also perform best in science 
courses. a 


Dewey B. Sturr, Dean, College of Liberal Arts, 
State University of Iowa, Iowa City, Iowa. 

This test is one of the earliest attempts to 
construct a predictive instrument for a subject 
matter professional field. In keeping with much 
of the early test construction practice the test 
is based largely upon hunches and guesses as 
to the type of exercise which will work in a 
given situation. The test attempts to measure a 
considerable variety of factors believed to be 
associated with success in the study and practice 
of science. In the words of the author, the test 
“Gs concerned with detecting a conglomerate of 
basic traits which enter into what may be called 
aptitude for science or engineering." 

The test consists of II exercises or parts as 
follows: Experimental Bent; Clarity of Defini- 
tion; Suspended versus Snap Judgment; Rea- 
soning; Inconsistencies ; Fallacies; Induction, 
Deduction and Generalization; Caution and 
Thoroughness ; Discrimination of Values in Se- 
lecting and Arranging Experimental Data; Ac- 
curacy of Interpretation; Accuracy of Observa- 
tion. Each of the exercises consists of relatively 
few items and is weighted in accordance with 
its capacity to discriminate between groups of 
science and non-science students. The weights 
vary from 2 to 7 and the maximum weighted 
scores from 10 to 28. The total maximum score 
in the test is 203. It requires two hours for ad- 
ministration. 

The validity data supplied by Zyve are limited. 
For a group of 50 science students Zyve (2) 
found a correlation of .50 between aptitude 
scores and scholarship and a correlation of .74 
between aptitude scores and ratings of these stu- 
dents by a group of judges, the judges being 
asked to rank the students on aptitude for sci- 
ence. For this same group the Thorndike In- 
telligence Examination correlated .27 with schol- 
arship. Correlations of .13 to .50 were found be- 
tween group intelligence test scores and scores 
on the test. Benton and Perry ( 4) found a cor- 
relation of .45 between the American Council on 
Education Psychological Examination and the 
aptitude test scores. These investigators found { 
correlations of .30 and .37 between aptitude 
scores and scholarship based upon four years of 
work. 

Cooprider 
dictive value o 


and Laslett (6) studied the pre- 
{ the Stanford test along with the 
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Engineering and Physical Science Aptitude 
Test, the American Council on Education Psy- 
chological Examination, and the Ohio State 
University Psychological Examination. Scores 
on the Stanford test were found to correlate no 
better with grades in engineering and science 
courses than the scholastic aptitude tests. The 
test also failed to differentiate between scientific 
and general scholastic ability. Long (5) reported 
significant correlations with scores in the appro- 
priate scales of Strong’s Vocational Interest 
Blank. 

The format of the test and the manual reflect 
the early vintage of the test. The directions are 
scanty, norms are limited to means for three 
different groups, reliability data are not dis- 
cussed except in passing, and the interpretation 
of the scores is left almost entirely to the user. 
Actually the manual gives only the general di- 
rections for giving the test and the scoring key. 
The test is described and factual data are pre- 
sented in the explanatory booklet which accom- 
panies the test. There is no separate answer 
sheet. 

This reviewer would agree with A. B. Craw- 
ford, writing in The 1940 Mental Measurements 
Yearbook, that the test as it stands cannot be 
recommended for use in the measurement of 
scientific aptitude. While the Stanford Uni- 
versity Press reports that “the demand for the 
test has remained steady for several years,” the 
small number of published articles would make 
it appear that the test is not used in research 
studies which are brought to the attention of 
professional counselors and test constructors, 
The test does have some interesting exercises 
deserving of further study and research. As a 
finished instrument for use in 1951, however, it 
falls below the standards now generally required 
for tests in the aptitude field. 


For a review by A. B. Crawford, see 40:1676. 
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*Iowa Legal Aptitude Test: 1946 Revision. Law 
school applicants; 1946-48; 8 scores: analogies, rea- 
soning, opposites, relevancy, mixed relations, memory, 

information, total; the test may be shortened to in- 
` clude only the subtests in reasoning, relevancy, and in- 
formation; 1 form, ’46; no norms for part scores; man- 
ual (’48) ; $13 per 100 sets of test and answer sheet ; 
separate answer sheets must be used; $3 per 100 an- 
swer sheets; postage extra; 40€ per specimen set, post- 
paid; 170(190) minutes or 90(110) minutes; Michael 
Adams, L. K. Funks, and D. B. Stuit; Bureau of Edu- 
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Tests?” J Legal Ed 4:69-85 au '51. * 
ALEXANDER G. WESMAN, Associate Director, 
Test Division, The Psychological Corporation, 
New York, New York. 

This test was constructed through the coopera- 
tion of a number of law schools and is intended 
for general use by law schools and college per- 
sonnel counselors. 

Part 1, Analogies, requires the examinee to 
select from a group of six words a pair of words 
which bear the same relationship to each other 
as do words in the stimulus pair. Part 2, Reason- 
ing, calls for a judgment of the truth or falsity 
of a conclusion drawn from postulated major 
and minor premises. Part 3, Opposites, requires 
the selection of an opposite with changes in the 
part of speech, for example, an adverb which 1$ 
most nearly opposite in meaning to a given ad- 
jective. In Part 4, Relevancy, a long summary 
of a legal controversy is presented, together with 
35 postulated propositions of law; the examinee 
is to indicate whether the proposition favors the 
plaintiff, or the defendant, or is irrelevant to the 
cases of both. Each item in Part 5, Mixed Ке 
lations, presents 6 words, 3 of which are to be 
identified as somehow related to one another. 
Part 6, Memory, contains 25 questions base 
on a 23-line legal case report which the examine? 
had studied for:8 minutes at the beginning 0 
the examination (a little more than 2 hours as 
lier). Part 7, Information, contains 80 di vi 
items concerned with more or less common fac 
of law. я tial 

The test will be acceptable to many potent’ 
users because of its “face validity.” The е 
ing, relevancy, memory, and information ie 
have content appealing to teachers and ome 
tioners in law; even the other three parts Ж 
share in this appeal, since they deal with e d- 
skill, a talent generally deemed desirable shen 
ding lawyers and jurists. No doubt the stu H 
will also be favorably affected by the reas 
ness of the content. One suspects that the 
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validity is based on a somewhat stereotyped no- 
tion of what the law profession is all about ; but 
if teachers and students share the stereotype, the 
sought effect is accomplished. 

There may be some question as to whether an 
aptitude test should sample knowledge in the 
feld. The reviewer believes that it may properly 
do so. It is reasonable to expect that those who 
have learned relevant information and developed 
relevant skills will learn more effectively in that 
same area than those who have not. Since the 
principal purpose of this test is to predict which 
students are most likely to succeed in law schools, 
the inclusion of information content is defensible 
in so far as it aids in that prediction. 

The manual contains generally good direc- 
tions for administering the test; unfortunately, 
that is its best feature. A single paragraph pro- 
vides the only information with respect to re- 
liability and validity of the test. The long form 
“has yielded coefficients of correlation between 
total test scores and measures of first-year law 
school achievement in four law schools of .44, 
49, .52, and .53, respectively. At the latter two 
schools, the total scores in Parts 2, 4, and 7 
were also correlated with first-year grades and 
correlation coefficients of .56 and .60 were 
obtained." Comparison of these coefficients 
with those obtained at the University of 
Iowa Law School using prelaw grade-point av- 
trages as predictors results in some disappoint- 
ment in the tests. The undergraduate record 
yielded validity coefficients ranging from .45, 
їп colleges other than Iowa, to .67 for Iowa un- 
dergraduates (4). It may be that the authors of 
a test can present evidence that the test can 

gment the predictive power of prelaw grades 
appreciably ; unfortunately, the manual does not 
offer such evidence. 

M reliability of the long form is cited as 
1ng 93; for the short form (Parts 2, 4 and 
| m as .80. No information is offered 
Nia lanual as to the extent to which speed is 
aei ria the test. This isan important omission 
"d е aforementioned reliability coefficients 
а И рае by the Kuder-Richardson 
test; the. if speed plays a significant role in the 
t б} уя Coefficients may be overestimates of 
other ha ly of the instrument. On the 
«ЫЙ » the combination of somewhat heter- 
шла content in the several parts would pre- 
reliabili et in underestimates of the real 
vided us · One wishes that the authors had pro- 
ormation with respect to the speeded- 


ness of the test parts and estimated reliability 
by means of split'half techniques if the test is 
unspeeded. Since only one form of the test is 
available, alternate form estimates of reliability 
could not be attempted ; a test-retest estimate 
would have been welcome.* 

Norms are presented for the long form in 
terms of percentile equivalents of total scores 
achieved by 1,280 students in the first year of 
law at 6 schools. Norms for the short form are 
based on only 986 students. Since the 1,280 stu- 
dents must obviously have taken the three parts 
which comprise the short form, it is unfortunate 
that all were not included. The authors' belief 
that the norms are “fairly typical" is based on 
the fact that the law schools included were "all 
fully accredited and at the same time none of 
them exercised particularly stringent entrance 
requirements" may be typical ; the reviewer does 
not know whether or not this is the case. In any 
event, law schools which do have stringent re- 
quirements will probably find these norms too 
low. 

It should be noted that the above criticism is 
addressed to failure of the manual to provide 
adequate information rather than to positive de- 
ficiencies in the test itself. Law schools which are 
willing to investigate the validity and reliability 
of the test in their own local situations, and to 
develop local norms—steps which would be rec- 
ommended even if the manual had provided bet- 
ter information—may find the Jowa Legal Apti- 
tude Test a useful adjunct to their admission pro- 
cedures. 


[815] 
XLaw School Admission Test. Law, school appli- 
test administered four times annually 
(November, February, April, and August) at centers 
established in each state by Educational Testing Serv- 


sions; prepared by the, 

Service under the direction of the Law School Admis- 

sion Test Policy Committee ; 
Service. * 
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ALEXANDER С. WrsMAN, Associate Director, 
Test Division, The Psychological Corporation, 
New York, New York. [Review of Form 
YLS2.] 

The Law School Admission Test is actually 
a program rather than a test and must be evalu- 
ated as such. The most recent form available for 
review, YLS2, is an outgrowth of earlier forms 
developed by the testing organization since 1947. 
At present, the test consists of two booklets 
which contain six tests. Book І consists of a 
Principles and Cases section (46 items, 45 min- 
utes) in which the applicant judges the rele- 
vance of stated principles to the described cases ; 
a Data Interpretation section (35 items, 60 min- 
utes) intended to measure mathematical reason- 
ing; and a Reading Comprehension section (46 
items, 60 minutes) containing eight passages 
with rather general content. Book 2 contains a 
section called Debates (60 items, 20 minutes) in 
which the applicant indicates whether a state- 
ment supports, refutes, or is irrelevant to the 
resolution it follows; a Best Arguments section 
(32 items, 45 minutes) which requires the evalu- 
ation of arguments which are offered in de- 
scribed disputes ; and a somewhat speeded para- 
graph reading section (30 items, 1o minutes) in 
which a word is to be located which spoils the 
meaning of each of the paragraphs, 

No direct estimate of the test-retest reliability 
of the current form is available. This fact is per- 
haps not especially surprising since the test is 
part of a continuous program in which new 
forms are to be prepared each year. Evidence 
presented for a previous form (WLS) indi- 
cates that it was very reliable. A coefficient of 
-91 was obtained for a group of 72 applicants 
who were retested at their own request; since 
they had been low scorers originally (thus being 
restricted in range) and claimed illness or other 
conditions predisposing to an underestimate of 
their real ability (which would also result in 
important score changes), the coefficient of .91 
may be accepted as probably an absolute mini- 
mum of the reliability of Form WLS. It is re- 
grettable that test-retest coefficients have not 
been presented for the present form since there 
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are some sharp differences in content between the 
previous and present forms ; in a program as ex. 
tensive as this, into which so much experimental 
and statistical work has gone, one would 

some opportunities to have arisen for obtaining 
such reliability estimates.t However, the nature 
of the tests and the data for previous forms 
would lead to confidence that the present total 
test scores are unquestionably highly reliable, 

Validity studies of the present form must nat- 
urally await the completion of a year’s academic 
work by the students. Extensive studies have 
been reported for form WLS; a summary re 
ported by Schrader and Olsen (4) presents va- 
lidity coefficients for 21 groups of students for 
the Law School Admission Test and validity 
data for prelaw grades, both alone and con- 
bined, for 14 of these groups. The 21 LSAT 
coefficients ranged from .18 to .65; the average 
r was .44. To the extent that LSAT scores were 
considered in the admission process, these co- 
efficients are underestimates. Validity coeffi- 
cients for the prelaw grades alone against aver- 
age first year law school grades ranged from 
—.04 to .50; LSAT score and prelaw grades 
combined yielded coefficients from .34 to 68. 
The poorest of these coefficients are discourag- 
ing (if one forgets the unreliability of the cri- 
terion grades and similar considerations) ; the 
best of them are heartening. It is a welcome sight 
to see so many studies reported, and perhaps the 
variation in success of prediction will indicate 
to the cooperating law schools how у d 
lidity is. Certainly the differences found "t 
school to school emphasize the inadequacy 5 
tests and programs which cite validity data! 
only one or two institutions. 

The content of the present form seems ^ 
been selected for face validity as well as pen 
cal validity. In keeping with the view set үн 
by the Policy Committee of the participa mE 
schools, no specific legal information has dit 
included, although an applicant may mu 
easier to read many of the items if he has 21804 
ously encountered such legal terms p "ilio, 
ranty,” “title certificate,” “statutory 86тай, je. 
or is familiar with stilted legal phraseology’. 
absence of questions on specific infor cate 
explained by Schrader and Olsen : “A sys ole d 
effort has been made to minimize is test 
specific subject-matter preparation in This 
and to focus attention on basic abilities. i 

1 The reviewer is informed that a retest study. Liner by the 


one described above for Form WLS will have 
time this volume is published. 


to have 


f 
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emphasis should be kept in mind in comparing 
the validity coefficients (cited above) with those 
summarized in reviews of other law school tests 
by Kandel, Adams and Stuit.” (4) The reviewer 
questions that this proposition can stand on its 
own feet. If it can be demonstrated that an in- 
formation test does not predict law school grades, 
or that it adds nothing to a prediction from pre- 
law grades, then the proposition is acceptable. 
If, on the other hand, an information test is a 
valuable member of a predicting team, it has a 
rightful place in a prediction program which 
cannot be gainsaid by such a phrase as “basic 
abilities.” The reviewer would prefer empirical 
demonstrations of ineffectiveness of good gen- 
eral legal information tests to having such tests 
summarily read out of court. 

The instructions sent to supervisors and the 
attractive pamphlet which each applicant re- 
ceives are well prepared and contain much useful 
information. Of particular merit are the pro- 
vision of a set of sample questions to prepare the 
student for the kind of examination he will face 
and a detailed statement informing him of such 
regulations as apply to the use of slide rules and 
other devices, the value of a watch for pacing 
himself, etc. Although the instructions to the 
student are generally clear and concise, one 
statement is simply silly. The student is informed 
that there will be no correction for guessing and 
advised to use shrewd guesses but to “omit ques- 
tions about which you have absolutely no knowl- 
edge, since you can use your time more profit- 
ably in other parts of the test.” It is obvious that 
а student who has necessarily already spent time 
reading a question can far more profitably im- 
mediately record a guess which might do him 
н good and does not penalize him than omit 
e pw and lose whatever possibilities reside 
Chance guesses. This admonition may well 
pair the applicant's confidence in other infor- 
mation given him. 

{ ну desirable aspect of the program is 
vis Ee of analyses of test results by un- 
in 5 aduate college. This kind of report provides 
RAT concerning the caliber of students 
Mu from individual prelaw schools which 
esi dm be gathered by most law schools ex- 
Чеш n а considerable period of time. Activi- 
Rivier as these are potent advantages of a 
chase oo over the simple outright pur- 
Pu pe The cost is, of course, much 
failure ^ ut proper cost accounting of student 
© should persuade administrators that hav- 
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ing 8 service program is less expensive than not 
having. one. Another excellent feature is the 
availability of two parallel forms of the test each 
year; this is valuable both for security and for 
retesting if the occasion demands it. 

Perhaps just because the organization which 
runs this program has so much experienced tal- 
ent available to it, the reviewer takes exception 
to certain flaws which do not, for the most part, 
affect the quality of the tests or program as such. 
For example, reliability coefficients based on 
odd-even and Kuder-Richardson techniques are 
reported for tests which are admittedly speeded. 
This is improper. As another example, а col- 
umn of validity coefficients is corrected for range 
on the assumption that each of the schools would 
have an applicant population as heterogeneous 
as the total population, and on the further as- 
sumption that all such applicants would have 
been admitted if the tests had not been used to 
select them. It is doubtful that these assumptions 
are warranted. 

The format of the test booklets is reasonably 
good and the type face (reduced typewriting) 
one of the clearer of its species. This last is for- 
tunate, for the size of the type is, in the reviewer's 
judgment, much too small when it is remem- 
bered that the applicant spends several hours 
reading it. The applicant may not mind very 
much when there is enough space between the 
lines, but the longer paragraphs are quite formid- 
able and might well contribute to the reading 
difficulty of the passages. 

In evaluating the program as a whole, cer- 
tain considerations arise. There seems to be lit- 
tle doubt that the program сап contribute to bet- 
ter selection of students in those law schools 
which do not have their own programs. It seems 
equally clear that the services performed by the 
program are real contributions to the cooperat- 
ing institutions, especially if those institutions 
do not have skilled measurement people on their 
staffs. The existence of a program, permitting 
joint effort and economies, is a good thing. How- 
ever, are these the best tests for the purpose— 
the prediction of law school success? We have 
mentioned above the lack of evidence with re- 
gard to the possibilities of an information test. 
Can validity equal to that now being attained be 
achieved by more efficient tests in less time, or 
is four hours really necessary? It would be in- 
teresting, for example, to see comparative pre- 
dictions from the same organization s Verbal 
Factor and Profile Mathematics sections of the 
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Graduate Record Examination, or some similar 
test of general academic aptitude. 

If the separate parts of the Law School Ad- 
mission Test were shown to be differentially 
predictive of specific law school courses, the 
length of the present examination could be justi- 
fied. If the law schools were interested in the 
broad cultural backgrounds of their applicants, 
a long multipart test which accomplished that 
purpose could be justified. That a single-score 
test with a single purpose needs to be four 
hours long seems less credible—however, both 
the law schools and their candidates may believe 
that a long examination is ipso facto a fairer one 
than a shorter test of equal reliability and va- 
lidity. Although a number of studies of various 
item types, some time-consuming and others less 
so, have been done, one wonders if shorter item 
types could not be found to compete with the 
present series. 
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*Purdue Test for Machinists and Machine Oper- 
ators: Purdue Vocational Test, No. 2. Grades 9- 
12 and adults; 1942-49; 6 scores: lathe, planer and 
shaper, grinder, milling machine, general bench opera- 
tions, total; Form BH ('49—same as test copyrighted 
in 1942) ; preliminary manual ['49]; 406 per test and 
answer pad; separate answer pads must used; $1.90 
per 25 answer pads; 75€ per specimen set; cash orders 
postpaid; 50(55) minutes; manual by Joseph Tiffin; 
H. F. Owen, C. C. Stevason, H. G. McComb, and C. D. 
Hume; Science Research Associates, Inc. * 


For a review by William W. Waite, see 3:702. 
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*Medical College Admission Test. Candidates for 
medical school; 1946-51; formerly called Professional 
Aptitude Test; 4 scores: verbal, quantitative, under- 
standing of modern society, science; test administered 
twice annually (May and Noyember) at centers estab- 
lished by Educational Testing Service; application 
form and bulletin of information may be obtained from 
Educational Testing Service; examination fee, $10; 
fee includes scoring service and reporting of scores to 
any 3 recognized medical schools designated at time of 
application; $1 per each additional report requested at 
that time or any time thereafter; 360(390) minutes in 
2 sessions; prepared by the Staff of Educational Test- 
ing Service for the Association of American Medical 
Colleges; distributed by Educational Testing Serv- 
ice. 
REFERENCES 
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Morey J. Wantman, Associate Professor of 
Education, and Director, Bureau of Educational 
Statistics, The University of Rochester, Roches- 
ter, New York. 
This test is intended as an aid to admissions 
officers of members of the Association of Ameri- 
can Medical Colleges and of “certain other pro- 
fessional schools.” Scores of all applicants are 
reported to all members of the Association of 
American Medical Colleges. The scores are re 
ported in intervals of 10 on a standard. scale 
with a mean of 500 and a standard deviation of 
100; the range is 200 to 800. The group on 
which the 1950 scale is based consisted of ap- 
proximately 5,000 candidates who took the ч 
amination in May. Percentile ranks терн 
for specific standard scores imply that ће distr! 
bution of each test is normal. The actual distri 
butions deviate from the normal slightly. à 
This test comes up to the usual high iar : 
of test construction characteristic of the s 3 
cational Testing Service. The format of the T 
booklets is excellent, the bulletin of ШЫ 
to applicants is clearly written, and the we d 
ganized instructions to examiners are as © 
lete as possible. 
А А тод examination of the 1951 Bullet m 
Information suggests, however, that it can ^ 
some revision. On page 18, the section on ite 
vice to Candidates," the bulletin ЕЧ =. 4 
candidates as follows: “Do not cram. A m" 
review of science may be of value, but inte The 
last-minute cramming will not be helpfu 
test is to be taken in stride." This ne 
been asked many times by applicants E 10 
pret these instructions, and has been het #4 
advise them to spend as much time as t coller 
afford in reviewing their elementary 


н 
Vir. 
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course work. Now that this reviewer has had 
an opportunity to examine the contents of the 
test of Understanding Modern Society, he will 
probably find himself advising applicants to re- 
read the Constitution of the United States, cop- 
ies of Time and N ewsweek for the last two years, 
and so on. 

The bulletin goes on to mislead the applicant 
by stating that “there is no reason to become 
worried or disturbed if you find that you are 
unable to answer a number of questions in the 
. «test or if you are unable to finish. The test is so 
designed that the average person taking it will 
answer correctly only about fifty per cent of the 
questions.” This same type of misleading state- 
ment appears on the directions for each test book- 
let, These statements give some of the applicants 
a mistaken confidence in a cautious, methodical 
pace, Actually, at least 79 per cent of a random 
sample of 500 applicants finished 75 per cent 
of each of the sections of the 1950 test, and as 
great a percentage as 89 of this sample finished 
at least one section of the test. 

Again, the directions on page 19 of the 1951 
Bulletin of Information, are somewhat ambigu- 
ous. The bulletin’s statement with respect to 
guessing is as follows: 


Many candidates wonder whether or not to guess the 
answers to questions about which they are not certain. 
In general, it is best to answer all questions about 
which you have any knowledge at all, because a shrewd 
guess is more often right than wrong. On the other 
hand, it is best to skip questions about which you have 
absolutely no knowledge, since you can use your time 
more profitably in other parts of the test. If you com- 
plete a section of the test before time is called, it is wise 
to go back and reconsider any questions about which 
you were not certain at first. 


The directions on the test booklets are по 
clearer in view of the fact that there is “no cor- 
rection for wrong answers.” The applicant who 
is testwise, either through experience or coach- 
Ing, may proceed to answer at random all re- 
maining questions in a section as he observes the 
time limit for the section approaching. 

Another question may be raised regarding the 
Paley with respect to the reporting of scorés. 
"n is not clear why the applicant who submits 
imself to almost six hours of examination time 
and pays a $10 fee is not entitled to a report of 
n Scores, The Graduate Record Examination 
a d Scores to applicants who are applying for 
ERAS to graduate schools of arts and sci- 
of S The reasons for failing to do so in the case 

applicants to medical schools is not clear. 

The Bulletin of Information reflects fairly 
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accurately the content of the various sections of 
the test. However, two points might well be 
made in connection with the test content: (а) 
A review of algebra might aid an applicant on 
the Quantitative section; and (b) Some of the 
questions on Understanding Modern Society 
call for answers which could be judged to be 
matters of opinion. This situation has led some 
applicants to report to this reviewer their mis- 
taken judgment that this particular section is a 
test of “Conservatism—Radicalism.” 

The statistical data supplied by the Educa- 
tional Testing Service for the 1950 test give 
further information about the contents of the 
test. The correlation between the Verbal score 
and the score on Understanding Modern So- 
ciety is reported as .77. In general, the inter- 
correlations between the reported scores are 
high. They range from a high of .77 to a low of 
58. The latter correlation is between the Quan- 
titative and Understanding Modern Society 
scores. The intercorrelations involving the 
Verbal, Science, and Understanding Modern 
Society scores are all higher than .67, suggest- 
ing that these three sections are heavily loaded 
with a “verbal factor.” 

The Science section is claimed to sample only 

“concepts and problems taken from the basic 
college courses in biology, chemistry, and phys- 
ics.” Schultz’s study (то) of the Science section 
indicates that the claim is justified. His evidence 
led him to the conclusion that : 
The results of this study would appear to offer no sup- 
port for the hypothesis that taking additional courses 
in biology, chemistry, and physics beyond a_certain 
minimal number leads to better scores on the Medical 
College Admission Test science section. This is a gen- 
eral conclusion, of course, and may very well not apply 
to any specific individual or in any given situation. 

The reliability data reported for the 1950 tests 
cannot be judged to be more than “satisfactory.” 
The reliability coefficients, computed by means 
of the Kuder-Richardson formula 20 are .93, -89, 
.gt, and .94 for the Verbal, Quantitative, Sci- 
ence, and Understanding Modern Society scores 
respectively. The corresponding standard errors 
of measurement are 4.1, 2.8, 4.9, and 4.7 for raw 
scores. For standard scores these are 26.7, 33:2, 
30.7, and 25.4- Thus the 95 per cent confidence 
interval for a Quantitative score extends over 
approximately 130 points. Such a large zone of 
approximation for a “true score, coupled with 
the fact that the Kuder-Richardson technique 
for estimating reliability may be overestimating 
the reliability since speed is a factor in the test, 
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prevents one from concluding that the reliability 
of the test scores is more than “satisfactory.” 

In the last analysis, any test must stand or 
fall on its demonstrated validity for a specific 
purpose. The validity of the Medical College 
Admission Test for aiding in the selection prob- 
lem for medical schools has not yet been demon- 
strated. The justification of its use together with 
the other evidence in the selective procedures 
is based on faith that it has “intrinsic validity.” 
This view has been expressed by Stalnaker (rr), 
In this same reference, Stalnaker examines the 
criterion data which might be used to evaluate 
the test and finds the limitations of these data so 
great that he does not see any reason for waiting 
for the completion of validity studies before tak- 
ing the test out of the experimental category. 

This attitude on the part of the director of the 
project is somewhat disturbing in view of the 
fact that the published evidence up to 1951, ad- 
mittedly meager, is on the negative side. Two 
studies (3, 5) carried out at the University of 
Utah Medical School used first quarter medi- 
cal school grades as the criterion. Stuit and 
Schlicher + and Young and Pierson (3) have re- 
ported that first year medical school grades are 
indicative of later success in medical school. In 
neither of the studies could the evidence be used 
to defend the use of the test scores to predict the 
criterion. In fact, in the Young and Pierson 
study, where the now discarded Moss 5, cholastic 
Aptitude Test for Medical Students (testing for 
Science achievement only) was also adminis- 
tered, the Moss test was the best predictor of 
the criterion; the correlation reported was .44. 
This finding raises the question as to why the 
Moss test was dropped as a selective device be- 
fore the Medical College Admission Test had 
been demonstrated to be superior to it. 

In a study reported by Morris (9), premedi- 
cal science college grades correlated . 55 with first 
year medical grades at the University of Iowa 
Medical School. The best predictor among the 
scores of the Medical College Admission Test 
was the science achievement test Score; the cor- 
relation was .48. 

The limitations of criterion data in the form 
of grades must be admitted. Nevertheless, from 
the practical point of view, the test constructors 
must face the fact that at the present time these 
data constitute the criterion they are trying to 
predict. Until the criterion has been revised, the 
test scores must be required to correlate as highly 
as possible with it. If this requirement is not in- 
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sisted upon, then the M edical C. ollege Admission 
Test must be considered an additional criterion 
of success in medical schools. If the American 
Association of Medical Colleges maintains the 
latter, then one cannot question the intrinsic va- 
lidity of the test unless he wishes to question the 
judgment of the Association regarding the crj- 
teria by which they characterize medical School 
Success. 1 

Even though Stalnaker expresses the view 
cited above, he reports (тт) that the study of 
the present criterion and its prediction is being 
pursued. The staff of the Medical College Ai- 
mission Test and the American Association of 
Medical Colleges are working in the following 
areas: medical school drop-outs, tests of inter- 
ests, devices to detect serious emotional insta- 
bility, the interview as a predictor, study along 
psychiatric lines of the nature of students enter- 
ing medical school, and routine correlational 
studies. 

It would be interesting to know the value of a 
multiple correlation which could be obtained 
from some of the variables in the above list, 
when applied in combination with the currently 
used predictors. As one aspect of this, a multiple 
correlation between the separate scores of the 
Medical College Admission Test and medical 
school success should be investigated. One would 
not expect this correlation to be much higher 
than the highest zero-order r’s in view of the 
high intercorrelations among these scores. On 
the other hand, a multiple correlation based : 
the previous college record and the M hel 
lege Admission Test may be appreciably highe 
than the zero-order r’s. Stuit and Schlicher's 
summary of correlations between the aun d 
premedical college work and medical sc i 
grades indicates that there is a positive pps 
relationship between these two variables. t 
cobsen similarly concludes : “1 believe that eu 
was uniform agreement that the academic a 
ord, for all of its weaknesses, was still the 

Ы КХАН анн ide that 
single criterion and the best single gu tst 
we might have in the choice of a competen 
dent." ? 

Morris’ (9) study confirms Jacobse a 
clusions. Furthermore, when Morris use 5 
medical science test scores in combination ^. 
the premedical science college grades to P 

an 
of Anti nbus ede den 


П. Association of American Medical Colleges, [1950]: PP i 
34. * » J As 


2 Jacobsen, Carlyle, “Student Selection Problems. 
Med Col 25:7-11 Ja ?so. 
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first year medical school grades, the zero-order 7 
was increased only to a multiple correlation of 
60. Thus, the relationship between the Medical 
College Admission Test science score and previ- 
ous scholastic science record is high enough so 
that the Medical College Admission Test does 
not improve the prediction to any great extent. 

In summary, this reviewer would rate the 
Medical College Admission Test as technically 
among the best of the available standardized 
tests, Its reliability is satisfactory but its validity 
for aiding in the selection of medical students 
remains to be demonstrated. 
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*George Washington University Series of Nurs- 
ing Tests. Prospective nurses; 1031—50; 4 parts; [re- 
vised] mimeographed manual ('50); 2% per profile 
chart ['44]; боё per specimen set; postpaid; (180) 
minutes; Thelma Hunt; Center for Psychological 
Service, George Washington University. * 
а) APTITUDE TEST FOR NURSING. 1931-40; Forms І 
(31), 2 (40) ; $2.75 per 25; Form 1: 45(55) minutes; 
Form 2: 70(80) minutes; F. A. Moss is co-author of 
Form І. 
b) ARITHMETIC TEST FOR PROSPECTIVE NURSES. 1940— 
49; Form 2 ('49—a revision of Form 1); $2 per 25; 
20(25) minutes. 
C) READING COMPREHENSION TEST FOR PROSPECTIVE 
NURSES. 1940; Form 1; $2.50 per 25; 30(35) minutes. 
d) GENERAL SCIENCE TEST FOR PROSPECTIVE NURSES. 
1944; Form 1; $2.50 per 25; 30(35) minutes. 
с) INTEREST-PREFERENCE TEST FOR PROSPECTIVE NURSES. 
1944; Form 1; $2.50 per 25; nontimed (30) minutes. 


REFERENCES 
1-6. See 11599. íi 
Ad: Bers, lnwix Avcusr. “A Study of Success and Failure 
Drac Student Nurses." Abstract. Am Psychol 1:249-50 л 
46. * (РА 20:3872, title only) n 
Three ео, SATEEN . “Prediction of Деде SES in 
$ 0 eu Б e '50. 

(РА 25:4038) ursing." J Appl Psychol 34:1 
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КИНЕ Achievement Tests for Basic Profes- 
к prial Nursing Program. Students in state approved 
i 0015 of professional nursing ; 1943-51; to be admin- 
P Б at or near completion of instructional period in 
ul ШЕ piner area; IBM; 14 tests; examination fee, 
pa Der test per student tested; fee includes all test 
Eu ias, Scoring test papers, and reporting results ; 
р imed (90) minutes per test; prepared by the. De- 
partment of Measurement and Guidance of the National 
pied of Nursing Education, Inc. in cooperation with 
tion, Ыга; National League of Nursing Educa- 

, " 


( ANATOMY AND PHYSIOLOGY. 1943-49; Form 149 
b) 


2) CHEMISTRY, 1943-49; Form 149 (749). 


MICROBIOLOGY. 1943-49; Form 149 (40). 
(gp). DAMENTALS OF NURSING. 1945-50; Form 750 


(оттон AND DIET THERAPY. 1946-49; Form 149 
ip (pem AND THERAPEUTICS. 1944-49; Form 
ў à 


(49) 
MEDICAL NURSING. 1944-49; Form 149 (49). 
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h) SURGICAL NURSING. 1944-49; Form 149 (^49). 

1) OBSTETRIC NURSING. 1945-49; Form 149 (лө): 

j) NURSING OF CHILDREN. 1945-49; Form 149 (^49). 

0 ттун DISEASE NURSING, 1946-49; dona 
9 (49). 

І) PSYCHIATRIC NURSING. 1945-49; Form 149 ('49). 

m) (ism FOUNDATIONS OF NURSING. iue M Boni 

750 (^50). 

$: TUBERCULOSIS NURSING. 1946-50; Form 750 ('50). 
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*NLNE Graduate Nurse Qualifying Examina- 
tion. Registered professional nurses who are candi- 
dates for advanced college training or are entering 
merit systems; 1945-51; ІВМ; test dates determined 
by test user; application form must be obtained from 
the school or merit system to which the candidate has 
applied ; 8 tests; Plan A: A.C.E. Psychological Exam- 
ination, Reading Comprehension, and NLNE Gradu- 
ate Nurse Examinations, $6 per examinee; Plan B: 
NLNE Graduate Nurse Examinations and a choice of 
either A.C.E. Psychological Examination or Reading 
Comprehension, $5 per examinee; Plan C: NLNE 
Graduate Nurse Examinations only, $4 per examinee ; 
fee includes all test materials, scoring test papers, re- 
porting results to the school to which examinee has 
applied, and an examiner to supervise testing if Plan A 
or B is used for 15 examinees or more; if test taken 
individually or with smaller group, the test user or 
candidate must make arrangements with individual ex- 
aminer; $1 per additional report. Plan C prepared by 
the Department of Measurement and Guidance of the 
National League of Nursing Education, Inc. in coop- 
eration with nurse educators; National League of 
Nursing Education, Inc. 
а) AMERICAN COUNCIL ON EDUCATION PSYCHOLOGICAL 
EXAMINATION FOR COLLEGE FRESHMEN. See 277; 3 
scores: quantitative, linguistic, total; 1948 Edition; 38 
(65) minutes. À 
b) READING COMPREHENSION: COOPERATIVE ENGLISH 
TEST: HIGHER LEVEL, TEST C2. See 547; 3 scores: vo- 
cabulary, speed, level; Form T (43); 40(45) minutes. 
C) NLNE GRADUATE NURSE EXAMINATIONS. б tests; 
Form 149A ('49) ; nontimed (60) minutes per test. 

1) Medical Nursing. 

2) Surgical Nursing. 

3) Obstetric vites id 

4) Nursing of Children. б 

Communicable Disease Nursing. 
à Psychiatric Nursing. 


[ 821] 

*NLNE Practical Nurse Achievement Examina- 
tions, Students completing program in approved 
schools of practical nursing 1950-51; ап end-of-pro- 
gram examination; IBM; Form 150 (’50) ; examina- 
tion fee, 50¢ per Student tested; fee includes all test 
materials, scoring test papers and reporting results; 
nontimed (120) minutes; prepared by the Department 
of Measurement and Guidance of the National League 
of Nursing Education, Inc. with the assistance of rep- 
resentatives from the field of practical nurse education ; 
National League of Nursing Education, Inc. 


[822] 

E [Practical Nurse] Pre-Admission and 
E n Examination. Candidates for admis- 
sion to approved schools of practical nursing; 1950- 
5r; an examination program administered by i divid- 
ual schools for use in the selection and guidance 0 
applicants and students; IBM; 3 tests; examination 
fee, $4 рег candidate; fee includes all test materials, 
scoring test papers, and reporting results to any one 
approved schoo of practical nursing; nontimed (210) 
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minutes; tests b and c prepared by the Department of 
Measurement and Guidance of the National League of 
Nursing Education, Inc.; National League of Nursing 
Education, Inc. * 3 

а) GENERAL ABILITY EXAMINATION. Ап adaptation of 
Army General Classification Test (see 280); Form 
650 ('50). 

b) GENERAL INFORMATION AND JUDGMENT EXAMINA- 
TION. 4 scores: information, judgment, arithmetic, 
total; Form 650 (’50). 

с) VOCABULARY EXAMINATION. Form 650 ('50). 


[823] 

X*NLNE Pre-Nursing and Guidance Test Bat- 
tery. Candidates for admission to state-approved 
schools of nursing; 1941-51; examination program for 
use in the selection and guidance of student nurses ; 
IBM; tests administered on specified dates throughout 
the year at centers established by the National League 
of Nursing Education, Inc.; application form and 
schedule of examinations may be obtained from the 
League or the director of any state-approved school of 
nursing; application form must be signed by the au- 
thorized head of the school to which the candidate has 
applied; 5 tests; examination fee, $5 per candidate 
tested if taken with group of 15 or more; less than 15, 
candidate or school must also assume administration 
costs; fee includes all test materials, test administra- 
tion (for groups of 15 or more), scoring test papers, 
and reporting results to any 1 state-approved school 
of nursing; $1 per additional report; 190(220) min- 
utes; National League of Nursing Education, Inc. 
4) AMERICAN COUNCIL ON EDUCATION PSYCHOLOGICAL 
EXAMINATION FOR COLLEGE FRESHMEN. See 277; 3 
scores: quantitative, linguistic, total ; 1948 Edition. 
b) READING COMPREHENSION: COOPERATIVE ENGLISH 
TEST: HIGHER LEVEL, TEST C2. See 547; 2 scores: speed, 
level; Form T (43). 
C) MATHEMATICS. ` Ап adaptation of Cooperative 
Mathematics Test for Grades 7, 8, and 9 (see 370) ; 
Form Q (ло). 

А TEST OF GENERAL PROFICIENCY IN THE FIELD OF 
NATURAL SCIENCES: COOPERATIVE GENERAL ACHIEVE- 
CAT TEST, REVISED SERIES, TEST II. See 595; Form T 


€) COOPERATIVE GENERAL CULTURE TEST, PART II: HIS- 
TORY AND SOCIAL STUDIES. See 6; Form T (43). 
REFERENCES 


т. SHAYcoFT, Marton Е. “A Validation Study of the Pre- 
Nursing and Guidance Test Battery.” Am J Nursing 51:201-5 
riers 
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[ 824] 
* Aptitude Associates Test of Sales Aptitude: a 
Test for Measuring Knowledge of Basic Princi- 
ples of Selling. Sales applicants; 1947-50; Form A 
(°47); manual ('50); тоё per test; 20¢ per scoring 
key; 75¢ per manual; 75¢ per specimen set; cash or- 
ders postpaid; nontimed (20-30) minutes; Martin M. 
Bruce; the Author, Apt. 6A, 624 East 20th St, New 
York 9, N.Y. * 
REFERENCES 
1. МсСовмаск, R. L. An Investigation of “A Test of Sales 


Aptitude.” Master’s project, University of Minnesota (Minne- 
apolis, Minn.), 1951. Pp. 54. 


MirroN E. Haun, Professor of Psychology, and 
Dean of Students, University of California, Los 
Angeles, California. 

The manual states : “This test can be of value 
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as an aid in appraising sales aptitude for select. 
ing sales personnel and for vocational guidance 
purposes. However, it should be emphasized 
that it is an aid only. Many other factors are 
important. Interest, intelligence, motivation, ap 
pearance, health and the like must be taken into 
account.” 

The preliminary work done in developing the 
test is better than that of many tools offered for 
sale prematurely. The norms appear to be ac- 
cumulating satisfactorily, since the test has a 
copyright of 1947. If attention is paid to further 
accumulation of norms for specific sales groups 
as well as more comprehensive data for men and 
women in general, the author is meeting his eth- 
ical obligations in this regard. As the norms now 
stand, they are inadequate for maximal useful- 
ness. 

The manual presents as evidence of validity, 
in distinguishing salesmen from nonsalesmen, 
chi square and critical ratio values which are 
significant at or beyond the 1 per cent level of 
significance when the two distributions are com- 
pared. Representativeness of the original and 
subsequent samples was also investigated, ac- 
cording to the manual. З 

"The test has an estimated test-retest relia- 
bility of .74.” The reviewer questions whether 
the instrument should be used for predictions 
with individual cases, because sound forecasts 
cannot be made when the coefficient of reliabil- 
ity is this low. The method of determining the 
reliability coefficient is not given. › 

The reviewer raises a question regarding tse 
of the word "aptitude" in naming this test. The 
low r of .02 with an intelligence test plus к) 
learned and specific nature of some items, ae 
as 7, 18, 22, 28, 36, 37, 43, and 49, indicate tha 
this instrument has a loading of learned specifics 
which do not appear related to insights йү, 
from usual experiences. May it be that the tes 
is as closely related to trade as to aptitude ete 
Paterson, Strong, Kuder, Bills, and others we 
contributed research which permits the hypot 4 
esis that an appropriate measured interest a 
tern is the best single indication of sales арі A 
The low correlation of .08 cited in the pee 
between this instrument and the persuasive 
terest scores on the Kuder Preference gar 
runs counter to this hypothesis unless dun 
taps a new and unique factor in general s€ this 
A more tenable assumption may be that 
test is a form of trade test for general sales. 


à is in- 
The reviewer questions the value of this 
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strument for vocational guidance in educational 
institutions with clients who have had no sales 
experience or training. The first objection is an 
ethical one—the correct answers to some items 
suchas 36 and 37 are lies to outwit lower echelon 
employees (receptionists). The second objection 
is that the specific learning involved for answer- 
ing some items is not part of the students’ experi- 
ence, 

In summary, this is an instrument which will 
be useful in selecting general sales personnel 
from applicants with sales experience and train- 
ing. The author properly cautions the user 
against its use as a single criterion of selection. 
The "estimated" reliability coefficient of .74 is 
lower than the acceptable minimum for use with 
individuals. Additions to norms are needed, par- 
ticularly some evidence of discrimination be- 
tween successful and unsuccessful sales persons 
in a variety of sales areas. If the test is revised, 
the reviewer would recommend that the trade 
test aspects be emphasized and that the title be 
changed to conform better with the usual con- 
cepts of aptitude. 


DoxArD G. Paterson, Professor of Psychology, 
and Member of Staff of Industrial Relations 
Center, University of Minnesota, Minneapolis, 
Minnesota. 

This 4-page test contains 50 items (most of 
them describing hypothetical sales situations) 
each followed by four possible answers, one of 
which is to be checked as being the best of the 
four. A 9-раре mimeographed manual and a 
Single page scoring key are provided. 

Test-retest reliability is reported to be .74 for 
à heterogeneous group of 658 “men, women, and 
salesmen.” In view of the reported sex differ- 
епсе and the sales-nonsales differences, this is 
decidedly suspect. Correlation between test 
Score and total score on the Otis Self-Adminis- 
lering Test of Mental Ability is .02 for 145 col- 
lege students and .08 for a heterogeneous group 
of 178 “men, women, and salesmen.” Correla- 
tion between test score and persuasive interest 
Score on Kuder Preference Record, Form BB, 
1 .08 for 88 “individuals.” 

Validity is based on the extent to which test 
Scores differentiate between salesmen and indi- 
Viduals in other occupations. Weights (from 
+5 to —5) for scoring each of the possible re- 
Sponses to each item were derived on the basis 
of obtained differences between sales and non- 
sales groups, Naturally, distributions of total 


test scores for sales and nonsales groups given 
in the norms show large differences. It is claimed 
that these differences hold up in cross-validation 
and distributions of scores of additional special 
groups of salesmen bear out this claim. 

Should independent validations confirm the 
author’s claims, this test would represent an im- 
portant contribution to the problem of sales se- 
lection. One such independent validation by Mc- 
Cornack (т), however, casts serious doubt 
upon the universal validity of this test. He found 
that design engineers and accountants were not 
sharply differentiated from “successful” life in- 
surance salesmen. He also found the split half 
reliability to be in the neighborhood of only .29. 
It is probable that the complicated weighted 
scoring system developed for this test maximizes 
chance differences. For this and other reasons 
personnel psychologists would be loathe to rec- 
ommend this test for practical sales selection 
work at the present time. 


[ 825] 

*Aptitude Index. Prospective life insurance agents ; 
1938-50; I score, combining an evaluation of personal 
background, interests, and attitudes ; [revised] Form 
('50) ; manual ('48) ; separate scoring keys for Unite 
States and Canada; distribution is restricted to life in- 
surance companies which are members of Life Insur- 
ance Agency Management Association ; (30) minutes ; 
Life Insurance Agency Management Association, * 


REFERENCES 


-5. 11646. 

9 See FD K. “Selecting Salesmen by Personal His- 
tory Ln ДЕГИЗ aad STU] Abstract. Psychol B 36:528 
Ji^ $07 13:6431, title only. : , т 

ИН. р, F. H. “Progress in Selection of Agents. 
[Proc AI МО. з Life Agency Officers & Life Ins Sales 

p 0. i 

е Бо ALEERT K. Ф®кесепї Research in the Selection of 
Life Insurance Salesmen.” J Appl Psychol 25:11-7 Е '41. 
GA TERE K. “Scoring Rating Scales After the Re- 
sponses Are Punched on IBM Cards,” pp. 28-34. In Proceed- 
ings of the Research Forum, Endicott, New York, August 26-30, 
1946. New_York: Jntermatonn. Business Machines Corpora- 
tion, 1947. Рр. 9& Rains, Jn. “The Basis of Decision.” [Proc] 
Annual Mtg Life Ins Agency Mgmt Assn, 1947. 19471141764 


"48. * У i ru 
. F. An Analysis of Factors Related to Life In 
BUS e Boos thesis, Purdue University (Lafayette, 


Ini. 1945. Аівевт К. “Тһе Weighted Application Blank.” 


12. Kurtz, 
Mgmt Rec 10:274 da "A p. F. “A Comment on Wal; 


й Kann, 1 
laces US i TM Байма "to Life Insurance Selling. 


Taped, 5390 ене. 5 Ge! Misure qual t 


AND HADLEY, J. 
i e БЕШ J Appl Psychol 33:132-40 Ap '49. . 
(РА 24357) “How Association Research Can 


n. у 

Hap Арест Lo) Proc} Annual Mtg Life Ins Agency 

Ж E oa НГА a Note on Kahn and Hodlers 
*Factors Related to DE bo Selling.’ " J Appl Psyci 

‘Ag '49. * 24:2884) E » Proc] 

А s, Jr. “The Revised Index. тос 

Annual Mig Life Ls Mgmt Assn, 1948. 1948:97-102, 

discussion tos d TwrcneLt, Constance М. 

16. WADAG 5, ае тг Validities." Personnel Psy- 


А 24:2131) 

р W.: ASSISTED BY JOHN J. HorkINS; 
aD алоо ану and Its Effect оп Selecti ion Test Morro 
Personnel Psychol 4:141-50 su '51. (PA 26:1734. 
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Donatp G. Paterson, Professor of Psychology, 
and Member of Staff of Industrial Relations 
Center, University of Minnesota, Minneapolis, 
Minnesota. 

Part I is a personal history blank covering the 
following ten items: age, prior experience in 
life insurance selling, number of dependents, 
recent civilian occupation, education, employ- 
ment status, organizations belonged to and of- 
fices now held, net worth, minimum current liv- 
ing expenses, and amount of insurance carried. 
Part II contains three subtests: (а) 21 Yes- 
No-? questions pertaining to interests and atti- 
tudes; (b) self-ratings of 18 personal character- 
istics on a I to 6 point scale; and (c) forced 
choice tetrads regarding attitudes toward 12 sit- 
uations and activities. Both parts are scored by 
hand. Item weights (item analysis against ex- 
ternal criterion of “success”) are assigned, and 
total weighted score for Parts I and II consti- 
tute the Aptitude Index. These scores are then 
divided into a 5-step (A, B, C, D, and E) rat- 
ing. 

Research on both parts of this selection device 
was begun in 1932-33. The first edition of the 
Aptitude Index appeared in 1938. Form 4, the 
latest revision, appeared in 1948. Continuous re- 
search is being carried on. 

Extensive validity studies have been made and 
published. Two criteria have generally been em- 
ployed, namely, survival rate during first year 
and production during first year equal to or 
greater than the median production for one-year 
survivors in a given agency. Survival rate and 
production level for A, B, C, and D or E ratings 
on the Aptitude Index for United States com- 
panies and for Canadian companies clearly show 
the value of the device, especially for rejection 
of applicants. Validity coefficients are not com- 
puted because of statistical difficulties inherent 
in the fact that it is easier to predict failure than 
to predict success. The predictive valüe of the 
ratings is also shown to be dependent upon the 
efficiency of agency management. This latter fact 
is considered to be so important that future re- 
visions of the Aptitude Index will undoubtedly 
take it into account with a probable further in- 
crease in validity. Cross-validation studies of 
item weights have been found to hold up in a 
satisfactory manner. It is unfortunate that va- 
lidities are not reported separately for Parts I 
and II. The personnel psychologist would like 
to know the relative validity of personal history 
versus personality and interest measurement. 


No estimates of the reliability of the Aptitude 
Index as a whole or of its parts have been ob. 
tained, except for corrected Spearman-Brown 
r's of .64 and 54 computed for two subtests of 
Part II with the original validation sample, The 
association which finances the Index research 
has offered the opinion that evidence concerning 
reliability is not essential to test users so long as 
sound evidence of validity can be presented, 
However, since the Index is used with individ- 
uals in selection (especially rejection), it would 
seem reasonable to expect that test users should 
be provided with data concerning the reliability 
of the test for use in individual diagnosis. 

Comparative studies of the reliabilities and 
validities of Part I and Part II, and of Strong's 
Vocational Interest Blank should be undertaken. 
It is possible that the Strong test is actually su- 
perior to the Aptitude Index. Marian Bills’ 
work at the Aetna Life Insurance Company with 
both the Index and the Strong test shows clearly 
that agencies relying on the Aptitude Index 
alone are not doing as good a job as could be 
done. 

Even though the Aptitude Index is confiden- 
tial and distributed exclusively for use by pat- 
ticipating agencies, it would seem to be in the 
interest of the life insurance business to make it 
available to bona fide guidance agencies with 
proper security provisions. Since guidance and 
counseling agencies in schools, colleges, govern- 
ment agencies, and social agencies are here to 
stay and are being manned with professionally 
trained personnel psychologists, it would seem 
that enlightened self-interest would lead the Life 
Insurance Agency Management Association to 
cooperate directly with such agencies. 


ALBERT S. Тномрѕом, Associate Professor of 
Education, Teachers College, Columbia Univer- 
sity, New York, New York. ЕД 
The Aptitude Index is one of the few i е 
which has had a long history of continuous 4 
velopment, cross-validation, trial-by-fire unt 
field conditions, and a research-oriented agency 
behind it. It is a pleasure to review it sineta 
though it is available only to members of y 
Life Insurance Agency Management Dp 
tion, the association has provided the pisi 
information from which an evaluation may vé А 
cally be made. Would that publishers of ү 
commercially available tests were as ope? 
above board. d the 
The first edition appeared in 1938 a" 
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latest form was developed in 1950. Part I con- 
sists of personal history items, such as previous 
occupation, number of dependents, education, 
financial status, etc. Part II consists of Bern- 
reuter-type personality inventory questions, self- 
ratings according to short personality descrip- 
tions, and questions concerning motivations and 
interests. Detailed instructions for administer- 
ing, scoring, and translating into a letter-grade 
rating (A, Excellent, to E, Poor) are given in 
an accompanying manual for use by field-office 
managers. The Index is designed for use only 
with applicants having no previous life insurance 
selling experience. 

The rationale for the Index is that success in 
selling ordinary life insurance is predictable 
from personal history data and self-ratings when 
these are empirically validated and weighted. 
That the rationale is, in general, a sound one is 
evidenced by the results of periodic validation 
studies which rather consistently show a positive 
relationship between Index scores and criter- 
ion data such as survival rate and production 
level. 

The rationale has certain limitations, however, 
which the research group responsible for the 
test are aware of and attempt to alleviate. The 
predictive value of personal history items varies 
with such factors as time, locality, age of testee, 
etc. Periodic revisions are necessary to maintain 
correct item weights, a separate scoring key has 
been devised for use in Canadian companies, and 
total scores are given different letter ratings ac- 
cording to the testee’s age and sex. Research 
data indicate that the Index has differential 
validity depending upon the age, marital status, 
and educational level of the applicant and upon 
the financing arrangement (commission vs. sal- 
ary) and quality of management of the employ- 
ing agency. Fortunately, those applicants ob- 
taining low ratings have poor chances of success 
regardless of the variables mentioned above. 

Probably the basic limitation is the attempt to 
develop an industry-wide selection test based on 
items as subject to variable influences as are 
Personal history and self-rating items. To insure 
predictive efficiency in a specific agency will re- 
Чшге not only continuous checking and research 
(such as the LIAMA has done and will continue 
to do) but also local validation studies. Many 
Msurance companies are research-oriented and 
the combination of vigorous industry-wide re- 
search and local research efforts bodes well for 
the continued improvement of this selection tool. 


[ 826] 
*[Bigelow] Hiring Kit. Sales applicants; 1944-48; 
a series of 8 hiring aids to facilitate the selection of 
sales personnel; manual ('48); distribution is re- 
stricted to business firms employing the consultant 
services of publisher ; Edwin G. Flemming and Burton 
Bigelow; Division for Sales Personnel Selection, Bur- 
ton Bigelow Organization. * 
а) PRELIMINARY EXPLORATORY INTERVIEW QUESTIONS. 
1947; (6) minutes. 
b) RESULTS OF PRELIMINARY EXPLORATORY INTERVIEW. 
1947; а checklist to aid in the evaluation of the pre- 
liminary interview; (10) minutes. 
с) PERSONAL HISTORY RECORD. 1944; title on test book- 
let is Application for Employment; (20-30) minutes. 
d) EVALUATION OF PERSONAL HISTORY RECORD. 1946; 
(10-15) minutes. 
е) QUESTIONS TO ASK PREVIOUS EMPLOYERS. 1947 ; (10- 
30) minutes per employer. 
}) PATTERNED ANALYTICAL INTERVIEW: PART I, INTER- 
VIEW GUIDE. 1947; (30-45) minutes. 
g) PATTERNED ANALYTICAL INTERVIEW : PART II, INTER- 
VIEWER'S JUDGMENTS BLANK. 1947; (20-30) minutes, 
h) QUESTIONS FOR MEDICAL EXAMINER. 1947. 


[ 827] 


XSales Situation Test. Prospective salesmen} 1951; 
Form A; no data on reliability ; mimeographed manual 
[51]; $2.50 рег 25; 356 per specimen set; postpaid; 
nontimed (30-40) minutes; Milton L. Rock; Aptitude 
Test Service. * 
REFERENCES 

1. Rock, Mitton L. “А Sales Situation Test.” J Appl Psy- 

chol 35:331-2 О ’51. * (PA 26:4228) 


[ 828 ] 
*Steward Selection System, 1951 Edition. Sales 
applicants; 1934-51; а revision of Composite Inven- 
tory and Examination (see 40: 1651) ; 20 scores: men- 
tal ability, dominance, stability, vocational interests, 
maturity, appearance-grooming-speech, health-vitality- 
strength, education, occupational experience, financial 
status, memberships and activities, family status, mis- 
cellaneous factors, caliber level, aptitude for selling, 
mental health, interest in selling activities, probable 
effort in job, capacity to survive, total; 3 parts; $1.75 
per manual ('51) and keys; $2.50 per specimen set; 
postage extra; Verne Steward; Verne Steward & As- 
оине 1 RY OF BASIC FACTORS. 1934-51} 
а) PERSONAL INVENTO I . 1934-51; 
D E materials include the Otis Self-Administering 
Tests of Mental Ability and an adaptation of Bernreu- 
ters Personality Inventory; 1 torm, ; $8 per 25; 
nontimed in part (60-80) minutes. 
b) PERSONAL INVENTORY OF BACKGROUND FACTORS. 
1034-51; I form, '51; $575 Der 25. ° 
c) GUIDE TO EMPLOYMENT DECISION : FOR SALES PERSON- 
NEL ONLY. 1945-51; I form, '51; $3.25 per 25. 

REFERENCES 

1-3. See 40:1651. 

4-5. See 3:704. 
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Donatp G. PATERSON, Professor of Psychology, 
and Member of Staff of Industrial, Relations 
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Center, University of Minnesota, Minneapolis, 
Minnesota. 

One is in a quandary as to what to do about 
reviewing Verne Steward's Sales Selection Sys- 
tem. It is not a test but a whole personnel selection 
system, and it is almost impossible to put your 
finger on the reliability and validity of any part 
of the system or even on the reliability and valid- 
ity of the system as a whole. In the reviewer’s 
opinion, in spite of Steward’s attempts to clothe 
his System with an aura of science by means of 
testimonial statements from two psychologists, 
enough scientific evidence has not been provided 
by him or by anyone else to permit an objective 
and searching appraisal of its reliability and 
validity. The reader is, therefore, referred to the 
two unfavorable reviews which apeared in the 
Third Mental Measurements Yearbook, one by 
Milton E. Hahn and the other by Floyd M. 
Ruch. The unpublished Mortensen doctoral 
thesis which has been completed since the previ- 
ous reviews were written and which is supposed 
to have evaluated this System is so defective in 
design that its findings are of little value, even 
if the findings were to be published in detail 
which they have not been. 


Атвевт S. Тномрѕом, Associate Professor of 
Education, Teachers College, Columbia Univer- 
sity, New York, New York. 

The 1951 edition, like the edition described 
fully in the previous Yearbook, consists of a 
mental ability test, a personality inventory, vo- 
cational interest items, self-analysis questions, 
and a detailed personal information blank. These 
data, plus those obtained in personal interview 
and from references, form the basis for ratings 
made through the use of the Guide to Employ- 
ment Decision. The end result is a set of ratings 
on 19 factors on a four-point scale (unqualified, 
borderline, acceptable, superior) and a “Final 
Rating for Person as a Whole.” An accompany- 
ing manual gives detailed instructions for ad- 
ministering and scoring the tests and presents 
“favorable cues” and “danger signs” to be con- 
sidered in rating the applicant on the Personal 
History factors and Key to Success factors. 

What we have here is not a battery of selection 
tests, objectively scored and weighted to predict 
success in a given occupation, but rather a selec- 
tion procedure providing guides and tools for 
acquiring information about the applicant which 
can then be compared judgmentally with em- 
ployment standards so that a final employment 
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decision may be made. Steward recommends 
that these employment standards be set by job 
and analysis, with adjustments made as the re- 
sult of experience. Whether the selection pro- 
cedure will work depends, therefore, upon the 
adequacy of the applicant data obtained and the 
validity of the employrnent standards for the 
specific situation involved. The Steward mate- 
rials per se do not insure that these two require- 
ments will be met. 

It is a rather trying experience to attempt to 
evaluate the Steward materials in the usual way, 
i.e., by studying technical data on the reliability 
and validity of the instruments involved, An 
August 1950 statement by the author, entitled 
Development and Validation of the Steward 
Sales Personnel Selection Materials, describes 
eight research studies ; but the descriptions deal 
merely with the subjects, instruments, and cri- 
teria, and do not include the results of compari- 
sons between the selection instruments and the 
criteria. From descriptive data provided by the 
author, the reviewer was able to find two studies 
which approached validation in the usual sense: 
a follow-up study of 51 agents hired with the 
Steward system; and a comparison of 21 "suc- 
cessful” and тї “marginal” agents employed by 
a company with its own selection system but on 
whom Steward ratings were obtained. In both 
studies a positive relationship was found be- 
tween ratings, using the Steward materials, and 
the criterion. 

It would appear that the author has recently 
been making a sincere attempt to obtain and 
present research data necessary for evaluating 
his materials. Those presented so far, howeven 
do not cause this reviewer to disagree with the 
overall evaluations made by the reviewers In 
The Third Yearbook. As a "battery of orden 
tests,” they need further validation (as do mos! 
tests). As a “system of selection procedures 
they would appear to depend upon the m 
judgment and occupational acumen of the p { 
and thus require empirical validation !n 
specific situation. is 

The most disturbing thing to the present 
viewer is the test author's assumption that uu 
materials, developed on life insurance agente 
apply equally well to other sales personne il 
author states that “the materials can be 
with success in any field of selling as 500 Ж 
suitable employment standards have га 
tablished.” No evidence to support this $ 
ment is presented. 
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For reviews by Milton E. Hahn and Floyd L. 
Ruch, see 3:704- 


[ 829] 
E for Ability to Sell: George Washington 
а ау Series. Grades 7-16 and adults ; 1929-50 ; 
Form 2 (’50—a revision of Form 1); manual (’50) ; 
$3 per 25; 30¢ per specimen set; postpaid; 45(50) min- 
utes; original test: F. A. Moss, Herbert Wyle, Wil- 
liam Loman, and William Middleton; revised test: 
Thelma Hunt, Robert George, and William Schnell; 
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University. * 1 Эх 
For a review by Floyd L. Ruch, see 3:703. 
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Flemming; Division for Sales Personnel Selection, 
Burton Bigelow Organization. * 
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Army Air Forces Aviation Psychology Program 
Research Reports, Nos. 1-19. Washington, Bc: 
U.S. Government Printing Office, 1947-1948. Individual 
volumes are also listed separately. 

a) FLANAGAN, Јонм C., Eprror, The Aviation Psy- 
chology Program in the Army Air Forces. Report No. 
1, 1948. Рр. xii, 316. Out о print. * (РА 22: 4144) 
For reviews, see B147. 

b) DuBors, Рнплр H., Eprror. The Classification Pro- 
gram, Report No. 2, 1947. Pp. xiv, 394. $1.50. * (PA 
22: 4143) For reviews, see B123. 

c) THORNDIKE, Ковевт L., EDITOR. Research Problems 
end Techniques, Report No. 3, 1947. Pp. viii, 163. Out 
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on Pil, Де Neat E., EDITOR. Psychological. Research 
of dd Training. Report No. 8, 1047. Pp. xix, 488. Out 
i e (PA 22:4153) For reviews, see B274. 
Eprror Ее H., AnD Јонмѕом, A. PEMBERTON, 
ing. Ri з. Psychological Research on Bombardier Train- 
110. Report No. 9, 1947. Pp. x, 204. Out of print. * (PA 
D) o) For a review, see B226. 
on Monee, Launor F., EDITOR. Psychological Research 
186 ии Training. Report No. то, 1947, Рр. іх, 
Bio Ara * (PA 21:3708) For reviews, see 
k * 793. 
on porns, Nicmoras, Eprtor. Psychological Research 
Ы Шы le Gunnery Training. Report No. 11, 1047. 
Вто», ii, 508. $1.75. * (РА 22: 4645) For reviews; see 


1) Соок, S 
, Stuart W., Eprror. Psychological Research 
on Radar Observer Training. Report No. 12, 1947. Pp. 


x, . » 
5 j5. Out of print, * (PA 22: 4139) For reviews, see 


m 

on paty, Joun T., Eprron. Psychological Research 

vii, 239 it Engineer Training. Report No. 13, 1947. Pp. 

n) Wet 25: * (РА 22: 4141) For reviews, see B107. 
KERT, FnEDERIC, Eprtor. Psychological Research 


on Problems of Redistribution. Report No. 14, 1947. Pp. 
vii, 298. $1.25. * (PA 22: 4652) For reviews, see B419. 
о) Bryou, SIDNEY W., Еютов. The Psychological Pro- 
gram in AAF Convalescent Hospitals. Report No. 15, 
1947. Pp. viii, 256. Out of print. * (PA 22: 4415) For 
reviews, see В50. 

P) Crawrorp, MEREDITH P.; SOLLENBERGER, RICHARD 
T.; Warp, Lewis B.; BROWN, CLARENCE W.; AND 
GHISELLI, EDWIN Е., Eprrons. Psychological Research 
on Operational Training in the Continental Air Forces. 
Report No. 16, 1947. Pp. vii, 367. Out of print. * (PA 
22:4140) 

q) LEPLEY, WaıLrram M., Ештов. Psychological Re- 
search in the Theaters of War. Report No. 17, 1047. Pp. 
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and 3:984. 

ғ) Deemer, WALTER L., JR, EDITOR. Records, Analy- 
sis, and Test Procedures. Report No. 18, 1947. Pp. ix, 
621. Out of print. * (PA 22:4142) For review, see 
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s) Frrrs, PauL M., EDITOR. Psychological Research 


on Equipment Design. Report No. 19, 1947. Pp. xii, 276. 
Out of print. * (PA 22:4175) 

J General Psychol 41:141-7 Jl "49. HI. 
Eysenck. The groans from members of the re- 
viewer's Department which greeted the arrival 
of a perspiring postman carrying the nineteen 
books which constitute this series of reports will 
have been echoed by members of many other 
Psychological Departments, but the first feeling 
of incredulous disbelief very soon changed to one 
of awe and admiration as the whole picture of 
the work of American Psychologists in the 
Army Air Forces Program unfolded itself. Here 
is richness indeed, and there will be few Psy- 
chologists churlish enough to withhold their 
heartfelt congratulations from the men who orig- 
inated this program in 1941, carried it through 
during the following five years, and finally pro- 
duced this rather overwhelming testimony to 
their success, between three and four million 
words long. These 19 volumes are edited and 
written by different persons and it is remarkable 
that a comparatively great degree of uniformity 
in style and presentation has been secured * For 
him who runs as he reads,....[ Vol. 1] will, per- 
haps, give a sufficient general picture of what is 
contained in the whole series ; for most readers, 
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however, it will probably serve to whet their 
appetites by presenting only some choice sam- 
ples from the abundantly laid table. * Volume 
6....gives a detailed account of this most impor- 
tant single contribution of the Air Forces psy- 
chological research. Volume 7....gives an inter- 
esting account of a rather specialized type of 
investigation which will undoubtedly become 
of greater importance eventually. All these vol- 
umes [1—7] are of general interest to psycholo- 
gists and should be read and pondered in great 
detail. The same may be said about Volume 
15..and Volume 18....which forms the fitting 
complement to Volume 3. The other volumes... 
are of rather more specialized interest and, in 
so far as they have general psychological rele- 
vance, are probably covered adequately by the 
summary given by J. C. Flanagan in Report 1. 
By and large, the reader will find that all reports 
whose title begins, Psychological Research on 
are of specialized interest to Aviation Psy- 
chologists mainly, whereas all other reports 
contain much that is generally valuable. * The 
first point which emerges from these reports is 
that on the whole, psychology has made an out- 
standing contribution * The success of the selec- 
tion program finally developed as indisputable 
and stands as a monument to what scientific 
psychology in its present status is capable of 
doing, and as an inspiration to those who have 
always held that the main scientific advance 
during the present century will lie in the field 
of mind rather than of matter. * It is very inter- 
esting to note that printed tests alone succeed 
almost as well in selecting successful pilots as 
does the whole battery ; this is certainly an un- 
expected and important result of the program. 
The over-all correlation of .66 is miraculously 
high when we consider that the criterion itself 
had a reliability which was far from perfect and 
may not have been much above .7 or thereabouts, 
and that there are a number of factors in pre- 
diction which will inevitably reduce it below 
perfection (good candidates may receive poor 
instruction, have personality clashes with their 
instructors, have personal problems which 
preoccupy their minds and so forth). In 
view of these adverse factors, the correla- 
tion found expresses a degree of successful 
selection which is most heartening and en- 
couraging and probably comes nearer to per- 
fection than most of us would have dared 
hope at a time when the program was inau- 
gurated. Detailed individual follow-ups were 


conducted of high testing individuals who failed 
and low testing individuals who succeeded, The 
main points of differentiation between these ар. 
pear to lie in the fields of motivation, emotional 
maturity, self-discipline, and personality gen- 
erally, This finding shows up what I consider to 
be probably the weakest point of the Army Air 
Forces Aviation Program, namely, its failure 
to deal adequately with the problems of tempera- 
ment and character. A number of studies are, 
indeed, reported dealing with neurotic ques- 
tionnaires, Rorschach, and other projective tech- 
niques; these apparently were found largely 
useless and therefore not incorporated in the 
program. Unfortunately, little attempt was made 
to incorporate and try out objective tests of 
temperament, such as, for instance, the Flicker- 
Fusion ("strobotac") Test, which received a 
preliminary validation as described in Report 
15. There are a considerable number of tests of 
this kind and it would appear possible that had 
use been made of them, even better prediction 
would have been achieved. It is interesting to 
note that while pilot-selection was extremely 
successful, bombardier selection was found very 
much less effective. This is probably due to the 
fact that satisfactory criteria was finally worked 
out for pilot efficiency but it was not found pos- 
sible to work out satisfactory criteria for bom- 
bardier efficiency (navigators are intermediate 
between the other two groups). Another fact 
which emerges from this work is the go 
mental importance of the criterion. Much w 
was devoted to increasing the reliability an 
validity of the criteria of navigator nonga 
and it is interesting to note how very low D 
reliability of instructors' ratings was before rd 
final objective assessment procedures were H 
troduced. This finding suggests that he " 
the low prediction correlations found in jr 
trial work may be due to faulty criteria 12 x4 
than to faulty tests. The work done in the : 
tual theatres of war is inevitably diet ii 
by a less high standard of experimental o. d 
and analysis than that carried out in the 3 
States; reasons for this are, of course, obvi F 
and inevitable. Of particular imparts Ph 
haps, was research on problems concerne Min 
the causes of mission failures. Flanagan * 

cludes his discussion of this point by say!n£* ; 


i ined 

Much improvement in operations could yar failures 
systematic tabulations of reasons for miss dures, rou" 
and the development of operational P raining pro 
tines, classification procedures, and special tales.’ so 


grams....The myth that each separate operatio 
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important that its success cannot be jeopardized for the 
sake of obtaining accurate information on which to 
hase future operations has been relegated along with 
the horse cavalry and the battleship to the limbo of lost 
causes. The effectiveness of future military operations 
must be a matter of continual systematic, and thorough 
scientific observation and experimentation, 


Few psychologists would disagree with this con- 
clusion and it seems a pity that psychologists 
were not called in early enough to help in this 
type of "operational research." There are very 
many other points which one would like to dis- 
cuss, among them, the ingenious assessment 
techniques for group therapy sessions, described 
in Volume 15; interesting adaptations and new 
developments in statistical methods, such as the 
Gillman and Goode G coefficient (an estimate of 
the correlation of a bivariate normal population 
when X is truncated and Y is dichotomized); 
studies of individual reactions to combat; the 
detailed analysis of methods of using Hollerith 
machines for record and analysis processes ; the 
many psychological issues arising from the large 
number of factorial studies described in Vol- 
ume 5 by Guilford and others, and the fascinat- 
ing problems opened up by some of the work on 
motion picture testing by Gibson. However, 
there is only room for certain more general ob- 
servations. In the first place, these studies indi- 
cate the imperative need in psychological re- 
search for large numbers. In Volume r, Flana- 
Бап gives a table showing the planning figures 
"sed in regard to the appropriate size of the 
sample considered adequate to achieving the 
Purpose listed, relating to the predictive value 
ofa specific measure. For exploratory purposes, 
100 is considered sufficient, for testing the null 
hypothesis, 400, for decisions with respect to 
use, 1,600, for establishing precise regression 
Weights, 6,400. Those values are considered ap- 
Propriate when evaluations are to be made in 
terms of product moment correlation coefficients 
[eut Continuous scores. When other procedures, 
mM biserial correlation coefficients or tetra- 
ee correlation coefficients are to be used, 
bae the samples should, of course, be in- 
са to compensate for the larger sampling 
id s of the coefficients. When these numbers 
s ges with those appearing i most 
will ПЕШ. researches published in journals, it 
inc ° Seen that very many of these researches 
eS 5 below the level considered necessary 
fel i oratory purposes; hardly any reach the 

beyond that necessary for testing the null 
YPothesis. This is a very serious business. Many 


alleged contradictions in different research find- 
ings which puzzle the reader can probably be 
explained in terms of the smallness of the sam- 
ples involved and the very large probable errors 
which occur. It is possible that the numbers 
given above are slightly exaggerated because of 
a second point, to which I wish to draw atten- 
tion. By and large, most of these predictive 
values for individual tests are relatively small. 
Correlations of the order of .2 or thereabouts, 
which, in the pre-war period would have been 
looked down upon as useless, were the order 
of the day and from them it was found possible 
to build up very high multiple correlations. Now, 
as the probable error of a correlation is a func- 
tion of its size, clearly these small correlations 
require larger numbers of subjects for ascertain- 
ing their significance and their appropriate 
weights in a regression equation than do the 
rather higher correlations often dealt with in 
research. This point leads on to a third one, 
which is probably the most important single psy- 
chological finding which emerges from this pro- 
gram, namely, the very large number of semi- 
orthogonal factors which emerge in the cog- 
nitive field. Where, previously, we had been 
thinking in terms of 10, or at the most, a dozen 
relatively independent mental abilities, these 
studies and more particularly, Volume 5 by Guil- 
ford, show that we should be thinking rather in 
terms of 100 or more. If that be so, and the 
evidence in favour of this contention is very 
strong, and if complex ability, such as flying, is 
determined by a large number of these separate 
abilities, then it is quite obvious that a relatively 
pure test of one of these abilities cannot show 
more than a rather low correlation with the 
criterion. If there are 20 different independent 
abilities involved in predicting a certain cri- 
terion, such as flying ability, then, on the av- 
erage, each pure component ability test can- 
not contribute more than four or five per cent. 
This corresponds to a correlation coefficient in 
the neighbourhood of .2. Thus, it appears that 
whenever we are dealing with a complex ac- 
tivity, we should give up the hunt for high cor- 
relation coefficients and rest content with a com- 
bination of relatively pure tests showing low 
correlations with a criterion, and correlations 
in the neighbourhood of zero among themselves. 
It will be seen that this conclusion is precisely 
opposed to that of the "organismic" school as 
expressed, for instance, in the OSS publica- 
tion, Assessment of Men. It isa strictly “atomis- 
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tic" method and its success in the present case 
as opposed to the failure of more “organismic” 
methods (it was found here, as it had been 
found previously in the work of the Navy Psy- 
chologists, that interviewers who tried to arrive 
at an over-all picture by trying to imbricate 
their personal global views with the score pat- 
tern on selection tests, only succeed in reducing 
the accuracy of the prediction) is of obvious 
systematic interest and importance. A word may 
be opposite with regard to the implications of 
this program for peace-time ‘psychology. Is it 
only during war that large numbers of psycholo- 
gists can work together toward the achievement 
of a worth-while over-all objective, so that they 
can pool their resources and their abilities and 
succeed not only in forwarding the practical as- 
pects which led to the inauguration of the pro- 
gram, but also in the discovery of important 
scientific principles? Will psychology return to 
its pre-war “hole-in-the-corner” attitude of 
small-scale single-person research, lacking any 
affective liaison or leadership in the field of re- 
search? Do we have nothing to learn from the 
point of view of organization from the experi- 
ence of the many hundreds of psychologists who 
contributed to the fulfillment of this program? 
These questions are easy to ask. To make posi- 
tive suggestions as to what precisely can be done 
to utilize the many lessons which we can learn is 
much more difficult and can hardly be under- 
taken in a review of this kind. However, it is 
possible by raising questions of this kind that 
perusal of these volumes will be most rewarding. 
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gence of Eleven-Year-Old Pupils. Preface by 
Godfrey Thomson. Sponsored by The Population In- 
vestigation Committee and The Scottish Council for 
Research in Education. Publications of the Scottish 
Comal for Research Preis Lid Тон 9 ERN 
A 7 = dni td., 1949. Pp. xxviii, 151. 

Am Sociol R 15:149 Е 50. Wm. Н. Sewell. * 
The major findings...are that there is clearly 
a negative association between size of family 
and average intelligence score in 1947, just as 
there was in 1932, but that instead of the ex- 
pected fall in average intelligence of 1947 eleven- 
year-olds in comparison with the 1932 group, 
there was а substantial increase in scores, both 
on the group and the individually administered 
tests. It is obvious that the committee was taken 
aback by this, but to its everlasting credit made 
no concerted attempt to explain away the re- 
sults. Several interesting speculations are of- 
fered by Professor Thomson as possible ex- 
Planations of the rise in average score—some 
of which are basically environmental, and others 
fundamentally genetic—but he rightly points 
2 that until these are thoroughly tested by 
e accumulation of evidence, judgment must 
e suspended as to their correctness. Meanwhile 
d efforts will be made (says Professor 
Het) “to examine (and, let us hope, find 
m E сап refute) every conceivable explana- 
"SN ich may leave open the possibility that 
of Pac selection is going on behind a facade 
"i тыру improvement." One wonders if 
icon x zeal to examine possible explanations 
jus ave been exhibited had the results gone 
tole, ahh The undertaking reported in this 
hs ds is a remarkable one. The Committee 
atid "età an excellent job, both in the planning 
tW rue of the survey and in the presenta- 
da. pe results thus far reported. Social psy- 
lig ia: $ and sociologists will profit from read- 
бй volume and will anxiously await the 
ing re that portion of the inquiry deal- 
ыз the association between sociological 

Ts and intelligence. 
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Ann Eug 15:186-7 Mr ' 5o. L. S. Penrose. * 
The high standard of....[the] earlier [publica- 
tions]...is maintained in the present volume. * 
One of the objects....was to re-examine the well- 
established finding that there is a negative re- 
lationship between the intelligence of a child and 
the number of its brothers and sisters. In the 
present data the correlation between these two 
variables was found to be —0.28 in the large 
sample and —0.32 in the small sample tested 
with the Terman-Merrill Revision, Many au- 
thorities find it difficult to believe that a nega- 
tive correlation between intelligence and sib 
number is compatible with a genetically stable 
population.: Sir Godfrey Thomson....cautiously 
suggests that selection adverse to intelligence 
may be going on behind a facade of temporary 
improvement and urges that the investigations 
be continued. It is perhaps worth while to point 
out, however, that the relationship between an 
individual's intelligence and the number of his 
sibs is not necessarily a good index of the true 
relationship between intelligence and fertility in 
the population. Before making predictions about 
probable rise or fall of intelligence level, the ac- 
tual fertilities of individuals at all levels, includ- 
ing idiots, should be taken into account. It has 
been customary among eugenists to accept argu- 
ments based upon the inverse relationship be- 
tween family size and intelligence as correct. 
This would imply that the observed stability 
of intelligence level is illusory. The results of 
the Scottish Survey, reported with such care, 
on the other hand, might not unreasonably be 
accepted as valid. If so, the arguments based 
upon observations concerning differential fer- 
tility with respect to intelligence must be fal- 
lacious. There are many by-products of the work 
which it is hoped will be fully reported upon in 
the future. One of these is the follow-up study of 
children with exceptionally high test scores. 
Another, even more important, is the light 


thrown by sociological investigation upon the 
f differential fertility with 


nature and causes 0 i 
special reference to education and economic en- 


vironment. It is perhaps too much to expect that 
an inquiry of this sort will finally solve any of 
these problems, but the team, which has been 
brought together to produce this excellent body 
of data, has earned the gratitude of all those who 
are prepared to give serious thought to the sub- 


ject. 

i Brit J Psychol, Gen Sect 40:157-8 Mr '50. 

R.W. Pickford. * a most important and valuable 
‘ [1 / 


uf 


L^ 
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contribution to the problems of intelligence and 
its inheritance * a very large amount of im- 
portant sociological material was collected ; this 
will be analysed and published in subsequent 
books, and will be of the greatest value to psy- 
chologists and sociologists. Indeed, one can 
hardly overestimate the interest and importance 
of surveys such as these. Everybody, the world 
over, who is dealing in any way with such mat- 
ters as the distribution of variations of intelli- 
gence and their inheritance, must buy this book 
and read it carefully. * Prof. Thomson, in a 
most admirable Preface, explains the circum- 
stances under which the work was done, makes 
due acknowledgements to those who have helped 
and sponsored the 1947 survey, and discusses 
the significance of the conclusions with great 
lucidity. * The whole book is a model of clear 
and unprejudiced exposition and of sympathet- 
ically co-ordinated team-work. * Because of 
all....[the] imponderable influences, much more 
study will have to be done, especially in the di- 
rection of splitting up the population into sub- 
divisions on the basis of measured intelligence, 
and these again in terms of environment. Then 
changes, if any, in the average intelligence level 
and its distribution for each of the subgroups 
might be observed in the offspring of those now 
tested, and be considered in relation to environ- 
mental influences. The formidable task of carry- 
ing out surveys, planned in this manner, over 
three generations, would not be beyond the 
powers of the Scottish Council for Research 
in Education, which has made a magnificent 
start. 

Brit J Psychol, Stat Sect 2:189. N "49. Ar- 
thur Summerfield. * It is emphasized that the 
present volume, written by eight contributors, 
constitutes only a preliminary report. Addi- 
tional light on the main issue may therefore be 
expected from the further analysis of the data — 
particularly from the “sociological schedule." 
Meanwhile, the account of the planning, the ad- 
ministrative work, the selection of tests, ques- 
tionnaire-items, and the like will furnish much 
valuable assistance to those contemplating fu- 
ture surveys. The discussion of the methods 
used for coding data for the Hollerith counter- 
sorter, for checking such operations, and for 
detecting errors, forms a novel feature of special 
interest to statistical psychologists. One would 
have welcomed a little more information as to 
the nature and extent of the errors encountered ; 
and, since the volume is a compilation rather 
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than a narrative, an index, like that to the earlier 
volume, would have been most helpful, 

Brit J Sociol 1:154-68 Je ’50. Cyril Burt 
[See Burt's review for critical comments to 
lengthy to excerpt. ] 

Ed R 2:83-4 O ' 49. W.J. Sparrow.* Chapters 
are contributed by different members of the 
Mental Survey Committee and the method in- 
volves some repetition, particularly of accounts 
of procedure, but this is only a minor defect in 
a description of a research outstanding alike in 
its range, thoroughness, and the significance of 
its findings. 

Eug R 41:196-7 Ja’50. R. С. K. Ensor.* The 
enquiry of 1932 had yielded one very remark- 
able result. It had shown that, taking the average 
trend, there was a negative association between 
intelligence and size of family, ie. the higher 
the number of points scored by the children on ; 
the group intelligence test, the smaller tended to 
be the families of which they were members. The 
natural (though not, of course, conclusive or in- 
evitable) inference to be drawn from this was 
that the proportion of intelligent people in the 
Scottish population must decline. A major 
(though not, again, the only) motive for the 
1947 enquiry was to bring that inference itself 
toa test. It was thought that a fifteen-year inter- 
val would suffice to show whether the standard 
of measurable intelligence in the Scottish schools 
had in fact declined. There were two possible 
flaws in the reckoning. First, 1932 was the con- 
cluding year of a long world economic p 
which had brought much hardship to the Шү 
of industrial Scotland ; and secondly, 1047 7А 
a year of full employment following а ad 
war years, during which the nutrition and im 
of Scottish children were generally higher | : 
ever before. In other words, the special d 
mental features of the two years were 50 НЕА 8 
and pointed in such opposite directione | 
they were bound to disguise to some pni 
effects of any trend in heredity. In the 9 
the findings of 1947 were, in relation to on 
of 1932, somewhat baffling. On the one ЭДИ 
the negative association between intelligen¢ ind 
the size of family was shown as strongly 25 iR 
* On the other hand, the average Scores jn 
telligence points over the whole лара in 
definitely higher than over the 87,498 e 
1932. That is, the standard of peser ii 
year-old intelligence, instead of having ty risen: 
during the fifteen years, had apparent y 
Such is the showing of the group tests 
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are we to account for the apparent contradiction 
between the results of the group tests? The re- 
viewer's own inclination is to refer much of it 
to the environmental difference between 1932 
and 1947 alluded to above. An eleven-year-old 
child, with any given degree of inherited intelli- 
gence, must be expected to make a better show 
at a group intelligence test, if he has behind 
him seven years of good nutrition with all the 
vitamins, than if his history for the same period 
has been one of underfeeding and hardship. But 
there is another large factor which might help 
to disguise а decline in inherited intelligence— 
the factor which Professor Godfrey Thomson 
calls “test sophistication.” Between 1932 and 
1947 many developments had intervened to 
familiarize school children, if not with tests 
themselves, at all events with "procedures in 
class teaching, in school reading books, in news- 
paper ‘puzzle corners, and in wireless broad- 
casts, similar to those used in tests." But such 
explanations, however high one may rate their 
claims, cannot be proved. Only another com- 
parable enquiry held in 1962 might silence 
doubts on the subject. * Thomson....contributes 
à luminous preface * 

J Mental Sci 96:545 Ap '50. M. B. Brody. * 
This report is the first of what should be a most 
valuable series based on the enormous amount 
of data. It shows beyond possibility of dispute 
that on the average, members of large families 
are less intelligent than members of small fami- 
lies. * many important results [are] given and 
discussed in this report of a research which in 
its careful preparation, meticulous execution, 
pellucid exposition and scientific reasoning, is a 
model work, * Psychologists, psychiatrists, edu- 
cationalists, geneticists and sociologists will all 
tead this book with profit and pleasure. 

Lancet 257:657 О 8 '49. * Before any final 
conclusions are drawn...we must wait for the 
more detailed analysis of the data which the 
Council promises, and possibly a third survey 
undertaken at a longer interval from the first. 

eanwhile, the enterprise of the Scottish re- 
search council in organising so formidable a task 
Шау well encourage education authorities else- 
bu to undertake similar school surveys; and 

e detailed account of the methods finally 
dopted will show how such schemes may best 

* planned and how the inevitable difficulties 
Шау be overcome. 

Scottish Ed J 32:547 Ag 26 49. * The report 

“gives in detail—perhaps too much detail— 
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the organisation and procedure involved in so 
extensive an investigation. * One, though only 
one, of the main objects of the survey was to 
establish whether there had in fact been a drop 
in the intelligence of Scottish children over the 
fifteen years separating the two surveys. In the 
group test, the 1947 children had an average 
score 2.28 points higher than the 1932 children, 
this difference being equivalent to between two 
and three points of І.О. In the Individual Binet 
Test, the average I.Q. in 1947 was 102.5 as 
compared with the average of 101.5 1.0. for the 
1932 children. But, as the authors are careful 
to point out, these figures do not lead immedi- 
ately to the conclusion that the intelligence of 
Scottish children has risen, rather than fallen 
as some feared. Direct comparison is mislead- 
ing. In the group test, for instance, children have 
had more experience of such tests than in 1932, 
and though there are indications that this “test 
sophistication" may not account for all the in- 
crease, it doubtless accounts for some of it. And 
it is not unknown for Binet questions to appear 
in school readers. Unfortunately, circumstances 
compelled the use of a different version of the 
Binet Test in the two surveys, so again direct 
comparison is not possible. Indirect comparison, 
however, is, and a chapter is devoted to the 
statistical adjustment of the 1947 Binet scores 
to those of 1932. The conclusion reached is that, 
because the Binet Test used in 1947 appears to 
be easier than that used in 1932, there is in fact 
no substantial change in the average level of 
І.О. The authors reserve any final conclusions 
on a possible rise in the level of intelligence, but 
are quite definite that there is no direct evi- 
dence of a fall. The same problem is, however, 
tackled from another angle in this report. Fig- 
ures are given which clearly show a steady fall 
in the average intelligence test score as the num- 
ber of children in the family is larger. The less 
intelligent, therefore, should contribute more 
members to the next generation than the more 
intelligent. But again interpretation is difficult. 
It is possible that the small families, with the 
higher intelligence, may be the incomplete larger 
families of a coming generation, and the larger 
families may be the completed families of a more 
recent generation. * there is again no positive 
evidence of a decline in intelligence. The final 
verdict must, however, await the results of fu- 
ture investigations based upon the present sur- 
vey. It is in this that the real value of the survey 
lies. It gives a foundation for a large number 
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of future investigations along various lines. The 
data about the 1947 eleven-year-olds will act as 
a standard of comparison against which further 
developments can be measured; and reference 
to the sociological schedules printed in the book 
shows that there is a wealth of information avail- 
able. * The children tested in the survey have 
now nearly all left the Primary School and are 
progressing through the Secondary School. 
Soon they will be leaving school. It was not the 
expectation of the sponsors of the survey that 
action would follow immediately upon their 
findings, but many readers may legitimately 
wonder what is the use of this mass of informa- 
tion that has been collected. What is the point of 
knowing so much about the educational, intellec- 
tual, physical and social conditions of these chil- 
dren if we do not know how these conditions af- 
fect their education and their future? Large fami- 
lies, for instance, may be associated with lower 
intelligence—but may they not have compensat- 
ing advantages? The answers can only be given 
after further investigation. The Committee re- 
sponsible for the survey have planned to follow 
the future progress, in school and beyond it, of a 
group of 2000 of these children. This is an es- 
sential part of the whole plan of the survey, for 
without such a follow-up much of the value of 
the material collected would be lost. There are 
difficulties in following up even such a small 
group as 2000 children, but it is to be hoped 
that these difficulties will be surmounted. As 
time passes, there is no doubt that the contribu- 
tion of this survey to Scottish Education, and 
to educational knowledge generally, will be rec- 
ognised as considerable....[and] the various or- 
ganisations which sponsored and co-operated in 
the survey, and indeed the Scottish educational 
world generally, may justly regard with pride. 
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XU.S. Clerk, CAF 1 to 7: Clerical Series for Fed- 
eral Clerical and Filing Tests, Revised Edition. 
Milwaukee, Wis.: Pergande Publishing Co., 1948. Pp. 
128. Lithotyped. Paper, $2.00; cloth, $2.50. * 
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ЖАвт, LAWRENCE EDWIN, AND BELLAK, Leopotp, Enr- 
tors. Projective Psychology: Clinical Approaches 
to the Total Personality. New York: Alfred A. 
Knopf, Inc., 1950. Pp. xvii, 485, xiv. $6.00. * (PA 25: 
2445) 

Ann Am Acad Pol & Social Sci 274:247-9 
Mr °5т. Manford Н. Kuhn. * It was with hope- 
ful anticipation...that this reviewer began his 
perusal of Projective Psychology * On the 
whole this reviewer was rewarded. The papers 
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on the thematic apperception test (TAT), the 
mosaic tests, figure drawing tests, and finger 
painting tests are informative. Perhaps more 
than any other the paper on the sentence соп. 
pletion test is of value to the social scientist in 
that it points toward a type of projective test- 
ing which is relatively amenable to adaptation 
for use by the social scientist himself. The three 
chapters on the Rorschach (ink blot) Test will 
be of no use to the uninitiated; they are ad- 
dressed in esoteric language to fellow Rorschach 
specialists. АП through the volume there are 
evidences of the increasing awareness on the 
part of projective testers that it is necessary to 
develop a greater range of reliability checks and 
other standardization procedures. To the social 
scientist the failure up to now on the part of 
projective tests to pay sufficient attention to 
cross-cultural and intergroup variation, to repre- 
sentativeness in sampling procedures, and par- 
ticularly to the problem of reliability in the in- 
terpretation of test results has served partially 
to discredit the claims made for the findings 
Therefore the social scientist will welcome signs 
that these matters are being attended to. The 
first two chapters of this symposium—wnitten 
by its editors—are certainly well worth reading. 
The first is a rewarding examination of the psy- 
choanalytic mechanism of projection on which 
the whole conception of projective testing was 
originally based. In it the author argues cogently 
for the use of several differentiating concepts 
instead of the catchall concept of projection. ni 
second chapter is an excellent argument for fie 
theory as opposed to orthodox learning р 
in the study of personality. Apparently p 
chapters are to be regarded as the "project í 
psychology" referred to in the title of this T 
posium. But we ought to consider carefully Т 
implications of this title. Apparently there » : 
it the tacit assumption that the application m 
given technique, or class of techniques, i 
disentangled from its theoretical onemi aid 
be expected to yield a separate and autono En 
body of theory which can then stand prin ү 
techniques are properly constructed ad io: 
test deductive extensions of already tested £ i 
eralizations. While new techniques O ten е 
lead to unexpected discoveries of in ы 
to theory, it does not follow that one vore 
a theoryless technique and from the арр Thus 
of this technique he discovers a theo Бу, 
while we study attitudes in social psy‘ 


ting 
we would not properly say we were constru 
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an “attitude social psychology.” There are a 
number of shortcomings to this symposium as 
far as social scientists are concerned. For one 
thing, each paper is written by someone who is 
a special protagonist (I have chosen this word 
carefully) of the particular test about which he 
is writing. The results are (1) that undoubtedly 
too much is claimed for each test, (2) there is 
an apodictical tone given to the interpretive 
schemes invoked, and (3) precise operational 
referents of the testing procedure are often lost 
sight of. Then again, as far as social scientists’ 
interests are concerned, far too much attention 
is paid to psychopathology—particularly to what 
each test will do in diagnostic classification of 
emotionally disturbed subjects. On reflection, 
however, I realize that the objections I have 
raised refer to (probably very temporary) short- 
comings in the whole field of projective testing 
as such, and these should not be leveled at the 
particular editors and authors of this symposium 
at all. These people have done a good job. This 
book ought to be widely read among social sci- 
entists. We social scientists ought, however, to 
construct our own tests of indirect expression 
in ways consistent with our theory of social in- 
teraction, and we ought to use our findings as 
corroborative checks against the bales of evi- 
dence we already have collected on direct ex- 
pression—particularly our findings from atti- 
tude testing. Incidentally the bibliographies 
Which accompany each paper in this volume are 
excellent—invaluable to the social scientist who 
Wishes to acquaint himself more thoroughly with 
this field. 

В Menninger Clinic 15:194 S ’51. Helen D. 
Sargent. The editors, in their introductory chap- 
ters, have made an ambitious attempt to rede- 
=з the concept of projection and to reconcile 

with general psychology as well as with psy- 
choanalysis, Bellak’s demonstration that pro- 
ee can be explained in terms of association, 
Pilani and apperception is valuable. The effect 
are confused, however, by Abt's desig- 
his of projective psychology as "the psy- 

ogy of revolt" and by his effort to make it 
а System in its own right. * 

ine Opin & Attitude Res 5:429-31 f '51- 
of wh odger. * Doubts about the scientific value 
vs at is to come are aroused by the very first 
ds of Dr. Lawrence K. Frank's intro- 
he on. In it he asserts that “projective psychol- 

method emerging as a body of concepts ап 
s clearly to be distinguished from most 
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of the other divisions or schools of psychology.” 
This seems just as meaningful as a statement 
that nationalism is emerging as a body of con- 
cepts and methods clearly to be distinguished 
from most of the other religions and races of 
mankind. Dr. Frank's mixing of his categories 
is, it transpires, emulated to some extent by 
most of his fellow-writers, The result must be 
that the critical reader in search of knowledge 
about “projective psychology" will feel inclined 
to wonder whether there may not perhaps be 
some incompatibility between a taste for science 
and an interest in this “division or school” of 
psychology. Nor will his confidence in the au- 
thors be increased when he finds, explicitly in the 
introduction and implicitly elsewhere, an appar- 
ent claim that nobody recognized “the dynamic, 
constitutive character of all perception” until the 
psychoanalysts came along. In short, the book 
makes a bad start by giving the earnest seeker- 
after-truth the impression that it may turn out 
to be a propagandist document, wooly, ignorant 
and pretentious. However, he may decide to 
persevere, at least to the extent of searching the 
essays very carefully for reports on validation 
studies. If he does so, he will be disappointed. 
He will find, for the most part, an abundance 
of unsupported claims. For instance, Dr. Sid- 
ney Levy’s essay on figure drawing is full of 
statements like this: “If the head is markedly 
enlarged, the subject may either be very aggres- 
sive, have intellectual aspirations, or have head 
pains or other somatic symptoms" (p. 277). 
Now, Dr. Levy may have ample justification 
for his assertion (which sounds rather like the 
sort of "interpretation" of a big head that one 
would expect from a moderately intelligent but 
not very imaginative schoolboy) ; but if he has 
that justification, he does not reveal what it is. 
Dr. Molly Harrower, on Group Techniques for 
the Rorschach Test, is more scientific. Here is a 
paragraph from her contribution, which deals 
with a series of tests made up of her Multiple- 
Choice Rorschach, the drawing of a man and a 
woman, and the composition of a description or 
story to match the drawings. (Part of the in- 
terpretation of the composition was graphologi- 
cal.) “The psychological tests located a group 
of fifty-nine students with potential emotional 
or personality difficulties in a total of 632 stu- 
dents tested. Of this group of fifty-nine, fifty- 
four had been previously referred to the school 
psychiatrist [author's italics]. The other five 
were then interviewed. Two of these were felt 
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to need further help. On the other hand, only 
one student who had not been isolated by the 
psychological tests was referred to the psychia- 
trist by the customary avenues. Such agreement 
between test and counseling case-finding pro- 
cedures is definitely encouraging” (p. 169). 
This is better, though the adjective “encourag- 
ing” seems odd. Unless there is more than meets 
the eye in the bit about customary avenues, ог 
unless the diagnostic instruments used before 
the previous referrals were very similar to Dr. 
Harrower’s, “remarkable” would surely be more 
appropriate. The essay near the end of the book 
by Drs. Daniel Brower and Arthur Weider be- 
gins with a brief caricature of the “traditional” 
activities of psychologists in industry and busi- 
ness. Having successfully disposed of rating 
scales by depicting them as diagnostic instru- 
ments rather than merely as recording devices, 
they proceed to talk in an optimistic and wholly 
unconvincing way about the use of the com- 
moner projective techniques in the industrial 
and commercial areas. Not a word is said about 
the suspicions these techniques often create in 
candidates for employment, training or promo- 
tion: nor is there any acknowledgment of the 
difficulties produced (not only in the occupa- 
tional field) by the fact that most of them are 
very time-consuming. Our experts on “pro- 
jective psychology” will have to do a lot better 
than this, if they want their work to be taken 
seriously. 

J Abn & Social Psychol 46:277-8 Ap ^51. 
Julian B. Rotter. Since L. K. Frank wrote his 
well-known article on “projective techniques" 
in 1939, there has been little accomplished in 
systematizing, clarifying, or even adequately de- 
scribing this field of personality measurement 
that has mushroomed in the last decade. In Part 
I of Projective Psychology, Abt and Bellak at- 
tempt to provide just such a theoretical rationale. 
The authors seem to suggest, perhaps unfor- 
tunately, that an explanation of the psychologi- 
cal processes involved in responding to am- 
biguous material and analysis of such material 
requires a special set of psychological principles. 
Projective psychology, in contrast to academic 
psychology, deals with “molar concepts," the 
"whole individual," the "total personality." It 
is also "dynamic" and "functional." In general, 
the theory outlined attempts to integrate Gestalt 
psychology and psychoanalysis. Several postu- 
lates regarding personality are stated which are 
generally accepted in psychology today. These 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK [ 836 


are of the level of generality of: "personality iş 

a system...it functions between stimulus and 
response....it is dynamic and motivational., it 
selects and controls....it is a configuration, i 
rests both upon differentiation and integration 
-..it is influenced by environmental and cultural 
factors." At a more concrete level it is pointed 
out that projection as psychoanalytically de. 
fined does not adequately describe the behavior 
of subjects responding to “projective” tech- 
niques. A breakdown of this process is suggested, 
including inverted projection, simple projection, 
sensitization, externalization, purely cognitive 
perception, and expression. To a systematically 
or experimentally oriented psychologist, these 
theoretical chapters may be disappointing, They 
are, nevertheless, an interesting beginning. Some 
major difficulties are the high level of generality 
and the wholesale use of vague, ill-defined terms, 
such as “the whole personality.” Perhaps most 
symptomatic of its weakness for experimental 
or systematic purposes is that the subject matter, 
the unit of study, is never defined. The authors 
never satisfactorily answer the questions: Is it 
a theory of all perception or all behaviors; if not, 
which ones or under what conditions? In fact, 
the only definition of the unit of study Js given 
by the authors in referring to material included 
in Part II: “Our justification for including them 
in our Projective Psychology rests upon the be- 
lief that these procedures are referred to 2 
‘projective’ by the clinicians who utilize the meth- 
ods in their day-to-day work of study and diag- 
nosis of personality" (p. 69). For all its short- 
comings, this attempt to provide some -— 
and to clarify the assumptions of projective e 
ing is a step in the right direction and one s 

reading. Part II provides chapters, contribu Ў 
by individual authors, dealing with а samp ^ 
clinical techniques. The selection of "i м 
tions for this section is justified by the А 
in that they have attempted to obtain ma ‘a 
that is not available elsewhere and to m M" 
judicious choice of methods currently Ж aU 
and popular in psychodiagnosis. At best, t " 
thors only half succeeded in their first aim. i 
the possible exception of the Picture Fru en 
tion method, they have succeeded fairly bi я 
including a chapter dealing with SOMME their 
the more popular clinical techniques, * 7 
introduction, the editors state that ..- ^; di 
preferred to have the separate chapters едип 

several projective instruments and prO ике 
prepared by leading workers whose comp 
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with a particular technique or application in a 
special setting has been established" (p. xi). 
Herein lies the strength and weakness of this 
section. Although individual chapters vary 
greatly, for the most part they provide helpful 
and condensed essays on clinical application. At 
the same time, these chapters are generally short 
on validity data, statements of limitations, or 
critical evaluations of the techniques discussed. * 
For the most part, this book is a reflection of 
the current interest in projective techniques 
rather than an evaluation of them. It is charac- 
terized by enthusiasm, perhaps overselling, and, 
it sometimes seems to the reviewer, the frequent 
tendency in this field to squeeze too much magic 
insight from too specific or too few samples of 
behavior. Readers seeking a careful and critical 
evaluation of the theoretical, experimental, and 
clinical status of projective methods in the field 
of psychology will be disappointed. Those seek- 
ing a source book providing a general rationale 
for the clinical use of projective methods and 
brief discussions of clinical applications may find 
much that is useful in this book. 

J Consult Psychol 15:85 Е ’51. Laurance F. 
Shaffer. A series of fourteen essays....that vary 
greatly in intention and quality. Two introduc- 
tory chapters by the editors attempt to frame a 
theory of “projective psychology” with indif- 
ferent success. Among the ten chapters on vari- 
ous projective methods, Ruth Munroe's is out- 
standing in presenting a full and useful manual 
for her Rorschach inspection technique. Several 
other chapters, as those on Rorschach content 
analysis, the mosaic test, figure drawing 
and finger painting, abound with sweeping 
hypotheses supported by little or no evidence. 
In a chapter on the Szondi, Deri does not at- 
tempt a miniature manual but reports a research 
that might have been a good journal article if 
the groups had been large enough to justify sta- 
tistical tests of significance. As a whole, the vol- 
ume illustrates clinical psychology’s major need, 
more persistent and more critical research on its 

asic instruments. 
2 General Psychol 45:271-5 О ’51. Dorothy 
ilden Spoerl. This is an important book. Not 
only because it summarizes a large number of 
lective techniques, but also because more 
than many of the current writings on the sub- 
Ject, it attempts a rationale for the total theory. 
nd furthermore because it attempts, in small 
Part at least, to show how this material can be 
applied in various fields. That it does not alto- 


gether succeed is not to its discredit in view of 
the general atmosphere of ambiguity that per- 
tains currently in the field of projective theory. 
The present reviewer's chief quarrel with Abt 
and Bellak’s presentation is with their apparent 
insistence that projective psychology is a field 
apart, even a revolt against “many of the main 
currents of academic psychology.” At times in 
their presentation they seem to recognize that 
projective psychology is a tool and a technique, 
as when they say it is “a useful and respectable 
approach to the study and diagnosis of per- 
sonality,” yet at other times they seem to set it 
up as an area that stands alone and is even op- 
posed to the psychology of personality in some 
respects. Why, we ask, the extension of the mul- 
tiplicity of fields in psychology when this tech- 
nique can otherwise be interpreted, and why the 
erection of another psychological-god-head 
around whom a new cult may grow? However, 
aside from this undercurrent of specialization, 
the fundamental structure of projective tech- 
niques is more clearly presented here than in 
any writing which the reviewer has seen. The 
first two chapters, although slightly divergent in 
view and terminology, can be considered as a 
unit, and in these a number of important the- 
oretical points are made. First there is the in- 
sistence that projective tests are a process of 
clinical inference, and that as such they must be 
accepted as hypotheses until checked against 
other available findings in the situation. This is 
a healthy attitude that enhances and extends the 
usefulness of projective tests. Secondly, the 
authors clarify the role that the Gestalt theory 
of perception plays in a system of projection. 
This is an important point that has not often 
been made so explicit. Doubtless there are some 
academic psychologists who will object to the 
tacit implication that projective techniques are 
primarily an “apperceptive distortion.” That ap- 
perceptive distortion plays an important part 
cannot be doubted, yet there are many instances 
in the current projective techniques where a 
more simple interpretation on the level of selec- 
tion of stimuli (a level which the authors recog- 
nize) is quite sufficient. Abt and Bellak stress, 
and rightly, that the lack of structure and the 
degree of ambiguity present in the stimulus al- 
ways strengthens the projective process, so that 
they are able to state explicitly (p. 57) that 
“what is important is the amount of structure 
of the stimulus field, and the essential thing 1s 
the way in which the subject’s performance 18 
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interpreted." In view of this conclusion the name 
given to the interpretive process seems of rela- 
tive unimportance. The.discussions of Part II 
concern themselves with a number of specific 
projective tests as they are used in clinical psy- 
chology. This part...is well arranged, with in- 
troductory paragraphs that serve to orient the 
reader to the material and to tie the book into 
a more cohesive whole than sometimes happens 
when so many authors are represented. This part 
brings both an imposing list of techniques and 
an important list of authors. There are some of 
us who would like to see, not only in this book 
but in the field of projective techniques in gen- 
eral, the distinction between expressive and pro- 
jective techniques more clarified. Numerous геї- 
erences are made throughout this section....to 
adaptive, expressive, and projective elements, 
but not always are the distinctions made explicit. 
One of the clearest statements of this difference 
is that made by Allport in his Use of Personal 
Documents in Psychological Science, where on 
page 112 he makes the distinction clear: 

The primary difference between the concept of ex- 
pression and projection seems to lie in the nature of the 
unconscious determinants of behavior which the investi- 


gator assumes. Expression stresses manner of perform- 
m projection rather more the content that is pro- 
duced. 


Take for example the question of figure draw- 
ing and finger painting both of which are given 
full treatment in Part II. Either of these tech- 
niques contains much which is as expressive, as 
well as much which is projective. The expres- 
sive aspects are surely those which are con- 
cerned with style, the pressure, use of the full 
page, placement of the figure, type of line, sure- 
ness of execution, rhythm, speed, or choice of 
color. And these expressive aspects tell us a 
good deal about personality structure, but they 
are not projective, at least surely not in the 
Freudian sense which has been made basic. Sid- 
ney Levy in his essay in Part II on figure draw- 
ing has made this clear by listing in his record 
blank (pp. 286, 287) these expressive aspects 
as "graphic analysis," while the clearly projec- 
tive aspects are listed separately. In none of the 
other presentations is the distinction between 
expressive and projective made clear. So much 
work has been done in the field of expressive 
movement, as well as in the field of projective 
techniques, and each has so much to offer in 
the final appraisal of personality that it seems 
important to keep the two separate so that the 
full value may be drawn from each. To one who 
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is unfamiliar with any of the various technique; 
considered in the series of essays which com. 
prise Part II it will be a mine of information, 
particularly because of the adequate bibliograph. 
ical references given for further study of any 
specific technique. The Rorschach is fully pre. 
sented with three essays which include content 


"analysis, the inspection technique, and group 


methods. This three-fold presentation will serve 
to clarify many questions for academic psycholo- 
gists who have had a one-sided view of this 
method. The chapter on the Thematic Apper- 
ception Test is one of the clearest in the book, 
particularly valid for its suggestions in regard to 
interpretation, its presentation of the responses 
most frequently found for each of the pictures, 
and its synthesis of a number of approaches both 
to the scoring and interpretation of this useful 
device. Wertham's essay on the Mosaic Test 


‚ (although perhaps too brief) is clear and sug- 


gestive and should lead many to consider this 
newer technique. The three essays which pre- 
sent figure drawing, sentence completion, and 
finger painting are also extremely adequate and 
helpful. There are two chapters about which one 
might raise some questions, and these are the 
essays which present the Szondi Test and the 
Bender Visual-Motor Gestalt Test. Both are il 
luminating presentations, making quite dear 
the possibilities of these techniques. One might, 
however, well question their inclusion as pro 
jective techniques. Let us look into this ques 
tion more closely in regard to each of them. The 
discussion of the Szondi test seems to pares 
the tendency to call any method which inclu i 
a certain element of analytical orientation pid 
intuitive hypothetical interpretation as à n 
jective technique. If one were to coin 4 ЕС 
word (and surely one who objects to nm , 
the terminology in Bellak's introductory chaP 
ter, as the reviewer does, should resist the ii 
tation) he would be tempted to commit A 
horrible sin of asking if the Szondi Test 18 ji 
an “Identicative” technique. Certainly ie | 
is interesting, and the presentation of se 
findings is convincing, yet equally surely je to 
is little that is projective in asking POT, m 
select the pictured faces liked and disli к Ж 
reality what is being studied is the deg" ín 
which one can identify with certain ур А 
there is nothing here which comes и sh pie 
heading of "apperceptive distortion" in t da the 
that it has been used on Part I. MEE Test 
case of the Bender Visual-Motor Gestalt 
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(except insofar as one interprets the expressive 
aspects of the subjects re-presentation of the 
forms through drawing) projection does not 
seem to be the underlying process. Yet it is an 
extremely important test of perception, and the 
writer was particularly struck by the degree to 
which this essay clarifies such studies as Wer- 
ner's of physiognomic perception, or Piaget’s of 
the syncretic nature of the perceptions of the 
child, or of Goldstein and Scheerer’s on the con- 
crete thinking that reappears in certain persons 
who have distorted or wholly lost their ability of 
abstraction. Interesting and important as these 
are, we would raise the question as to whether 
either the Szondi or the Bender test are truly 
projective techniques. The only really disap- 
pointing portion of the book is Part III which 
isentitled Projective Tests in Nonclinical Areas. 
By far the shortest section of the book (50 
pages) its inclusion suggests that projective 
techniques are useful in nonclinical areas, while 
at the same time its limitations as to areas ne- 
gates that suggestion. Both of the essays, the 
one on projective techniques in business and in- 
dustry and the other on projective techniques 
in action research, are important, illuminating, 
and well presented. Yet surely there are many 
other “nonclinical areas” which should have 
been presented. What about projective tech- 
niques in personality and social psychology? 
What about projective techniques as they throw 
light on the problems of general psychology 
usually subsumed under chapters on emotion, 
motivation, learning, and educational psychol- 
ogy? In other words why the limitation? In par- 
ticular we would like to question why projective 
Psychology in the nonclinical area of education 
$ 3 ү been more thoroughly considered? 
Bis 8 of teachers аз collaborators in action 
they iP roshanaky does seem to admit that 
i M t be able to profit in a limited fashion 
4) he findings of projective psychology (р. 
th a The advantage of this procedure lies in 
d that the collaborators (teachers) in the 
: hia of accumulating data will to some extent 
ig : to understand and appreciate what the 
hay "Ws mean." In view of the emphasis which 
кыра in Part I on the idea that the in- 
tative ations are inferences to be used interpre- 
бы rather than as quantitative findings it 
Project though education is one field where 
ets des үе psychology has much to offer. Teach- 
jecti perately need the resources which pro- 

үе psychology makes available, especially 


Bao 


elementary school teachers who have so many 
artistic productions available, some of which 
have strong projective as well as expressive as- 
pects, and they could be trained to see these re- 
sources, After all teachers are a professional 
body of intelligent human beings, and the inclu- 
sion of a chapter on Projective Techniques in 
Education would have rounded out this section 
of the book in a most desirable’ fashion. Alto- 
gether we would reassert our opening conten- 
tion: this is a valuable book. It is particularly 
valuable in view of the degree to which projec- 
tive techniques have been popularized by unsci- 
entific writers in non-academic magazines. It is 
a book with which the academic psychologist 
should, and surely will, familiarize himself ; but 
it is also one that should have wide reading in 
other disciplines as well. If this book is widely 
read the values and the dangers, as well as the 
possibilities and the limitations, of projective 
techniques will come to be far more clearly un- 
derstood than they are at present. 

J General Psychol 44:305-16 Ap '51. Robert 
M. Allen. * The overall impression is that this 
book is a collection of articles, a compendium 
proposing justifications for the inclusion of each 
particular technique in the family of projective 
methods. It does fall short of the task of inte- 
grating each contribution into the editors’ pur- 
pose of establishing a unique theory of person- 
ality that would be acceptable to young and 
vigorous projective psychology. This volume 
will be suitable for the reader interested in a 
survey of the more commonly used techniques 
of the projective variety. [See original review 
for additional critical comments not excerpted. | 

Psychol В 48:539-49 N ?sr. Irving Lazar. * 
Part Two consists of ten papers describing eight 
different tests. * The authors are well known 
clinicians with considerable experience in the use 
of the instruments they discuss. Тһе quality of 
these papers is, however, highly uneven, Except 
for Bellak's paper on the ТАТ, none of the au- 
thors relates his discussion to the hypotheses in 
Part One. Only three present data on sizable 
groups at all useful in evaluation of the instru- 
ments they describe; four present clinical data 
only, or studies of small samples such as in 
Deri's study of Szondi Test changes in nineteen 
depressives before and after electric shock ther- 
apy, with two control groups of ten Ss each. The 
other papers are either theoretical reviews of the 
literature or descriptions of tests and how they 
are scored. The erratic quality of this second 
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section is in part compensated for by the fact 
that there are some tests described in detail here 
which have not previously been included in such 
collections. Part Three consists of two papers 
—one on the use of projective tests in business 
and industry, and the other on their use in ac- 
tion research. These are both reviews of selected 
literature with some examples of applications. 
While not as complete as the reviews by Camp- 
bell and by Deri, Dinnerstein e£ al., which have 
appeared in this Journal, they are useful sup- 
plements to them. This is not an instructional 
manual like Rapaport, Gill, and Schafer's Diag- 
nostic Psychological Testing; it is considerably 
more detailed in its description of fewer tests 
than is Rosenzweig and Kogan's Psychodiag- 
nosis, and it does not attempt the bibliographic 
coverage of Bell’s Projective Techniques. It 
does present a coherent theoretical structure, 
descriptions of eight specific tests, and an in- 
troduction to their use in a variety of situations. 
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Brit J Psychol, Stat Sect 1:219-20 N ’48. E. 
A(nstey). This book....should be read by every- 
body in this country who is engaged either in per- 
sonnel selection or in training students for that 
work. The language is simple and clear. Rela- 
tively few technical terms are used; and they 
are defined both in the text and in an unusually 
full index. Abundant illustrations and examples 
are given. The book will thus be found very 
readable by non-psychologists. If it finds its way 
into the hands of senior Civil Servants and busi- 
ness men responsible for the methods of select- 
ing new entrants to their respective organiza- 
tions, it should convince them that in the United 
States scientific selection is no longer an in- 
triguing possibility, but an established fact. 
Based as they are on wide experience, the recom- 
mendations in the book are eminently practical, 


and take due account of public feeling, * Th 
emphasis in this book is on achievement tests 
which are used far more extensively in the 
United States than in the United Kingdom: by 
most of the points made are of general applica 
tion. It is assumed throughout the book that 
limited-response tests are best. The advantages 
of this type of test, particularly аз regards ob- 
jectivity of scoring, are numerous and impor- 
tant; but certain types of inventive-response 
test, e.g., Arithmetic and Abstractions items, 
can be easily scored and have other advantages, 
* Chapter V presents a discussion of the de 
velopment of performance tests which will be 
of special interest to industrial psychologists 
and managers. In a book of this size a few minor 
blemishes are inevitable. The non-statistician 
will have no fault to find with Chapter III m 
“Basic Statistical Tools"; but the statistician 
will complain of certain inaccuracies. For ex- 
ample, leptokurtosis and  platykurtosis art 
wrongly defined and illustrated (p. 103); and 
a large-sample estimate of the standard error o 
r should not be used for a correlation of p 
with a sample of 42 (p. 133), since, when ris 
greater than 0.8, the sampling distribution be 
comes very skew. The Kuder-Richardson the 
oretical estimates of reliability (pp. 153-154) 
are interesting and useful; but it is argua»* 
that the reliability of a test under normal work 
ing conditions is best measured by the соте 
tion between two equivalent forms of the test 
given to the same subjects at an interval ? 
(say) a fortnight. In the section on item-andly 
sis (pp. 180-186) it is surprising that there" 
no mention either of Horst’s second rie 
(Psychometrika, 1936, pp. 229-245) veh 
(criterion) divided by r (test), or of the “a 
ardson and Adkins short-cut method (J. 2 nf 
Psych., 1938, pp. 547-552), which are anm 
the most promising. These criticisms, Ln 
are trivial when it is remembered what a bs 
of information has been packed into 4 Y? "i 
which is still small enough to be used as a P 
tical handbook. ` 247, Charli 
Ed & Psychol Meas 7:787-90 W 47: ^ 
I. Mosier. * It may be that a number of sir 
close professional association with ШП n 
author and her associates...have pre nee 
the reviewer to a favorable opinion of En awe. 
It is the reviewer's firm conviction, 10%” y 
that the effect of this association has pich в 
heighten his awareness of the need, M the 
filled by the book and his appreciation 
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limiting conditions which necessitated some of 
its apparent shortcomings. * Although directed 
toward achievement testing in the merit-system 
setting, with all of its examples drawn from 
that field, there is no area of psychological test- 
ing to which most of the principal contents of 
the book are not applicable even though the 
technical detail is sometimes greater than is 
usually applied. The book brings together basic 
principles and procedural details of test con- 
struction and analysis from widely scattered 
sources—professional journals, agency manu- 
als and the accumulated experience of workers 
in the field which is not elsewhere set down. 
Careful attention to these principles and tech- 
niques could vastly improve the quality of merit- 
system examining and, by increasing the compe- 
tence level of public servants, could proportion- 
ately increase the quality of public service. * The 
emphasis on a practical approach has resulted 
in some unfortunate omissions and distortions of 
emphasis. Thus nearly as much attention is de- 
voted to methods of filing items as to the de- 
velopment of a criterion. The fallacy of using a 
split-halves or Kuder-Richardson reliability es- 
timate for speeded tests is nowhere mentioned ; 
neither are the problems of item analysis of the 
final items in a time-limit test. Although special 
care has been taken to define, in an excellent 
glossary, the technical terms of statistics and 
psychological testing, equally esoteric terms 
from the federal personnel vocabulary are used 
without explanation. * There can be little dis- 
agreement with most of the [statistical] con- 
Pia included, although the practical applica- 
B. s of kurtosis or the standard error of a 
ndard deviation are likely to be infrequent. * 


Some readers will feel that the discussion of” 


кү should have referred to the logic ОЁ 
ai yr hypothesis and have at least acknowl- 
d eM existence of such sampling statistics 
the р de Analysis of Test Results, following 
hea rh Statistics, gives an elementary, but 
k ч ly sound introduction to the concepts 
liability? standardization, item difficulty, re- 
anal d , validity for a specified purpose, item 
ER » weighting and the practical applica- 
teal hy (inden toward the improvement of the 
an Pen though the distinction between internal 
qued ernal criteria is clearly made, the subse- 
terpret Conon of item analysis may be misin- 
Plyin Ps by the unsophisticated reader as im- 
the is at the former may be substituted for 

ater. The intimate interdependence of test 
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questions, directions, methods of administra- 
tion, and even order of items in defining the na- 
ture of the total test is hinted at, but not made 
explicit enough to impress properly the begin- 
ning student of the field. Chapter V presents a 
detailed discussion of certain types of perform- 
ance tests for the measurement of job profi- 
ciency. Since this chapter was written by dif- 
ferent authors, at a different time and originally 
for a somewhat different purpose than were the 
other chapters, it is incompletely integrated with 
the remainder of the book. The distinction be- 
tween speed and power tests, although appropri- 
ate to performance tests as well as to others, has 
far wider application. The proper relation be- 
tween reliability and validity, though briefly ac- 
knowledged, is not as clearly made here as it is 
in Chapter IV. The inexperienced reader may 
improperly infer that reliability is a desirable 
end in itself rather than a means to the real ob- 
jective of valid measurement. Similarly, the dis- 
cussion of internal weighting is not wholly con- 
sistent with, and is in general inferior to, that 
in Chapter IV. The suggestions for objectifying 
performance ratings and the detailed (some- 
times too detailed) consideration of the step-by- 
step process of constructing a performance test 
will prove helpful to many. Since the book has 
been written for practical use, concessions are 
occasionally, though rarely, made to expedi- 
ency. Procedures are given for situations in 
which something must be done, but under such 
limiting conditions that nothing scientific can 
be done. * The book may give the impression 
that the thoroughly sound techniques described 
characterize the bulk of examining by the U.S. 
Civil Service Commission. Within the review- 
er’s experience, written and performance tests 
are used for only a small proportion of the posi- 
tions for which the Commission examines, and 
the techniques of construction and analysis de- 
scribed in this training manual are as yet ap- 
plied to only a small part of these. It is hoped 
that publication of this book and its wide appli- 
cation will improve greatly the quality of ex- 
amining in the federal service, in other public 
personnel agencies, and in the broader field of 
educational and psychological examining. 

J Appl Psychol 32:426-7 Ag ^48. Fred S. 
Beers. * Adkins and her colleagues have not 
only followed the canons of scholarship admir- 
ably but have also made the techniques of meas- 
lear for intelligent laymen and rea- 


urement C d e GE AU 
ive for the specialist * it 1s 


sonably comprehens 
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the first volume of its kind with chief emphasis 
on the development of tests by and for public 
personnel agencies. Unlike most “practical” 
books this text is not superficial. Difficult, com- 
plex topics and techniques are not dodged, if 
they are necessary to an understanding of test 
development. Rather, they are faced squarely 
* Extensive tryout of the materials in training 
courses has demonstrated that persons new to 
the field of testing can learn, with the aid of 
this text, to apply the concepts and methodolo- 
gies germane to testing in the public service. 
For this reason, the volume should be invalu- 
able for federal committees and boards of ex- 
aminers functioning for departments of govern- 
ment under the policy of decentralized examin- 
ing. College teachers in the field of tests and 
measurements will find this book a valuable 
adjunct to their reference library or their list 
of collateral reading. Among others to whom it 
will be useful are college placement and testing 
services, college departments engaged in large- 
scale examining, and industrial concerns with 
well established or prospective personnel testing 
programs. The tabular and graphical materials, 
and oftentimes the text itself, should prove a 
boon even to the sophisticated technician and re- 
searcher. Although theoretical questions are 
strictly excluded, adequate discussion plus the 
necessary modus operandi of calculation are 
given for means, standard deviations, standard 
errors of differences, tetrachoric and point 
biserial correlation, and multiple regression. 
Thirty-four tables simplify the machinery of 
statistical calculation and serve as an excellent 
step-by-step process to orient and inform the 
newcomer to the field and to furnish handy tools 
for the seasoned researcher. Twenty-two figures 
supplement the tabular material. Twenty-four 
“exhibits” make clear many practical applica- 
tions of measurement and statistical methodology 
to problems of selecting personnel—trades jour- 
neymen, clerical workers, professionals, Dr. Ad- 
kins’ text should take a place among the signal 
and enduring contributions to the field, 

J Consult Psychol 12:196 My-Je '48. Lau- 
rance F. Shaffer. * In exceptionally readable 
style, it covers the problems of planning a test, 
writing items, and analyzing the test results. 
There is an excellent chapter on the elementary 
statistical techniques needed for test analysis * 
This manual will be widely useful, as a reference 
for those who construct tests, and as a textbook 
for students, 
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Occupational Psychol 23:61 Ja 49. P, E 
Vernon. This is a most useful book for amate 
constructors of psychological tests, and it i ш. 
fortunate that—at the moment—it is very dit 
ficult to obtain in this country, University de- 
partments and other psychological organizations 
should importune the book importers to pro- 
cure it * We say “amateur” test constructors, 
since it contains little which is not well knownt 
psychologists who have had experience of 
large-scale testing in the Fighting Services, the 
Civil Service, ог in industry. Indeed much of it 
is available in almost any standard book on 
mental tests and statistics, and quite elementary 
points are often described in unnecessary detail 
* But it does bring together a wealth of practical 
hints on planning, devising, trying out and 
standardizing new tests, and is throughout emi- 
nently sensible and sound. Though mainly con- 
cerned with written achievement tests....most of 
it is applicable to aptitude tests, whether written 
or performance, and to tests for children, Per- 
haps the most useful section (since we cannot 
recall any other accessible account) is that on 
the construction of practical or written trade 
tests for industrial workers. A few minor points 
may evoke disagreement among British psychol- 
ogists. There is no recognition that any methods 
of selection or assessment, other than obje 
tive tests, exist; consequently also there 1s per- 
haps an undue insistence on detailed job analy- 
ses in planning new tests. Matters which are 
barely mentioned, but which certainly Шей 
consideration, include the suitability of pis 
types of item-difficulty distributions for de 
purposes, the use of item-validity indices o d 
than tetrachoric correlation, of homogeneity in 
rection formulae in validation, of a 
variance in studying test results in di Ый 
groups, and of factorial methods in analy " 
what tests measure. But it would be e 
criticize the book for omitting these rather у 
nical issues, when it goes such а long oh 
wards setting the student on the right "Robert 

Personnel Psychol 1:259-62 su “48. f the 
M. W. Travers. * the main duse p л 
material on the preparation of written lone 
achievement is derived from a manual nin 2 
during the war by the Adjutant C UH the 
fice. It may be well to start by enumera ina 
merits of the volume. First, it is Minor "t 
relatively simple style and can be studie nd i 
reader who has had but little backer the 
statistics. The statistical sections begin W 
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discussion of simple concepts, as the frequency 
distribution and the class interval, and build up 
the reader’s background before the more com- 
plex statistical concepts are discussed. All in 
all, the chapters on statistics are the best in the 
book, and appear to have been given much more 
thought than the remainder of the material. 
Secondly, there is a wealth of useful material 
on the organization of testing programs. Most 
of this material is scattered through the book 
and represents a series of practical tips on man- 
agement. Thirdly, there is a healthy emphasis 
оп validation procedures, particularly in the 
chapter on the analysis of test results. However, 
for a book which claims in its title to be a vol- 
ume on the construction of tests, one is surprised 
to find that the weakest part of the entire volume 
is in the sections devoted to that specific topic. 
From the title of this volume one would have 
hoped that it would include a considerable 
amount of material giving good examples of 
well constructed achievement tests. However, 
this is not so; and those who would wish to see 
good examples of achievement tests should, with 
but a few exceptions, turn to other sources. Even 
where the authors are attempting to contrast 
well written test questions with poorly written 
ones, the supposedly good questions are often 
as clumsy and as inadequately worked out as 
the poor test questions. If the authors had given 
more careful thought to the selection of good 
examples, the book would have been improved 
very substantially. As it is, many of the ex- 
amples are drawn from specific fields of gov- 
ernment work and have limited illustrative 
value. The best examples of tests in the book 
are found in the section on performance tests. 
Another major weakness in the volume is the 
section on the planning of a written test, for 
the authors seem to advocate the haphazard 
and unsystematic test planning methods which 
Were characteristic of the work undertaken in 
this field 25 years ago. It is even doubtful 
whether the authors really believe in planning 
à test at all since they say that “rigid adherence 
toa detailed test outline leads to stereotypy and 
to too little emphasis on an imaginative prod- 
his (p. 29). The authors then go on to say that 
est plans should be used with * flexibility." 
no practice of making a test outline and 
+ isregarding it was current 25 years ago, 
n Were even at that time few who confessed 
i they used such haphazard procedures. It 
$ particularly unfortunate that this volume fails 
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to bring out the importance of very detailed 
planning in the development of tests. If any 
major advances have been made in the area of 
test construction during the past two decades, it 
is in the methods used in planning tests. The 
type of detailed test planning undertaken by 
Carroll L. Shartle in the War Manpower Com- 
mission, or by Ralph W. Tyler in education, 
represent considerable advances in test con- 
struction procedures. The appraisal of written 
test forms other than the multiple choice form 
is most inadequate. The essay test is discarded 
with the brief statement that it is “commonly 
considered impractical if the number is at all 
sizable.” (p. 6) It might have been better for 
the authors to have noted in this connection 
that the largest test construction and test service 
agency in the country outside of the Federal 
Government does use essay tests with large 
groups. Similarly, various forms of objective 
test questions are discussed at length and are 
then discarded without any proper indications 
of either the specific values of these forms or 
the reasons why they are not used. When the 
multiple choice item is discussed the only ad- 
vantage given to it is that it permits “the use of 
standard machine scorable answer sheets” (p. 
41). The more compelling merits of the multi- 
ple choice types of question certainly deserve 
to be stated in a work of this kind. It is not suf- 
ficient to appraise any form of test question 
solely in terms of the ease with which it is 
scored. When the above inadequacies are con- 
sidered in the light of the fact that the senior 
author of the book has undisputed competence 
in the field, one is led to the conclusion that the 
work was produced in haste to meet one of those 
unreasonable deadlines which the staffs of gov- 
ernment agencies often have to meet. Finally, 
it may be noted that 130 pages, or almost half 
the main body of the text, are devoted to sta- 
tistical procedures and problems related to tests. 
This apportionment of space to statistical prob- 
lems probably represents the personal bias of 
the authors. In general, in the test construction 
field, there are those who devote their energies 
to the improvement of test materials by the use 
of statistical techniques, and those who believe 
that the main improvements in the measure- 
ment of achievement must come through the de- 
velopment of better planning and better test con- 
struction. Many of the criticisms made in this 
review stem from the fact that the authors of 
the book seem to belong to the former school of 
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thought while the present reviewer belongs to 
the latter. 

Psychol B 45:546-8 N ’48. Dewey B. Stuit. 
According to the author "the original purpose 
of this book was to serve as a basis for training 
personnel..directly engaged in the prepara- 
tion of written or performance tests of achieve- 
ment for predicting job performance of public 
personnel...'The material in the book seems ad- 
mirably suited for such a training course and 
has actually been used for this purpose. The 
orientation of the book is public personnel ad- 
ministration; those interested in educational 
achievement testing may, therefore, not find the 
book quite as useful as would be the case if the 
book were written with their particular problems 
in mind. There is no material, for example, deal- 
ing with the question as to how one should 
attempt to measure various outcomes of instruc- 
tion. The emphasis, instead, is upon the techni- 
cal matters pertaining to the planning, construc- 
tion and analysis of achievement tests which are 
to be used in predicting success on the job. 
Chapter I is concerned with the planning of a 
written test. * the author might have stressed 
a bit more that the differences between so-called 
aptitude and achievement tests are often not very 
great. An aptitude test is really an achievement 
test but typically measures knowledge of a more 
generalized character than that which is found 
in so-called achievement tests. It would seem 
more logical to differentiate between aptitude 
and achievement tests on the basis of purpose 
rather than content. The topic of Chapter II is 
"Constructing and Compiling Written Tests." 
The discussion of how to convert ideas into 
multiple-choice test items and determining the 
kinds of problems which can be set by multiple- 
choice items is one of the best that this reviewer 
has seen. The examples of good and bad items 
illustrate the author's points very well and 
should be helpful to anyone who prepares tests 
using the multiple-choice type of item. The au- 
thor's rich experience in the construction of ob- 
jective tests is clearly demonstrated in this chap- 
ter. The sections on mechanics of recording and 
preserving items and on the compiling of a writ- 
ten test should be of special interest to all those 
connected with large-scale testing programs. 
Chapter III is entitled “Basic Statistical Tools." 
* The reader who is well versed in elementary 
statistics will not find anything particularly new 
in this chapter, but the person who wants a 
brief review of basic statistical tools and the 
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person conducting a training course for test 
constructors will find the contents of this chap- 
ter very helpful. Chapter IV, the “Analysis of 
Test Results,” is a “must” for anyone—except 
the most sophisticated—who is concerned with 
the building of tests. All of the sections in this 
chapter are treated very competently, but this 
reviewer liked especially the discussion of (1) 
the concept of a standardized test, (2) analyz- 
ing the difficulty of a test, (3) establishing the 
validity of a test, (4) weighting the parts of a 
test or of a total examination, and (5) establish- 
ing critical scores and transmuting raw scores, 
In these sections the author presents many ideas 
and suggestions which, while they may have 
been known to others, have not been set down 
so they could be used satisfactorily in training 
courses. The material is excellently organized 
and clearly presented. Chapter V, “Special Prob- 
lems in the Development of Performance Tests," 
was prepared as a unit independent of the re- 
mainder of the book. The material should be of 
interest to those who wish to prepare better 
performance tests in the trades fields and various 
vocational courses. If time and circumstances 
had permitted, it might have been better to have 
incorporated this material as a part of Chapter 
II. The glossary of terms in the appendix is very 
good and should be especially helpful in instruc- 
tional work. In the opinion of this reviewer the 
author is to be congratulated upon the work 
which she has done in the writing of this book. 
In contrast with most books on tests and meas- 
urements, this book gets down to fundamentals 
and really gives the reader some insight А 
how а test is actually built. All too frequently 
authors of books in this field have been ст 
with general descriptions of tests and vagt 
generalizations as to how they were constructe | 
This book will, therefore, be welcomed by ^ 
those who are concerned with the fundamenti 
or foundations of tests and measurements | 
contrasted with the патен pe 80у 
uently in textbooks published to date. " 
: EAR 13:49-52 Mr '48. F nim 
B. Davis. * Although the authors are well aw » 
of the fact that the contribution of an ite 
the variance of a weighted sum (such fe the 
total score of a test) is dependent in part er oe 
magnitude of the item variance, which is а at 
tion of item difficulty, this fact does not ane d 
have been related to the suggestion on uet y 
that if "items in one area are so easy yop 
will be passed by about 80 per cent of the 
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petitors, the test constructor can arrange in an 
orderly fashion to counter-balance these by more 
difficult items in another area." A word of cau- 
tion might have been appropriate at that point, 
because the suggestion would result in balanc- 
ing average item difficulty at the expense of 
reducing the relative effective weights of items 
in both areas in determining the total test score, 
Such deviations from the weights prescribed by 
the test outline could only lead to impairment in 
some degree of the test's validity, as defined by 
the outline. * Excellent suggestions for item 
writers are presented, though these do not by 
any means exhaust the topic. Apparently, the 
field is still open for a definitive treatment of 
the writing and editing of objective test items. 
The reviewer was happy to note the descrip- 
tion of the item writer's job as "essentially a 
highly creative task." The authors are appar- 
ently aware of the costly statistical treatment 
often. (and unfortunately) lavished on intrin- 
sically shoddy items. The reviewer would have 
made more explicit than the authors have (on 
р. 4I) the necessity for making sure that the 
keyed answer to a multiple-choice item is not 
only the best answer presented but is also the 
right answer, able to stand alone on its merits 
as precisely correct. The authors quite prop- 
erly refer to the need for making each individual 
item as reliable as possible. They do not, how- 
ever, mention the fact, demonstrated by Horst 
many years ago, that to maximize item reliabil- 
ity the incorrect choices in a multiple-choice 
item (where the correct answer is the most pop- 
Шаг choice and the choices represent a graded 
series of steps of knowledge of the point being 
tested) should be made of equal attractiveness 
to the subjects. Chapter III, titled “Basic Sta- 
tistical Tools," is, in the opinion of the reviewer, 
m quite up to the standards of the rest of the 
i In fact, because it seems likely that many 
the statements and concepts in this chapter 
will be subjected to critical review by statisti- 
ange this review will merely describe the chap- 
6. às а whole and cite only a few examples of 
1e inaccuracies and oversimplifications that will 
ү irritating to statisticians. * this material 
еу presented and is illustrated with some 
Monte MORAL routines. The reviewer s objec- 
к з arise from what seem to be avoidable over- 
mplifications. * In chapter IV the authors 
s considered a number of problems and pro- 
ee encountered in the analysis of test re- 
- * A good many of the points discussed in 


this chapter are familiar ground to test tech- 
nicians but have not heretofore been treated in 
references suitable for use by students of mental 
measurement. It is good to see them made ac- 
cessible in this book. One criticism that might 
be made is, perhaps, that certain procedures are 
presented rather uncritically. It is interesting to 
know what technique of item analysis is used 
at the U.S. Civil Service Commission ; it would 
be valuable to know what, if any, modifications 
in the technique could, on the basis of the ex- 
tensive experience of test-construction experts 
on the staff of the Commission, be expected to 
result in greater convenience and efficiency. In 
discussing reliability coefficients the authors 
state that the superiority of the Kuder-Richard- 
son procedures over other methods "is gen- 
erally recognized." While the reviewer would 
insist on the value and convenience of the vari- 
ous Kuder-Richardson formulas, he objects to 
so sweeping a generalization. When suitable 
validity coefficients are available for a test, the 
reliability coefficient is useful mainly for re- 
search purposes in revising the test or the bat- 
tery of tests of which it is a part. The reviewer 
believes that for research purposes, with the 
short, relatively homogeneous tests that will 
come more and more to be used in combination 
to measure a good many slightly correlated hu- 
man traits, the product-moment correlation co- 
efficient between two parallel forms carefully 
constructed to measure similar functions has a 
good deal to recommend it as a measure of re- 
liability. Such a reliability coefficient is easy to , 
compute by the sums or differences formulas 
and, of great consequence for research pur- 
poses, it is subject to precise tests of significance. 
Tt is as economical to compute as is a Kuder- 
Richardson coefficient obtained by one of the 
more precise formulas and does not require as- 
sumptions as untenable as those demanded by 
the use of the simpler Kuder-Richardson formu- 
las. An excellent feature of chapter IV is the 
emphasis on the relationship of test validity and 
test reliability. Very few psychologists realize 
how useful carefully constructed short (and 
therefore relatively unreliable) tests can be, es- 
pecially when they are used in combination to 


yield weighted composite scores. The sooner we 
can get test constructors to realize the absurd 


inefficiency of using tests with reliability со- 
efficients approximating .90 to .95 when their 
validity coefficients are of the order of 30 to 
до, the quicker progress will be made in laying 
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the basis for practical measures of value in edu- text for personnel staff training. To the М 
cational and vocational guidance. The final chap- viewer, the book is fundamentally concerned 
ter in the book presents a remarkably complete with values rather than figures. The author te. 
discussion of the problems encountered in de- peatedly and successfully drives home the im- 
veloping performance tests. This chapter will portance of validity and practicality in civil 
be of special interest to psychologists in industry service testing. While statistical methods are 
and in public-service agencies. Reading this described in considerable detail, they are recom- 
book may make many people realize for the first mended only as a means to an end * In the realm 
time what a complicated and specialized task it of human relations an amazing vista of de. 
is to construct examinations properly. It will — velopments appears just ahead of us. While the 
be encouraging to laymen to realize that we author and her collaborators deal with only one 
have psychologists in Federal agencies, espe- segment of these developments, they have con- 
cially in the U.S. Civil Service Commission, tributed appreciably in that one segment—the 
who are abreast of developments in psycho- forecasting of job success on the basis of meas- 
metrics. The reviewer hopes that the technical ured human skills and abilities. 

staff of the U.S. Civil Service Commission, [B23] 

which has been cut to the point where there is АНЫ Eur Handbook of Бел ОИ, 


actually no group of item writers and test con- and Records. Research Report No. 16. New York: 
structors available to write the examinations American Management Association, 1949. Pp. 227. Pa- 


that have to be constructed to select employees Рег. $3.50. * 

for the Federal service, will be built up to ade- Personnel Adm 14:77-8 Л '51. Robert W. 
quate size. Readers of Psychometrika should Pearson. * serves as a guide for developing and 
perhaps constitute a nucleus of support for pro- designing...forms. Publications of this type are 
viding sufficient personnel to permit the sci- all too rare. A great amount of research and 
entific preparation of civil service examina- analysis has gone into the compiling of this 
tions. Handbook and the method of presentation is an 
Pub Personnel R 9:101-2 Ap '48. Henry Е. excellent one. This presentation makes it pos- 
Hubbard. * Here assembled for the first time — sible to adequately cover the forms of each ma- 
is a substantial body of communicable knowl- jor activity in the field of personnel administra- 
edge on employment testing for the public serv- tion. Some 83 companies and individuals par- 
ice. * a much-needed volume which fills most ticipated in this study of personnel forms and 
adequately a previous gap in the modern library records. There are 137 illustrations of forms 
of personnel management. * The scope..is well used by various of the participating companies 
‚ Stated in the subtitle * the contents have been and many of the forms are analyzed. Therefore 
logically organized * In an unusually good pref- this is truly a “Handbook” and is an excellent 
ace, the-author makes no apology for the in- guide for those interested in improving 4? 
clusion of technical material in the text, but creating personnel forms. The bringing WEE 
comes quickly to grips with the problems con- of so many different forms not only gives ins 
fronting those who would improve written and user a wide range to study but also ШШ 
performance tests. * Adkins has done a superior the need for a great deal of research. Here si 
job in pulling together from many sources quot- opportunity is afforded to see how the SHE 
able writings, and in editing them into a well- companies have handled their personnel jo : 
knit and authoritative volume * A glossary of problems. * Of particular interest is the pu 
technical terms used in statistical analysis of test ^ ment of application forms. The application M 
results will help guide the layman through what perhaps more than any other personne val 
may seem in a few spots like jungles of profes- сап stand critical review since there is, 1 Malt 
„sional jargon. * Not recommended for light instances, much room for improvement. tir 
. summer reading, the book is sound, scholarly, such forms are quite formidable and reques is 
and scientific. Indeed, considering the technical formation for which there is no apparent А Й 
nature of the subject matter, the presentation and which has no bearing upon the eae 15 
can be called clear and comprehensive. The re- qualified employees. There seems to be a i the 
viewer suggests по mere once-over-lightly read- tance to change these forms either weenie 
ing, but a thorough study which should include questions are traditional or because long alid- 

re-reading and discussion. In brief, it isan ideal has endowed them with an appearance 0! Y 
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ity. The Handbook undertakes a thorough analy- 

sis of the forms, their purposes, and use. Vari- 
ous types of application forms, such as prelimi- 
nary applications, registration-type applications, 
selective applications, and weighted applica- 
tions, are discussed. There is a guide which may 
be used in considering the items to include on 
an application form and a check list. of ques- 
tions which have been culled from the hundreds 
of applications reviewed. Careful study of this 
chapter is recommended for those interested in 
improving and simplifying their application 
forms, There is some discussion of the physical 
characteristics of the forms which will perhaps 
be of assistance in the preliminary planning of 
anew or revised personnel form. The actual de- 
sign and format of many of the forms can be 
improved upon but it is mentioned that they 
are not offered as examples of the best in de- 
sign but rather as subjects for analysis and 
study. In fact the Preface contains a statement 
stressing the critical use of the Handbook as 
there is danger that some will select an attrac- 
tive form from each of the groups and expect 
them to mesh together effectively. Format alone 
does not prove the efficiency of a form. 

Personnel J 30:31-2 My ’51. E. Fassberg. * 
The chapter on application forms is particularly 
helpful. * Miss Ahern has several helpful sug- 
gestions for getting more out of reference forms, 
both as to the proportion of replies received, and 
the validity of the information. * provides a 
broad-gauge frame of reference within which 
пува may examine the functions of the 
Ni illustrated and pre-test new forms they 
nay draft for their own companies. Every per- 
sonnel executive will find this a convenient ref- 
erence book. 
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J Consult Psychol 15:354 Ag "51. Laurance 
F. Shaffer. * useful handbook * The main part 
..is an annotated inventory of techniques for 
measuring abilities, aptitudes, achievement, in- 
terests, and personality. There is a good chapter 
on the reporting of psycliological examinations 
to meet the needs of rehabilitation counselors. 
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Am J Orthopsychiatry 20:213-4 Ja '50. 
Adolf С. Woltmann. This is one of the first 
comprehensive books published on projective 
techniques and gratitude is rightfully expressed 
to author and publisher for having filled a wide 
gap. * The final chapter presents a good dis- 
cussion on criteria for adequacy, criticisms, and 
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research needs in the field of projective teh. 
niques. Both American and European studies 
alphabetically arranged in comprehensive bib 
liographies, are covered in each chapter, The 
bibliography on the Rorschach Test alone саг. 
ries 798 entries. Wherever possible, the subject 
matter is tabulated for quick orientation ani 
comparison. Rorschach workers will welcome 
the tabulations on pages 84 to 97 which give the 
German, French and English scoring symbol 
together with brief descriptions. Other tabul- 
tions cover handwriting analysis and elements 
in the analysis of drawing and painting together 
with their clinical significance. The grouping of 
Rorschach studies according to the topics in- 
vestigated constitutes another commendable 
feature. The description of the Thematic Apper- 
ception Test, its modifications and summarized 
applications to diverse clinical groups, form 
another clarifying part * Although the use of 
plastic creative art and of puppet shows as pro 
jective methods is not mentioned, and several 
names have been misspelled, it is a pleasure to 
recommend this book which fills a great need, 
serves as a reference source, and stands out by 
its comprehensiveness. Е 
Am J Psychiatry 106:796 Ар ’50. Paul Ё 
Meehl. This book attempts to bea general survey 
of projective techniques, a summarization of their 
literature, and a text usable as an UN 
manual" for students learning the (m 
Whether these aims are compatible can be P 
puted, but within the limits of such a d * 
fairly good job has been done. * It must 3 п 
riously doubted whether the introductory n 
idea has been achieved, since no one Pii 
methods could be satisfactorily studied t 
this book. The summary of literature | nds. 
fire, densely packed, and not particular tie 
cal. Habits of methodological sophisti ‘he 
would not be demanded or engendered "i 
student by studying Bell's reviews. For Sale à 
in studies failing to control the age i differ 
stress is put upon this fact. As is usta’, se 
ences and trends are noted without ect à 
cern for overlap and for the practica tite 
a given trend. In a few cases did S 
ture is missing from a context where hach 16 
be expected: e.g., in treating of Rors¢ on di 
liability, the negative findings of Ро ost 
tortion of protocol are cited, but not t Lm 
results of Coffin. However, for the bt dds 
coverage is good and there is not any 


mane. 
re 
play of bias in any direction. The theo 
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section is brief and quite disappointing, con- 
sisting of cliches such as “personality is a dy- 
namic process” and an enunciation of basic 
theorems (sic!) such as “personality struc- 
ture, as well as the influence of the field....re- 
veals itself in the behavior of the individual since 
behavior is functional.” On the other hand, the 
general critical section and the proposals for re- 
search are good and if taken seriously should 
help raise the level of scientific work in the field. 
For teaching purposes, it would haye helped to 
present some case material so that the kind of 
clinical information actually contributed by each 
device, and the part it could play in concrete 
clinical decisions, might be clarified. On the 
whole, the book is a contribution. It could be 
used as a general text in a survey course where 
students are to be exposed to all the major 
methods but expected to really study only a few, 
for which other manuals would be needed. It can 
be recommended to psychiatrists-in-training 
who want to learn more about the psychologist's 
tools without spending much time at it; and it 
isa handy reference book for any psychologist in 
the clinical field, even if the experts on single 
techniques may find things to complain of. 

Am J Psychol 63:134-6 Ja ’50. Pauline 
Snedden Sears. Projective techniques attain in 
this thoughtful volume a respectable adoles- 
cence, * Since....[the author] is interested in 
appraising the present status of all such tech- 
niques, rather than in propounding the virtues 
of one, the book gains in stature and conserva- 
tive judgment. It must be admitted that some- 
thing is lost in color and excitement. The true 
Protagonist of a certain projective device is 
facilitated in describing it by his faith and per- 
sonal experience ; although he has been at times 
not wholly restrained by scientific caution. Dr. 
Bell has undertaken the monumental task of 
collating and selecting from all, and the result 
15 an extremely valuable contribution. * pro- 
Vides a useful brief manual for the clinician and 
for the research worker, and for graduate stu- 
а а sound start in the field. There аге excel- 
ent bibliographies for each topic * This general 
and inclusive work is particularly valuable for 
Ned perspective in a field which seriously 
t Hs orientation. Tt gives a useful impetus to 
bie ^ еа of a projective approach with varying 
e but similar problems. Whether this idea 
tha not be better elaborated in one place, rather 
is ^ лац through the twenty-two chapters, 

air question. That might possibly have 
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brought out more clearly the want of basic 
theory, which is the main deficit in this field to- 
day. When an attractive method for sampling 
the “whole” personality is first adopted and 
then rationalized by reference to existing classi- 
ficatory principles (e.g. an outmoded Kraepelin- 
ian system) the efforts at theory later made 
are often sad and usually of far too limited ap- 
plicability. A few methods, including the TAT 
and some of Wolff’s procedures, have devel- 
oped more logically from broad theory to 
specific testing, a procedure more fruitful for re- 
searching upon personality. Bell does not under- 
take to differentiate between fragmentary hy- 
potheses and systematic orienting concepts as 
bases for a given method. If the scientific study 
of personality is to progress, it seems essential 
that more be required of a method than that 
“Gt works” fairly well in detecting a patient be- 
longing to an ambiguous and discredited psychi- 
atric classification. It should also discriminate 
among the normal by way of basic variables of 
personality. Few methods meet this require- 
ment. For clinicians who wish to select and use 
one of the projective procedures, the book is 
probably ideal, with its division into groups of 
techniques and all material relevant to a single 
technique gathered together in a separate chap- 
ter. For the use of graduate students who are to 
be introduced to the study of personality, how- 
ever, a more critical approach to interpretation 
of present results is needed, and more emphasis 
upon underlying theory. The projective ap- 
proach is undoubtedly sound, but a welter of 
nomenclature, without specified basic assump- 
tions or constructs, is not helpful in discovering 
new principles of development and structuring 
in personality. 

B Menninger Clinic 14:183 $ ^50. Robert К. 
Holt. Clinical psychologists now have for the 
first time a broad survey of the projective tech- 
niques. Bell has digested almost everything that 
has been written about projective tests of any 
significance. In a number of handy tables, he 
has summarized in easily surveyable form the 
principal methods of interpreting most of the 
tests and whatever diagnostic indications have 
been reported for each. The book will probably 
stimulate both practice and research with pro- 
jective methods. 

Brit J Psychol, Gen Sect 41:198 D "50. There 
will presumably be many compendia dealing 
with projective techniques. This one takes the 
story up to 1948, and with the help of full bib- 
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liographies provides a very comprehensive sur- 
vey of the field. The bibliography on the Ror- 
schach technique alone includes 798 items. * The 
book not only describes the principles of the 
tests but gives an introduction to their use, ade- 
quate at least for elementary purposes. No very 
detailed critical survey is offered, but the limi- 
tations as well as the usefulness of the tests are 
recognized, and the blend of serious apprecia- 
tion with reasonable caution makes the book a 
good introduction to the subject for students. 

Int J Psycho-Analysis 30:209—10 pt 3 '49. 
Joseph Sandler. * a useful contribution * [ Beli] 
interprets the term projection in a much wider 
sense than Freud, and is thus able to assemble 
the most diverse collection of methods in one 
volume. * The book provides a fairly objective 
résumé of the nature, use, and present status of 
the techniques discussed; references are full, 
well classified, and will be of particular value to 
the research psychologist. The pages on play 
techniques (including such methods as the 
World Test) will be of particular interest to the 
psycho-analytic worker. These methods, unlike 
the majority of projective techniques, may be 
applied in the actual therapeutic situation, and 
their use as diagnostic instruments goes hand 
in hand with their therapeutic use. * Unfortu- 
nately each projective method seems to have its 
own peculiar set of underlying concepts, and the 
psycho-analyst may find it rather difficult to 
translate the results of many of the projective 
methods into his own language. As a compen- 
dium of projective methods, however, Bell's 
work is most valuable. It is excellently written, 
full of charts and tables, and should find its place 
as a reference work. * 

J Abn & Social Psychol 44:556-8 О "49. 
Magda B. Arnold. * the purpose of this volume 
is fivefold: (1) to present a comprehensive re- 
view of the literature; (2) to facilitate compari- 
son of projective tests; (3) to serve as an in- 
troductory manual to projective tests; (4) to 
serve as a stimulus and help to the more exten- 
sive application of tests; and (5) to stimulate 
the invention of new projective methods. Of the 
several purposes cited, the one fulfilled best is 
the first one. The present volume is the best 
review of the literature up to date and can be 
used with profit as a handy compendium for 
clinician or academic psychologist who wants 
to explore new techniques or to refresh his 
memory with respect to some of the less known 
methods. Its value as a reference work is some- 


what lessened by the fact that in SOME cases the 
references to research studies are so brief tha 
the reader can form no adequate Concept as to 
the results reported. For instance, on page 136 
is stated that “Earl (154) evaluated the validity 
of symbolic interpretations of content (in the 
Rorschach) by securing free associations un. 
der hypnosis. He reported a high validity if ade- 
quate precautions were used in assigning sym- 
bolic significance to the responses.” What were 
the precautions and what kind of symbols were 
used? If we had been informed at least whether 
Freudian, Jungian, or some private symbolism 
had been employed, it would give us a hint as 
to what to expect ; as it is there is no other re- 
course but to look up the original study, for the 
title in the bibliography is similarly uninform- 
tive. The author’s ambition to have the present 
work serve as a test manual is a much more 
doubtful one, for it is obvious and acknowledged 
by the author that it can serve as a guide to test 
administration only for the simplest tests 
Whether the book will "facilitate the compari- 
son of projective tests” is a question which this 
reviewer does not dare to decide. There is little 
attempt to make such comparison easy by pro- 
viding a reasoned evaluation of the various 
methods, especially in regard to projective tests 
proper (Part II, Visual Stimulus Techniques). 
As to the last purpose, “to stimulate the inven- 
tion of new projective methods,” the profusion 
of techniques described will intimidate ia 
than stimulate the student in his first appro 
to the field. To provide a guide through the Lei 
it would have been helpful to make some Ей 
tempt at theoretical clarification and cono 
evaluation. As it is, a prodigious effort is Ау д 
first to assimilate and then to digest the * 
Offered. * a most adequate bibliography at 
end of each chapter * There is no A h 
compare the personality picture obtaine 5 
means of the various tests, not even the а 
schach and the TAT. Limitations of spac? i 
count for the abbreviated descriptions 0 case 
tests themselves and their rationale; 1 rad 
of the Szondi test, in particular, the comp i ible 
* 


; it almost imposs! 
way of presentation makes it a f the test 


tion O ү 
to form an adequate concep ndwriting s 


The chapter on the analysis of ha "ral 

especially welcome because it provides Г ade- 
a brief historical introduction but а V° T anch 
quate discussion of this long-neglect? is the 
of psychology. On the whole, Part Я 
most rewarding part of the book, pro 
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cause the available material was not too un- 
wieldy and could be covered adequately in a 
book of this size. As a compilation of widely 
scattered material in the whole field of personal- 
ity diagnosis by means of projective techniques, 
the book fills a real need and the author is to be 
congratulated on the painstaking way in which 
he completed his task. Considering this very 
real achievement, it is perhaps ungracious to 
harp on shortcomings, but to this reviewer, at 
least, a compilation without concise evaluation 
of theoretical implications is like a dish without 
seasoning. The author himself says: “While this 
volume will not undertake the systematic pres- 
entation of theory involved in projective tech- 
niques, which must be accomplished soon in this 
science, it will review much theoretical material 
and refer the reader to sources where more de- 
tailed discussions of theory may be found.” Thus 
there is to be some theory—but unfortunately 
the theoretical part is (both in the introduction 
and the “overview”) decidedly meager and the 
least rewarding portion of the book. According 
to the author, projective methods depend on a 
common concept of personality: personality is 
conceived as a dynamic process, but it is also an 
evolving unit, a structure ; this structure is func- 
tioning, revealing itself as behavior expressed 
in projective tests. Furthermore, personality is 
not a surface but a depth phenomenon, of which 
some traits are observable, some hidden ; infer- 
ences are possible from external observation to 
latent structure, the rules of inference to be 
furnished by Freud. The reader may well ask: 
Does the concept of “dynamic process” really 
add anything to that of a functioning structural 
unit? And is Freud really the only guide in in- 
ferring hidden from observable phenomena? 
These common concepts of personality, accord- 
ing to Bell, have five sources: Freud furnished 
the rules for inference of hidden trends; the 
Gestalt school including Lewin providing holis- 
tic analysis of motivation and perception ; clini- 
cal psychiatry described abnormal behavior and 
Personality structure; learning experiments 
added “the concepts of conditioning, of reward 
and punishment as motivating forces, and analy- 
sis of physiological processes in habit forma- 
tion.” Finally, cultural anthropology is tendered 
the inevitable bow and we are informed that 
Hy psychology of personality gained support 
Tom this valuable science in removing the mech- 
hey bias of earlier personality theories" (p. 
)- Apart from the question whether such re- 
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moval is desirable or undesirable, has the mech- 
anistic bias really been removed, and removed 
by anthropology? With such an imposing array 
of sources (the author's careful habit of enumer- 
ating throughout the book irresistibly reminds 
one of the classroom), the meager product is 
rather astonishing: a concept of personality 
combining the notions of process (which 
stresses temporal change), evolving unit (which 
emphasizes the unitary nature of personality) 
and depth phenomenon (which stresses its hid- 
den aspect). In none of these notions is there 
any recognition of the individuality of the per- 
son, or any suspicion that there may be a differ- 
ence between a process occurring in a chemical 
compound and in a person, between an evolving 
unit that is a plant and one that is a human be- 
ing, and between hidden or unobservable phe- 
nomena in animals and those in men. Passing 
from the theoretical foundations of personality 
to the author’s evaluation of projective tech- 
niques as a means of personality diagnosis (Part 
Ш, Ch. XXV. An Overview), the situation is 
no better. Professor Bell suggests several cri- 
teria by which projective methods may be evalu- 
ated, the first of which is the proposition that 
“the technique must stimulate behavior by the 
subject in which the different layers of the per- 
sonality may be manifested and, as much as pos- 
sible, distinguishable” (p. 494). This implies 
the assumption (a distinctly Freudian one) that 
there are such different and distinguishable (if 
not opposed) layers. Yet he later censures the 
practice of freely relating "the results of pro- 
jective techniques to personality theories which 
are themselves lacking in validity, but which are 
treated as being valid” (p. 498). Among the ex- 
amples of such theories he quotes those of Ror- 
schach, Jung, Sheldon. One might ask whether 
the theory of Freud has either greater or less 
validity than the theories he condemns. We are 
also told that the conclusion emphasized by 
many theorists “.... hat in projective techniques 
the individual has free self-expression...is ап 
oversimplified description of what takes place. 
By no means are all the responses 1n projective 
tests free. They include free behavior in the 
sense of its being beyond conscious control. 
They objectify this unconscious behavior so 
that it may be observed, but they also elicit 
conated behavior” (p. 494). If free behavior is 
“behavior beyond conscious control,” what is 
behavior that is not free? Surely the only sense 
in which the term “free self-expression” is used 
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of projective tests is “behavior not prescribed 
or controlled by the stimulus situation,” there- 
fore expressive of a person’s own motives, both 
conscious and unconscious. To contrast "free" 
behavior which is beyond conscious control, with 
"conated" behavior which is under conscious 
control, seems to imply that voluntary action is 
somehow unfree just because it can be controlled 
by the person, which surely is a contradiction 
in terms. After listing eleven criticisms of pro- 
jective methods, some of which are directed 
toward the use of such methods, others toward 
their abuses by some workers, Professor Bell 
enumerates seven major areas of research in 
this field which require the attention of scien- 
tists. Of these, the second area seems to be the 
most urgent to the present reviewer who is, at 
least in this respect, in complete agreement with 
the author: continuing research into the nature 
of the personality for the validation of present 
theories and for the advancing and validation of 
new theories. 

J Am Med Assn 141:568 О 22°49. * is some- 
what more detailed than is necessary for an in- 
troduction to the methods, but it is undoubtedly 
the best source of finding out "what all these 
technics are about." The hasty reader can find 
the gist of the material in the chapter summaries 
and then decide whether to read the entire chap- 
ter. Dr. Bell has organized his material very 
well and has an excellent expository style. Al- 
though he points out that projective technics 
are of little value except in the hands of the 
skilled examiner, he believes that they “аге the 
most promising for speedy diagnosis of the 
total personality. With all their present deficien- 
cies—and the author has taken pains to point 
them all out—they have established for them- 
selves a secure position in research and clinical 
study." ' 

J Consult Psychol 13:64 F "49. Laurance Е. 
Shaffer. Teachers of clinical psychology will 
greet this text with unrestrained and well justi- 
fied enthusiasm. It is an answer to their need 
for a comprehensive, clear, and sound book on 
projective methods, and a better answer to this 
need than might have been expected at the 
present time. Bell covers all types of projective 
diagnosis under a fourfold classification : word- 
association. (including incomplete sentences), 
visual stimulus techniques (Rorschach, TAT, 
and others), expressive movement, and play and 
drama. The administration and elements of 
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scoring and interpretation of each technique аге 
described in sufficient detail to Serve as an ip. 
troductory manual. As the most highly devel. 
oped methods, the Rorschach (127 Pages, 798 
references) and the TAT (46 pages, 91 refer. 
ences) receive the most extended description, 
Less well known devices are represented, hoy. 
ever, including the Szondi, Mira Myokinetic, 
and Bender Gestalt. An almost inevitable short. 
coming, that many interpretations are included 
without strict evidence for their validity, is more 
the fault of the field than of the author. There i 
a clear awareness of needs for further research, 
evident both in the sections on the validity of 
each test and in the generally cautious tone of 
the discussions. Many aspects of the volume 
make it a valuable reference for experienced 
workers as well as for students—the descriptions 
of several techniques hitherto available only in 
journals; the Rorschach tables comparing Ger- 
man, French and English scoring, and sum- 
marizing the scoring of normal details by Klop- 
fer, Beck and Hertz—to cite only a few. 
Psychoanalytic О 19:115-7 Ja ’50. William 
G. Barrett. * designed for advanced students of 
human personality and is written with exem- 
plary clarity and objectivity. The various tech- 
niques are described in sufficient detail for the 
reader to obtain an informed impression of their 
respective merits and applicability. The author's 
critical approach is generous and encouraging, 
but optimism does not substitute for keen criti- 
cal appraisal and constant awareness of the re 
quirements of scientific methodology. * In a dis- 
cussion of the nature of projection, and the “i 
of the term “projective” to describe these tec 
niques, the author carries the reader with him 
through various dilemmas of definition and ш 
age to the point where he decides to adopt t° 
term for his text “because it does des RN 
tially what takes place in the techniques an ү 
cause it has the advantage of carrying а - 
icance through usage beyond the strict ni 
ing of the term.” Although it seems that n fni 
may misunderstand the psychoanalytic Ce i 
tion of projection as an unconscious proren y 
that he expects the materials manipulate ied 
the defense mechanism also to be une x 
(cf. p. 2, “The expression of consciously їе de 
nized autobiographical material throug I 
jectives confirms the impression that more one 
psychoanalytic projection is involve This t 
cannot quarrel with his conclusions. +M 
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viewer is particularly sympathetic toward Dr. 
Bell's problems of definition and semantic prob- 
ingsas he himself vividly recalls the interminable 
discussions and the agonies of definition that 
took place at the Harvard Psychological Clinic 
in the middle thirties during the gestation of 
several of the tests described in this volume. 
The final chapter is noteworthy. The author 
enumerates "criteria for the adequacy of a pro- 
jective technique," outlines eleven justifiable 
criticisms of projective techniques and discusses 
"seven.major areas of research with projectives 
which require the attention of scientists." Limi- 
tations of space make it impossible to do justice 
to this excellent discussion. Suffice it to say that 
the points raised are of interest to all serious 
students of personality and should be particu- 
larly stimulating to the psychoanalyst. There is 
a well-chosen bibliography at the end of each 
chapter * 

Psychol В 47:89-91 Ja' 50. Arthur Weider. * 
a useful survey * Following each chapter the 
author presents a comprehensive and thorough 
bibliography. Several bibliographies contain 
references to the foreign literature as well as to 
the American journals. The most outstanding 
bibliography consists of 798 references and cov- 
ers 50 pages (these are approximately all of the 
references to the Rorschach technique for the 
past 27 years since publication of Psychodiag- 
nostiks). These features, as well as the author's 
lucid style of writing, recommend this volume as 
a text in clinical methodology for graduate stu- 
dents and as a reference book for psychologists 
and other clinicians concerned with human per- 
sonality. 

Psychosom Med 12:337-8 S-O '50. Gotthard 
Booth. Considering the need for progress in 
personality diagnosis the present volume prom- 
ises to be of great service. It fulfills its purposes 

to facilitate research into the respective merits 
of projective methods by permitting intercom- 
Parison of those procedures....and to encourage 
and help psychologists and students in the more 
pou application of projective technics." * 
he bibliography seems so all-inclusive that it 
Bo be most welcome even to the specialist. 

he text itself is very readable and includes a 
us deal of stimulating psychological informa- 
ев from nonmedical sources. Although the au- 

От 15 convinced of the great potentialities of 
Ka jective methods the discussion is consistently 

"са! The concluding “Overview” condenses 


in twelve pages the current problems of pro- 
jective personality testing. The author draws 
particular attention to the totally neglected inter- 
personal and personal aspects: of the testing 
methods. * The author’s demand for “calibration 
of the examiner" (р. 496) is as justified as it is 
unlikely to be fulfilled on a large scale. This re- 
viewer knows only one case in the literature: 
Roemer’s publication of Rorschach’s own Ror- 
schach record as basis for his criticism of the 
ten cards which Rorschach had selected for his 
standard series. (Z. Psychother. Sonderh, 10: 
310-70, 1938.) It is hardly coincidence that the 
only projective method applicable without con- 
sent of the tested person has been so far the 
“stepsister” (p. 291) of academic psychology 
and psychiatry. Graphology is adequately dis- 
cussed in this volume. In discussing the semantic 
handicap of personality descriptions (p. 497) 
the author comes out in favor of inventing new 
words (p. 501) because “some of our most 
common psychological terms are unsuitable 
for present-day personality descriptions.” But 
would not the neologisms of the groping psychol- 
ogists be as liable as those of the blocked schizo- 
phrenic of leading into a vicious circle of cumu- 
lative misunderstanding? This reviewer feels 
that the present-day confusion of psychological 
language is not likely to be remedied by better 
abstractions, but by more concreteness of ex- 
perimentation (e.g., the rationale of the Szondi 
method) and recognition of the multidimensional 
structure of personality (e.g., the variables which 
differentiate in the Rorschach test between hys- 
teria and schizophrenia are relatively independ- 
ent from those differentiating between arthritis 
and hypertension.) It may be an advantage for 
those doing research in the field of projective 
technics that the author presents only existing 
concepts without complicating matters by any 
new theories of his own. 

Rorsch Res Exch & J Proj Tech 12:255-7 no 
4°48. Edward M. L. Burchard. This is the book 
which all teachers of clinical methods and re- 
searchers in projective techniques have been 
eagerly awaiting since the rumor became cur- 
rent two years ago that it was in preparation. 
Here, for the first time, the student may find 
between two covers а description and evaluation 
of the entire field of projective techniques. 
Others...have written more-or-less exhaustive 
er a single technique or on à 


treatises on eith n 
small battery of tests, but Bell is the first to 
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attempt the staggering task of accumulating 
them all in a single volume. * АП available pub- 
lished research on each technique is cited, and 
the author constructs detailed and ingenious ta- 
bles to bring together related findings from di- 
verse sources. * The mere listing of the types of 
tests considered impresses this reviewer anew 
with the magnitude of the task attempted. How 
successfully has it been completed in the light 
of its five stated aims? The answer..is: very 
successfully with respect to all except aim num- 
ber three (provision of a manual for adminis- 
tration, recording, and interpreting); and to 
carry out that aim would require not one—but 
several—volumes of this size, and probably not 
one—but several—authors. The very nature of 
projective techniques, with their emphasis on 
dynamic interrelationships and their holistic ap- 
proach to personality, makes their interpretation 
difficult in print and impossible adequately to 
summarize. Every teacher of projective meth- 
ods knows the snail's pace with which students 
proceed from scoring through single factor 
identification to total personality interpretation. 
This final step can be achieved only through re- 
peated supervised’ practice, and in print can be 
approached only by multiple examples. Bell has 
conscientiously stated the interpretive principles 
for each test, but this reviewer does not believe 
that anyone could (or should attempt to) inter- 
pret a Rorschach, TAT, or Rosenzweig P-F 
protocol or a sample of handwriting or finger 
painting on the basis of material in this book 
alone. The rules are clearly stated, but interpre- 
tation requires more than rules—something 
which the author recognizes; thus his failure in 
this aim is a failure only in the sense that he has 
attempted the impossible. The very great value 
of this book resides primarily in its exhaustive 
review of the literature through 1947. Complete 
bibliographies are given after each chapter (that 
on the Rorschach includes 798 titles; on the 
ТАТ, от; on handwriting, 137 ; on word-associ- 
ation, 80; to cite a representative sample) and, 
within chapters, research studies on various con- 
ditions have been grouped in a very convenient 
manner. For the research psychologist this book 
is invaluable; for the beginning student it pro- 
vides the best orientation to the field ; and it un- 
doubtedly will be widely adopted as a textbook 
for introductory courses in projective tech- 
niques. * the publishers are to be congratulated 
on careful editing, a handsome job of book- 
making, and a sensible price for a volume which 
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unquestionably would have a large sale even 
at twice the price. 
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Personnel J 29:390 Mr ' 51. Hyman Brandt,* 
Benge has organized the basic fields of measure- 
ment rather well. It is the manner in which he 
measures interest, intelligence and the like, as 
well as the conclusions that he draws from rather 
unreliable and somewhat invalid measuring tools 
that leave much to be desired. The following 
comments with respect to the measures used by 
Benge are appropriate: (т) Interests—Cover- 
age of an interest field by the use of ten items 
(obviously labeled) is inadequate. Many people 
would have difficulty understanding or inter 
preting some of the terms he uses in his 
(2) Intelligence—Benge uses timed tests but 
permits an individual to take them without 
supervision, Lack of supervision, permitting 
laxity in time allocation, can render the rest! 
obtained from any timed test invalid, The choice 
of the content for the memory, voca vit 
reasoning and space perceptions tests 1$ 4 
poor and would be disapproved by most e | 
structors. (3) Mechanical Intelligence— E 
Benge leaves the realm of aptitude and «ШШ 
trates on information and p mee 
specific mechanical areas. Again 
aen of ten items is employed. ше 
scoring does not take into account the fred к 
and skill with which the individual perta pe 
tasks he reports as having done. (4) 50 strats 
velopment—In this area Benge de 
several inconsistencies. In a temperamen dud 
of 140 items, Benge insists that the be 0 
select 35 (no more—no less) even if thers 
solicit the assistance and advice of и r 
make up his mind. In contrast to this, d 
sonal Traits” scale of 120 items adl " 
“pin-pointed,” probing method for o + the 42 
replies with double checks on 10 out 0 expeti 
I would very much like to see some rove il 
mentation with this probing method to s ollow 
value for more accurate self-reporting. 
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ing this, Benge goes right back, on a "leadership 
qualifications" scale to the ludicrous by suggest- 
ing that an individual who is not in a position to 
evaluate the items realistically, do so by indi- 
cating what he thinks he would do if he were in 
such a position. The second section of Mr. 
Benge's book is devoted to the facts concerning 
jobs and career fields. It is an excellently or- 
ganized digest of the material found in the De- 
partment of Labor's Occupational Outlook 
Handbook. He also describes and illustrates a 
very intriguing job analysis form. The third sec- 
tion of the book is a short but pungent summary 
of how to get a job and contains pointers on how 
to improve oneself in order to get ahead on the 
job. The last two sections are in direct contrast 
to the first part in which Benge gives no evi- 
dence as to the reliability and validity of his 
short, unsupervised tests whose scoring keys 
are obvious and whose norms are unspecified. 
It cannot be too strongly stressed that compre- 
hensive measures, administered under proper 
supervision using appropriate testing facilities, 
are the prerequisites for good diagnosis and pre- 
diction. Only this way can we hope to obtain 
valid and reliable estimates of human abilities 
and behavior. 
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ЖВЕММЕТТ, GEORGE pm Di MUN Еотн M. 
А Summary of Clerical Tests. New York: Psycho- 
logical Corporation, 1949. Pp. vii, 122. Paper. $1.25. * 
(PA 23: 3926) 

J Appl Psychol 34:214-6 Je ’50. Donald E. 
Super. * There are two distinct parts: a discus- 
sion of the development and use of tests of cleri- 
cal ability, and a series of brief descriptions of 
specific tests, An historical sketch of the growth 
of clerical occupations and survey of develop- 
ments in clerical testing from 1912 until World 
War II are followed by a review of the types of 
items used in clerical tests, material which could 
be very useful to persons seeking ideas for cleri- 
cal test construction. Two chapters are devoted 
to job descriptions of clerical occupations and 
to reviews of studies in the selection of workers 
for these occupations: these are helpful as an 
Overview of what has been tried, but they im- 
Press this reviewer as being insufficiently analyt- 
eal and integrative. That some attempt was 
ye along these lines is illustrated by the au- 

Ors’ noting of the fact that Hay's Pennsylvania 
м. norms for the Minnesota Clerical Test are 
CT similar to those compiled in the USES work 

Stead, Shartle, and others. But in general the 
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criteria of success are not carefully examined 
(e.g., on page 26, Thorndike’s use of earnings 
at ages 20-22, when beginning workers have 
not yet had a chance to prove their worth and 
to earn accordingly) and sometimes they are 
not even described (e.g., Oberheim's "library- 
performance" criterion on page 24) : a serious 
defect in view of the recent emphasis on the 
need to validate the criterion. Sometimes results 
which might mystify or even disillusion the un- 
sophisticated are not discussed, as in the case of 
the perfectly logical megative correlations be- 
tween Otis scores and salary cited in Table V, 
p. 28. Chapter VI is a useful discussion of who 
should be tested, when, and by whom: content 
often omitted in the more academic treatises, but 
especially necessary in those read by laymen. 
The test summaries..take up 45 of the 115 
pages of text * These descriptions range in 
length from one-half page to about one and one- 
half pages for each of 32 tests. The descriptive 
material is more adequate than the evaluative, 
for while the authors have given as their objec- 
tive the inclusion of *a maximum of objective 
information regarding....reliability and validity,” 
this information ranges from “No studies re- 
ported,” through “Correlation coefficient of 63 
between speed scores and supervisor's ratings," 
to somewhat more detailed data not exceeding 
seven lines. The analytical comments sometimes 
do something to remedy this paucity of detailed 
objective evidence in the test summaries, as 
when it is stated "The manual does not define 
the superior group as to number of cases or num- 
ber of steps in the rating scale" (p. 99), or when 
the authors say (of one of Bennett's clerical 
tests), “It needs further study to show its use- 
fulness in differential placement work” (p. 88), 
or “This is another among the recently published 
tests (in this case not Bennett’s) which include 
some normative data but little except face va- 
lidity or item types known to be valid to indicate 
its areas of usefulness” (p. 99). But these com- 
ments are brief, and often nonevaluative, For 
example, concerning one test which undeserv- 
edly (according to the authors’ own criteria) re- 
ceives a full page of description, the authors 
write simply that it is an American revision of 
an English test, that the author points out that 
good secretaries do well on the whole test 
whereas routine office workers do poorly on 
certain subtests, and that scores increase with 
schooling and are probably related to intelli- 
gence. They do not state that the American vali- 
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dation of this test is extremely limited (only 3 
of 5 references have any such data) and that the 
mentioned lack of occupational norms makes it 
useless in vocational counseling or selection. 
Thirty-two generally available tests are de- 
Scribed in the manner just discussed; 8 tests, 
the use of which is restricted, and another 34 
tests which are no longer available or virtually 
unused, are also described more briefly. While 
it is admittedly difficult to categorize tests in 
this manner to the satisfaction of all readers, this 
reviewer is inclined to believe that Bennett and 
Cruikshank would have rendered a greater serv- 
ice to personnel psychology if they had been 
more analytical and evaluative in the discussion 
of each test, or if they had applied more rigorous 
standards in deciding which tests should be 
treated. One test publisher has told the reviewer 
in conversation that the mere mention of a test 
in a book on tests increases the sale of that test : 
if this is so, then a number of tests which have 
already been tried and found wanting are likely 
to receive a new lease on life from this publica- 
tion. It might have been a greater service to 
personnel psychology and vocational guidance 
if Bennett and Cruikshank had given more space 
to the tests which the evidence shows to be most 
worth using, and had relegated to a small-type 
appendix all of the less promising tests which 
needed to be discussed for historical reasons or 
for completeness of coverage. * Personnel psy- 
chologists will find this summary a time-saving 
survey of the field, a very valuable source of 
leads to studies they should be familiar with, and 
a helpful reminder of what is available in the 
way of tests; they will also find that the test 
summaries do not enable them to make final 
judgments about tests, but that they tell enough 
to indicate what is worth looking into more in- 
tensively both in the literature and in their own 
research. Vocational counselors will also find 
the survey and the leads helpful, but will need 
to go to Buros’ yearbook and to the more in- 
tensive treatises of specific tests found in some 
texts. Personnel managers and other executives, 
who may have had a little training in measure- 
ment but who are not psychologists, are likely 
to be confused by the large number of briefly 
described and sketchily evaluated tests. As this 
last is one of the largest probable consumer 
groups, and also one which is likely to make least 
use of the literature to which this booklet could 
lead them, this limitation becomes even more 


important. Bennett and Cruikshank are not the 
first authors who have, because of the very na 
ture of their field, written for too heterogeneous 
an audience; but it would be interesting to see 
what they would produce, if they wrote two 
versions of the booklet, one for technicians and 
counselors highly trained in testing, and one for 
personnel workers and counselors with at most 
a course or two in measurement. In closing, the 
reviewer, whose eye tends to be somewhat 
jaundiced when reading test discussions by test 
authors and publishers (timeo Danaos, et dona 
ferentes), would like to point out that Bennett 
and Cruikshank have conscientiously treated 
their own tests as objectively as those of other 
authors and publishers. 

J Bus Ed 25:40 Je '50. * will be of aid to 
those interested in the selection, evaluation, and 
training of clerical employees. It is far more 
complete than anything else available at present. 

Occupations 24:72, 74 O ’50. Roy М. Ander- 
son. * This is an excellent summary of an im- 
portant area of aptitude and ability testing and 
should be read by those who use clerical tests 
for vocational selection or vocational guidance. 

Personnel Psychol 3:228-9 su '50. W. J. E. 
Crissy. * a much-needed and well-prepared ref- 
erence work that should be of tremendous value 
to all concerned with the screening, selection 
and upgrading of clerical workers. It should be 
equally helpful to those concerned with the vo- 
cational guidance of high school and business 
school pupils who are preparing for clerical 
occupations. “This report....is an attempt (0 
bring together in a single publication pertinent 
information regarding tests used in selecting a" 
upgrading clerical workers.” This quotation 
from the Preface is borne out by the contents. 


(EA tr Attri- 
*Benz, SrANLEY С. An Investigation of a qu 
butes and Techniques of High-School Co 1 Ret 
ors. Purdue University, Division of Education? La 
erence, Studies in Higher Education [No.] t Pa- 
fayette, Ind.: the Division, October 1948. Pp. 4! 
per. $0.75. * 
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Ж*Внлтгл, C. M. Intelligence Testing 4 Kitabs 
tional Reconstruction, Bombay, India: Hin 
Ltd., 1949. Pp. v, 83, 4. Rs. 2. * (PA 23: 5304) 
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ЖВ:ев, У/пллам С. А Comparative Study. The 
Seminary Group and Four Other Groups о ntory: 
Minnesota Multiphasic Personality ДУ тег 
Washington, D.C.: Catholic University of + 5609) 
Press, 1948. Pp. xi, 107. Paper. $1.25. * (PA 22: 
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Bijou, SwNey W., Ештов. The Psychological Pro- 
gram in AAF Convalescent Hospitals. Army Air 
Forces Aviation Psychology Program Research Re- 
ports, Report No. 15. Washington, D.C.: U.S. Gov- 
ernment Printing Office, 1947. Pp. viii, 256. Out of 
print. * 

J Consult Psychol 12:59 Ja-F '48. Laurance 
F. Shaffer. For about one year, October 1944 to 
October 1945, an intensive program of clinical 
psychological service and research was carried 
out in the AAF Convalescent Hospitals. * The 
research programs of the hospitals were designed 
to invent and develop tools needed for the prac- 
tical services, and to evaluate the results of con- 
valescent treatment. The research chapters of 
the report review accomplishments in personal- 
ity and adjustment studies; interest, attitude, 
and biographical surveys; the measurement of 
disturbance of mental functioning ; and explora- 
tions in projective techniques. Even in its short 
career, the AAF convalescent program made 
а number of significant administrative and re- 
search contributions to the advancement of clini- 
cal psychology in hospitals. 

Psychol B 46:523-4 N ’49. George A. Kelley. 
* Psychological research in the convalescent 
hospitals was entirely oriented toward produc- 
ing instruments for more effective services. 
Some of the instruments, such as the Rorschach, 
the Shipley Personal Inventory, the Wechsler- 
Bellevue Scale and the Cornell Selectee Index, 
were adapted from civilian forms. Others, such 
as attitude scales specific to the military situa- 
tion, a biographical inventory and a test entitled 
Efficiency of Mental Application, were con- 
structed on the spot. Studies of validity were al- 
most wholly dependent upon psychiatric rather 
than behavioral criteria and reflect the well- 
known inadequacies of such criteria. Validity 
coefficients for the inventory type of instrument 
under these conditions ran quite high, e.g» а 
bi-serial у of .67 for the revised Shipley Personal 
Inventory, Two exploratory studies of biograph- 
ical information indicated the feasibility of using 
such items in the statistical prediction of a psy- 
chiatrist’s estimate of a patient’s prognosis. The 
authors are careful to point out, however, that 
Since the psychiatrists used some of the same 
biographical information, the criterion is, from 
One point of view, contaminated. The Rorschach 
Test and the Bender-Gestalt Test were studied 
їп relation to the available criteria without con- 
clusive results, This checks with other military 


B52 


studies of this type of instrument. It must be 
either that Dr. Rorschach's ink spots cast a 
peculiar magical spell over the psychological 
mind or that our system of statistical logic ur- 
gently needs some extensions. The Incomplete 
Sentences Test, by way of contrast, demon- 
strated substantial validity for differentiating 
groups of patients judged by psychologists to 
be psychologically unfit for duty (r — бї), al- 
though its correlation with the psychiatric cri- 
terion was considerably smaller (r = .39 and 
Дт). T 
For areview of the series, see B2. 


[B51] 
X*BrrrNGsLEA, Fren Y. The Bender-Gestalt: An 
Objective Scoring Method and Validating Data. 
Clinical Psychology Monograph No. 1. Brandon, Vt.: 
Journal of Clinical Psychology, 1948. Pp. pae г. 
$0.90. * (РА 22: 5419) (Reprinted from J Clin Psy- 
chol 4: 1-27 Ja '48. *] 

[ B52] 
XBrarm, С. W. Scorr. Measurements of Mind and 
Matter. London: Dennis Dobson Ltd., 1950. Pp. 116. 
10s. 6d. * 


Brit J Psychol, Gen Sect 42:387 N '51. B. S. 
* The book is not written from a psychological 
standpoint, and the psychological reader may 
be a little surprised to find such diverse matters 
as the Weber-Fechner law, the principles of 
Factorial Analysis and Rorschach all subsumed 
under a single chapter heading. Other sections 
of the argument, which is largely concerned with 


` the comparison of values of incommensurables, 


exhibit a comparable (and legitimate) degree of 
compression, and this may perhaps render the 
book more difficult than the author had intended. 
Nevertheless, it is a vigorous and provocative 
essay, and well calculated to add the reader to 
the number of those qualified for inclusion in 
е dedication. 
у Nature 168:136 Jl 28' 51. G. R. Noakes. * dis- 
cusses the meaning of measurement in general 
from the philosophical point of view, and shows 
the dimensional difficulties that can arise in prac- 
tice. The real argument of the book is that in 
the relatively unexplored field of “psychophys- 
ics,” which constitutes the frontier between phys- 
ics and psychology, à different approach from 
that of pure physics may be justified ; this in 
itself needs no elaboration, for anything amen- 
able to a purely physical treatment 1s surely well 
within the frontier. Nevertheless, a number of 
controversial points are raised, and the author's 
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own reflexions on them are given in a detached 
and critical manner. The book is addressed to 
the non-specialist and the general reader, the 
latter of whom may find it rather a tough prop- 
osition ; and its object is rather to give a wider 
currency to the fact that certain problems exist 
than to lay down solutions to them. 

Times Ed Sup 1855:878 N 17 '50. * Blair... 
discourses pleasantly on physical and psycholog- 
ical measurements, and though he does not get 
anywhere in particular, he shows that different 
branches of science have many problems in com- 
mon. Psychologists nowadays, by means of fac- 
tor analysis, choose the dimensions which are the 
most convenient for the full description of hu- 
man beings ; similarly the physicist often finds it 
preferable to employ dimensions other than the 
conventional Length, Mass and Time. There 
are, moreover, a number of qualities intermedi- 
ate between the physical and the psychological, 
such as the “firmness” of bitumens and cheeses, 
which are being studied by his colleagues and 
himself at Reading University. He ends by urg- 
ing different brands of scientists not to be afraid 
of overstepping their conventional boundaries 
and to cooperate in solving their mutual prob- 
lems. 
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X*BLUM, Mitton L., AND BALINSKY, BENJAMIN. 
Counseling and Psychology: Vocational Psy- 
chology and Its Relation to Educational and 
Personal Counseling. New York: Prentice-Hall, 
Inc., 1951. Pp. xxi, 586. Text edition, $5.00; trade edi- 
tion, $6.65. * (PA 25:6250) 

Personnel J 30:30 My '51. Donald A. Laird. 
Here is an account of the most approved present- 
day practices in occupational counseling which 
will help the personnel executive bring his think- 
ing up-to-date. This is more than a manual on 
how to test, though the more widely used tests 
are discussed. Scores are relegated to the back- 
ground and more emphasis given to the motives 
and problems which may lie behind test scores 
and influence job success. The goal presented 
for the counselor is to lead the individual to 
think things out for himself, rather than to give 
him direct advice. Test standings are useful for 
this self-evaluation, but not ends in themselves. 
The book is saturated with the non-directive 
interview, and should be helpful for the employ- 
ment interviewer who uses this method. (It is 
not often used in the employment office at pres- 
ent.) Two chapters on the counseling interview, 
and one on the qualifications and training of 
counselors, are especially important for per- 
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sonnel workers. Written as a text for с è 
people, personnel executives will be able to rea 
it, but the many interviewers who have not fin- 
ished high school will find it difficult in spots, 
Many portions of the book can be given good 
use, however, in training interviewers, It will 
be a valuable addition to the personnel library, 

Pub Personnel R 12:168 /1?°5т. * Personnel 
people will find much of the material directly 
useful. Chapters 4 and 5 deal with the interview, 
the most important technique in counseling, 
Three interviewing techniques are discussed and 
evaluated: the authoritarian, the nondirective, 
and the nonauthoritarian. * 


[ B54] 
*Bournitet, LORRAINE, AND Byrne, KATHARINE 
MaNN. You and Your Mental Abilities, Chicago, 
Ill.: Science Research Associates, Inc., 1948. Pp. 48. 
Paper. $0.75, * (РА 23: 5037) Instructor's Guide, Pp. 
12, Paper. $0.30, * [The booklet copyrighted in 1949 


is a reprinting of the 1948 edition.] 


[ B55] : 
ЖВоулвр, Jonn F.; Cozens, FrepertcK W.; AND 
HacMaN, E. Patricia. Tests and Measurements 
in Physical Education, Third Edition. Philadel- 
phia, Pa., and London: W. B. Saunders Co., 1949. Pp. 
xvii, 410. $4.50; 31s. * For reviews of the second edi- 
tion, see 40: B848 and 38: B316. 

[B56] [ 
XBovp, GerTRUDE. Appraising and Developing 
Reading Skills. University of Wyoming, College g 
Education, Bureau of Educational Research and Serv 
ice Monograph, No. 7. Larami, Wyo.: the Bureau 
1949. Pp. 41. Paper, lithotyped. $0.50. * 


[B57] 3 
*Bovrm, Roscoe A., AND Eaton, MERILL T. jn 
ardized Testing in the Schools of Indiana. it 
tin of the School of Education, Indiana perii 
Vol. 27, No. 1. Bloomington, Ind.: Indiana Uu y 
Bookstore, January 1951. Pp. 38. Paper. $0.75. 
25:7104) 

[B58] "T 
XBnrccs, Hersert S. Evaluating Manageri ference 
tions. Foreword by $. Avery Raube. Cony pA 
Board Reports, Studies in Personnel Policy, Board, 
New York: National Industrial Conference [Distri 
Inc, October 1951. Pp. 112. Paper. $2.00. d to edt 
bution is limited to associated companies anc Д 
cational institutions, and governmental agencies. 
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X*BnoADLEY, CHARLES V., AND BroaDLey, 
Know Your Real Abilities: Unders 
and Developing Your Aptitudes. ae 
McGraw-Hill Book Co., Inc., 1948. Рр. vil, it 2 
* (London: McGraw-Hill Publishing Co, 14 
(PA 22: 4129) im, 
Brit J Ed Psychol 18:120 Je ’48. This n n 
based upon some years of research NOR So 
son O'Connor's Human Engineering ac 
tory in U.S.A., where over 18,000 ан t 
reported as being tested yearly. It makes 
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ever, no pretence of being based upon precise 
estimates made as the result of psychological 
tests; indeed, the authors seem rather anxious 
to make it clear that they are not psychologists, 
though they have talked with others “who have 
taken aptitude tests." Some of the “known apti- 
tudes" appearing in their list will appear some- 
what strange to psychologists, but there are 
many interesting records of individual people 
who have changed their work on the results of 
their experience and of advice given. 

Nerv Child 7:346 Jl’48. Based upon the “re- 
search and aptitude tests of the Human Engi- 
neering Laboratory," professional groups are 
given leads for self-education, an attempt which 
seems to reach the height of overdoing testing. 


[ B60] 

xBrussEL, James A., AND HrrCH, KENNETH S. An 
Introduction to Rorschach Psychodiagnostics in 
Military and Civilian Psychiatry: With Color 
Illustrations of the Rorschach Cards, [Second 
Edition]. Including "A Rorschach Compendium" by 
Zygmunt A. Piotrowski. Revisions of papers from 
Psychiatric Quarterly, January 1942. tica, : 
State Hospitals Press, 1947. Pp. i, 51. Paper. * For 
latest edition, see Вбт. 


[ B61] 
BrusseL, JAMES A.; HrrcH, KENNETH S.; AND P10- 
TROWSKI, ZycMUNT A. A Rorschach Training Man- 
ш [Third Edition]. Utica, N.Y.: State Hospitals 
ress, 1950. Pp. 86. Paper. $075. * (РА 25: 4560, 
4581) For reviews, see 118. 


, [ B62] 

MES Joun N. The H-T-P Technique: А Quali- 
е and Quantitative Scoring Manual. Appen- 
um “Comments on the Analysis of Chromatic Draw- 
1089, by John T. Payne. Journal of Clinical Psychol- 
ipt Monograph Supplement No. s. Brandon, Vt.: 
ү of Clinical Psychology, October 1048. Рр. 120. 
ren $200. * (PA 23:8, 5529) [Reprinted from 

lin Psychol 5: 37-76 Ja ’49. *] 
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Wes CHARLOTTE; BUHLER, KARL; AND LEFEVER, 
Scor. ELTY, Development of the Basic Rorschach 
Standa With Manual of Directions. Rorschach 
Chantardization Studies, No. 1. Los Angeles, Calif. : 
PONE Buhler, 1948. Pp. ix, 190. Paper, mimeo- 

Phed. $4.05. * (PA 23:745) For reviews, see 110. 
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ба, СнАвготтЕ, Anp Lerever, D. Үкү. A 
teristi ach Study on the Psychological Charac- 
of A св of Alcoholics. Yale University, Laboratory 
УЧ ied Physiology, Memoirs of the Section of 
terly Dj оп Alcohol, No. 6. New Haven, Conn.: Quar- 
aper ournal of Studies on Alcohol, 1948. Pp. viii, 64. 
Studis 47 5. * (PA 22:1232) [Reprinted from Q 
120, cohol 8: 197-260 S '47. *] For reviews, see 
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mone Cuartorre; Lerever, D. WettTY; KALL- 

ment Hos. E.; AND Prax, Horace M. Develop- 
of the Basic Rorschach Score: Supple- 


mentary Monograph. Los Angeles, Calif.: Ror- 
sche Ende iden Studi Pp, iv, 7 i 
per, mimeographed. $2.50. ыа ÉL MEM 

[ B66] 
*Buner, Cuartorre; Lumry, Gavre Килу; AND 
Carrot, HELEN Sara. World-Test Standardization 
Studies, Child Care Monographs No. 4. New York: 
Child Care Publications, 1951. Pp. 81. Éa er, $2,00, * 
(Reprinted from J Child Psychiatry 2: 1-81 '51. *) 

[867] [ 
*Bunxer, М. N. What Handwriting Tells You 
About Yourself, Your Friends and Famous Peo- 
ple: A Key to Grapho-Analysis. Cleveland, Ohio: 
World Publishing Co., 1951. Pp. vii, 240. $1.95. * 

[ B68] 
*Burxe, Henry К. Personality Traits of Success- 
ful Minor Seminarians. Doctor's thesis, Washing- 
ton, D.C.: Catholic University of America Press, Inc., 
1947. Pp. viii, 65. Out of print. * (РА 22: 848) 

[ B69] 
*Buros, FaANcIs C., AND Buros, Oscar К. Express- 
ing Educational Measures as Percentile Ranks. 
Highland Park, N.J.: Gryphon Press, 1949. Pp. ii, 27. 
Paper. $0.60. * (PA 5: 2562, 24: 6137, title only) [Re- 
print of edition originally published by World ook 
Company in 1930.] 

[ B70] 
Buros, Oscar Krisen, Eprror. The Nineteen Forty 
Mental Measurements Yearbook. Highland Park, 
N.J.: Gryphon Press, 1941. Pp. xv, 674. Out of print. * 
(PA 15: 1992) 

Eug R 33:20 Ap '41. P. E. Vernon. * The 
outstanding features of these Yearbooks were 
the accuracy and comprehensiveness of the in- 
formation about each test, and the "frankly crit- 
ical" nature of the reviews. * The present vol- 
ume carries on the good work * The format, the 
printing and indices of the book could hardly 
be bettered. * A welcome feature of the volume 
is the greater attention than in previous years to 
work by English-speaking psychologists out- 
side America, and the collaboration of sixteen 
non-American (mostly British) psychologists. 
Tests published in this country probably require 
critical evaluation even more than those pub- 
lished in America, most of them being far be- 
hind American ones in technical competence. 

Harvard Ed R 11:386-9 My '41. S. Vincent 
Wilking. Among the forces which are operative 
in the field of mental measurement and whose 
aim is the improvement of published tests the 
reviewer believes that the 1940 Mental Meas- 
urements Yearbook bodes well to be a potent 
one. The job of editing a manual of this sort 15 
an ill-rewarded one and I believe that educators 
and psychologists will have much for which to 
thank Buros. * The present” yearbook supple- 
ments, but does not supplant, the ground cov- 
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ered by the 1938 edition. * There was an at- 
tempt to avoid having a reviewer appraise a 
test competing with one of his own although a 
perusal of the material will give ample indica- 
tion that this was not always possible (nor neces- 
sarily desirable). The list of reviewers includes 
many well-known names among educators and 
psychologists. An interesting element is the in- 
clusion of many British test specialists among 
the reviewers in addition to a large number of 
English tests for review. Many names well 
known in the field of measurement, however, are 
conspicuous by their absence. * Five hundred 
and three tests are reviewed, the majority quite 
adequately, but a noticeable number are merely 
noted or given cursory treatment. Some rather 
important current efforts such as the Co-opera- 
tive Reading Comprehension Test, C1, C2, are 
merely noted without comment. (This is also 
true of the rather popular Rorschach Method, 
which seems worthy of review). This factor, 
however, is surprisingly infrequent in occur- 
rence considering the magnitude of Buros’ task 
so that it should not be considered as a serious 
deterrent by the prospective user. While the 
section on research and statistical methodology 
books which appeared in the 1938 yearbook has 
been omitted, the general section entitled “Books 
and Reviews” is quite extensive. It purports 
to present a good sampling of books which per- 
tain to, or are closely related to, mental meas- 
urement. * The classification rubric “Closely 
related to mental measurement” has caused the 
inclusion of some books of a class which, in turn, 
draw attention to others of the class which are 
conspicuous by their absence. This is particularly 
true in the field of remedial reading where Har- 
rison’s book on Reading Readiness is reviewed 
while several other important books published 
throughout the past two years have been omitted. 
It might be advantageous if the yearbook re- 
turned to the more restricted classification of 
“Mental Measurement Books” so that users 
would not search futilely for books which could 
not possibly be included in the space available. * 
Many of the most important publications of the 
last two years such as Thomson’s book men- 
tioned above, Murray’s Explorations in Per- 
sonality, and the manual for the 1937 Stanford- 
Binet and Wechsler tests are covered in an en- 
tirely satisfactory manner. There appears to be 
an inordinately high degree of heterogeneity in 
the criticisms of the tests. Tests and test con- 
struction are criticized from innumerable points 
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of view—a tabulation of the factors which are 
brought up for discussion by the various review. 
ers in this volume may be illuminating. Tt must 
be pointed out, however, that no one contributor 
has attempted to cover all of the factors listed, 
They represent, rather, a composite picture of 
all the modes and points of attack used. Түр 
or Criticism. Test proper. Basic assumptions 
underlying the test content. Examples of good 
and poor items. Time limits—power versus 
speed tests, Administration technique. Clarity 
and adequacy of directions. Manual. Fallacies in 
presentation of theoretical background and sta- 
tistical explanations embodied in manual. In- 
adequate data upon which to judge the test, ie, 
on reliability, validity, and standardization. In- 
adequate reference to background experimental 
literature, Application and Use. Does it fit into 
current instructional practice? Value and prac- 
tical applicability of test results. The substantia- 
tion of claims for use. Difficulty of administra- 
tion which limits use. Format and appearance. 
Durability (in the case of vocational tests). Sto- 
tistical, Reliability: the sample, split-half or in- 
terform method, sub-test reliability. Validity: 
criteria, size of coefficient and interpretation, 
sample. Norms: discriminatory power of items, 
description of standardization population, ex 
pression of scores and scoring scheme, value | 
a profile, where given. It would seem that the 
possibility of setting up definite criteria for " 
viewing and criticism should be brought to frui 
tion. Buros, in his introduction, has Чү, 
this point and rejected it because he feels 5 
reviewers might be constrained to deal va 
those phases of a test whereof they know ШШ 
nothing. It seems, however, that if every see 
tent reviewer had a definite piece of groun A 
cover, the yearbook's value might be Di 
severalfold for an educator who, none {00 Vt 
versed in statistical peculiarities, would “4 
have strictly comparable information ant 
group of tests which he contemplated hid 
This would avoid the absurd situation W Ў 
occurred in the case of а single coe 
a widely used intelligence test where t i à 
confined his discussion to shooting holes ; " 
theoretical discussion presented in the ш d 
of the test manual. The teacher, psycholog, al 
student reading this review would have n 

an entirely distorted view of t 
test and its authors but would h 
criticism at all which was pertinent t 


е УО ich havé 
itself. In spite of the minor criticisms whi 
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been mentioned here, the 1940 yearbook is of 
great importance and value in the field of men- 
tal measurement and should attract the atten- 
tion and respect of all those who are in any way 
connected with the measurement of mental 
functioning. 
Indian J Psychol 17:196-7 S-D °42. H. P. 
Maiti. * The great zeal, the conscientious care, 
and the thoroughness with which Prof. Buros 
and his associates have worked for the success 
of the Year Book project is evident from the 
fact that each successive volume has been a dis- 
tinct improvement on the preceding one. The 
present volume has several features which make 
it vastly more valuable than the preceding pub- 
lications of the series. We can refer only to some 
of these here. The practical value of the reviews 
would rest mainly on two things: (1) On their 
representing the actual interests and viewpoints 
of possible test users, and (2) on their being 
honestly and authoritatively critical. The most 
„outstanding feature of the present Year Book is 
the systematic attempt made in these two direc- 
tions. In selecting reviewers for a particular test 
care has been taken not only to base selection on 
the basis of proved or widely testified ability and 
on freedom from bias but also to take into con- 
sideration the most important purposes for 
which the test in question is likely to be used. 
Some of the tests have also been reviewed by 
more than one reviewer and we are glad to 
note that in future this procedure of multiple re- 
view of a test will be more extensively applied. * 
The get-up of the volume is excellent and the 
classification of tests has been improved. There 
cannot be any reasonable complaint about in- 
dexing and cross-references. Generally speak- 
ing, the Year Book will help test users better 
than before in selecting between good and bad 
tests for their special purposes. Further, it will 
make authors and publishers more careful than 

efore in the matter of new tests. Prof. Buros 
үе “Тһе task of initiating, editing and pub- 
ishing a year-book which aims to be compre- 
m in „scope, unbiased in treatment and 
М 4 critical in attitude is not easy.” We agree 
Ms ш But he has done it all the same, and 
P. een I have no hesitation in making the re- 
ae that the service that Prof. Buros and his 
ek ia have been rendering to the cause of 
аі measurement is of very great value and 
: must feel grateful for it. The present volume 

ui ME a very useful book of reference and 
Sidance to test users of all types—teachers, 


principals, research workers and test techni- 
cians, It should find place in every up-to-date 
library, psychological laboratory, progressive 
school and training college. We hope that ade- 
quate financial aid would be forthcoming not 
only to keep the Year Book going but also to 
enable the promoters of the Year Book idea to 
develop their valuable service along useful di- 
rections. By way of a suggestion it may be 
pointed out that inasmuch as the value of the 
Year Book, both as an expert guidance to test 
users and as a directing influence over construc- 
tion and publication of tests in future, would 
depend principally on the quality of the reviews, 
an effort should be made to study how to secure 
able, frank and practically useful critical ap- 
praisals of the tests in future Year Books to a 
greater extent than has been achieved in the 
present work, As one goes through the pages of 
the book, one notes that the United States of 
America have received better representation 
(perhaps mainly for the War) in the lists of 
tests reviewed and in the lists of reviewers than 
other countries of the world. As the volume is 
meant as a book of reference, we hope that this 
shortcoming will be removed in future, specially 
when world conditions permit. 

Social Ed 5:631-2 D '41. Horace T. Morse. 
For the teacher bewildered by the uninterrupted 
flow of tests and crosscurrents of conflicting re- 
ports, Buros has again performed an invaluable 
service. The 1940 volume is an improvement 
over the 1938 volume, just as that one was an 
improvement over his previous test, reference, 
and book review compilations. The current vol- 
ume is improved not only in format, but the 
whole scope has been broadened. There are more 
and longer reviews, each with the title of the 
reviewer. An inspection of these titles reveals 
that the reviewers were widely selected and 
represent no one group or school of thought. A 
significant decision has been made to include 
reviews of old as well as of new tests. This is cer- 
tainly a necessary step if the editor's statement 
is true that “Despite the fact that tests have 
been on the market for 5 to 15 years, there ex- 
ists, for probably 9o per cent of the tests, a 
dearth of critical information concerning their 
reliability and validity.” (Introduction, р. I1). 
In the section devoted to the social studies, 
twenty-four tests are critically reviewed and an 
additional seven are listed without review. А 
careful reading of this section leaves the reader 
in no doubt that testing in the social studies 
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needs much real improvement. There is a gen- 
eral tendency among the reviewers to question 
the assumption made for many of these tests 
that they constitute a measure of “reasoned un- 
derstanding” and other less informational out- 
comes of instruction. For example: “....despite 
the names of the four parts of the test, it is still 
largely factual..." (p. 416) ; “....this test places 
a premium upon the pupil’s recall of a list of 
standardized facts and equally standardized re- 
lationships" (p. 424). The writer’s curiosity was 
aroused to the extent of counting the number of 
times similar statements appeared. There are 
22 such statements, including repetitions, in 
the less than 24 pages devoted to the social stud- 
ies. One competent reviewer reports on one test 
that in spite of this same weakness, it “....is 
clearly well above the average in this regard" 
(p. 425). It is thus apparent that there is still 
plenty of room for instruments which will ac- 
tually test for more than factual information, 
whatever their titles may be. In spite of this dis- 
concerting situation, there is still hopeful prog- 
ress evident. The more spectacular advances in 
test techniques have now given way to patient 
pushing of our knowledge into the unexplored 
areas little by little. The consolidation of these 
gains is a slow, painful, and undramatic process. 
Even though some test makers may be overly 
optimistic in the titles which they give to their 
tests or to sections of them, their real contribu- 
tions should not be lost sight of in a quibble 
over the titles. Wrightstone, for instance, has 
directed the attention of test users to an area 
for which tests have been sadly lacking for a 
long time, that of study skills. Criticism of Buros' 
method of compiling reviews based only on ex- 
amination of the printed tests and their manuals 
was voiced by reviewers of the 1938 yearbook. 
This is probably not as vital a weakness as 
claimed. The reviews are also definitely more 
than uncritical descriptions of the tests. A very 
helpful addition to this volume would have been 
a section analyzing and describing the current 
trends, with a critical evaluation of new devel- 
opments in test construction and usage. 


For reviews of other volumes in this series, 
see Ву, 3:788, 40:B856-8, 38:B325-6, and 
36:B45. 
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Rutgers University Press, 1949. Pp. xv, 1047. $12.50. * 
(PA 23:3523) 


Calif J Sec Ed 24:185-6 Mr ’4o, Henry) 
C(lay) L(indgren). * Although The Thiri 
Mental Measurements Yearbook cannot relie 
the test user of his professional responsibility to 
know well the purposes and limitations of the 
measuring instruments he uses or desires to Use, 
it can aid him in so many different ways that i 
is perforce a “must” for the professional book- 
shelf. In no other volume can one find listings 
of all the tests and all the books on testing which 
have appeared or have been revised since 1940, 
plus penetrating and critical reviews on most 
of them. No important test or publication lacks 
a review, and those which have merited the 
most attention from the profession are reviewed 
at length by several experts. For example, the 
Rorschach test and its manuals merit 30 pages 
of double-column type, a tribute to the interest 
and controversy aroused by this projective de 
vice. The Yearbook is truly a monumental work, 
and it is easy to see why Rutgers University re 
leased Professor Buros from his teaching loadin, 
order to permit him to devote sufficient time and 
attention to his duties as editor. * Although the 
reviews of the tests are interesting, informative 
critical, and penetrating, they are not of uniform 
quality. Some were written carefully and cor 
siderately; a few, unfortunately, give evidence 
of having been dashed off in haste. (In tribute to 
Professor Buros’ editorship, the latter are ve 
much in the minority.) Hence, the reader som 
times knows little more about a new test afte 
he has read the review than before. The thought 
has occurred to this reviewer, as doubtless it bas 
to others, that the reviews might be improved 4 
the reviewers routinely included standar ir 
which should be available in all test eo 
reliability, validity, populations on which s 
wére based, etc.—with critical comments cde 
adequacy of the tests in the light of these jd 
This has been done in a number of the 25 
with good results. For example, the pit j 
Herbert S. Conrad of the Tests of Genera or 
ucational Development * Reviews like “ics 
rad’s contain information, data, and per 
which can help.... [readers] to decide whet ў 
tests under consideration will fill their ga ў 
would be impossible in this brief review uns 
logue the many points of excellence 1n nee " 
This reviewer was impressed by the do itis 
ness of the section on personality tests, eh read 
heartily to be desired that this section ring i” 
by everyone who uses this type of pe 
strument, in the hope that the users of st! 
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might become infected with the hard-boiled 
skepticism with which the Yearbook’s contribu- 
tors regard measures of personality and adjust- 
ment. If users of such tests were to harken to 
the wisdom of these reviews, there would be an 
immediate reduction in what amounts to the 
wholesale and indiscriminate misuse of person- 
ality tests in the schools today. Indeed, any per- 
son who uses tests can gain vital information 
and valuable insight by reading the reviews 
which concern the instruments he is now using. 
If more members of the profession were to fol- 
low this procedure, the resultant critical attitude 
respecting tests currently in use should inevi- 
tably result in higher standards for tests now 
being constructed, for there is no reason why 
educators should be patient with poor construc- 
tion, inadequate norms, and extravagant claims 
which are typical, at least in part, of most tests 
published today. But the critical examination of 
currently used tests in the light of the reviews in 
the Yearbook and the subsequent making of de- 
mands on test authors and publishers are re- 
sponsibilities which the profession cannot shirk 
and must assume if there is to be any improve- 
ment in the quality of tests offered for sale in the 
future. The hard truth of it is that the publishers 
will offer for sale no better tests than the public 
will buy. It is to be hoped that the appearance of 
Buros’ volume will result in much soul-searching 
on the part of test authors and publishers, but 
there will be little progress made in the better- 
ment of tests unless the profession follows up 
this advantage. Test publishers are in business 
to sell tests, not to use them, and the initiation 
of any change for the better must come from 
the public which purchases and uses these tests. 
The Third Mental Measurements Yearbook has 
much which can be used to mold a more alert 
professional attitude in this regard. 

Cath Ed R 47:213-4 Mr’4o. Е. J. Houlahan. 
* indispensable * This work is heartily recom- 
mended to anyone who has a need for the use of 
tests or for information about them. 

Clearing House 23:565 My 749. Earl R. 
Gabler. * This monumental work....is a veritable 
gold mine of information for those who use tests 
in education, industry, psychiatry, and psychol- 
ogy. The wealth of information is organized in 
such a fashion that it will talk to you. The editor 
and his staff are to be commended for a pains- 
taking and scholarly publication that is indis- 
Pensable to workers in the field. 

Digest Neurol & Psychiatry 17:548 S 49. * 


An indispensable reference tool in the testing 
field, this book, like its predecessors, aims to 
foster optimum standards in the publication, use 
and appraisal of mental tests. It is a tremendous 
piece of work. It should do much to impress 
test users with the need for careful study of new 
tests and the values and limitations of the var- 
ious standard tests. * 

Ed 70:132 O’49. Ralph E. Pickett. It is with 
rejoicing that we greet the appearance of the 
Third Mental Measurements Year Book. Those 
of us who found the 1940 yearbook so valuable 
will be pleased that the same general plan has 
been used again. * monumental undertaking. 
Truly a “must” for all those working with edu- 
cational, psychological, and vocational tests, * 
Buros and his associates deserve, again, the high- 
est commendation. 

Ed Adm & Sup 35:316-8 49. А. S. Edwards. 
Those who know the earlier publications in this 
series need no recommendation for the present 
volume. It is one of the indispensable books for 
those who want to know the usefulness, validity, 
and reliability of the tests that are being so widely 
used in many fields. This yearbook, including 
the period 1940-1947, gives the best critical in- 
formation to be had about tests during that 
period. * As is well known to the many users of 
the former yearbooks in this series, the reader 
finds the most comprehensive listing and dis- 
cussion of all of the different kinds of tests made ^ 
available during the period which is covered, 
including many early tests, especially those not 
included in earlier yearbooks. The importance 
of the comprehensive listing of tests is matched 
by the even more valuable reviews which give 
from the point of view of professional men and 
women the best criticism available of the tests. * 
The cross references add immensely to the use- 
fulness of the volume. The indexes are well 
made and immediately take the reader to the in- 
formation he desires. The catchwords at the top 
of the right-hand pages are helpful. The page 
numbers appear at the inner edge of the page. 
On the outer edge of the page are the test num- 
bers so that reference can be made immediately 
by number to any part of the volume, the final 
number being 1254. The real value of the book 
can, of course, only be known by its actual use, 
and serious workers in the field of tests and 
examinations can hardly afford to be without it. 
The bibliographies are included on the same 
page as the tests and one can hardly turn a page 
without finding from one to seventy or eighty 
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or more references to the material. Since the 
book is one which is likely to be used continually 
it is appropriate to call attention to the fact that 
it is well made and bound in what appears to 
be very durable material. The print is clear and 
the double columns make for easy reading. Not 
only will the user of the tests find here necessary 
and valuable professional information but any- 
one who studies even small parts of the volume 
will come to have a respect for carefulness and 
accuracy in the making and using of tests and 
will probably come to realize that a critical un- 
derstanding and a reservation of judgment are 


quite as necessary in this field as in any ѕсіеп-, 


tific field. Tests are not merely to be accepted 
with great enthusiasm on the emotional level. 
They are to be understood, to be used for what 
they are really capable of doing, or not to be 
used at all—if they do not meet proper require- 
ments. One may also come to understand from 
the reading of such a book that one is not pre- 
pared to give any kind of a test merely because 
he has read over the test the night before; such 
ignorance is, of course, not as common at the 
present day when it is at least coming to be 
fairly well known that thorough and appropriate 
training should be required to use any techni- 
cal and professional instruments and especially 
to interpret the results of the examinations. 
The editor and the publisher are to be congratu- 
"lated. 

Ed Forum 13:498-9 M y ’49. This ponderous 
and monumental volume is a definitive volume 
on tests and their uses. * The present volume 
serves the expert in testing and the practitioner 
in testing much as an unabridged dictionary 
serves the general reader and writer. * stupen- 
dous undertaking * The reviews of tests are 
written by well-known persons * The binding is 
sturdy buckram. It can stand the hard usage 
which the volume ought to have. Though large 
in size, it is well proportioned and manageable. 
It is published by The Institute of Mental Meas- 
urements of Rutgers University through the 
Rutgers University Press. The creation of this 
Institute enabled the editor to give full time to 
the preparation of the volume. The University 
is to be congratulated on this important project. 
The price may seem high. However, attention 
is drawn to the fact that it is justifiable, even 
when the volume is used as a textbook, as it may 
be kept for permanent reference afterwards. One 
cannot see how a teacher of courses in tests and 
measurements will want to be without it. Using 
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it will increase his effectiveness and Save count- 
less hours of time. 

High Points 32:78-9 Mr ’50. William B, 
Reiner. * a book of tremendous usefulness to 
workers in the field of mental and psychological 
measurement. Teachers, counsellors, and super- 
visors will find it to be a treasury of informa- 
tion regarding tests of ability and achievement 
in subject matter, character, personality, and 
vocational aptitudes. No single volume in print 
to-day encompasses so large an area of such 
useful testing information with such fullness and 
detail. * Despite the magnitude of its contents, 
it is very simple to locate information about any 
test, book, or reference in it. The editors merit 
a pat on the back for the wise use of varied type 
styles. This eliminates the dullness that is so 
often present in such large factually packed 
tomes. It also speeds the job of locating head- 
lines and titles. The varied cross references and 
indexes enable the reader to gather pertinent 
points quickly. * Such outstanding authorities 
as Wrightstone, Tyler, and Remmers have con- 
tributed liberally to these revealing reviews. The 
critical and objective attitude of the editor has 
spurred test writers to strive for greater finesse. 
Its spirit is brightly reflected in the accuracy 
and the reliability of the book. The section 
“Books and Reviews"...has great utility as à 
bibliography and as a source of many revealing 
and provocative ideas on testing and evaluation. 
The Third Mental Measurements Yearbook is 
a volume that commands respect and admiration. 
It was handsomely designed by Luella Buros 
and is pleasing to the eye. It was wisely pame 
and expertly executed. Between its covers шз 
of educational, psychological, and mental tes 
will find many rich rewards. ^ 

Н Sch J 32:142-3 My ’49. A. M. Torta. 
essentially а set of opinions about feo n 
books on tests rendered by those who i 
tests, both by constructing them and by pon 
them. In addition there are bibliographies Jing 
ally good and in some cases excellent, we 
with each test. * Generally speaking, eac Wi 
has one or more candid reviews which pie 
it. The reviews are not slow in calling atten its 
to a test’s shortcomings or in appreciating 
good points. * The editor has arranged theat ia 
well under their proper heads and with T ote 
ous cross references. * This volume 15 rr Р 
to be used аз a reference book. Three ou ie d 
ing uses may be made of it. In the first or in 
a teacher or principal wishes to select 4 
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a certain field he can get a very good notion of 
what experts think about several tests on that 
subject. From those opinions he can make a 
preliminary or final choice of the test he wishes 
to use. In general, the more reviews there are 
of a test, the more highly is the test regarded. In 
the second place, the counselor or the guidance 
officer who must needs interpret the results of 
tests will find great help from reviews by men 
who have similar problems to solve. In the third 
place, the book is a must for those students who 
would like to know the trend of opinions about 
mental measurements and their improvement. 
Weakness in the volume appears in having no 
reviews for one test and five or six for another. 
A second limitation arises out of the nature of 
the reviews themselves. In some cases, reviews 
are highly critical and point out the fundamental 
weakness of the instrument described. In other 
cases, the reviews are more concerned with the 
format of the test, the time it takes to give, and 
its ease of scoring, and they describe rather than 
critically evaluate the test. As a whole, the book 
is an excellent one which fulfills an important 
evaluative function in a time when new tests ap- 
pear frequently and sometimes may be strongly 
backed by a competent sales organization. 

J Appl Psychol 33:181-6 Ap ’49. E. Donald 
Sisson. Colossal is the word for this sixth and 
latest offering in the familiar series of biblio- 
graphical works on mental measurements edited 
by Buros. Starting with a modest 44-page list- 
ing of tests in 1935, the [series has had] phe- 
nomenal growth * the rate of increase in vol- 
ume has been tremendous and as yet shows no 
dear change in trend. One is reminded of the 
fable of the sorcerer's apprentice ; and one hopes 
that Buros has his wonderful editorial leger- 
demain under better control. The Third Year- 
book follows the familiar pattern of the earlier 
models. There are two main sections: "Tests 
and Reviews" and “Books and Reviews." The 
first of these, comprising over two-thirds of the 
volume, is the section for which the Y earbooks 
are best known, and on which their reputation 
is founded. * It was originally planned that new 
editions of the Yearbook would be issued at 
two-year intervals, but this schedule was inter- 
rupted by the war. The first post-war model, 
therefore, covers a period of some seven years 7 
à fact which, in part at least, accounts for its 
gargantuan dimensions and excuses its minor 
Sins of omission. * The second main section... 
lists “549 books on measurements and closely 


related fields," accompanied in most cases by 
excerpts from reviews of these books culled 
from the journals. * The selection of this ma- 
terial was presumably the sole responsibility of 
the editor. Inclusion of all the words written 
about all of the books listed would obviously have 
been both impossible and ridiculous. On the 
other hand, wherever judgment must be em- 


* ployed in the selection and editing of material, 


one is entitled to ask questions about the basis 
of the selection, and to be suspicious of possible 
conscious or unconscious bias. In this case, the 
editor assures us that all the critical appraisals 
of the books listed, collected from all the reviews 
of these books appearing in the professional 
and scholarly journals, have been included, and 
only the purely descriptive or non-evaluative 
reviews or parts of reviews have been left out. 
This being true, the reader might conclude that 
Buros’ own Nineteen Forty Mental Measure- 
ments Y earbook, since it accounts for 50 reviews 
in 15% pages, was either the most important or 
the most criticized—surely the most contro- 
versial—book on the subject published since 
1041. Sharers of the honor of evoking the most 
critical comment would be the two volumes of 
Diagnostic Psychological Testing by Rapaport 
et al. (13 reviews, 16% pages) and Stoddard's 
The Meaning oj Intelligence (13 reviews, 10 
pages). * The reputation of the earlier Year- 
books derived principally from the “Tests and 
Reviews" sections; and the same will doubtless 
be true of the latest volume in the series. The 
major rubrics are essentially the same as those 
employed in the earlier ‘issues; Achievement 
Batteries (22 entries) ; Character and Person- 
ality (от entries) ; English (57 entries) ; Fine 
Arts (7 entries) ; Foreign Languages (36 en- 
tries) ; Intelligence (89 entries) ; Mathematics 
(62 entries) ; Miscellaneous, e.g. home eco- 
nomics, safety, computational and scoring de- 
vices (111 entries) ; Reading (70 entries) ; 
Science (44 entries) ; Social Studies (30 en- 
tries) ; and Vocations (86 entries). It is difficult 
to know what significance, if any, to attach to 
the numbers of entries in each category. Perhaps 
they illustrate the difficulties of fitting fabri- 
cated classifications to any given series of data, 
particularly when the data present themselves 
without regard to the principles on which the 
classification was originally compounded. When 
this occurs, the pattern must be stretched here 
and there, and the miscellaneous section orig- 
inally provided for overflow inevitably grows 
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bigger and bigger. In spite of this—and assum- 
ing the listings are as comprehensive as claimed 
—it appears that the war and post-war periods 
have provided an atmosphere more congenial 
to production in the field of character and per- 
sonality than elsewhere. This conclusion might 
be somewhat misleading, however, since one test 
alone accounts for nearly a quarter of all the en- 


tries under this heading. Needless to say, it is- 


the Rorschach which somehow merits 67 pages, 
including a bibliography of 598 titles! The orig- 
inal reviews themselves appear to this reviewer 
as a varied lot having only one factor in com- 
mon—all are critical. Criticism is, in fact, the 
dominant tone of the whole volume (nil nisi 
bonum is definitely not the editor’s watchword !) 
and while going through it page by page one 
may conjure up an image of the editor at the 
head of his ranks of contributors daring the 
would-be test maker to attempt to get away with 
anything shoddy or unscrupulous. The image is 
an inspiring one, and, though fanciful, not too 
remote from the editor’s intention. One of the 
major objectives of the Yearbooks, in fact, is: 
To impel authors and publishers to place fewer but 
better tests on the market and to provide test users with 
detailed and accurate information on the construction, 


validation, uses, and limitations of their tests at the 
time that they are first placed on the market. 


To achieve this objective, the editor has in- 
structed his cooperating reviewers to provide re- 
views that are “....frankly critical with both 
strengths and weaknesses pointed out in a ju- 
dicious manner.” Just how “judicious” are such 
randomly selected remarks as: 


This is just another test for neurotic tendencies. The 
reviewer can think of no reason for its publication or 
use.... The only excuse for publishing another test of 
neurotic tendency in this day and age is increased valid- 
ity over other tests in the field. This test is grossly lack- 
ing in this respect. 


With the perspective attained in the years since its 
publication....one may view (test maker’s) arrant non- 
sense with tolerant amusement. 


or, again: 


The instrument is a reversion to a type of psychologi- 
cal and sensory testing that belongs to the infancy of 
mental measufement, and has repeatedly been proved 
worthless as an index to higher mental ability. 
the reader must judge for himself. If such can- 
did criticisms are indeed warranted, the review- 
ers deserve full praise for their courage in 
saying so. Fortunately, a good many of the re- 
viewers have displayed this kind of forthright 
frankness. Certainly, this reviewer is not advo- 
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cating libelous brutality for the sheer sadistic 
enjoyment of contemplating the discomfiture 
brought about by a well-placed literary needle, 
But it is difficult to discern what value might 
accrue to the potential test user from that: 

of review which finds a little to praise and a 
little to blame in every test and sums up with an 
equivocal statement of possible usefulness asa 
crutch for intuitive hunches. Fortunately, this 
sort of review is not found very often in tle 
Yearbook. But while the general tone of the 
book is healthily evaluative, the basis of criti- 
cism varies considerably among the reviews. 
They might well be classified under headings 
suggested in an excellent review of the 1940 
Yearbook (Pedro T. Orata in The Teachers 
College Journal (Manila), 1941, 3, 59-61): 

т. Emphasizes functional or "true" validity, criticizes 
test for success or failure to measure ultimate educa- 
tional objectives..in measuring worthwhile results of 
instruction; and in general subordinates the techniques 
of test construction and mechanics of administration, 
scoring, and tabulation of scores to the higher values 
that the test and testing should engender in the pupils 
and in those who use it. ; 

2. * criticizes the test mainly for success or failure 
to meet the requirements of statistical validity and re- 
liability, evaluates it on the basis of commonly accepted 
techniques of test construction, and in general assumes 
functional validity or subordinates it to formal content 
and make-up of the test. y MES 

3. * evaluates it mainly from the point of view o a 
success or failure to meet the mechanical requiremen 
of efficiency in scoring, administration, and tabulation 
of test results. 

Each of these three points of view has merit, to 
be sure, and each could doubtless rally а ей 
siderable number of supporters to its side. i 
fact, it is probably more to the point to ariel 
reviewers in this manner than to classify t e 
products. And herein lies the fundamental ern 
ness of the Yearbook in the opinion of this " 
viewer. Granting that all of the contributors 
the volume are experts and qualified to i 
and be heard, they are not all equally sens! n 
to all phases of psychometry. In diagnosing nist 
man ailments, we don’t call in only the Ve int 
or the neurologist. Nor should we, in exam! ро 
the test, expect the subject-matter pice 
detect statistical ailments, or the fufcum 
to point up an undernourished teaching 0 j Á 
tive. We should call in all the specialists ^ 
hold a thorough clinical examination sa dis 
case. That something like this was dn pt 
indicated by the editor's statement of o eri 
in the 1940 Yearbook: to provide reviews An 
ten by persons of outstanding ability que n 
ing various viewpoints..." But in calling 
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experts, the editor has made the assignments of 
cases, which implies that he already knew the 
patients needs. Though some such procedure is 
a practical necessity in a venture of this kind, 
it has the definite disadvantage that the treat- 
ment of the various types of tests is apt to be 
unbalanced. A cursory survey, for example, in- 
dicates that nearly all the reviewers assigned to 
the Achievement Batteries are educationists, 
educational researchers or examiners, and that 
psychologists and psychometrists predominate 
among the reviewers of Intelligence tests. After 
many hours spent in contemplating the some- 
what frightening aspect of the Third Yearbook, 
this reviewer found himself musing about the 
practicability of another kind of volume. This 
“dream” book would not attempt to reproduce 
verbatim the literary efforts of the experts, but 
would edit and cull from them all the essential 
materials to fill out a standard outline. Spared 
the necessity of literary composition, reviewers 
could concentrate on specifics, and could handle 
more tests with no greater expenditure of effort. 
The outline itself would be drawn up by a board 


of outstanding specialists including both test . 


makers and test users. It would cover such points 
as: type of item ; sources of items ; nature of item 
analysis; descriptions of populations used for 
item analysis, factorial analysis, validation, 
cross-validation, standardization ; judgments of 
functional validity ; adequacy of "coverage"; et 
cetera, et cetera. This list is obviously not ex- 
haustive, and many readers may detect a statisti- 
cal bias in it. It is for precisely this reason that 
the board of experts would be used to insure 
the inclusion of all important dimensions of 
а test. Finally, there would be the main feature 
of the book—the board of experts’ “seal of ap- 
Proval” for tests which merited adoption and 
use, In this last connection, the reviewer is re- 
minded of the statement made by Sandiford in 
jamentipg on the earlier Educational, Psycho- 
ogical, and Personality Tests of 1936 ( Ameri- 
can Journal of Psychology, 1938, 51, 200): 
«Professor Buros’ annual publication would 
made much more useful if he would mark with 
‘prominent star those (tests) which were valid, 
reliable, and had satisfactory norms. Then busy 
үе could neglect the rest, or if they wasted 
uot money on ‘gold bricks,’ the fault would be 
us own.” This reviewer can think of no better 
b Y of achieving the objectives of fewer and 
etter tests, 
J Consult Psychol 13:146 Ap '49. Laurance 
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F. Shaffer. The long-anticipated revision of this 
standard reference seems to have been worth 
waiting for. * The new edition is not only bigger 
but also better. While the earlier versions were 
concerned mainly with educational tests, the sec- 
tions on character and personality (91 tests), 
individual intelligence (33 tests), and vocations 
(86 tests) especially show the increased scope 
that makes the book useful for clinical psycholo- 
gists and counselors. The bibliographies given 
for many tests are exhaustive; that for the 
Rorschach has now reached 598 titles and for 
the Stanford-Binet, 351. A good bibliographic 
feature is the inclusion of the Psychological Ab- 
stracts citation, by volume and entry number, 
for each reference, The selection of the 320 re- 
viewers was acute, and their comments main- 
tain a high level of constructive criticism. The 
indexes that end the Yearbook will be much- 
thumbed for their own reference values: lists 
of periodicals, of publishers, of test titles, of 
names, and of tests classified by topics. In short, 
every psychologist or educator who uses tests 
widely will want the Yearbook at his elbow, and 
graduate students may consider it an important 


early acquisition to their personal libraries. 


J Ed (London) 82:460-2 Ag 50. C. M. Flem- 
ing. * Such small collections of tests [Burt's 
Handbook of Tests and Stephenson's Testing 
School Children] for specific purposes are of 
considerable value. Their contribution is, how- 
ever, dwarfed by the variety and the thorough- 
ness of The Third Mental Measurements Y ear- 
book. Buros’ work is well known through his 
earlier volumes. It is sufficient to say that this 
new publication conforms to the same high stand- 
ards as before, (with the addition of a most com- 
prehensive index) and continues to present re- 
views and extracts from reviews of all recent 
publications in the field of tests and testing. No 
library can afford to be without a copy of this 
encyclopaedic reference book. * 

J Ed Res 43:230-1 N ’49. D. А. Grant. * In 
the tradition of the earlier yearbooks the reviews 
are “frankly critical,” which is to say, just notice- 
ably different from libelous in some entries. This 
monumental work shows the effects of intelli- 
gent and painstaking editorial work and tech- 
nically splendid production work, The bulk of 
the volume does not reduce its usefulness, as 
might be feared, because of helpful cross-refer- 
ences to this book and to its 1938 and 1940 
predecessors. The classified index, periodical di- 
rectory, and publishers index—which includes 


B71 


listing of all tests and books for each publisher 
—are most helpful. * It would be presumptuous 
of any single reviewer to attempt a detailed cri- 
tique of the individual entries in the Third Y ear- 
book. A few comments are, however, in order. 
In spite of very good editorial work there still 
remain numerous instances of repetition of test 
and book descriptions. Duplication of critical 
comment may serve some purpose but duplica- 
tion of descriptive material is unfortunate in 
such a bulky volume. Too many of the reviews 
contain no validity estimates, but this reflects 
not so much on the reviewers as on the test 
authors and users. Many, but not all, reviewers 
note carefully the kind of reliability coefficient 
they quote as well as its magnitude. This is a 
distinct advance over earlier yearbooks. The 
reviewer noted a distinct double standard in 
the reviews of achievement and intelligence tests 
on the one hand and personality tests on the 
other hand. The reviewers of the personality 
tests, particularly, the projective tests, simply 
do not demand that these instruments toe the 
line of scientific measurement standards as do 
the reviewers of most of the achievement and 
intelligence tests. In general the reviews are 
critical and good. The editor’s good judgment 
in selecting reviewers can be seen whenever 
excerpted reviews are contrasted with originals. 
It might be noted that, aside from the aim of 
improving the quality and reducing the quantity 
of tests, reviewers themselves are likely to im- 
prove their output lest they find their vapid, 
superficial comments juxtaposed with a care- 
ful, insightful, analytic review of the same test 
in some future Yearbook. Future yearbooks 
might attempt comparisons of tests on the same 
topics. This is almost too much to ask, but con- 
sidering how well Buros has done so far, he 
might be able to achieve such a goal. Another 
change of quite a different sort which might be 
suggested concerns the typography. The present 
typography is excellent, in fact it may be too 
good. Most readers will be using the Yearbooks 
as reference only, and a small type would per- 
mit a saving in bulk. Possibly Buros’ aims of: 
(1) reducing the sheer quantity of tests, and 
(2) producing a review volume every other 
year, would prove a more satisfactory solution 
to the bulk problem. All in all, psychologists 
the world over owe a debt of gratitude to Buros 
and to Rutgers University for the Third Year- 
book. Each successive book has been a great 
improvement over the last, but the next will have 
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to be remarkable indeed to be an j 
over the present volume. 

Occupational Psychol 23:128 Ар 5р. д 
Rodger. For most of the war, the last edition o 
this work, which was called The N: ineteen-Fori 
M ental Measurements Yearbook, sat on the | 
viewer's office mantelpiece within the reach of 
his left arm. It did so partly because its size made 
it inconvenient for ordinary receptacles, and a 
mantelpiece seemed the obvious place to keep 
it, and partly because it was a valuable and 
highly interesting book. It could be depended on 
to provide factual information which would 
never be found wrong (or so it appeared), and 
to give opinions worth reading, if only becauseit 
usually presented the opinions of several people 
about every device it covered. Here, beautifully 
printed and bound, and put out with all the 
characteristic Buros efficiency and courtesy, is 
its successor. (“Ѕиссеѕѕог” is perhaps the wrong 
word. Buros Yearbooks do not succeed one an- 
other so much as supplement one another.) A 
useful, well-ordered preface, telling the reader 
how to extract the maximum value from tle, 
book * 

Occupations 28:137-8 N ’49. Donald E. 
Super. * The standard source of information o 
all types of tests had become, in the late 19305 
the Mental Measurements Yearkooks * Tit 
monumental Third Mental Measurements Yar- 
book was well worth waiting for. * а valuable 
feature often forgotten....[is the inclusion d] 
evaluative excerpts from published reviews 0 
original monographs and textbooks dealing ү 
measurement topics. One good indication of the 
coverage is the fact that it includes, for examp^ 
three reviews of a monograph by this bafi А 
which reviews this writer had never seen: { 
Buros’ Yearbooks are already established 2$ " 
essential item in any professional library 01 jd 
ing or vocational guidance, and since ћеї Е s 
eral lines are already familiar to ene ihis 
there is no need to describe other features 0 ж 
edition here. But а few evaluative сое, 
in order. Perhaps the most important сп 
of the Yearbook, and caution for ће 187 : 
to do with the unevenness of the rere lis 
original and reprinted from the јан the 
is probably an inevitable defect even it 6ш" 
material is prepared by a variety of an viens i 
der editorial direction (the original їе” 
When in many instances, they are WT! t wiit 
authors not selected by the editor and eni 
ing under his supervision (the journal г 


Mprovemen 
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the unevenness is even greater. More editing 
and selecting by the editor would give the year- 
books a more even quality and make them more 
easily and safely used by the less sophisticated 
test users. To illustrate, one might cite the re- 
views of the Gentry Vocational Inventory, three 
of which condemn it (original reviews by Bor- 
din and Super, journal review by Brayfield), 
and one of which is moderately favorable (orig- 
inal review by Carter). What is the high school 
counselor who has had one or two courses in 
measurement to deduce from this apparent dis- 
agreement among specialists? What would the 
four men concerned conclude if they were to 
read one another’s reviews, correspond about 
them, and prepare a joint statement? Obviously, 
the Third Mental Measurements Yearbook is 
not the last word on testing. Eventually a more 
easily used and more manageable method of 
pooling the best current knowledge about tests 
will be found. But in the meantime Buros is to 
be congratulated on having performed a com- 
mendable feat in bringing together current think- 
ing on tests, and counselors will find his volume 
an indispensable manual worthy of frequent pe- 
rusal and consultation. 

Personnel Psychol 2:243-8 su '49. Donald G. 
Paterson, Oscar Buros and Rutgers University 
are performing a signal service for the whole 
field of tests and measurements. As the field 
expands, we also find the Third Mental Meas- 
urements Yearbook expanding. And with this 
expansion, the need for just such an encyclo- 
pedic reference book becomes increasingly ur- 
gent. It is a must book for every personnel psy- 
chologist worthy of the name. * The stated 
objectives are: 1) To provide up-to-date bibli- 
ographies of older tests and of recent tests; 2) 
To supply frankly critical reviews of available 
tests; 3) To impel authors and publishers to 
Place fewer but better tests on the market ; and 
4) To provide test users with sufficient informa- 
tion to enable them to choose wisely tests that 
Will be sufficiently reliable, valid, and appropri- 
ate for their needs. The book makes a valiant 
attempt and a reasonably successful one with 
respect to the attainment of objectives І, 2, and 
4. But the preceding Yearbooks have failed with 
respect to objective 3, and the present book is 
no more likely to succeed. In this reviewer's 
opinion, it becomes increasingly clear that the 
tendency is to put on the market more and 
poorer tests rather than fewer and better tests. 

est authors and test publishers deserve to be 
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indicted for this state of affairs. Although some 
test publishers are more ethical than others, 
even those with the best reputations need to 
overhaul their offerings and to take off of the 
market those that cannot meet reasonable stand- 
ards of acceptability. And such test publishers 
ought to discontinue the practice of putting on 
the market tests and test manuals that fail to 
meet reasonable standards with reference to 
principles of test construction, item analysis, re- 
liability, validity, and norms. Here is a virgin 
field for the recently established Sub-Committee 
on Tests of the American Psychological Asso- 
ciation on Ethics. If this committee really goes 
to work on the problem, Buros’ books will be a 
gold mine of information. If this were done, 
Buros’ next Yearbook might be considerably 
reduced in size. It could be done by the judicious 
use of panels of experts who would vote on tests 
and manuals published prematurely and on tests 
and manuals disclosed to be inadequate. These 
might be separately listed in a sort of psycho- 
logical dog-house. A test so segregated could 
emerge in future editions only in case additional 
research provided adequate evidence that re- 
lease were warranted. But, of course, such pro- 
ceedings would take courage—great courage 
indeed—since the axe would fall on many of our 
outstanding and influential leaders in this whole 
testing business. Just where the line between 
professional competence in the field and a deep- 
seated interest in the business side of the field 
is to be drawn seems to have become exceed- 
ingly hazy in the minds of some of our keenest 
intellects. If all of this be true of commercially 
available tests, what about the home-made (so- 
called tailor-made) tests? These are being ped- 
dled to business and industry by incompetent or 
unscrupulous personnel psychologists, person- 
nel directors, employment managers, and others 
who develop or install testing programs with the 
greatest of ease. Few, if any of these, are sub- 
jected to the light of “frankly critical evalua- 
tions.” If one shudders when he reads Buros’ 
Yearbook, he would surely die of shock if ће 
could take a good look at what is actually going 
on in business and industry. Even well-inten- 
tioned employers, comptrollers, purchasing 
agents, production managers, sales-managers, Р 
{о say nothing of employment interviewers, etc., 
do not hesitate to make up their own tests blithely 
ignorant of the simplest A, B, C's of test con- 
ion. What is worse, so-called consultants 


structi { 
(many of them also ignorant of the simplest A, 
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B, C's of test construction) are selling gullible 
management men and poorly trained personnel 
men tests and “test packages” that are as poor 
as, if not poorer than, the poorest of the tests 
included in Buros’ Yearbook. All of this is likely 
to bring widespread discredit on the testing 
movement in business and industry. But we 
must get back to the business at hand—namely 
what is there about the Buros’ Yearbook that 
makes it an important publication? In the first 
place, no one can keep abreast of the field who 
does not carefully scrutinize the vast array of 
tests and measurements included in each special 
field. The personnel psychologist will wish to 
turn to character and personality tests, intelli- 
gence tests, tests in art, music, business educa- 
tion, computational ability, engineering, indus- 
trial arts, safety, sensory-motor skills, clerical 
aptitude, vocational interests, manual dexterity, 
mechanical ability, and tests for specific voca- 
tions. Well nigh 200 pages are devoted to char- 
acter and personality tests, a field markedly ex- 
panded especially since 1940. In studying the 
available tests in these fields the personnel psy- 
chologist will be richly rewarded even though he 
will emerge with few tests which seem to meet 
reasonable standards and with practically no 
tests that meet the standards of the most critical 
evaluators. To substantiate this charge, one can 
locate a typing achievement test, a stenographic 
proficiency test, a brief intelligence test, a vo- 
cational interest test, a clerical aptitude test, a 
manual dexterity test, a mechanical aptitude 
test, a personality test or what have you. But he 
cannot locate quickly a reasonably satisfactory 
test in any of these fields because he will have 
to weigh the evaluative reviews of a large num- 
ber of tests in each field and, if a conscientious 
job is done, he will have to read the references 
accompanying each test. In other words, this 
book is no short-cut to ready-made knowledge. 
It is, in truth, no more than an encyclopedic ref- 
erence book, though a .mighty handy one and 
probably the most important one that a test 
user can haveat hand. Careful reading of reviews 
of a given test (the more important tests may 
have as many as four or five independent re- 
viewers) turns up an interesting fact, namely, 
each reviewer reveals quite a bit about himself 
as well as about the test being reviewed. This is 
in spite of the fact that Buros supplies each re- 
viewer with the same set of instructions and with 
the same material in the form of a copy of the 
» test and the accompanying manual. The result 
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18 a sort of projective test of the evaluators, This 
is both a weakness and a strength for the pro- 
spective test consumer. At least he can form his 
own judgment about the judgment of the Te 
viewers as well as about the merits of the test 
under review. Some reviewers are Critically 
evaluative in the best sense of that term, Others 
seem to believe that criticism implies nothing but 
derogatory remarks. On the whole, Buro 
achieves his objective of judicious praise and 
blame. One technique that appears to produce 
this result is the selection of authors of competi- 
tive tests as reviewers of each other's work, To 
select one review and call it the best is, of course, 
impossible. But this reviewer was tremendously 
impressed with the thoroughness and the con- 
petence displayed by H. S. Conrad in his evalu- 
ation of the General Educational Development 
Tests which have been used so widely in grant- 
ing educational credit at the college level to those 
returning from the armed forces. E. В. Wes 
ley’s review of the more recent “progressive 

tests in the social sciences is a refreshing te 
minder that thinking cannot take place in 
vacuum and that “facts” are still useful and are 
still necessary even though our most rabid at- 
vocates of general education believe that the 
learning of “facts” is no longer significant 
modern education. The importance, if not the 
value, of any given test can be ascertained ду 
the length of the excellent bibliographic 1 
erences accompanying each test listed as we 
as the number of reviewers assigned to à gmi 
test. For example, a given test may be lis : 
without a bibliographic reference becatse tee 4 
exists, or there may be no reviews of a vn a 
cause it is not judged sufficiently valua sem 
because the test has just been put on wr ài 
(probably prematurely). On the other je 

poor test may be adversely reviewed by biis 
persons because it is put out by a good ge other- 
providing a warning to those who mig bist 
wise buy the test simply because the е m 
is prominent in the field. Or a test like t ns 
schach, with its variants and books and m um 
will reach the other extreme with peer eed- 
ning now to 598 (147 were listed in ag za 
ing edition) and with an even 100 rev! 


д is therwise 
ews, critical and 0 f 
excerpts from review: his highly te 


den of printer’s ink. Furthermore, 
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purporting to provide not only a highly reliable 
and valid device for the study of personality but 
claiming that it is an invaluable selection tool 


differentiating between a large number of hold- ' 


ers of jobs from top management executives, to 
salesmen, to foremen, to skilled workers as well 
as differentiating between the sheep and goats 
within each of these many job families. And the 
end is not in sight. The Yearbook also contains 
excerpts from published reviews of books on test- 
ing which have appeared since 1940. Journals 
and monograph series dealing with testing are 
listed in seven pages and there is a six-page di- 
rectory of test publishers and distributors. A 
mere reading of these directories is enough to 
convince anyone that testing is Big Business. 
Unusually complete indexes complete the vol- 
ume. In spite of the tremendous coverage and 
the prodigious work involved in compiling this 
Yearbook, there are some areas that have been 
slighted. For example, attitude tests or socio- 
metric devices and techniques for measuring the 
home, the neighborhood, other status factors, 
and inter-personal relations. These tests and 
measurements are already on the scene in in- 
creasing numbers and deserve treatment in fu- 
ture editions. One final word, this Yearbook is 
keyed in with the Nineteen Forty Yearbook so 
intimately that one should possess both books 
v ready reference. Fortunately, the Nineteen 
torty Yearbook continues to be available. Both 
are indispensable. 

Psychol B 47:177-8 Mr’50. D. A. Worcester. 
* the Yearbook has established itself as one of 
the necessary volumes in personal and depart- 
mental libraries * The whole series....constitutes 
the most complete bibliography available, and so 
far as this writer knows, the only extensive one 
which gives critical evaluations. * Those who 
contributed to the Yearbook were charged to 
be “frankly critical with both strengths and 
weaknesses pointed out ina judicious manner.” * 
While the invitation emphasized strengths as 
much as weaknesses, the prevalent connotation 
of the word, “critical” has led many of the re- 
viewers to give the major portion of their at- 
tention to short-comings. Perhaps this is desir- 
able in order to attain the objective of securing 
better performance in the making of tests. * in 
Seneral, the....[reviewing] has evidently been 
done conscientiously. If one were to take the 
amount of space devoted to given tests or books 
as evidence of their relative importance, it might 
lead to erroneous conclusions. Much discussion 


2. EAA Ыы „1. 20 
may mean importance; or it may mean great 
controversy, or unestablished techniques, or cur- 
rent interest. Rorschach testing and projective 
techniques generally command many pages of 
the Yearbook. Stoddard's Meaning of Intelli- 
gence, judging from the nature of the reviews, is 
given much more attention than it deserves ; and 
the editor might have been wiser, as well as more 
modest, if he had reduced the number and length 
of excerpts of reviews of The 1940 Yearbook. 
Here, as in many other instances, the duplica- 
tion of matter saying the same thing does not 
add to the value. The Yearbook is carefully in- 
dexed in several ways which enchance its merit, 
both in referring to its own subject matter and 
as sources of information to test users. * 

Psychosom Мей 11:247-8 JI-Ag "49. J. 
Brožek. * The practice of including more than 
one review is a healthy countermeasure to care- 
less or colorless review writing, and should stim- 
ulate the reviewers to develop a more critical at- 
titude toward their subject. The sum total of 
information and critical evaluation provided by 
the Yearbook represents, better than any other 
single publication, the present status of mental 
measurement, * 


For reviews of earlier volumes in this series, 
see Вўо, 3:788, 40:B856-8, 38:B325-6, and 
36:B45-6. 
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XBunr, Cvm, The Factors of Mind: An Intro- 
duction to Factor-Analysis in Ps срод, Lon- 
don: University of London Press, Ltd., 1940. Pp. xiv, 
509. 21s. (New York: Macmillan Co., 1941. .00) 
(PA 15:3208). For reviews, see 52 in Statistical 
Methodology Reviews, 1941-1950; ste also 2: 48. 
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ЖВовт, Сукш. Handbook of Tests for Use in 


d Edition. London: Staples Press 
depen poen 110, * (New York: Staples Press 


Inc., 1949. $2.00.) 


Brit J Psychol, Stat Sect 1:222 N '48. E. K. 
This is the fourth impression of Burt's well- 
known book of standardized tests for assessing 
the intelligence and educational attainments of 
school children. A valuable addition to the new 
edition is the list of tests for the Terman-Merrill 
Revision of the Binet scale: the list Shows the 
new age-assignments for English children, ob- 
tained by the investigations of the Binet Com- 
mittee. As is rather obvious from the type and 
paper, the book has been “made and printed in 
Holland." But teachers, educational psycholo- 
gists, and school medical officers will be glad t 


B74 


have once again so useful a volume, which un- 
fortunately has been out of print since the be- 
ginning of the war. 

J Ed (London) 82:460-2 Ag ' 50. C. M. Flem- 
ing. * a very welcome reprint of the tests from 
Professor Sir Cyril Burt's scholarly volume of 
Mental and Scholastic Tests (with the appro- 
priate addition of an analysis of the age assign- 
ments of the Binet-Simon tests in Terman's 
Stanford revision of 1916, and in the Terman- 
Merrill version of 1937) * 


[B74] 
*Burt, Cyr. Mental and Scholastic Tests, Sec- 
ond Edition, Preface by Robert Blair. London: Sta- 
ples Press, Ltd., 1947. Pp. xxi, 467. 35s. * (New York: 
Staples Press, Inc., soak $7.00) 

Brit J Ed Psychol 18:118-9 Je '48. M. S. * 
This book, originally published in 1921, made 
history. In 1913 the London County Council 
appointed Cyril Burt as psychologist in the Edu- 
cation Department. This in itself was an innova- 
tion. The primary motive for the appointment 
was the need for a more scientific method of 
selecting those children who could not profit by 
the ordinary methods of education, and were 
consequently certified as mentally defective by 
the school medical officers, and transferred to 
special schools. A problem of almost equal im- 
portance was that of allocating pupils for further 
education, whether at secondary schools with 
an academic bias, central schools, trade schools, 
schools of art and the like—a problem which is 
once again very much to the fore. * At that date, 
it must be remembered, what is now called child 
guidance was almost unknown, and its tech- 
niques at best uncertain. Its present develop- 
ment is to a large extent the result of the early 
researches of the author of this book. The book 
itself consists of three memoranda. The first 
gives in detail the London Revision of the Binet- 
Simon tests, with full instructions for their use. 
The second memorandum reports the results of 
a general survey of schools, determines the line 
of demarcation between normal pupils and the 
mentally defective, assesses the influence of sex, 
social status, and fertility on tested intelligence, 
and examines the diagnostic value of the tests 
themselves. Here Burt further proposed a num- 
ber of original tests, of a type then new to teach- 
ers—non-verbal as well as verbal, group as well 
as individual, most of which (like the analogies, 
the maze tests, matrix tests, reasoning tests, etc.) 
have since come into regular use. The last memo- 
randum gives a complete set of standardized tests 
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for the chief subjects of the elementary curricu 
lum—reading, spelling, arithmetic, writ 
drawing, handwork and composition, * The wj. 
ume, however, is concerned quite as much with 
the theoretical investigation of methods as with 
the practical construction of serviceable tests, As 
the preface points out, a number of the techni. 
cal procedures that have since become common- 
places in psychological investigations were in 
troduced for the first time—item-analysis, par- 
tial regression, representative sampling, Scaling 
tests by percentage scores, and various methods 
of factorial analysis. Most of these were based on 
procedures recently developed by Galton and 
Karl Pearson; but others, like the centroid 
formula for factor analysis, the use of correls 
tion between persons, and of graphs for tetra 
choric correlations, were of the author’s own de- 
vising. * The methods of investigation worked 
out in this volume have been widely adopted in 
various branches of individual psychology sine 
the book was first published, and call for stil 
further applications. In the new edition numer 
ous modifications or additions have been made 
to bring the work more up to date. Several 
shorter memoranda taken from the Reports o 
the Psychologist to the Council and now m 
longer available, have been incorporated. * Un 
like many books on the subject, it is really human 
as well as scientific. The problem of the ing 
vidual is never lost in the statistics of the group; 
and the author never mistakes the means for ж 
end, which is the diagnosis and treatment of 
individual child. As Sir Robert Blair remar : 
in his Preface: “Burt’s results provide, "e 
were, a first sketch of the intellectual and s 
cational progress of the average child Mae 
out the years of elementary school life. ie 
view of the increasing use of standardize e 
throughout our schools and the need to gis iis 
and standardize new tests, the re-issue 0 
book is most timely. % 
Ed R 3:149-52 Е ?зт. G. E. R. B.* ip 
book is, of course, a standard work, indee bo 
of the classics in the field, and can stand api 
of little introduction to most teachers, igs 
college lecturers and educational he a 
It is divided into two parts, the first dealing 
the author's revision of the Binet-Simon " 
—the London revision—and the Sere auti 
tests of educational attainment, which t ү psy” 
constructed and standardised when he ki of the 
chologist to the Education Depart | dre 
L.C.C. These latter were the first compr 


873 ] BOOKS & REVIEWS: BURT 
set of attainment tests produced for English chil- 
dren and many of them remain in use to-day. 
The outstanding features of the book are its 
thoroughness and its readibility. Complete de- 
tails of the tests, their administration and their 
marking, are there for all to read, as is also, in 
the case of the Binet-Simon Tests, the data from 
which they were standardised. As an example 
of the scientific manner in which tests should be 
constructed and evaluated and as an example 
of the manner in which the whole should be pre- 
sented, it could hardly be bettered. Time has in- 
evitably dealt some blows; the first edition was 
published in 1921 and the present third edition 
has received little amendment except in the way 
of appendices. It is hard to understand, never- 
theless, why time alone should be responsible for 
the swing away, even in this country, from the 
London revision to the more recent and Ameri- 
can Tecman-Merrill scale, yet this is the present- 
day tendency. It needs perhaps some other giant 
to undertake a new English revision. A move 
away from some of the attainments tests is more 
comprehensible for with changing educational 
philosophies and methods, the content of edu- 
cation changes and the attainments thought 
necessary in one decade will not be the same as 
those of another. * The book still stands as a 
model which somebody with the genius of Sir 
Gyril Burt may some day attempt to copy in re- 
lation to present-day educational needs. * 
Mental Health 7:106 My ’48. G. H. K. This 
volume, one of vital importance to all interested 
in the problems and implications of mental test- 
ing, makes a timely reappearance in a second 
and enlarged edition. * The volume is divided 
into three main sections or memoranda—the first 
dealing with Burt’s revision for English chil- 
dren of the original Binet-Simon Scale of In- 
telligence Testing ; the second with the theoreti- 
cal validity of the results, and the third with 
Burt's own tests of educational achievement. In 
the new edition of this book, the main bulk of 
the work is unchanged, but several additions or 
appendices of great importance and interest to 
Present day practice have been inserted. In the 
first appendix the author discusses the nature of 
intelligence, with special reference to the dis- 
tinctions between various types of theories. In 
the second appendix he gives a detailed factor 
analysis of the Binet-Scale. In the remaining 
Ones he deals with such valuable topics as the 
accuracy of his own supplementary tests; the 
Contribution of schooling to the scores on the 
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Binet Test ; the concept of mental age and some 
of the difficulties inherent in it; and the vexed 
and very pertinent problem for present day psy- 
chologists and educationists of special abilities 
and the extent to which such abilities can be as- 
sessed in mental tests, and at what specific ages 
this assessment can be most accurately carried 
out. It is a pity that Professor Burt has not yet 
been able to complete his revision of the Terman- 
Merrill Scale, and to include his results in the 
present volume. Let us hope that by the time the 
next edition appears, this will have been com- 
pleted, for it will form a very valuable addition 
to what is still the most outstanding treatment 
of the whole subject of mental testing. This is a 
book which should be in the hands of all teach- 
ers, school medical officers, education officers, 
psychologists and those concerned in the present 
day problem of “educating the child according 
to his age, ability and aptitude.” 
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J Bus Ed 25:41 Ja ' 50. * The list of tests... 
should be of interest to most teachers. 
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Psychol В 46:515-6 N '49. John L. Kennedy. 
* One of the most interesting jobs described in 
the book is the development of proficiency tests, 
both on the ground and in the air, by means of 
which a more adequate criterion of performance 
than school grades alone could be obtained. * 
The Project evaluated a 48-student training de- 
vice which simulated on the ground the condi- 
tions of navigation in the air. An extensive ex- 
periment was designed to discover whether or 
not the trainer "trained." As is the case with 
many training devices possessing high “face 
validity," the G-2 Trainer proved to have no 
special efficacy over equivalent time spent in 
the class room. The principal new contribution 
to method in applied psychology is the use of 
factorial analysis and niultiple correlation tech- 
niques in scientific job analysis. * 


For an additional review, see 3:793; for a 
review of the series, see Вә. 
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Carrett, RaymMonp B. Description and Measure- 
ment of Personality. Editor’s Introduction by Lewis 
M. Terman. Yonkers, N.Y.: World Book Co., 1946. 
Pp. xx, 602. $4.48. * (London: George G. Harrap & 
Co., Ltd., 1947. Out of print.) (PA 21: sor) 
Am J Psychiatry 104:591-2 Mr '48. И. Line. 
* [the] approach is bold and, at times, overpow- 
ering. It leads to forthright attack on many dif- 
ficult problems, and to tentative statements pre- 
sented in crystallized form before we are com- 
pletely ready to accept them. * The outcome 
(tentative, of course) is a list of 12 traits, ranked 
in order of certainty, and interpreted in the 
light of existing evidence. Presumably tests for 
measuring these traits will follow shortly. * The 
list itself tends to detract somewhat from the 
systematic aspirations of the approach. For there 
is nothing to suggest any greater scientific clar- 
ity or purity in the resultant classificatory typ- 
ing. The dominant interest, directing the trait 
field and hence, by the author's method, the 
ptom areas of observation, is that of psy- 
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(with one or two exceptions—eg., t 
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chiatry, with a modicum of educational and 
socially-moral influence. The reviewer is con. 
vinced of the potential value of Cattell's tech. 
niques of assessment and interrelating of data; 
of the significance of "clusters"; of the adher- 
ence to scientific measurement as the only safe 
check on conceptual structure. But something is 
sadly lacking in any discipline that permits its 
observations to be classified beforehand, unless 
the classification inheres in the very postulates 
and purposes of the discipline itself. This is not 
a criticism of Cattell, so much as of psychology. 
Brit J Psychol, Stat Sect 1:134-6 Jl *48. 
C(yril) B(urt). * In his endeavour to present a 
synoptic review of existing results, Dr. Cattell 
has carried out an amazingly thorough search 
of the available literature; and has sought to 
catalogue, in the form of a detailed summary, all 
the principal "surface traits" and "source traits" 
that have been discovered by the various meth- 
ods at present in vogue, namely, “behaviour rat- 
ings," “self-inventories,” and “objective tests.” 
From this catalogue he eventually deduces a 
short series of twelve “primary” source traits, 
which he considers to be now well established. 
For admission to this final list, a factor must 
have been found in at least three independent 
researches. In his scheme of the general struc- 
ture of personality, Cattell appears to agree with 
most workers in this country. He distinguishes 
(a) innate or constitutional traits from (b) 
those that are acquired, and recognizes three 
main aspects, namely, (i) cognitive abilities, 
(ii) temperamental characteristics, and (iii) 
conative or motivational tendencies. His twelve 
primary traits include so-called "general fac- 
tors,” such as “general intelligence” and “get 
eral emotionality,” and a number of tempera- 
mental, emotional, or psychoneurotic factors 
of narrower range, such as “cyclothymia, 
“surgency,” “dominance,” “anxiety,” “пеша” 
thenia,” and their opposites. The descriptive 
labels given to each indicate an attempt to a 
visage them as essentially bipolar in form, a" 
suggest that much of the material has been core 
ered with the psychoneurotic rather than us 
normal personality in mind. The person?" 
traits, however, do not include “special aptitudes 
or abilities,” Of these, Cattell believes t 
twelve are “more basic”; but he thinks another 
half dozen or so should be recognized as d 
tablished by present research." Both the scher 


and the list which he gives in his book coincide 
e list which he gives he omission 
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imagery) fairly closely with those of Burt in 
The Measurement of Mental Capacities (1926). 
His whole review of the factual data, supple- 
mented as it is by a bibliography of nearly 300 
references, constitutes а mine of information, 
which will be invaluable to future investigators. 
The statistical psychologist will doubtless be still 
more interested by Dr. Cattell’s discussion of 
methods than by his provisional summary of 
results. Here he shows himself to be far more 
eclectic and impartial than most writers on this 
subject. Further, he is one of the few psycholo- 
gists who seem equally well acquainted alike 
with British and with American work. Instead 
of appearing as the champion of any one school, 
he takes what is most plausible from each of 
them in turn, adds much that is both original and 
suggestive, and endeavours (with the aid of a 
somewhat formidable terminology of his own) 
to combine them into a “coherent but critical 
synopsis.” His main suggestions in regard to 
mathematical procedure are brought together 
towards the end of his book in the shape of a 
systematic presentation of theorems.” He be- 
gins with what he terms the “basic theorem,” 
namely, that “every behaviour manifestation of 
personality, PR (Personality Response), for 
an individual i, can be predicted by the following 
type of equation : 


РК = 5,,Ти + SojTos ... + Si jT» i 
poa Tja ej () 


where Si, Spy, etc, are factor loadings, and 
Ш T, etc., are source traits." Of the T's, some 
are “general to many performances," others ap- 
parently limited to groups of performances ; and 
at least опе, Т, is “peculiar to the situation in 
Which PR; is measured”; while e is “some ex- 
eui error of measurement, not function 
foo It will be observed that, unlike 
«_tstone, he has no hesitation over accepting 
ha factors; and, unlike Spearman, he is 
or «i to recognize a large number of “group” 
“the Vot factors. But he rightly insists that 
ü nunt between general and group factors 
Pu tive rather than absolute,” and that, ех- 
"t in abilities, we should not expect either gen- 
or group factors to be wholly positive." Con- 
айу » he does not reject bipolar factors 

d ng negative saturations. On the other 
sonali he fully agrees with Thurstone that “рег- 
ind ity source traits seem themselves to be not 

pendent, but correlated.” Now I venture to 
“uggest that for the investigator the basic prob- 


= 


B83 


lem is not so much to “predict” the “personality 
responses,” as to predict, or rather to estimate, 
the “source traits.” This Dr. Cattell discusses 
more briefly, He describes two modes of calcu- 
lation—a theoretical or “precise” procedure, and 
a rough and ready practical procedure, For 
theoretical purposes he gives a regression equa- 
tion of the usual type, namely, 


Тн = Boa 2 PR; + Bp» * PR, Spr 
+ By: PR (ii) 


where Т represents "the personal endowment 
of individual i” in regard to the pth “source 
trait,” and PR, etc, аге described as the in- 
dividual's “endowments in each of the perform- 
ances”: (unless I have misunderstood him, 
however, “endowment” as applied to PR must 
be a slip for "achievement" or *measurement"). 
Then, turning to the more practical methods of 
assessment, he observes that "in applied psy- 
chology, such estimation by weighted scores 
usually involves more trouble than is justified 
by the slight gain in accuracy. It is sufficient to 
find half a dozen or more variables heavily 
loaded in the factors concerned, and little in any 
other, or with mutual suppressors in others: 
these are weighted with unity, and all others 
zero" (except, of course, the “suppressors,” 
which will presumably have weights of minus 
unity). His discussion of these principles, and 
the corollaries that he draws from them, have 
provoked considerable criticism both from а 
theoretical and from a practical standpoint. The 
theoretical critic objects that many of the in- 
ferences deduced hold good only if the factors 
or “source traits” are assumed to be orthogonal, 
which Cattell has explicitly denied. Certainly, 
his somewhat condensed account of “factor pat- 
terns," "factor structure," and the requisite 
“loadings” (pP. 559-560) is exceedingly dif- 
ficult to accept as it stands. But Cattell himself 
warns us that here he is writing for “readers who 
will not be primarily statisticians” ; and it may 
be that some of the apparent confusion is due 
to a desire to present а technical argument in à 
non-technical form. His suggestions for prac- 
tical assessments have invited criticism from an- 
other quarter. If after all it is sufficient just to 
“average the group,” why the preliminary math- 
ematical calculation, and why the elaborate 
theorems? If, on the other hand, the preliminary 
mathematical analysis is essential to scientific 
rigour, why fall back, at the end of it all, upon 
simply totalling or averaging the measur 
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just as they stand? Cattell’s reply apparently is 
that the preliminary factorial calculations are 
needed to show which traits deserve selection 
for the final process of averaging: "the mathe- 
matical work reminds us that the choice is not 
arbitrary, but directed by factor analytic find- 
ings." But he might, I fancy, carry his defence 
still further. His answer appears to concede that 
the use of simplified or even arbitrary weights, 
and the acceptance of unstandardized marks, 
necessarily entail a sacrifice of mathematical 
rigour. But this, I would suggest, need be by 
no means true. Such a view perhaps demands a 
brief demonstration. We may fully agree that 
the “basic theorem" should deal with the factori- 
zation of the measurement matrix rather than 
of the correlation matrix. Moreover, if we re- 
write his equation (i) (adopting matrix nota- 
tion with the usual symbols) in the form 


PR=My)=M,+M2+ зуе (в + М, (ia) 


where 
Mi =; Ын (his STi), 


then (as I have shown elsewhere) we may take 
pi=WiaMo, fi = Mow, and vi = wi Mown 

(iii) 
where the wia and wi» may each be any column 
of weights or “loadings” (provided, of course, 
they do not reduce the divisor, v,, to zero). M, 
will thus be "hierarchical" (of rank one). And, 
on applying the same procedure to each of the 
residual matrices in turn, we shall obtain r 
linearly independent factors. According to our 
aims we may seek to maximize the ws; or we 
may treat them, as consisting of --1 only; of 
++I or —1; or of +1 and zero; and so on. Again, 
we may prepare M for the weighting process by 
subtracting means, by normalizing the devia- 
tions, or by first reducing M to a square sym- 
metrical form by working with R — MM". The 
omission of these familiar restrictions in the in- 
terests of practical simplification may or may 
not impair the theoretical interpretation or the 
scientific accuracy of the measurements as fi- 
nally computed. But it is clear, from the freedom 
allowed us in choosing our w’s, that it need not 
destroy the mathematical rigour of the analysis. 
In identifying factors, and in rotating the initial 
factors to obtain a consistent structure, he con- 
siders the most fundamental principle to be that 
of "proportional profiles." Factors are the same 


when their saturations are proportional. We are 
therefore to rotate axes until this proportionality 
is secured. But, when several factors are in- 
volved, this requirement, he believes, will be 
difficult to secure without “harassingly numer- 
ous" equations. Here I venture to suggest that 
the "symmetry criterion" (which was originally 
deduced from the proportionality principle) 
might serve his purpose. Although at times his 
work shows signs of haste and has apparently 
been somewhat hurriedly revised, it is unques- 
tionably as full of ideas as it is of data. It forms 
a valuable contribution to its subject; and can 
be neglected by no writer or investigator who 
desires to enter the field of individual psychol- 
ogy. 

J Abn & Social Psychol 43:114-8 Ja 48. 
J. P. Guilford. This is an important book. No 
one who proposes to deal with personality in a 
significant manner, in theory, in research, or in 
practice, should fail to read it, furthermore, to 
study it. In the treatment of personality, it at- 
tempts to put first things first. Fortunately, it 
was written by one who encompasses within his 
preparation and point of view a clinical back- 
ground and a strong bent toward rigorous types 
of thought and technique. * The author's sys- 
tematic position.is represented by his definitions 
of crucial concepts. À trait is defined as "a col- 
lection of reactions or responses bound by some 
kind of units" (p. 61). It would probably avoid 
some logical problems not to identify a trait with 
the reactions themselves, but to regard it as 
something inferred from reactions. This latter 
view regards reactions as indicators (à la Stag- 
ner) but does not necessarily require the p 
ception of a trait as a determining tendency (à : 
G. W. Allport). * Unusually sharp distinction 
are made between constitutional (hereditary 07 
congenital) traits and environmental 
traits. It is doubtful whether many will go along 
with Cattell on this. The distinction comes | 
abstraction, for the vast majority of Dae 
not all, have their joint contributions un 
reditary and environmental souces, nd 
tively more from the one and others re ай : 
more from the other. The distinctions pape 
three major categories of behavior trait 
ties, dynamic traits, and temperament Sn de 
he regards as practical but artificial. Any її. 
mental trait has three different cipe ҮШ 
states, “All behavior, in fact, can just as sive: 
be said to manifest ability as to manifest € ted” 
all modalities are present, ready to be abstra 
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(p. 165). The reason why factorists find modal- 
ity factors is that they set up research designs 
which abstract them. Dynamic traits are meas- 
ured best when incentives vary and abilities are 
measured best when complexity of tasks varies. 
No coordinate condition is set up for abstract- 
ing temperament traits; they are found, says 
Cattell, by exclusion. "Under general, unspeci- 
бей, natural conditions of trait-element sam- 
pling, a source trait is not of any one modality 
alone” (p. 181). This is quite novel doctrine. It 
is true that behavior is nonmodal; it is neither 
pure ability or motive or temperament. Traits 
are descriptive categories, however, from which 
predictions are to be made. Descriptions must 
analyze in terms of aspects as well as in terms 
of elements. Cattell clearly recognizes this in his 
stress on factors, which are variables. If factors 
tend to fall into natural groups depending upon 
major aspects, we should not try to ignore the 
fact. Factors that fall into modalities Cattell calls 
“conditional” factors. The underlying factors, 
without modality, he calls “naturalistic” or 
“wholistic.” It is hard to imagine what these are 
like, and if they can be described only through 
their modality aspects we must be ready to face 
the fact. It must be admitted that there are a 
few factors which can be classified either as in- 
terests or as temperament traits, e.g. sociability. 
Any classification system has its borderline 
cases, It is not so easy to conceive of factors that 
are on the borderline of temperament and abil- 
ity. There have been frequent suggestions of 
Pairings of interests and abilities, but the find- 
ings in this area generally show rather low inter- 
Correlations between corresponding pairs. Cat- 
нв distinguishes two kinds of traits in a manner 
àt promises to be logically and operationally 
i very useful—surface traits and source traits. * 
Incidental to the carrying out of his main ob- 
Jectives, Cattell makes some very astute sug- 
ee concerning a number of logical prob- 
m of measurement and research. His discus- 
= of types „of measurement—interactive, 
ате and ipsative—is thought provoking. 
the m worth reading by those concerned with 
x оріс of measurement, though not all will 
$ = With all of his conclusions, particularly the 
ШЕ. Tse status that he assigns to psychophysical 
теш His discussion of the design of 
E investigations presents many novel 
those S and it should extend the horizon of 
i то, intend to approach personality from 
irection. He recommends the use of inter- 
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correlations not only of tests but of individuals 
and of occasions ; and not only the intercorrela- 
tions of status scores but also of concomitant 
changes in status. These suggestions should 
broaden the base of research opportunities con- 
siderably. In places, the author shows a tend- 
ency to straddle on issues, if not an ambivalence. 
For example, he subscribes to Allport's dictum 
that most traits are unique to persons, even go- 
ing so far as to say that all traits are unique. 
Having made this bow to Allport, he properly 
points out that individuals can be unique with- 
out there being unique traits. He then hedges 
further by saying, "It is therefore feasible to 
say, with respect to some traits, that different 
individuals have more or less of the same trait" 
(p. бт). Still later, but not much later, he says 
“Mathematically....a common source trait is ab- 
solutely common, its uniqueness being expressed 
through additional traits" (p. 61). The author's 
position on this issue is therefore not clear. It 
is unfortunate that in the direct quotation just 
cited the last phrase was added, for the sentence 
seems to say that a source trait is both common 
and unique. What the author probably means is 
that, when common source traits occur in com- 
bination, the composite is unique. Adding a 
second source trait to the first does not change 
its nature. Ambivalence is even more apparent 
in his treatment of personality inventories ог 
questionnaires. His criticism is at first quite 
scathing, e.g. “...а5 а serious approach to dis- 
covering the structure of personality it (ques- 
tionnaire) represents the nadir of scientific in- 
ventiveness and subtlety" (p. 341). Again, 
“Like most of the lower forms of life, the self- 
inventory is very prolific" (p. 343). Having 
made such statements, he devotes a full chapter 
of 54 pages to factor-analysis studies of ques- 
tionnaires and leans heavily upon their results 
in drawing up and in defining his final list of 
“the established primary traits.” His chapter on 
results from objective tests is excellent. It makes 
one regret that investigations of this type have 
not gone farther. Their limitation is no fault of 
the author, of course. It is regrettable, however, 
that he did not weight relatively more these re- 
sults, as compared with those from behavior 
ratings, for example. Readers interested in the 
performance-test approach will find masterly 
summaries on tests of different kinds and dis- 
cussions of certain important problems, such as 
speed vs. power and speed, fluency, tempo, рег 
severation, and psychosomatic factors. Toward 
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projective techniques the author is quite cool. * 
It has previously been indicated that too much 
attention was given to behavior ratings. This is, 
indeed, a weak point in a generally excellent 
book. Cattell properly objected to semantic er- 
rors in naming traits, in Chapter 4. He sets up 
popular terminology as the best source of de- 
scription of the complete personality in Chapter 
9. In connection with this chapter, it will pay the 
critical reader to turn to the primary source ma- 
terial in journal articles as a proper background 
for evaluation. Ratings, as is well known, are 
beset with biases of many kinds. The halo error 
and the logical error, in particular, make ratings 
unsuited as material for factor-analysis studies, 
unless means are employed for identifying and 
allowing for those sources of error. To a large 
extent, a factor analysis of ratings is a study of 
language and meanings rather than of psycho- 
logical traits. The one analysis upon which so 
much depends is questionable from several points 
of view. In condensing trait terminology from 
several thousand concepts to 35 variables to be 
rated, each of the 35 variables came out with 
several pairs of trait names attached to it, many 
of which are not synonyms. What is worse, 
many trait names were listed as describing more 
than one variable, e.g., friendly, deliberate, soft- 
hearted, optimistic, aloof, and assertive. These 
facts serve to muddy the waters as well as to 
favor the halo effect, which is present even when 
exclusive terminology is used for each variable 
rated. There is also the question of the number 
of source traits involved. Should this number 
approach 35 (and in the total personality it 
probably exceeds that number) and should that 
many be represented in the rated variables, a 
factor solution would be precluded. Even if the 
number of factors actually present were small 
enough to permit a solution, the probable fac- 
torial complexity of each rated variable would 
work against satisfactory psychological inter- 
pretation of obtained factors. It is difficult 
enough when there are a few factorially pure 
experimental variables in an analysis; when all 
experimental variables are complex the task of 
interpretation becomes quite precarious if agree- 
ment of observers is to be reached. In the analy- 
sis in question, which seems most basic to the 
final list of "established primary traits," the gen- 
erally low communalities present another handi- 
cap. Of the entire table of 420 rotated factor 
loadings, only four were greater than .50; only 
IO more were greater than .40; only eight of the 
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factors had any loadings at all greater than 39; 
and three factors had no loadings greater than 
.30. Yet all factors were given interpretations, 
It is exceedingly difficult to discern the nature 
of an element common to three or four variables 
when that common element does not make up 
more than то to 20 per cent of the total variance 
of each. The reviewer made an independent at- 
tempt to identify the 12 factors derived from 
the analysis of ratings, with the result that he 
could interpret three of them and gave up on 
the remaining nine. On two of the three (A and 
C) he reached different interpretations and ti- 
tles than those offered by Cattell. Factor B, 
which was called g, he cannot accept as a single 
source trait, in view of the mass of evidence 
against referring all intellect to a single com- 
mon factor. Most regrettable is the discussion of 
“the established primary traits” in the first part 
of Chapter 12. In the first place, these 12 are 
built mainly on the foundation of the weak 
analysis just referred to. In the second place, the 
number of relatively independent source traits 
must certainly be much greater than 12. There 
are more than that number of ability traits alone 
that apparently did not find any representation, 
direct or indirect, in the list. The 12 primary 
traits seem to be either caricatures of genuinely 
distinct factors or to be composites of several 
others (e.g., the g factor). The terminology used 
in naming the 12 primaries is also open to ques 
tion, although this is a minor issue. It eed 
that too much attention was paid to pathologic 
syndromes, in naming and defining the fina 
factors, after earlier recognition that syndromes 
are complex surface clusters. Certain terms e 
used in naming several factors; бусаи 
emotionality, and character, each three time 
and schizothyme twice. This replication ш) 
not contribute to clarity of the picture x oo 
sonality structure. It is highly destro : 
attempt to identify factors found from one xl 
proach with factors found from another vd 
proach. The same factor, if genuinely desc ү 
tive, should be sensible by different perit 
To prove any such identity is no mean ас 


t the author 
plishment. It should be noted tha Шет em- 
my have 
pirically. It would have been Deni avail- 


able before attempting very much i 


PEE Cy ` tity 0 
identifications. In view of the great aia 
intricate detail mentioned, the volume font 


: 0 
markably free from errors. Two errors 
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gorization were noted, however. The identifica- 
tion of Thurstone's N (number) factor with 
mathematical ability is unwise, for mathematical 
ability is surely much more than facility with 
number operations (p. 219). The combining of 
physical strength with endurance in a single 
factor runs counter to experimental evidence 
that calls for separation of the two (p. 220). The 
insistence of a Spearman g factor is a definite 
bias, and one of far-reaching consequences 
which the reviewer believes are unfortunate. 
One more weakness might be pointed out, and 
that is the author’s dependence upon published 
factor analyses based upon very small samples 
(sometimes до to 70), or at least his apparent 
failure to weight such results properly as com- 
pared with studies in which N amounted to sev- 
eral hundred cases. To recapitulate, this is an 
important book. It should be read for its presen- 
tation of a rigorous and objective conception of 
personality ; for its systematic treatment of traits 
and syndromes ; for its lucid definitions of such 
concepts as ability, interest, attitudes, and tem- 
perament; for its provocative discussions of 
measurement theory and methods of research; 
for its clear exposition of factor-analysis theory 
as applied to personality and for some new finer 
Points in factor solutions ; and for comprehensive 
summaries of factor analyses of behavior traits, 
ну inventories, and performance tests. It 
nm ih be taken seriously for results pre- 
ib i factor analyses of ratings or for its 
be list of established primary traits." The 
ine established source traits is almost certain 
ао longer than 12, and will require con- 
achi y more investigation before it can be 
ieved, 
Y & Mental Dis 110:177-8 Ag 49. 
ШШ sa M. K (elley). Any review of this book 
йор begin with hearty agreement with the 
"ti Vena ies There is no doubt but that 
eil of the issues discussed are admittedly 
dre and. the critical reader may ос- 
шоп y find himself in disagreement with 
бог” VER conclusions expressed by the au- 
did im is reader finds the above statment mild, 
е еге seem to be many occasions for dis- 
by the ci An outstanding example, as selected 
Кем itor, is the author's statement that the 
analo ach test can be dismissed “as something 
dh i to a patent medicine," or even that the 
the nd as suggested by the author represents 
dd 22 in psychologic advancement. We do 
› however, with Doctor Terman 1n his in- 
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troduction when he states that the reader "should 
be compelled to think, as he certainly will be by 
this book." To begin with, thinking is not only 
of value but essential in the reading of this com- 
plicated volume which introduces an approach 
to the problem of personality through techniques 
of factor analysis. Superficial survey of the vol- 
ume leaves one completely confused and yields 
the notion that the author has attempted to re- 
duce human function to a series of traits which 
may appear statistically valid but are hardly 
worth the effort when applied to the individual. 
Detailed re-reading, and the book will require 
several re-readings to be adequately understood, 
emphasizes the point that statistical and clinical 
procedures must be brought into close juxta- 
position if psychology is to function in a sound 
scientific fashion. Clinicians will take an ex- 
tremely poor view of the requirements for vali- 
dation of various clinical methodologies widely 
used and apparently with some success. Stat- 
isticians will gleefully agree that the methodolo- 
gies employed are most accurate. Somewhere be- 
tween these extremes probably lies the truth. 
While there is no.doubt that, to date, many clini- 
cal procedures are not adequately validated from 
a point of view of scientific methodology, there 
is equally no doubt that such procedures seem 
to be of value when one gets down to the actual 
specific problem of handling an individual case. 
That they are not adequately tested is no reason 
to discard them. It would seem better to concen- 
trate on their validation «than to consign them 
willy-nilly to the ash heap. The author of the 
nt volume points out that this book repre- 
sents Volume I of a two-volume series. The 
present volume presents а complicated discus- 
sion of personality traits and expert knowledge 
is required to follow him through the maze of 
tabular analysis employed. He eventually pro- 


duces twelve primary source traits of personality 


which are an excellent approach to the compli- 


cated problem of what is personality. There is 


nothing in the book, however, to demonstrate 
that these twelve basic traits are any better in 
the individual clinical case than anybody else's 
concept of personality. 'The author's discussion 
of the principle pathologic syndromes seems to 
lack a genuine feeling of knowledge born of 
long experience with psychopathology. One 
wonders whether such a lack has not colored 
the entire volume. Finally, while the book is 
written for practicing psychologists in guidance 
education, industrial and other personnel work, 


prese 
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it is exceedingly doubtful if many of these busy 
practitioners will have the time or energy to 
plow through the complicated writing habits of 
the author. Much of the material contained in 
this book is of extreme value. The effort neces- 
sary to secure these nuggets of value makes it 
hardly worth while. 

J Social Psychol 26:269-79 N "47. Robert P. 
Fischer. * 'The first section presents the statis- 
tical methods and mathematical argument neces- 
sary to discover and delineate the fundamental 
personality traits. Here Cattell presents for the 
first time some extremely ingenious statistical 
techniques which should prove useful if not es- 
sential in all types of psychological measure- 
ment. The second section....consists of a critical 
summary of all the factor analytic studies of 
abilities and personality traits attempted up to 
1945. Applying his newly developed techniques 
to these studies, Cattell gives a contingent solu- 
tion to the probable number and nature of the 
factors in each research. By a careful matching 
technique he then extracts the common factors 
and catalogues these source traits in a convenient 
and usable taxonomy. As a sort of epilogue, 
Cattell discusses in an incisive and sometimes 
iconoclastic way what he considers to be the 
shortcomings of much current research and as 
an antidote develops a vista for future study. 
Cattell clearly recognized that any satisfactory 
solution to the enigma of personality must be 
founded in a precise set of operational defini- 
tions. * The great confusion as to what consti- 
tutes a personality trait seems to have grown 
out of the notion that surface traits constitute 
the ultimate deterministic irreducibles of per- 
sonality. By presenting clear-cut operational 
concepts of surface and source traits, Cattell 
has done much to straighten out the hitherto 
existing chaos. Indeed, in the reviewer's opinion, 
by handling personality description under sys- 


tematic concepts of source and surface traits, 


Cattell has superseded the debates of Allport 
and others on what constitutes a trait. Should 
the reviewer's optimism on this point prove to 
be substantially correct, this contribution alone 
would justify the book. * The outstanding con- 
tribution of the covariation chart lies in its syn- 
thesis of all the methods of covariation pre- 
viously considered only in isolation. It puts into 
a readily seen perspective a group of useful tech- 
niques and should prove especially helpful to 
those who are engaged in teaching the intrica- 
cies of factor analysis. It has been the experience 


of the reviewer, in teaching the material, that 
only by the chart are the various methods read- 
ily grasped by students. Among the 18 designs 
of covariation elaborated upon in the book, the 
notion of “P-Technique” has special value for 
clinical psychology. “P-Technique” is the 
method of covariation which studies the varia- 
tion of time and test within one person. * The 
*P-'Technique" permits factorization within the 
individual and provides a method whereby dy- 
namic trait functional unities can be put ona 
measurement basis and done so on a footing oí 
unique traits and of common traits. The factor 
obtained with this method is essentially a meas- 
ure of the interactions of a constitutional tem- 
perament factor with the level of character inte- 
gration. Thus there is presented here a new tool 
for clinical analysis which is more objective than 
other methods currently used for the same pur- 
pose. Essentially it is a very usable device for 
studying a given person over a period of time 
under a variety of circumstances. In an inter- 
esting and speculative discussion of the predic- 
tive convenience of modalities Cattell develops 
an argument for the use of conditional modali- 
ties from which he derives a conditional factor 
pattern or conditional factor regression equation 
which he feels is more useful than the wholistic 
or natural factor pattern. * After designing two 
clever means for obtaining the predominant 
modalities (too detailed for review here), Cat 
tell develops a conditional factor pattern ог ү 
ditional factor regression equation by which the 
equation for predicting a given variable 80% 
for a given individual has the wholistic of s 
controlled factor replaced by the three con l 
tional factors, each with a distinct and ap 
priate loading. * Cattell’s concept of ergs dw 
metanergs is interesting but may be regarde Р 
some psychologists as simply new Wine in bile 
bottles. Close inspection reveals certain s 
differences between these two terms and xdi 
near-equivalents in classical American de 
ogy, viz., drives and motives. For inpr di 
definition ergs have goal value while driv nij 
not. Whether the distinctions are real of hich 
semeiotic will depend on the context 11 arise 
they are used. Furthermore, there may 0-4 
some justifiable argument that Cattell disp 
ence to ergs as innate psychophysical pre acquire 
tions, which permit their possessors to e read 
reactivity to some classes of objects MON”. f- 
ily than others, assumes more hereditary i | 
ences on behavior than the data warrant. 
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the reviewer takes sharp exception to Cattell’s 
notion that there are some 20 appetitive and 
non-appetitive ergs in man. This sounds too 
much like the old instinct naming which has 
had a well deserved demise. As to the 
derivation of metanergs from ergs, here again 
the reviewer feels forced to part company tem- 
porarily with Cattell, both on the ground that 
the term metanerg is really not greatly different 
from our term motive, and secondly on the 
ground that there is no clear-cut evidence which 
supports the view that metanergs (motives?) 
are directly derivable from ergs (drives). As 
Cattell sees it, a metanerg corresponds to a con- 
ditioned reflex, especially when applied to the 
learned modification of drives which are not 
simple reflexes. Indeed, social psychologists have 
long concerned themselves with attempts to de- 
rive motives from drives, but thus far there is 
no clear-cut evidence (save perhaps the ques- 
tionable clinical evidence of psychoanalysis) 
that such an ambitious program has yielded posi- 
tive results. On the other hand, if ergs and 
metanergs are really different from drives and 
motives, as Cattell maintains they are, then of 
course the above argument is no longer cogent. 
* The reader especially interested in factor 
analysis will find Cattell’s summary of the prin- 
ciples for determining factors extremely useful. 
They are: (a) Rotation to agree with clinical 
and general psychological findings, (5) rotation 
{о agree with factors from past factor analyses, 
(c) rotation to put axes through the center of 
clusters, (d) the principle of orthogonal addi- 
tions: rotation to agree with successively es- 
tablished factors, (е) the principle of expected 
Profiles: rotation to produce loading profiles 
потап with general psychological expecta- 
ions, and (f) the principle of proportional pro- 
jn or "simultaneous simple structure." This 
s» principle, the principle of parallel propor- 
‘onal profiles, is clearly the most basic. This 
m e is based essentially on the assumption 
5 at the principle of parsimony in rotation should 
ot apply to a single matrix as does Thurstone’s 
ше of simple structure but to all matrices 
e together, especially when the corre- 
Fn making up the matrices are derived from 
oen een fields of psychological observa- 
ms The criterion is then no longer that the 
bs lon shall offer fewest factor loadings for 
. uU matrix, but that it shall offer fewest 
mr (and therefore fewest distinct total) 
ings in all the matrices together." Simple 
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structure does not demand this condition but 
neither does it preclude it. Even though Thur- 
stone maintains that the weights of a primary 
ability for a test must remain invariant when 
changed from one test battery to another, this 
condition is expected to follow from simple struc- 
ture rather than vice versa. The rationale of Cat- 
tell’s argument is that real psychological func- 
tional unities will appear as mathematical fac- 
torizations in many different kinds of situations 
whereas the artificial mathematical factors 
(which might easily result from simple struc- 
ture) will be contingent on the peculiar matrix 
in which they appear and thus will not be found 
in all possible matrices. The method of parallel 
proportional profiles is most ingenious and 
would prove at once useful were it not for the 
prohibitive mathematical labors involved in its 
application. It involves the solution of sets of 
simultaneous equations and since the number 
of such equations becomes tremendous when 
more than three factors are involved, the labor 
of computation is too much for any practical situ- 
ation. The use of determinants or Doolittle’s 
method for the solution of simultaneous equa- 
tions both require vast amounts of time and 
until more expeditious methods of solving these 
equations are devised, the principle of parallel 
proportional profiles can expect to find only very 
limited applications. A mathematical colleague 
of the reviewer’s informs him that no short-cut 
method is at the present in the offing and he is 
somewhat pessimistic about one appearing soon 
since mathematicians have long labored over 
just this problem. Furthermore, as Cattell points 
out, until the special difficulties connected with 
oblique axes and variable errors in the given 
data are ironed out, the method must be with- 
held from routine practical application to factor 
solutions. Thus far this review has been con- 
cerned chiefly with the more general theoretical 
methods in Cattell’s exposition. The remainder 
of the review deals for the most part with the 
results obtained by the use of these methods. It 
is this section which should appeal most to the 
applied psychologist, since it presents for the 
first time, an intelligible and meaningful coali- 
tion of hundreds of studies which, when con- 


sidered alone, are not only confusing but often 
meaningless. The two chapters of results of 
most importance are the ones in which the princi- 
pal surface traits and source traits, discovered 
through behavior ratings, are presented. * The 


discovery of the primary source traits (factors) 
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derived from behavior ratings constitutes the 
most important of the findings in the book. * By 
use of centroid analysis into simple structure, 
12 factors finally emerged. * After contrasting 
the -above 12 factors with those of other re- 
searches and clearly marking all cross-research 
identities, Cattell presented them in a convenient 
taxonomic catalogue, giving the principal clus- 
ters in each. * Of clinical importance is the fact 
that there is more than one factor in each of 
several realms. For example, there are clearly 
two factors (A and H) in the schizothyme realm 
and maybe even a third factor in this realm, 
Factor (L). Similarly there appear to be two 
factors in the surgency realm, Factors (F and 
L), and two factors in the character realm, Fac- 
tors (C and G). Doubtless much of the confusion 
obtained in attempting to get clear-cut syn- 
dromal distinctions has grown out of the fact 
that these syndromes have more than one factor 
contributing to their variance. * Another inter- 
esting and quite promising development is the 
concept of trait interaction or catalysis. Certain 
traits determine the presence or absence of other 
traits but do not enter into them. Cattell, in de- 
veloping this point refers to them as catalysts, 
a term which seems to describe them well. Be- 
fore the real significance of trait catalysis can 
be fully comprehended, much more research 
must be directed at this concept. From Cattell's 
results, it would no longer appear to be correct 
to refer to distinct syndromes: e.g., this man a 
schizothyme, this one an anxiety neurotic. What 
is now called for is a statement of how much of 
this individual's behavior is determined by Fac- 
tor 4 and how much by Factor H for example 
or how much of the second individual's behavior 
is neurosis and how much is surgency factor. * 
Unfortunately this work will be read with under- 
standing by only a very limited number of psy- 
chologists, because it assumes that the reader is 
proficiently familiar with the fundamental intri- 
cacies of factor analysis. The absurdity and char- 
latanry so rife in the field of personality bears 
adequate witness to the need for some rationale 
for the sane construction and application of per- 
sonality tests. Yet it is probably too much to 
hope that the psychologists who most need to 
read this book of Cattell's will either be able or 
inclined to do so. Those who will read the book 
will likely not remain indifferent about it. It is 
a book, both because of the nature of the ideas 
developed and the iconoclastic vein in which 
much of it is written, that is calculated to in- 


to have distinct advantages over the me 
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cite extreme reactions. For example, Cattell’s 
criticism of objective personality tests quite 
probably will stir up resentment. But it will also 
provoke some critical evaluation of these devices, 
Cattell’s insistence that the statistical analysis of 
individual differences offers a more fertile field 
for the study of personality than does the labora- 
tory is bound to be met with sharp criticism, 
What he appears to be driving at is that the 
artificial nature of the experimental situation 
frequently leads to isolated prediction of a func- 
tionally sterile sort, whereas observations in 
situ (principally behavior ratings by psycholo- 
gists acquainted with the subjects being rated), 
when properly analyzed by the methods of cor- 
relation and factor analysis, are almost unlimited 
in their predictive value. There can be no doubt 
that there is much truth in his assertions. The 
mere summation of many isolated bits of be- 
havior by no means creates a psychologically 
meaningful whole or unity. Cattell’s ultimate 
aim would be to analyze, by appropriate statis- 
tical methods, the fundamental processes of per- 
sonality and once discovered to attack these 
processes and concepts by experimental meth- 
ods. The reviewer holds the view, however, that 
better techniques for observing behavior than 
those specifically used in his book are necessary 
if Cattell’s ingenious methods are to have max 
mum value. After all, the sophisticated refine 
ment of observations by advanced statistical 
techniques cannot improve the accuracy of the 
observations one whit. It can only extend their 
meaning and use. The use of laboratory tech- 
niques, designed to conceal their purpose from 
the person being observed, and sufficiently Se 
tensive to avoid the errors of predictive limita- 
tions noted above, would seem to the reviewer 
пой of 
behavior rating. Cattell's own cursive шшш 
situation tests, especially in their appara » 
form, would be the sort of thing the (p 
would like to have seen more generously use ei 
the collection of the original data. Undoubte Л 
however, the use of all conceivable methods 
observing behavior, when properly ini 
by factor analysis, will serve best the йш 
of data collecting. Probably most реоре ok 
agree that the principal contribution of би ibe 
is in the methodology developed. Certainly 
techniques introduced in this work are bo e 
far-reaching. It is the opinion of the Le be 
that the methodology constitutes the outs 


f the 
ing contribution. However, the discovery E 


a 
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basic source traits may provide a rational basis 
for test construction which may be considered 
by many just as important a contribution as the 
methods. Regardless of the position taken, one 
must certainly agree that the volume reflects a 
tremendous amount of work of a bold and schol- 
arly type. One of the outstanding features of 
any systematic treatment lies in the use that can 
be made of it in future research. We shall all 
anxiously await Cattell’s tests to measure the 
12 primary source traits. We shall also await 
the arrival of studies on larger samples and from 
different populations. For example, an applica- 
tion of his techniques to a psychotic population 
would probably yield a meaningful genotypical 
taxonomy. Furthermore a study of children, es- 
pecially by the “P-Technique” would doubtless 
throw much light on the genetic development of 
personality. Whether Cattell’s arduous task will 
bear fruition in future research is, of course, the 
final test of its merit. 
Psychiatry 11:215-6 My ’48. Margaret Ives. 
* This book....presents naively the accepted syn- 
dromes resulting from the “common sense ob- 
servations of clinical psychology and psychiatry” 
* Cattell's approach to the study of personality 
is mathematical, particularly through the tech- 
niques of factor analysis. He believes it ex- 
tremely unfortunate that so few psychologists 
(and even fewer psychiatrists) are prepared to 
master the technical difficulties of factor analy- 
sis; yet “the human mind is certainly no less 
complex than the finest of engineering struc- 
tures or the most intricate of man-made ma- 
chines.” He quotes Thorndike for his first 
axiom, as follows: “Whatever exists, exists in 
some quantity and can—at least theoretically— 
be measured.” Granting the truth of this, this 
teviewer would stress Thorndike's parentheti- 
cal insertion; would change Cattell’s “no less 
complex” to “so much more complex”; and 
Would conclude that even the most vigorous 
mathematical training as yet permits only rudi- 
mentary beginnings of measurement of the sub- 
CN of human personality. This does not mean 
thy aa approach of this book is not valuable and 
all t-provoking. * Whether one agrees with 
i, 5 Opinions here expressed or not, the book 
5 а challenging one and lays a foundation for 
ather research along this line in confirmation 
1006 сайоп of the list of traits. However, al- 
id Dr. Cattell pays lip service to other 
is: 8 of approach than the mathematical, his 
in that direction is attested by his relega- 
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tion of the Rorschach to the level of a “patent 
medicine” regardless of its great empirical value 
when administered by experts. This is because it 
has not lent itself to satisfactory factor analysis 
except as a measure of “fluency of association, 
oscillation, and possibly persistence, but none... 
with sufficient items to be reliable.” Also the in- 
clusion of environmental mold traits, though it 
is undoubtedly necessary at present, makes it 
evident that the source traits discovered are 
valid only for the culture in which they are 
studied and need not at all be basic human per- 
sonality traits. For example Factor В, Intelli- 
gence, general mental capacity, versus Mental 
Defect, includes certain character traits, such as 
“persevering vs. quitting" and "stable vs. 
changeable," which need not necessarily be as- 
sociated with intelligence at all. Thus the mathe- 
matical criterion of unity—co-variation of parts 
— need not be synonomous with true identity. 
Psychometrika 12:233-5 S 47. Roswell H. 
Johnson. * This is something to welcome, їп- 
deed, for nothing approaching this scope has 
appeared since Gordon Allport's "Personality: 
A Psychological I nterpretation." That came out 
in 1937 and thus followed after the earlier im- 
pressionistic writers such as Jung, but before 
the rich surge of statistical inquiries which fol- 
lowed. It is high time that such a book as Cat- 
tell’s appeared and that it should be that author 
to write it, because his work has been broad 
enough to do what is needed, viz: bring together 
the results from behavior ratings, personal ques- 
tionnaires, objective tests, and clinical observa- 
tions. It is also important that it should be done 
by one devoted to rigorous statistical methods, 
especially factor analysis. As the preface warns 
us, however, the title given is not really the title 
for this book, but rather for a series of two vol- 
umes of which this is the first. A more modest 
and accurate title for this volume would be "The 
Classification of Temperament.” The next will 
be on the development of temperament. There 
will be little help in this volume, and probably 
in the second, for one who wants to measure рег- 
sonality. For that, one must look to the new 
Buros handbook now in preparation, and one 
will obtain some help from Rapaport. One short- 


coming....i.... 
written long hers 
a fresh contemporaneity їп every part. As an 
illustration, no papers 


by Humm since 1933, ап 
al are cited. Fortunately, the volume to follow 
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can make some amends; but probably not 
enough, since it is to deal specially with develop- 
ment. In spite of this limitation, it is very valu- 
able achievement to collate the studies in four 
fields. Let us hope that the author will favor us 
periodically by gathering these studies, es- 
pecially those on objective tests, since summaries 
of this type are so meagre and so greatly needed. 
So far, the results of objective testing are few 
and fit together poorly. Such work will take 
years to make a coverage comparable with the 
other approaches. To the methodological psy- 
chometrist, the writer’s discussion of factor- 
analysis techniques is likely to claim special in- 
terest. While somewhat eclectic, he makes much 
use of the Thurstone centroid method and of his 
own “principle of proportional profiles.” The 
book is to a great degree devoted to discussions 
of nomenclature. The word temperament is used 
more narrowly than by many students of per- 
sonality. It is limited to behavior tendencies that 
are devoid of the dynamic, that is, that are the 
result of a drive rather than of a purpose, and 
so it is dependent upon physiological character- 
istics of the individual. It will be difficult to nar- 
row thus the use of the word temperament for 
those who wish to include attitude and interest 
as well as other behavior tendencies in the term. 
Objection may well be made to the stress the 
author places on the terms “surface traits” and 
“source traits," meaning observed traits and 
those statistically derived by factor-analysis 
studies. As new factor-analysis results appear, 
“source traits” become adopted by test devisers 
as observed traits, so the distinction becomes 
less and less significant. The strong preferences 
that the author shows for behavior ratings and 
objective testing as against the personal ques- 
tionnaire are not supported by experience. It is 
reported that the U.S. Army Air Forces found 
as much validity for questionnaire scores for 
prediction of the pilot pass-fail criterion as for 
all the behavior ratings and objective tests (ex- 
cept personal-history data) taken together. This 
finding should do much to alter the author's 
prejudices. He has given undue weight to the 
sources of error in the questionnaire and inade- 
quate attention to very grave limitations in the 
other methods. Unquestionably, all three meth- 
ods will be and should continue to be used, since 
none can be safely discarded. Insufficient recog- 
nition is given to the validating procedures in 
the use of questionnaires, notably Hathaway’s 
“К” score, which on further development will 


put questionnaire work on a better basis, The 
author finds little merit in the projective meth. 
ods and is indeed quite severe in his estimation 
of the leading test of this sort, the Rorschach 
ink-blot test. * The author ambitiously presents 
a scheme based....on his study of collected be- 
havior ratings which he hopes may bring more 
agreement in the classification of temperament 
than now prevails. There would be more hope 
of getting fair agreement if it were based statis- 
tically on behavior ratings and questionnaire 
studies, but it is also based upon objective test 
studies and clinical impressions. With this di- 
versity of sources, the conflicts and disparities 
become so great that the result is largely the 
author's guess rather than an objective finding. 
For this reason, the present classification has 
no chance of adoption as it stands. It is useful 
as a basis of discussion, however, and as some- 
thing to be given statistical test. Concerning this 
classification system the reviewer offers a few 
comments: (1) The inclusiveness is excellent. 
One will.see among the adjectives listed in the 
summaries nearly all the traits one is likely to 
find or need. (2) The greatest fault is a too 
ready willingness to set up a pair of traits as op- 
posite poles, where one pole is something more 
than a lesser degree. It is better to let assumed 
polarity await adequate correlation studies. The 
outstanding case is cyclothymia vs. schizothymi. 
Each of these poles is a well accepted trait, but 
there is not such a high negative correlation be- 
tween them as to justity a polarity. One finds 
schizoids with what he calls cyclothymia, those 
with middling scores, and those with low scores. 
There is a negative correlation, but not the cor 
relation close to — 1.00 needed for polarity. (8 

In cyclothymia, as described by Cattell, the T 
clic quality is one of its least important. Tn fact, 
it is quite swamped by other traits centering 
around cordiality. It would be better rename” 
(4) The trait “dominance-ascendance vs. St З 
missiveness" as described is dual. Оп the 0" 

hand is included a mere activity, the ороно 
which is not submissiveness. With this 1 Ж 
dominant-ascendant trait which is truly a 
sive, aimed at another person. It would | j 
best to treat this aggressive-toward-others xr 
pole opposite to submissiveness and to КЕЙ 
activity аз a separate trait as is done by Gui this 
in his “general drive.” All in all, we have нн vill 
book a very valuable contribution, which Y. 
aid significantly in the next steps to be step 
this important field. It is a real service to 
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up the progress so much needed in the classifi- 
cation and measurement of temperament. We 
may well all look forward with eagerness to the 
volume to follow on the development of personal- 


ity. 
hosom Med 10:183-4 Му-Је '48. Josef 
Brogek. Had the author been as ambitious in 
the title as is the stated purpose of the mono- 
graph, he might have christened his opus Prole- 
gomena to All Future Studies on Personality, 
paraphrasing Immanual Kant's famous Prole- 
gomena ги einer jeden kuenftigen Metaphysik. 
The aim of the book is to reconsider the funda- 
mental problems of personality description, to 
systematize and bring into perspective the 
knowledge already available, and to indicate di- 
rections and methods for future organized at- 
tack on the complex problem, a problem which 
is of direct concern to students of psychosomatic 
medicine. * The present book was written by a 
British-American psychologist, trained in psy- 
chology as a laboratory science, having a firm 
grasp on highly refined technics of statistical 
analysis, and seasoned by clinical work. This is 
a highly favorable background of training and 
experience for undertaking the project. * The 
monograph will require, even on the part of a 
sophisticated reader, a full concentration and 
much patience. In laying methodologic founda- 
tions to a field heavily laden with a ballast of 
confused terminology and fuzzy concepts, Cat- 
tell could hardly avoid coining’ а considerable 
number of new terms but the need for acquiring 
à new vocabulary slows down the assimilation 
of the material. The author approaches the ideas 
basic to a refined description of personality by а 
Series of successive approximations. Such a 
technic enables the reader to follow the author 
in his struggle toward light. At the same time, 
H may actually exhaust all but the most inde- 
atigable ones by making the reader share too 
much of the arduous effort to develop appro- 
Priate concepts and procedures, and to achieve 
СысшаНоп and logical coherence of thought. 
ае places a considerable reliance on the 
rating of behavior manifestations. Some readers 
(v feel that the projective technics, particularly 
е Rorschach test, were brushed off somewhat 
ые» and workers concerned with the 
easement of clinical personality deviations 
ks be perplexed by the omission of the Minne- 
win Multiphasic Personality Inventory which, 
i all its weaknesses, satisfies many of the cri- 
ria of test adequacy as formulated by Cattell. 
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All will await anxiously publication of the test 
battery being developed by the author. 

О R Biol 23:88-9 Mr '48. A. Chapanis. Psy- 
chology as a profession has undergone signifi- 
cant changes in the last few years. The impact 
of the war produced a need for psychologists to 
predict the behavior of large numbers of people 
in a variety of real-life situations. This need was 
both genuine and immediate. Many psycholo- 
gists left their laboratories in response to it, and 
the field now teems with clinical, consulting, per- 
sonnel, vocational, and applied psychologists. 
But there is cause for serious concern in this 
turn of affairs. Too much recent research is 
devoted to the practical, superficial prediction 
of behavior of specific kinds; too little to the 
integrative study of the dimensions of human 
abilities and personalities. Too often is there a 
slavish devotion to published psychological tests 
because "they seem to work" ; too seldom is there 
any critical inquiry into precisely what these 
tests measure. Though a clinical psychologist 
himself, Cattell is a severe critic of the empirical 
“ad hoc research finding, permitting no generali- 
zation to other situations and requiring to be 
re-proved in each of a thousand situations." For 
this reason alone, his book is a welcome and 
refreshing addition to this field. * The first part 
of the book is a methodological introduction 
which amplifies in great detail the use of factor 
analysis for discovering and isolating personality 
traits. This is heavy reading—it demands fa- 
miliarity with correlational procedures, ad- 
vanced general statistics, and the basic devices 
of factor analysis. But the methods treated here 
are well worth careful study; for, in the review- 
er’s opinion, they represent about the only way 
of achieving any coherence and system in the 
description of personality. The second part of 
the book will probably interest more readers. In 
it, the author has attempted to review all earlier 
work on the description of personality. * All 
in all, this book is an important contribution to 
the field of personality measurement. It is a 
fairly difficult book, highly theoretical, and suit- 
able as a text only for advanced courses and 
seminars in personality. The clinician and prac- 
tical psychologist will probably be disappointed 
in it, for, despite its title, there is virtually no 
discussion of specific tests or measurement de- 
vices for assessing personality. Even so, its Sys- 
tematic contribution is so great that it must 
certainly be classified as required reading for 
psychologists in this field. 
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Times Lit Sup 47:138 Mr 6 48. * а monu- 
mental work * Cattell admits that some chapters 
may need to be read more than once; and this 
seems to be an understatement. It is not only the 
mass of material that baffles one—the “moun- 
tain ranges of accumulated experimental re- 
sults,” as he himself calls them, the serried cata- 
logues, the arrays of alphabetical symbols—but 
the slippery tracts of statistical theory, the fog 
of new terminology and elusive concepts, and 
the haunting fear of falling into a philosophical 
crevasse. Despite an apparently orderly arrange- 
ment of topics the author’s enthusiasm leads him 
at times to talk of everything at once, and the 
difficulty of finding one’s way about is not much 
lightened by an inadequate and rather unintelli- 
gent index. Professor Cattell discusses various 
abstruse questions of statistical technique at 
intervals throughout the book: and though he 
sometimes makes an effort to be intelligible to 
the layman he generally assumes that oblique 
and orthogonal axes, rotation, hyperplanes, 
“Thurstone’s multifactor centroid method” and 
the like are more or less household words. At 
the same time his terminology is none too pre- 
cise, and the reader is liable to be confused be- 
tween traits, trait elements, trait clusters, syn- 
dromes, patterns, types, any or all of which may 
apparently be regarded as “factors” if they are 
found to be measurable. Such a statement as 
that “the particular overlaps and interferences 
of factors, which throw clusters into relief and 
even bring about their existence, are likely to 
be more fortuitous and specific to situations and 
samples than are the factors (causes) them- 
selves” leaves one guessing ; and a brief discus- 
sion of the philosophical “reality” of the various 
concepts in Chapter V seems to lead to the con- 
clusion that it all depends on what you mean by 
reality. After long preliminaries Professor Cat- 
tell takes up in turn the principal traits revealed 
by clinical research (neuroses, psychoses and 
so on), by “behavior ratings,” by questionnaires 
and by objective tests, each on a truly heroic 
scale; and in his twelfth and last chapter he 
matches his “twelve principal source traits” with 
corresponding traits discovered in each of the 
other three areas of observation, so as to obtain 
a complete picture. Thus “Factor V or F. Sur- 
gency—v.—Agitated Melancholic Desurgency" 
is practically synonymous in three of the areas, 
but in the “test measurement” list it appears, 
` somewhat surprisingly, as “Fluency of Asso- 
ciation.” * The extreme generality of some of 
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the concepts makes us wonder whether, if this 
line of inquiry is pursued much farther, we shall 
not find ourselves back at the old "faculty psy- 
chology" and trying vainly to measure intellect, 
will and emotion. Description and Measurement 
of Personality is the fruit of immense labour and 
immense research—the author himself has cal- 
culated “over 20,000 correlation coefficients,” 
and he gratefully records the labours of numer- 
ous assistants. The total effect is rather over- 
whelming, and the bemused reader will some- 
times rub his eyes and ask himself what it is 
really all about. A clinical psychologist is per- 
haps naturally prone to regard human behaviour 
as the product of so many hereditary and en- 
vironmental coefficients: certainly Dr. Cattell 
seems to be carrying the art of taking robots to 
pieces about as far as it will go. * 


For additional reviews, see 3:796. 
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*Carrett, RAvwoNp В. А Guide to Mental Test- 
ing: For Psychological Clinics, Schools, and In- 
dustrial Psychologists, Second Edition. Forewor 
by William Moodie. London: University of London 
Press, Ltd., 1048. Pp. xvi, 411. 25s. * (Toronto, Can- 
ada: Clarke, Írwin & Co., Ltd. $6.80.) (РА 25: 3807) 

Brit J Ed Psychol 19:146 Je ’49. In the new 
edition of this handy “Guide,” there is a new 
chapter on “The Measurement of Intra-familial 
Attitudes,” and considerable space is given now 
to a questionnaire for personality factors and to 
a general information test. There are also re- 
visions and additions elsewhere. It should be 
noted that the book is not primarily concern 
with tests of general intelligence, to which 1m- 
deed only 51 pp. are devoted ; much more spat 
is given to tests of interests, attitudes, temper?" 
ment and character. Professor Cattell still does 
less than justice, we think, to the Binet type 
test, though his criticisms are applicable to the 
original forms of the Binet test. To many tests 
the reference is little more than one would get s 
a catalogue of tests, but the newer types of ni 
are much more fully discussed. Among the "n А 
of musical abilities, it is surprising to ПЛ Dr 
reference to those of Dr. Н. D. Wing a? hes 
J. Mainwaring. Again as to verbal ability, ШЕ 
is no reference to the further analysis by tele 
Cyril Burt and Miss Enid John. The first jn) 
erence I looked up had a serious error S ба 
and it was to опе of Cattell's own papers fully, 
hopes others have been checked more care 
as it is precisely as a guide and reference 
that this volume is most useful. 
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Brit J Psychol, Stat Sect 2:68 Mr "49. J. W. 
"Prof. Cattell’s well known Guide has long been 
"recognized as almost the only handbook of tests 
“which covers, not merely intelligence and educa- 


Попа! attainments, but all the more important 


aspects of personality. For the new edition, most 
of the chapters in the 1936 issue have been en- 
Jarged and revised. The previous edition began 


by accepting Spearman's two-factor theory : our 


"abilities," it was said, consist of one general 
factor called “g,” and a host of specific factors 
or ^s"s (p. т). The new edition substitutes the 


three-factor theory, viz., that "abilities may be 


conceived of as (т) a general ability, entering 
into almost all performances, (2) certain group 


_ factors, covering an area such as verbal, spatial, 


"numerical, musical, etc., performance; and (3) 
certain abilities which are absolutely specific 
to one performance" (p. 6). In passing it is a 
little surprising to see this hypothesis attributed 
to Spearman and Thurstone, since Spearman's 
two-factor was expressly intended to exclude 
group factors, and Thurstone as late as 1938 de- 
clared that “зо far, we have not found a general 
factor." * The chapters on tests of temperament 
and character are now based mainly on Prof. 
Cattell’s own suggestive work on the measure- 
ment of personality; and chapters V and VI 
open with a very clear statement of the scheme 
of "primary personality factors" that he has put 
"forward. It is disappointing therefore to find 
that, in the *Notes on Mathematical Formulae," 
the brief section dealing with "Factor Analysis" 
Still treats only of the “tetrad difference" method 
and the “division of abilities into a general factor 
and factors specific to each test,” that is to say, 
with the discarded two-factor theory only. A 
Suggestive chapter on “The Measurement of 
Intra-Familial Attitudes" and two new appen- 


_ dices giving a “Questionnaire for Personality 


actors" and a “General Information Test" 
have been added; and the collection of new and 
Old tests now furnish a most valuable source of 
Material not only for the practical psychologist 
in the child guidance clinic but also for those en- 
Baged in research. 
_ J Ed (London) 81:232 Ap '49. C. M. Flem- 
ing. * full of fresh and stimulating material on 
the assessment of personality and attitudes, but 


1Thurstone, L. L. : AA ii 
i » L. L. ntal Abilities, 1938, p. УП, 
a esp dd a Мена а factor doy (in- 
ing the four group factors explicitly named) seems taken 
'Sychol, rbatim from the Board oF 24 Report on 
hardly 004001 Сок (p. 19), the theory as he cites it can 
hurstone; certainly it was strong y 

later (Abilities of Man, р. 
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it is somewhat lacking in accurate bibliographical 
references and in the citation of recent tests on 
intelligence and attainment. The background of 
thought seems also to be still that of the old- 
fashioned psychoanalytic approach. 

Psychosom Med 11:319-20 S-O ’49. Josef 
Brožek. * The interest in the needs of the Brit- 
ish users of this handbook....will somewhat 
limit the practical usefulness of the book in 
North America. This handicap is more than 
fully compensated for by the fact that Dr. Cat- 
tell could draw upon his extensive original re- 
searches and his arduous efforts (see his De- 
scription and Measurement of Personality, 
World Book Co., 1946) to clarify the funda- 
mental problems of theory and technics of quan- 
titative analysis of personality. Actually, the 
volume fits better the needs of a research worker 
than those of a practitioner. Many of the pro- 
posed tests lack norms and although (objective) 
personality tests are described, when available, 
the bulk of the methods for assessment of the 
different facets of personality depend on ob- 
server's ratings. The rating scales, most of them 
developed by Cattell on the basis of elaborate 
statistical analysis, are very extensive and con- 
stitute an important contribution. However, sit- 
uations in which ratings yield valid measures of 
personality are the exception rather than the 
rule. Cattell’s unconditional distrust of self- 
ratings is not justified, even though it is obvious 
that the lack of self-knowledge and of honesty 
in “critically motivated situations,” such as pre- 
employment examination, will invalidate the re- 


sults. The reviewer feels that it is not much easier . 


to find disinterested but informed observers — 
by the subjects’ peers) — 


(Cattell prefers ratings х 
than to find disinterested subjects. In spite о 
the unevenness of coverage and an almost com- 
plete omission of the “projective” (pictorial) 
technics, the volume represents а welcome addi- 
tion to psychometric literature. Much of it has 
the characteristics, positive and negative, stimu- 
lating and exasperating, of all work carried out 
on an expanding frontier of scientific knowledge. 
Scottish Ed J 31:488 Ag 27748. А reading of 
the first few pages of the new edition of “A Guide 
to Mental Testing” will readily disabuse a be- 
ginner’s mind of the idea that this work is in- 
tended as an introduction to mental testing. For 
those who are already somewhat acquainted 


with the subject it will nevertheless serve as а 


useful compendium ; the demand for a new edi- 
tion is sufficient evidence of this. The new ver- 
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sion contains additional material, but the re- 
vision is hardly sufficiently thoroughgoing, for 
there are references to “recent” research which 
appeared in 1930, and it is disappointing to find 
that the issue regarding the constancy of the 
І.О. is dismissed іп a few lines with a reference 
to an article dated 1926. An alphabetic bibli- 
ography of the tests mentioned throughout the 
text would have added considerably to the value 
of the work. 

Times Ed Sup 1732:386 Jl то '48. * Cattell 
was in some ways a pioneer in this country in 
the use of psychological tests applied to educa- 
tional problems, and it is interesting to have 
this second edition of what has proved a useful 
reference manual. The second edition contains 
an additional 100 pages, but these have not been 
used to full advantage to bring the book up to 
date as regards British tests and materials. This, 
no doubt, is largely due to the fact that for the 
past decade the author had been resident in the 
United States. On the other hand, certain chap- 
ters in the book have been largely rewritten and 
should be of value to psychologists. It is inter- 
esting to note that Dr. Cattell still regards the 
Binet-Simon test as a relatively poor test of in- 
telligence because “its component test items are 
frequently more tests of scholastic attainment 
and life experience than of ‘g,’ and the pass or 
fail items are far too few.” He prefers to use 
a group test and gives some evidence to support 
his preference. * The chapter on attainment tests 
is decidedly weak and is far from complete in 
respect of British tests and materials which have 
been made available during the past 10 years. 
Chapters V and VI on tests of temperament and 
measures of character have been extended by 
the addition of an account of some of the au- 
thor’s recent work in this field and by reference 
to new American tests. The new edition con- 
tains a chapter on the measurement of intra- 
familial attitudes, and this should be of interest 
to workers in fields in which the assessment of 
the emotional structure of the family situation 
is important. On the whole the manual should 
continue to prove useful, though there are still 
instances where the author claims too much for 
tests before they have been sufficiently tried out. 


For reviews of first edition, see 36:B48, 38: 
B329, and 40:B860. 
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*CarrELL, Raymonp В. An Introduction to Per- 
sonality Study. London: Hutchinson's University Li- 
brary, 1950. Pp. 235. 75. 6d. * (New York: Longmans, 


Green & Co., Inc. Text edition, 
$225.) (PA 25:2901) 

Am J Psychol 64:643-4 О '51-Ј2Р. Guilford, 
* written for “the intelligent, analytical student; 
desiring to come to grips with basic laws.” The 
student will indeed have to bring to his reading 
of this volume a fairly good background of psy- 
chological concepts. It is much more compre- 
hensive than its size would indicate. Cattell has 
made a system-building effort in writing this 
book. The goal was to create a psychological 
system in which personality is the central theme. 
The system falls into the category of “dynamic” 
psychologies, in which the concepts from psycho- 
pathology and from freudian psychology pre- 
dominate. The author swings from the lan- 


$1.80; trade edition, 


guage of Freud, to that of McDougall, and іо. 
that of factor analysis with great ease, effecting 


considerable integration. * In his listing of the 
source traits or factors in personality, Cattell 
persists in limiting them to a few broad cate- 
gories. Many of these are probably in the nature 
of second-order (or even higher-order) factors; 
since the number of primary traits is undoubt- 
edly much greater than a dozen. It is unfortu- 
nate that some of them are named for pathologi- 


cal syndromes, since they are applied to the 


whole range of normal personality, and since 


Cattell himself properly regards pathological * 


syndromes as surface traits, not source traits. 


Clinical psychologists may find his system for. 


inter-relating the freudian mechanisms into à 
succession of "dynamic crossroads," in which 
neurosis-formation and psychosis-formation find 
places, quite thought-provoking. Probably few 
efforts to bring logical clarity into this field have 
succeeded so well. Cattell’s style, as usual, 15 
vigorous and positive, and reflects not only €x- 
perience and erudition, but literary taste as well. 
The volume, at least in its first printing, СОП 
have been more carefully edited. Some ПЕШ, 
need minor corrections and the text could be 
made more consistent with them in places. 


Brit J Psychol, Gen Sect 42:192 Mr &My 


’51. P. E. V (ernon). Cattell’s work оп personal- 
ity has hardly received the attention it psa on 
in the country of his origin. Thus it is partc k 
larly useful to have a short and inexpensive boo 
which summarizes most of the arguments Ж 
Personality, A Systematic Theoretical and e 
tual Study and some of the data from his jit 
scription and Measurement of P ersin, 
Though somewhat condensed, it should no aed 
too difficult for Honours students. His geome 
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explanation of multiple factor analysis is very 
dear. The book is well written and adequately 
` printed, but there are some slight mistakes in 
names—Porteous for Porteus, Jeansch for 
Jaensch, and G. B. for J. B. Watson. Cattell's 
major contribution is his attempt to integrate 
McDougall’s, Freud’s and others’ clinical and 
observational studies of personality with the em- 
pirical findings of factor analysis. His stand is 
frankly “biophysical” ; personality is based on 
the individual’s instincts (ergs) and other con- 
stitutional factors, modified through interaction 
with environment into a dynamic network of 
attitudes, sentiments, repressed complexes and 


other Freudian mechanisms. He has no use for - 


“culture pattern” theories, or for the view that 
personality depends in part on the observer. The 
main psychopathological syndromes, and such 
concepts as the Superego and Ego are identified 
with the so-called source traits, derived from 
factorization of personality ratings. One may or 
may not agree with all his arguments, but one 
cannot but admire the extent of his knowledge, 
and the wide range of phenomena from normal, 
abnormal and social psychology that he brings 
together in a consistent theory of personality 
structure. 

Brit J Sociol 2:87 Mr '51. W. Н. М. Hotopf. 


"^ The chief inflüences by which the author of this 


Po e appears to have been affected, are 
those of McDougall, Freud and factor analysis. 
a author does not so much integrate these 
ege schools of thought as lay them care- 
ET end to end. The presentation is on the 
m very clear; terms which are often con- 
sd ye carefully distinguished, and a great 
Sith of matter is compressed into a small space 
А out overcrowding. From these qualities of 
сонне there must however be subtracted 
à s in evaluation. Very little mention is made 
the ny criticisms of these theories, whether on 
а ground that the evidence is inconclusive, the 
"ie oach mistaken, or the results trivial ; and 
ea n it is, the criticism is dismissed with abuse 
ae argument. On whatever side truth lies, 
ul oy attitude detracts from the author's con- 
Ewa сш of the virtues of strict scientific 
dent There are some deficiencies іп the pro- 
the ion of the book. Misprints and omissions in 
WI diagrams and discrepancies be- 
(Чеш; the diagrams and the text make some of 
is. : hindrance rather than a help. Far too 
i references are given of studies reported in 

€ text, but even those that are, are often of no 
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use. This is because nearly all cross-references 
from one chapter to a list of books at the end of 
another are incorrect in the sense that some other 
book than the one apparently referred to is in- 
tended. And so the uninstructed reader of this 
very readable book is robbed of an important 
safeguard against the author's tendency to over- 
simplification and too great optimism in report- 
ing the results of research. 

Eug R 43:50-1 Ap ’51. Н. J. Eysenck. This 
new small book on personality, compressing into 
some 230 pages much of the material which 
went into the same author’s larger textbook on 
Personality: A Systematic Theoretical and Fac- 
tual Study, shares most of the excellencies and 
some of the defects of the larger book. The chap- 
ter headings are fairly conventional, but the 
order of treatment is distinctly unusual. * Chap- 
ters 7 and 8 perhaps constitute the core of the 
book, dealing as they do with the description and 
measurement of personality in a fashion similar 
to that used in Cattell's book of that name. Then 
follows a chapter on psychosomatics, including, 
which is somewhat unusual but perfectly justi- 
fiable, a discussion of the relation between per- 
sonality and body build. A brief discussion of 
personality and the cultural pattern, and of the 
life course of personality closes the book. The 
writer himself draws attention to three innova- 
tions which mark this book out from the main 
run of personality textbooks. In the first place 
he has attempted to present in its simplified spa- 
tial, geometrical form, the essentials of the factor 
analytic approach to describing and measuring 
personality. As he points out, “Many psycholo- 
gists, accepting а traditional estimate of the 
difficulties of this technique and perhaps un- 
aware of the advances and simplifications made 
in this generation, avoid the whole subject and 
try, with some superficial and misleading talk 
about ‘types,’ to ‘get by’ on the subject of per- 
sonality description.” In his insistence on pre- 
senting the methodology of factor analysis, even 
in an introductory book, Cattell is indubitably 
right. If we want to get away from the purely 
semantic, intuitive, and popular method of treat- 


ing this subject and to adopt a scientific point of 


view, we cannot neglect a discussion of those 
methods which alone can give us meaningful 
answers to our problems. As Cattell points out, 
if simple algebra or geometry is too difficult for 
the student he certainly ought not to be study- 
ing so complex а subject as psychology. Cattell's 
second innovation is a substitution of the term 
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“erg” for the concept of “drive,” “need” or “in- 


stinct.” Cattell believes this to be a more precise 
concept, but the reviewer is doubtful whether 
neologisms of this type are really helpful to the 
beginning student, who, after all, is not bur- 
dened so much by connotations given to older 
terms like "instinct" by past psychological writ- 
ers whose work he would not be expected to 
be familiar with in any case. Cattell's third inno- 
vation is the introduction of a system of what he 
calls “adaptation-adjustment analysis," which 
integrates psycho-analytic and other clinical and 
experimental studies of dynamic processes. He 
claims that while “our observations on the de- 
velopment of dynamic structures in personality 
necessarily remain to-day still largely at a clini- 
cal level, they are certain enough to permit us 
to go forward to a systematization of this kind." 
The reviewer does not feel that this optimism is 
justified. These unverified, and possibly unveri- 
fiable, clinical observations may have a place in 
the psychiatrist's consulting-room ; they should, 
however, be sharply distinguished from opera- 
tional concepts firmly established and isolated 
by means of proper scientific methodology. It is 
to be regretted that Cattell would appear to give 
them equal status after having insisted through- 
out the rest of the book on the essential nature 
of proper validation and verification of the- 
oretical concepts and constructs. The book would 
certainly be considered difficult by most stu- 
dents who are not already familiar with the 
subject. It is probably made more difficult than 
it need be by the use of certain terms and con- 
cepts (“factor loading," “correlation coeffi- 
cient”) before they are explained on a later 
page. Nor is understanding made easier by the 
author’s frequent use of neologisms, to one of 
which attention has already been drawn. These 
may have their place in an advanced discussion 
for students already familiar with the field ; their 
place in an introduction to personality is more 
debatable. Also, the tendency of the writer to 
present concepts in number sequences, “cross- 
roads 1-6,” for instance, or “personality factors 
a, b, с,” etc., makes the text somewhat more 
difficult to read than it might otherwise be. 
These defects ought, perhaps, to be blamed more 
on the state of the subject on which Cattell is 
writing than on the author himself. The task of 
compressing the enormous theoretical and ex- 
perimental material which is relevant, and which 
seems to yield to no easy systematization or in- 
tegration into 230 small pages, is a manifestly 
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impossible one, particularly when the reader 
cannot be assumed to have any previous knowl- 
edge of scientific method, of mathematics, or of 
statistics. Cattell's is a brave effort, but the re- 
viewer fears that only a very intelligent and per- 
severing student would be able to extract from 
this book all that Cattell has tried to put in, 

Int J Psycho-Analysis 32:252-4 pt 3 'sr. 
Marjorie Brierley. This book admirably fulfils 
the author’s aim, which, he says, “has been to 
bring the general problems of personality de- 
scription and development, normal and ab- 
normal, into a single perspective and to integrate 
the principal fields of observation in clear-cut 
generalizations” * Each chapter is a concen- 
trated meal but too well written to be indigesti- 
ble. A fuller Subject Index would have been an 
advantage in a volume covering so much ground, 
* Although the book is intended primarily as а 
small standard textbook for University students, 
each chapter ending with a Summary and Sug- 
gestions for More Advanced Reading, it offers 
relatively easy access to the author's basic princi- 
ples and methods of research and may be read 
with advantage by psycho-analysts who may 
have little time or inclination to study the 689 
pages of his larger work, Personality: A Syste- 
matic Theoretical and Factual Study (McGraw- 
Hill, 1950). * He adopts some familiar terms, 
but often prefers to introduce new ones, and has 
re-cast the probable story of development for 
himself. He uses the term “erg” in place of 
primary instinct-impulse or drive, a choice pro- 
voking to natural scientists, because “erg has 
long been used in physics with a precise quanti" 
tative definition referring to work done under 
specific conditions. Psycho-analysts will some- 
times think the author’s transcriptions unen 
sary, e.g. that the term "motivation" would hav? 
served as well as the new-minted "subsidiatio 
and have avoided another addition to the UP 
fusion of tongues in psychology. However, e 
tell uses his preferred terms to good inne 
clear and logical exposition. * The present m 
viewer is not competent to assess the mue 
cal and technical validity of these methods P 
judging by their results, they appear to a 
most promising yet employed in рзубНоше е 
It is by no means easy to devise adequate 0 2 ei 
tive tests for all suspected traits. If the rely 
state of an individual and even what he 1s det 
to become under given conditions prove @ v 
able to measurement, the difficulties in the á 
of mathematical expression of the course 9 
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velopment may be very great indeed. In short, 
the difficulty of testing "genetic propositions" 
experimentally, discussed by Hartmann and 
Kris in relation to "field theory," appears to 
hold good of “factor analysis." Inference from 
observation still plays a large part in Cattell's 
own version of the story of development and in- 
deed understanding of the character of a given 
individual depends greatly on interpretation of 
“measurements.” About a dozen unitary source 
traits are thought to exist, but only some four 
of the best established of these personality fac- 
tors are dealt with in this book. One of the most 
important of these is Factor (C), the factor of 
dynamic integration, whose quantitative varia- 
tion, positive to negative, affords a measure of 
emotional integration or character stability. Cat- 
tell is unwilling at this time to venture further 
interpretations of this factor, but suggests it 
may evidence the normal integration of ego and 
conscious super-ego in the “self-regarding senti- 
ment" (p. 88). Psycho-analysts may surmise 
that it may prove to be a measure of integration 
of the total personality, rather than of ego sta- 
bility as such. If this should be the case (C) 
measurement might prove one useful index of 
degree of mental health or disorder and an aid in 
diagnosis, e.g. in cases where concealed psycho- 
sis is suspected in apparent neurosis. * Factor 
(E), Dominance-Submission, and Factor (1), 
Sensitive Emotionality-Hard Poise are con- 
sidered together. A satisfactory objective test 
for (E) is still lacking. Cattell thinks this may be 
because this trait is “зо purely social in charac- 
ter” and that it is the ego which is evidenced by 
it. This may well be so, but psycho-analysts are 
bound to think that intrapsychic relationships 
are also much involved here, particularly the 
distribution of aggression and the modes of its 
disposal, e.g. the balance of introjective and pro- 
jective defence-mechanisms. (I) is said to be a 
a but powerful contributor to personality 
oad possibly identifiable with hyperthy- 

idism or the hormone determinants of mas- 
Cülinity or femininity. The three source traits 
(0), (F) and (C) are slightly positively cor- 
id ed, compatibly with the existence of a single 
widerlying factor or temperamental disposition, 
wee rigidity," which psycho-analysts know by 
ii opposite, “lability.” This underlying factor 

called a factor of the second order, which is 
ашу confusing to readers accustomed to 
Mg PSycho-analytic usage of “primary” and 

econdary.” Cattell himself recognizes fully that 
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the measurement of personality is only in its 
initial stages, and he is in no hurry to force in- 
terpretations or to preclude future modifications 
of present conclusions. It is greatly to be hoped 
that his co-workers and students in this field 
will show equal restraint and not lend them- 
selves to premature standardization or rigidity 
in the interpretation of their results. The book 
is full of matter for discussion, but it is hoped 
that this brief review may indicate that the work 
in progress in this field is of considerable interest 
to psycho-analysts, not least because Cattell's 
technique seems to offer prospects of “experi- 
mental validation” in some directions, if not in 
all. His own conclusion is that “personality is 
the integration of the partial reactions of the 
organism, but it promises also to be the integra- 
tion of psychology.” 

J Consult Psychol 15:167 Ap '51. Laurance 
F. Shaffer. This small book is an able summary 
of Cattell's theories and researches in the field of 
personality. His theoretical position is a combi- 
nation, blended with considerable harmony, of 
McDougall's instinct theory, Freud's dynamics, 
stimulus-response psychology, and factor analy- 

. * 


sis. 

Mental Health 10:76-7 sp '51. О.Р. * a stu- 
dent who approaches the subject for the first 
time...may feel embarrassed by the thousands 
of volumes that lie between him and the secrets 
of the psychological science. For an intelligent 
student, Professor Cattell's book An Introduc- 
tion to Personality Study not only removes this 
embarrassment but also safeguards him from 
the disappointment which usually follows ex- 
tensive reading not rewarded by the acquisition 
of certain factual knowledge. * 'The author pur- 
posefully puts aside the speculative part of the 
subject and deals chiefly with what can bemeas- , 
ured and verified. The author gives statistical 
evidence for his statements or, in the absence of 
such evidence, indicates the way in which it 
could be obtained. No doubt in this respect he 
wants to be a thorough going behaviourist and 
determinist. This position, however narrow it 
may appear to philosophically minded psycholo- 
gists—for it may conflict with the notion of free 
will, moral action etc.—is perhaps the only one 
which renders some practical advantages. These - 
advantages cannot be secured by metaphysical 
speculations nor by biased personal judgments. 
1 the progress of psychology is to be estimated. 
by the number of discovered facts and empirical 
laws, the scientific research ought to be the omiy 
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legitimate method in use. It is encouraging to 
find that Professor Cattell's outstanding ability 
once more suggests this method. * The reviewer 
recommends the book to the beginners who want 
to have a quick grasp of the subject and also to 
those advanced students who have found psy- 
chological speculation unproductive and who 
consequently may feel more interested in the 
factual study. 


[ B86] 
Ж*Слттк„, Клумохр B. Personality: A Systematic 


Theoretical and Factual Study. New York: Mc- 


Graw-Hill Book Co., Inc., 1950. Pp. xii, 680. $6.50. * 
(London: McGraw-Hill Publishing Co., Ltd. 55s. 6d.) 
(PA 25: 4420) 

Am J Orthopsychiatry 21:429-31 Ap ’51. 
Richard L. Jenkins. The author....states that 
multiple factor analyses of human personality 
traits have yielded fourteen or fifteen factors, or 
source traits, as he calls them. Of these he lists 
twelve, but considers that only six are repeatedly 
confirmed and unmistakable. To these he assigns 
the following names : source trait A, cyclothymia 
versus schizothymia; B, general mental capac- 
ity vs. mental defect; F, surgency vs. desur- 
gency} H, adventurous cyclothymia vs. with- 
drawn schizothymia; K, socialized cultured 
mind vs. boorishness; and M, Bohemian uncon- 
cernedness vs. conventional practicality. While 
the naming of such factors is necessarily arbi- 
trary, the reviewer is troubled by a feeling that 
factors are given names without sufficiently 
careful reflection, but from that point on are 
treated as though the name really represented 
the factor. For example, “source trait A” is 
named cyclothymia vs. schizothymia and “source 
trait H” is named adventurous cyclothymia vs. 
withdrawn schizothymia. Examination of the 
author's previous publications reveals that these 
two factors are practically at right angles to 
each other; that is, they are practically uncor- 
related. It seems loose and confusing practice at 
best to give such similar names to two practically 


independent dimensions of the personality. Fur- - 


ther examination of Dr. Cattell's earlier writing 
persuades the reviewer that "source trait A" 
would better be described as social adaptability 
versus social inadaptability and "source trait Н” 
as free versus inhibited. If a cyclothymic- 
schizothymic dimension is to be used, it had bet- 
ter be a diagonal of these two, from the inhibited 
socially inadaptable person who may fairly be 
called schizothymic to the free, socially adaptable 
person, who is of the type described as cyclo- 


thymic if, like the author, we are willing to pre- 
sume that a person of this type will have cycloid 
mood swings. It is at least debatable whether it 
would not have been better to use such a dimen- 
sion rather than break this component into the 
two projections to which the author has given 
such confusingly similar names. The matter be- 
comes even more perplexing because of the list- 
ing of a third factor essentially at right angles to 
the other two, "source trait L,” trustful cyclo- 
thymia versus paranoid schizothymia. It would, 
of course, be possible to regard this as another 
projection of a diagonal—the diagonal of a cube 
the far corner of which would be “true schizo- 
thymia,” which in this conceptualization would 
have a component of suspicion added to com- 
ponents the reviewer has called social inadapta- 
bility and inhibition. This does not, however, 
seem to be Cattell’s thinking, for he writes, “The 
independence of this factor, in relation to the 
other two schizothyme factors, raises the ques- 
tion of whether the abnormal extreme of thts 
factor—presumably paranoia—should be recog- 
nized as a distinct disease entity from the schizo- 
phrenias.” If the same implication were read 
into the independence of the two other schizo- 
thyme factors it would imply two distinct schizo- 
phrenias, an inference the writer does not draw. 
Nor does he consider the question of whether 
there might be implied two or three distinct 
cyclothymias. This selective discovery of ele- 
ments which may be interpreted to support 
dominant ideas, with corresponding failure to 
discover or consider elements which would chal- 
lenge them, impresses the reviewer as a repeti- 
tive defect of the book. It appears to be a re 
sult of a failure to consider relations and impli- 
cations as sharply and critically as they deserve 
and relates to the amount of ground the author 
seeks to cover. It seems to the reviewer that the 
writer overworks useful concepts such as one 
thymia and cyclothymia by requiring n 
ride off in so many directions simultaneous 
Conceivably the integrity of the concept сус i 
thymia might be maintained even under d 
treatment by considering it the normal and p vs 
ing it at the origin ; but, even so, there Ts n 
the confusion resulting from the fact that t ih 
independent and unrelated deviations from “4 
normal are all named schizothymia. i i 5 
er's energy in presenting and cataloguing $ is E 
in the field is impressive, yet the reviewa iing 
veloped an increasing and disquieting зе 
that there was greater motivation to cover 
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amount of ground than to clarify meanings. The 
writer is creative, versatile and stimulating, but 
while many individual items are very well pre- 
sented, the material as a whole seems too fre- 
quently thrown into juxtaposition by careless 
approximations of meanings rather than careful 
fitting ог consideration of fit. What results lacks 
effective unity of structure and is less satisfying 
than а smaller, better-integrated undertaking 
‘would have been. The book cannot be recom- 
mended as a text on personality, for it does not 
achieve the clarity of unified systematic thinking 
a text should have. It is a stimulating cata- 
loguing of the results of various factor studies 
so far undertaken, together with a description 
of much of Dr. Cattell’s thinking. These are of 
substantial interest. Factorial methods will 
doubtless become a major tool in the under- 
standing of personality. 
Am J Sociol 57:192-4 S "51. Tamotsu Shi- 
butani. * Because of the complexity of factor 
analysis and the enormous range of traits cor- 
related, one can readily appreciate the stagger- 
ing labor that must have gone into the compila- 
tion, However, a careful examination of the 
procedure, especially in the gathering of the orig- 
inal data, leads one to question the validity of 
the product. The traits—which are initially 
measured through a wide variety of tests, scales, 
and observations in situations that may not be 
comparable—are organized in terms of trait- 
names, such as “austere,” “arrogant,” “епег- 
getic,” “sociable,” “aloof,” “carefree,” etc. There 
are no clearly stated indexes for the identifica- 
tion of such traits as arrogance and no assurance 
that all instances entering the computations as 
arrogance are of the same generic category. 
Hence the precision of the mathematical calcu- 
lations is unfortunately not matched by equal 
Precision in the original observations. Without 
in any way deprecating the great value of factor 
analysis as a tool, one can raise the question of 
Шш more fruitful results might not have 
хеп achieved had the author not been dazzled 
Y its potentialities and so put it to such un- 
critical use, Although the author is careful to 
c the limited evidence wherever it is relevant, 
ad Vene is essentially speculative. Even a 
tell he perusal of the volume will show that Cat- 
tion urs much from the formal declara- 
ета ү | Freud and McDougall. He devotes con- 
of P space to the promotion of his concept 
“dri е "erg," to replace terms such as "instinct, 
rive" "need," and "tension," However, his 
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protests to the contrary notwithstanding, the геї- 
erent of the concept of the erg appears identical 
to McDougall's “instinct,” with its three com- 
ponent processes, The six “dynamic crossroads” 
turn out to be nothing more than a thoughtful 
classification of the adjustive mechanisms fol- 
lowing blockages, with which psychiatrists have 
long been concerned. One odd feature of Cat- 
tell's discussion of the interruption of activity is 
the complete failure to mention reflective 
thought. * [See original review for additional 
critical comments not excerpted. ] 

Brit J Psychol, Gen Sect 41:197-8 D ’50. 
L. S. Н. * To cover so wide a field systemat- 
ically, theoretically and factually is an ambitious 
programme for which very few psychologists 
indeed are equipped. Nevertheless, the task of 
synthesizing the data and theories of psychology 
into some sort of synoptic view is one that must 
from time to time be undertaken, and Cattell, 
who has contributed prolifically to various fields 
of psychology has many of the necessary quali- 
fications for the job, including a lively apprecia- 
tion of the need for "precise methodology and 
critical evaluation of concepts." We should, 
therefore, be grateful to him for undertaking so 
gigantic a task. As a book of reference Cattell's 
Personality should be useful to most students 
and teachers of psychology. It contains, too, 
numerous pointers for research, and some of the 
author's vigorously stated and somewhat con- 
troversial opinions should awaken stimulating 
discussion. * 

J Abn & Social Psychol 46:444 Ji'sr; P. B. 
Vernon. Four major books on Personality have 
been published in America in the past 13 years. 
Their differences are considerable, yet probably 
no greater than those of four textbooks on gen- 
eral psychology picked at random. Cattell’s is 
indeed not unlike Stagner’s in the main fields of 
work with which it deals; and although poles 
apart from Allport’s, yet, like his, it aims to pro- 
vide a systematic and unified theory, rather than 
an eclectic survey. As might be expected from 
his previous publications, factor analysis plays 
a larger part in this than in other theories, since 
he hopes “to treat personality by the same sci- 
entific standards as are maintained in experi- 
mental psychology." He admits, however, the 
necessity of drawing largely on clinical, genetic, 
social, and other branches of psychology which 
are as yet chiefly at the descriptive stage. Per- 
sonality is defined as "that which permits a pre- 
diction of what a person will do in a given situa- 
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tion.” At the moment, our knowledge is only 
rudimentary, our map of it as sketchy as a fif- 
teenth-century map of the world. Nevertheless, 
one must agree that Cattell has done much to 
improve it by bringing together a very wide 
range of material from psychoanalysis and psy- 
chopathology, and from objective psychometric 
and sociological research. His standpoint is 
frankly “biophysical” ; the view that personality 
depends largely on the observer, also the culture- 
pattern type of theory, are given short shrift, 
though several chapters are devoted to the in- 
fluence of the family and other social groups on 
individual development. Little recognition, too, 
is given to the situational factors in personality 
or to the field theory approach. Cattell is a 
strong adherent of McDougall’s instinct and 
sentiment theory, but, more than most nonmedi- 
cal psychologists, he draws on psychoanalytic 
concepts to supplement this. (His hereditarian 
trend is further shown by his claim that intelli- 
gence is more than 80 per cent genetically de- 
termined. One could pass this if he also men- 
tioned that many psychologists believe the figure 
to be less than 50 per cent. But here and else- 
where his references to the literature, though 
very extensive, are somewhat selective.) One 
of the most valuable parts of the book describes 
the numerous ways in which constitutional fac- 
tors become modified, so that a complex "lattice" 
or network of attitudes, sentiments, and com- 
plexes develops, many of whose strands may be 
unconscious. The final chapter summarizes these 
principles of personality formation in 17 laws, 
which are reminiscent of Hull's postulates. The 
end results of this process, including the Freud- 
ian Ego and Superego, the self-regarding senti- 
ment, and the main syndromes revealed in the 
neuroses and psychoses, are believed to corre- 
spond to the "source traits" obtained by factor 
analysis. The early chapters contain a summary 
of the material from the author's Description 
and Measurement of Personality, including the 
derivation of his 12 source traits from factoriza- 
tion of personality ratings. While the reviewer 
would agree that several of these represent im- 
portant dimensions, which have often been found 
in other independent researches, he cannot see 
any good reason for accepting them all as valid 
descriptions of the fundamental traits of per- 
sonality. Even though they have been rotated to 
Simple Structure, they still seem to him to con- 
sist of the underlying components of the sub- 
jects’ reputations in the eyes of the raters; and 
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the arguments for the biophysical status of any 
but B (intelligence) and C (emotional stability) 
do not convince him. Nevertheless, Cattell's 
scheme deserves very serious consideration, and 
its ramifications into psychosomatics, delin- 
quency, changes due to age or to drugs, etc. are 
traced with great ingenuity. He also allows for 
the uniqueness of each individual, not only 
through the multidimensionality of source traits, 
but also by his P-technique which provides for ' 
factorization of unique traits within a single 
personality. The book is well written, and il- 
lustrated with numerous helpful diagrams. Even 
the more technical sections should be clear to 
the graduate students for whom it is intended. 
Psychiatric О 25:185 Ја 51. * Unfortunately, 

Cattell has tried to cover too wide a field at the 
expense of thoroughness. He admits that it has 
been necessary to condense the material and to 
*presuppose that the teacher will provide proper 
familiarity with the illustrations and facts which 
are often mentioned only by name in the dis- 
cussion of the principles." Had he narrowed his 
field and given more detailed material in fewer 
areas, the text would have been more valuable 
in a non-academic setting. 

[B87] 
*Cavan, RurH SHoNnLE; Burcess, Ernest W.; Ha- 
VIGHURST, ROBERT J.; AND GOLDHAMER, HERBERT. Per- 
sonal Adjustment in Old Age. Chicago, Ш.: Sci- 
ence Research Associates, Inc. 1949. Pp. xiii, 204 
$3.25. * (РА 25:2340) For reviews, see 101. 

[ B88] ; 
* CawrEY, ANNE Mary. A Study of the Vocational 
Interest Trends of Secondary School and Col ege 
Women, Washington, D.C.: Catholic University o 
America Press, [1947]. Рр. 185-247. Paper. boe 
(PA 22:450) [Reprinted from Genetic Psycho 
ogr 35:185-247 My ’47. *] 

[ B89] / 
*CHAppERDON, Hester; Brown, SARA ANN} р 
ING, CATHERINE; AND KELLEY, KATHRYN E 
Development of Paper-and-Pencil Tests te as in 
uate the Ability to Apply Generalizat ES Ba 
Home Economics. Foreword by Edna p o: 
Florence Fallgatter. Des Moines, Iowa: Boar! s Ше, 
cational Education, State of Iowa, 1947. Pp. 69: 
Out of print. * ; 
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Attainment Examinations in the Forel ite vt 
guages, Past, Present and Future: Cre consin, 
Achievement at the University, of conti 
1931-1947. Bulletin of the University of М О, 
Serial No. 2952, General Series No. 2720. 7 cary 
Wis.: Bureau of Guidance, the University, 


1948. Pp. 59. Paper. Gratis. * 
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KCHEYDLEUR, FREDERIC D., AND SCHENCK,, 


Lr 
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Results in Foreign Languages at the University 
of Wisconsin, 1943-1949. Foreword by Kenneth 
Little. Bulletin of the University of Wisconsin, Serial 
No. 3194, General Series No. 2968. Madison, Wis.: 
Bureau of Guidance, the University, June 1950. Pp. 63. 
Paper. Gratis. * (PA 25: 5633) 

[ B92] 
Carre, H. Harrison. The Application of Meas- 
urement to Health and Physical Education. New 


York: Prentice-Hall, Inc., 1945. Pp. xvi, 415. For lat- 
est edition, see Воз. * (РА 19: 1910) 

J Appl Psychol 33:98-9 F *49. С. H. McCloy. 
* There is a well-written and sound treatment 
of the problems and possibilities associated with 
the technique of somatotyping. This is followed 
by an excellent discussion of measurement in 
the field of posture. The author gives a some- 
what undue amount of space to the so-called 
“physical fitness index,” which is the percentage 
residual from the regression value of a general 
strength test. In the opinion of the present 
writer, far too much value is assigned to this 
test, and it is assigned attributes which it does 
not deserve. While strength is important in 
physical fitness, it does not deserve the reverence 
given here. * In the part of the book given over 
to tests of social efficiency, the author has in- 
troduced a section that is new to testing text- 
books in this field. He discusses—and gives 
numerous references to—a number of tests in 
the field of personality studies. This marks an 
Advance in this field and these tests should be 
given more prominence in physical education 
and health studies as time goes on and as these 
tests improve. In the field of skill tests, because 
of the large numbers of such tests, the author 
has had to choose a few of the more important 
ones to describe, and to give bibliographical геї- 
erences for the others, His choices have, on the 
whole, been good. The same has been true of 
his presentation of knowledge tests. * On the 
whole, the book presents a number of fresh view- 
Points and is a useful text and reference book in 
Its field, 


For an additional review, see 3:805. 


Л [B93] 
mr H. Harrison. Application of Measure- 
did to Health and Physical Education, Second 
xvii ion. New York: Prentice-Hall, Inc., 1950. Pp. 
(PA 493. Text edition, $4.95; trade edition, $6 м 
Hof, 25:3384) For reviews of earlier editions, see 
92 and 3: 805-6. 
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Examination Board. Forty- 
Director. New 
Paper. $0.50. * 
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[ B95] 
KCOoLLEGE ENTRANCE EXAMINATION Boarp. 48th An- 
nual Report of the Director. New York: the 
Board, 1948. Pp. xv, 112. Paper. $0.50. * 

[ B96] 
KCoLLEGE Entrance Examination Волвр, 49th An- 
nual Report of the Director. New York: the 
Board, 1949. Pp. iv, 83. Paper. $0.50. * (PA 25: 4848) 

[ B97] 
*Cottece ENTRANCE Examination Boamp. 50th An- 
nual Report of the Director. New York: the Board, 
1950. Pp. iii, 83. Paper. $0.50. * 

[ B98] 
*Conrap, Hersert S. Characteristics and Uses of 


Item-Analysis Data. American Psychological Asso- 
ciation, Psychological Monographs: General and Ap- 
lied, Vol. 62, No. 8, Whole No. 295. Washington, 

.C.: the Association, Inc., 1948. Pp. vi, 49. Paper. 
$1.00. * (PA 24:2937) 

[ B99] 

Соок, Stuart W., EDITOR. Psychological Research 
on Radar Observer Training. Army Air Forces 
Aviation Psychology Program Research Reports, Re- 
port No. 12. Washington, D.C.: U.S. Government 
Printing Office, 1947. Pp. x, 340. Out of print. * (PA 
22: 4139) à 

J Consult Psychol 12:123 Mr-Ap *48. Lau- 
rance F. Shaffer. * 'The report contains a clear 
picture of the work of the radar observer, and 
of the aptitude and achievement tests used 
in connection with his selection and train- 


ing. 

Psychol В 46:518-9 N *49. C. S. Bridgman. 
* Apart from having value as a record of war- 
time psychological contributions, the report will 
provide information for those interested in the 
development of selection tests and standardized 
proficiency tests, since it was in these two fields 
that the project concentrated its efforts. * The 
complexities of validating selection tests are 
thoroughly discussed in a penultimate chapter 
describing the results of two studies employing 
three batteries of selection tests validated against 
training course grades, against an air-crew clas- 
sification “radar stanine,” and, in part, against 
radar bombing error calculated from camera 
records obtained during aerial training flights. 
In spite of the besetting difficulties, a number of 
significant correlations were obtained. Since the 
results of these studies were available too late 
for application in the wartime selection pro- 
gram, their greatest value will lie in indicating 
to the uninitiated the practical problems which 
will be present in developing and evaluating 
tests of this sort. eon i 
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[ B100] 
ЖСовхк. Untversiry, UNiversiry TESTING AND 
Service Bureau. Cornell University Test List, 
1950 Revision. Ithaca, N.Y.: the Bureau, 1950. Pp. 
ii, 63. Paper, mimeographed. Supplement, January 
1952. Pp. 5. Paper, mimeographed. $1.00. * 

[ B101] 
*CnoNBACH, Lee J. Essentials of Psychological 
Testing. New York: Harper & Brothers, 1949. Pp. 
xiii, 475. $5.00. * (PA 24: 647) 

Am J Orthopsychiatry 20:848-9 О * 50. Adolf 
G. Woltmann. * Cronbach never loses sight of 
the fact that both theoretical psychometrics and 
practical psychology are involved. Although in- 
tended primarily for students, his book offers a 
penetrating analysis of the topographical anat- 
omy of mental testing which the worker in the 
field has forgotten or perhaps never mastered 
completely, and is thus a refresher source for 
the practicing psychologist. * a special merit of 
... [this book is that it reminds] the psychologist 
that his real and undisputable strength lies in a 
sound scientific training. The present trend in 
clinical psychology is toward mastery of pro- 
jective methods and the practice of psychother- 
apy. This new development is certainly wel- 
comed and desirable, but will be doomed to fail- 
ure if the psychologist foregoes his rigorous 
training in the field of mental testing. The sci- 
entific basis of mental testing serves as a check 
and prevents intuition from getting lost in hypo- 
thetical speculations. 

Am J Psychol 63:155 Ja’50. Madison Bentley. 
* far more than a descriptive catalog of tests * 
It is impossible that an intelligent student of this 
well-written book should be or remain a mere 
routine “giver of tests and computer of results." 
* Brief but intelligent expositions are given of 
Factor Analysis and Projective Techniques. The 
work appears destined to serve many persons 
and problems in the wide fields of testing. 

Austral J Psychol 2:63—4 Je ' 50. P. Lafitte. * 
The variety of psychometric instruments of the 
usual kinds is well represented, and the com- 
parative discussion of the functions and advan- 
tages of each kind is sound. The psychometric 
techniques mentioned are treated in a rather 
elementary way, which is adequate for the pur- 
pose of the book, except for the discussion of 
multiple correlation which, though sensible, is 
unduly short. Among the detailed critical evalu- 
ations of tests, the chapters on the Binet and 
Wechsler scales are unusually clear and compre- 
hensive. The chapter on other tests of general 
ability is also good, as are two chapters which 
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break new ground in books of this sort: the one 
on observation in the kind of miniature real life 
situations developed as a testing method in re. 
cent years, the other on projective techniques, 
Both these latter, and the chapter on the use of 
test results in counselling, are superficial in 
comparison with the complexity of these topics, 
but it is an achievement to have included such 
brief and sensible discussions, which fit well 
into the general argument. The chapters which 
cover most ground are less satisfactory. There 
are four of these, on tests of special abilities, per- 
sonality questionnaires, interest inventories and 
attitude surveys. All try to incorporate rather 
too much to leave room for critical evaluation. 
'The treatment of criteria and statistical predic- 
tion (chap. 11) is rather scanty and superficial, 
Chap. 9, on factor analysis, is particularly weak. 
The author tries hard to avoid the dangers of 
reifying factors, but with indifferent success: 
in consequence, he considerably over-rates the 
usefulness of factor analysis as a means of classi- 
fication. Generally, the book would be better if 
it introduced more of the basic principles of 
psychology, in order to relate these to the basic 
principles of testing. While it is more than a 
compilation of test information, it has two seri- 
ous limitations of scope. It is restricted almost 
entirely to American work (only 8 of the 654 
references come from other countries), so that 
the critical evaluation takes almost no notice of 
the work of several important schools. This is 
particularly notable in the chapter on factor 
analysis, in which the British school has only 
one reference, and that to Spearman. Further, 
the author, in common with most psychological 
testers, glances at the difficult problems of socia 
interaction which are raised by the use of p^ 
especially selection tests but rather жш » 
these problems will usually solve thema 
Here, if anywhere, a solid critical discussion 
would have been of use. Despite thes mee 
nesses, the book is largely successful in its al А 
and will doubtless be found of considerable ЧУ 
in teaching. The price, though not unusual 
high. 4 
El Sch J 50:414-5 Mr ’50. Arthur E. dd 
ler. * The extensive field of achievement tests i 
treated in somewhat sketchy fashion 10 à e 
chapter. * Most of the book is very well WE 
An informal, conversational, almost folksy | М 
which characterizes the brief first chapter 
placed in the rest of the book by a stralg effec- 
ward, expository style which is clear an 
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tive. * The book, on the whole, is hardly one to 
be read independently by a person with no pre- 
vious experience in the field. It contains much 
that would be of interest and value to an inexpe- 
rienced person, but portions of the chapters on 
interpreting test scores, how to choose tests, fac- 
tor analysis, and problems in prediction would 
probably be somewhat beyond him. * should be 
helpful to the test specialists who occasionally 
need to turn to reference material on basic con- 
cepts or technical questions. * admirably suited 
for use as a textbook in measurement. The 
thought-stimulating questions found at frequent 
intervals throughout the book should be espe- 
cially useful in promoting class discussion. * 
Criticisms of a few minor, or isolated, points 
may be mentioned. * there are tables in which 
representative tests are named and briefly char- 
acterized. A feature of these tables is brief com- 
ments of reviewers. Many of these comments 
are exceedingly short excerpts from reviews 
published elsewhere. The use of such brief ex- 
cerpts from reviews that are often uneven in 
quality is a risky procedure at best, for the ex- 
cerpts are more likely to leave a mistaken im- 
pression with the reader than would be the case 
ifthey were read in context. For instance, among 
the comments of reviewers of reading tests are 
quotations from reviews of the Iowa Silent Read- 
ing Tests and the Sangren-Woody Reading 
Test (p. 293). These quotations would cause 
the uninformed reader to think that the Sangren- 
Woody test is a much more dependable diagnos- 
tic instrument than is the Iowa test, although re- 
search data would rate these two tests about the 
same in diagnostic value, with the Iowa test 
Probably having a slight advantage. A number 
of instances of this kind suggest that the book 
gains little from the abbreviated comments of 
test reviewers, * The discussions of types of 
tests and of specific tests are helpful and fairly 
authoritative, but these are not the book’s main 
Contribution. Cronbach’s primary purpose was 
to present an organized, professional treatment 
of the basic concepts and techniques of psycho- 
logical measurement. He has achieved this pur- 
Pose very well indeed. The dependability, com- 
Petence, and clarity with which Cronbach has 
Presented complicated concepts and techniques 
should commend his book to numerous workers 
engaged in testing and counseling persons at all 
age levels, 

J Abn & Social Psychol 45:783-5 О'50. Clare 

right Thompson. This book....deserves its ti- 
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tle, for here are to be found the real essentials 
of psychological testing. Written on the under- 
graduate level, presumably for the beginning 
testing course which serves both psychology and 
education students, this text contains remark- 
ably little that will have to be unlearned in more 
advanced study. Dr. Cronbach’s position is that 
the day has passed when a text on testing can 
handle all published tests. Rather, he gives as 
his goal, that this book prepare the student to 
evaluate for himself any test in which he becomes 
interested. Actually, it does more than this. 
There are few, if any, omissions of important 
tests; fewer omissions than in many texts of 
the compendium type. And, although he does 
acquire tools for making his own evaluations, 
the student is both told the names of additional 
instruments in each area, and given specific 
references to them. Discussions of particular 
tests are excellent, focusing upon the primary 
problem posed by each in contemporary think- 
ing. The section on the Wechsler-Bellevue, for 
example, contains an exemplary discussion of 
the pitfalls of profile analysis in general. De- 
scriptions of “self-report” tests of the question- 
naire variety are preceded by a differentiation 
between empirical validity and face validity. 
This becomes important in contrasting the Min- 
nesota Multiphasic with the Bernreuter Per- 
sonality Inventories, or the Strong Vocational 
Interest Blank with the Kuder Preference 
Record. These two sections, among several oth- 
ers, might be read with profit by most practis- 
ing psychologists of adequate training and ex- 
perience. * In trying to present impartially 
many different points of view, violence is done 
to some. The section on the use of.tests by the 
client-centered counselor presents the method of 
Bixler, as is made explicit for some but not all 
of the presentation. That this method deviates 
quite widely from the way in which most client- 
centered counselors would handle the question 
of tests is never stated. Following this section 
we find the statement that “When a case 15 re- 
ferred for counseling, rather than coming in vol- 
untarily, the counselor cannot apply client- 
centered methods fully.” (р. 363.) The litera- 
ture, from the earliest references on this method 
to the present, is replete with instances where 
exactly this has been accomplished. Covering so 
wide a range in less than five hundred pages 
necessitates condensing some sections until the 
es as to whether their inclusion 


question aris 5 
serves any useful purpose. Such a question might 
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be raised about the twenty pages devoted to the 
projective techniques. Table 68, listing Ror- 
schach scoring, is extremely confusing yet is 
already just short of unwieldy. Dr. Cronbach 
has obviously covered the appropriate literature 
with critical care, quoting only studies of merit. 
Among the exercises for the student are several 
good examples of errors in the literature, such 
as #6 on p. 52 and #9 on p. 55. Also among 
the exercises for the student, however, are many 
which require the formulation of an hypothesis 
where such an hypothesis can be little short of a 
guess. A series of questions asks the student, 
after seeing from one to six items of a test, to 
“decide what mental activities it calls for" (#17 
p. 289), to "describe the abilities required" 
(#22 p. 179) or to guess “which of the primary 
abilities seem to be represented" in the test in 
question (#18 p. 207). Again, the student is 
asked to “prepare a set of seven statements re- 
flecting approximately equally spaced positions 
along the attitude continuum for 'travelling by 
commercial airlines’ " (p. 370 #4) ; to decide, 
after reading nine comments made by a boy about 
his drawings, “Which pictures seem to be most 
highly individual in content, unlike those of 
other seven-year-olds?" (p. 407 #:28a) ; and to 
answer of a “test of self-control, requiring the 
pupil to work in the presence of attractive dis- 
tractions....Does this measure the same motiva- 
tional factors as the story-completion test of per- 
sistence?" (p. 419 3£7). Obviously, although 
one can advance a working hypothesis in the 
first of these items, none of them can be de- 
cided without experimentation, and, in the case 
of the last two, without experimental evidence, 
it is fruitless to do more than pose the questions 
raised. The instructor might well add to many 
of the exercises the additional assignment of 
designing a critical experiment to test the arm- 
chair thinking the student is asked to do. These 
are, of course, not major criticisms. As it stands, 
Essentials of Psychological Testing seems to me 
to be the most adequate text on the subject. With 
suitable explanations, omissions, and additions, 
it would leave little to be desired. Dr. Cron- 
bach's own teaching experience is apparent 
throughout and has stood him in good stead. 
The book is not an easy one; the writing is suc- 
cinct, and a high level of motivation and intelli- 
gence on the part of the reader is assumed. It 
should be a challenging text to both student and 
teacher, primarily because it should lead the 
able student to study more intensively many of 
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the areas covered. Because of the breadth of 
topics included, the instructor can hardly help 
but increase his own fund of information in guid- 
ing such study. 

J Ed Psychol 42:124-6 F "51. Roger T. Loy. 
non. * Cronbach has set himself the task of writ- 
ing a textbook for the beginning measurement 
course, in which emphasis is on basic concepts 
and principles of testing are illuminated by in- 
tensive study of certain selected tests, He wisely 
refrains from attempting to introduce the student 
to all, or even a great number, of the available 
instruments, preferring to inculcate general 
principles that will enable the student to evalu- 
ate any test. Despite this approach, the book is 
in no wise overly theoretical or abstract; the 
principles typically emerge from concrete, de- 
tailed discussions of specific tests or studies, * 
The chapters vary in quality; those on Binet- 
type tests and diagnostic tests (represented by 
the Wechsler-Bellevue test) come much closer 
„to fulfilling the author's purposes than do the 
chapters on tests of special abilities and on inter- 
est measures, which tend to be more nearly of 
the cataloging type. Cronbach's approach gives 
rise to problems of organization. Thus, the chap- 
ter on tests of general ability includes a form 
for analyzing tests which can be used with equal 
appropriateness for appraisal of almost any type 
of test. Similarly, the chapter on tests of special 
abilities includes a section on diagnostic test 
batteries which would appear to be more nearly 
related to the tests of general ability covered in 
another chapter. The discussion of the Wechsler- 
Bellevue test gives rise to certain general obser- 
vations on reliability and validity of diagnostic 
measures which are equally pertinent for many 
of the other types of tests discussed. It may be 
that the particular location within the book of 
the discussion of a topic is not too important, 
provided that the entire volume is systematically 
covered during a course. The reviewer hazards 
a guess, however, that this type of treatment 
does not facilitate generalization of principles 
concerning validity, reliability, norms, etc. п 
one type of test to another. The reviewer foun 
himself uneasy about some of the defit 
proposed by Cronbach. Specifically, to say hat 
test is valid to the degree that we know W s 
it measures or predicts" (page 48) seems 4 
improvement over the usual formulation; e 
is the definition of a test as “а systematic Pt 
cedure for comparing the behavior of tw? à 

» > et WI 
more persons" (page 11) likely to me 
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universal approval. Cronbach accepts the point 
of view that an attempt to predict underlies every 
use of testing, and his treatment of validity in 
several places reflects acceptance of this postu- 
late. The book is readable by the beginning stu- 
dent in measurements, and the student who mas- 
ters its contents will have a wide acquaintance 
with various types of tests as well as sound 
bases for evaluating psychological measures. 
There is an abundance—perhaps an over- 
abundance—of research data and citations from 
the literature. Cronbach has favored the more 
recent literature (more than half of about five 
hundred twenty studies cited have been pub- 
lished since 1942), which fact may tend to give 
the beginning student a distorted historical per- 
spective in the field. Particularly praiseworthy 
are the thought-provoking questions liberally 
provided throughout the text. These questions 
are well calculated to further the author's goal 
of developing understanding. This reviewer, 
however, wishes that the questions were not in- 
terspersed as they are in the body of the text, but 
were concentrated at the ends of the chapters. 
In this respect and in others, the designer of the 
book has done the author no favor; typograph- 
ically, the book leaves much to be desired. Ap- 
pendices include a list of reviews of studies of 
testing (from which mention of the AERA 
periodical review of psychological tests is 
strangely missing), and a list of test publishers 
and distributors. 

J Nerv & Mental Dis 112:89-90 Jl.'50. 
(Zygmunt A.) Piotrowski. * not the usual 
type of statistical treatise. There are no logico- 
mathematical derivations of statistical concepts 
and formulae. The book describes the pro- 
cedures involved in determining the reliability 
and validity of tests, offers a description of the 
types of tests used most frequently without 
evaluating the extent and quality of psychologic 
conclusions deductible from each test. The au- 
thor is a statistician and not a clinician. His re- 
marks about the patients’ reactions to tests are 
infrequent, incidental and haphazard; they do 
not pertain to the common and important clini- 
cal problems encountered in clinical practice. 
Anyone wishing to acquaint himself with the 
Procedures necessary to determine the value of 
objective psychologic tests will find this tech- 
nical book useful. 

Occupations 29:386-7 F ’51. John C. Flan- 
адал. * There are two notable features through- 
out the book which appear to add much to the 
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presentation. One of these is the provision of 
carefully laid out step-by-step computing guides. 
The second is the presentation in the form of 
figures and tables of interesting representative 
information. * In general, the treatment is un- 
usually full of insight and is sound. Rather than 
list all of the many praiseworthy discussions, 
the few minor points to which this reviewer 
takes exception will be noted here. The treat- 
ment of standard errors of measurement of I.Q.’s 
is unnecessarily superficial in noting increases 
of these values for higher I.O.'s but failing to 
provide the simple mathematical explanation 
for it. The suggestion of scoring only the last 
five of 20 trials on motor tests runs directly 
counter to extensive Air Force experience which 
established the first trials as the most valid for 
a variety of such tests. Multiple cut-offs are 
strongly recommended over multiple correla- 
tion procedures solely on the basis of certain 
theoretical speculations and an anecdote. The 
reviewer's experience with data from large-scale 
studies does not support this recommendation. 
Certain sections, particularly the one on achieve- 
ment tests, suggest a lack of adequate first-hand 
familiarity and experience with certain types of 
tests. The last part includes discussions of per- 
sonality, interest, and attitude tests, and the ob- 
servation of behavior in normal and in test situ- 
ations. It closes with a chapter on projective 
techniques. This part to an even greater extent 
than previous parts provides a sophisticated 
view of techniques and principles. In this case 
they concern measures of personality or, in the 
author’s language, typical behavior. The prob- 
lems involved in techniques such as those used 
in the tests of Strong, Bernreuter, Kuder, and 
Rorschach are reviewed with a thoroughness 
and competence not previously available in texts 
in this field. One of the most distinctive things 
about the book is its current flavor. This is il- 
lustrated by the several hundred references cited 
and readings suggested. Only 5 per cent of these 
publications appeared before 1930; 24 per cent 
were issued in the period 1930-1939, and the 
remaining 71 per cent were published between 
1940 and 1949. The book may be regarded as 
a penetrating review of these materials. As such, 
it will be welcomed as an addition and supple- 
ment to books reviewing the earlier literature 
and to those reporting the results of extensive 
experience in the development and use of tests. 
Q R Biol 25:116 Mr "50. Alfred L. Baldwin. 
* the author has attempted to provide the stu- 
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dent with criteria for making intelligent choices 
of tests which will suit the particular needs of 
the situation. The book seems well designed to 
achieve the author’s purpose. Among other dis- 
cussions there is an excellent one on reliability 
and validity, a good section describing the im- 
portance of the subject’s perception of the test 
situation, and very competent concise reviews 
of material on'a number of specific tests. In dis- 
cussing tests of ability the contributions of fac- 
tor analysis are well described. * The discussion 
of projective tests seems somewhat weaker than 
the rest of the volume. 


[B102] 
ж[Сохміхснлм, Котн.] How to Construct a So- 
ciogram. New York: Bureau of Publications, Teach- 
ers College, Columbia University, 1947. Pp. iv, 37. 
Paper, lithotyped. $0.50. * 

Clearing House 22:565-6 My '48. G. Der- 
wood Baker. * a very useful guide * a how-to- 
do-it manual that describes this technique in 
sufficient detail to enable the teacher to approach 
the study of intergroup social relations with 
confidence. The diagrams, charts, and forms 
make the process readily understandable, and 
an excellent bibliography is included for those 
who wish to study the technique more fully. 
The sociogram is a convenient tool for elemen- 
tary-school teachers, homeroom teachers, and 
teachers of core groups who are concerned 
with establishing good personal-social relations 
within their groups or who wish to identify those 
pupils who need guidance assistance in their 
social relations. 


[ B103] 
Cureton, THomas Kirk. Physical Fitness Work- 
book: A Manual of Conditioning Exercises and 
Standards, Tests, and Rating Scales for Evaluat- 
ing Physical Fitness, Third Edition. St. Louis, 
Mo.: C. V. Mosby Co., 1947. Pp. xi, 164. Paper, spiral 
binding. $2.50. * 

Am J Pub Health 37:1193 S °47. Mabel E. 
Rugen. * Physical fitness is conceived as includ- 
ing: physique, organic efficiency and motor fit- 
ness—a concept associated with physical effi- 
ciency in an earlier era. Physique is determined 
by a series of anthropometric measures—girths, 
skeleton, fat, vital capacity, and muscle tone. 
Organic efficiency is determined by tests of re- 
spiratory fitness (e.g. breath holding), and car- 
diovascular efficiency (e.g. Schneider Test and 
adaptations). Motor fitness is determined 
through an elaborate battery of physical stunts 
and exercises designed to “measure”. flexibility, 


balance, agility, endurance, and strength. The 
soundness of this concept of physical fitness may 
be questioned since little attention is given to 
the health examination (required only before 
motor fitness tests are given), and the correction 
of remediable conditions, or to health practices 
other than exercise. In other words the condi- 
tions which must be present before the physio- 
logical benefits of exercise are likely to accrue 
are not considered in this reference. The work- 
book has greater value, albeit limited, in the 
improvement of the physical education program 
in the college than in contributing to the health 
program. 

О R Biol 23:391-2 Г '48. S. R. M. Reynolds, 
* designed as a personal record book for all un- 
dergraduates who participate in the Physical 
Fitness Program at the University of Illinois. 
The manual is rich in test procedures and scor- 
ing methods. * Except at the University of Il- 
linois, where this manual is integrated with a 
large program, this publication will be of most 
use to teachers of physical education. It should 
also serve as a stimulus to physiologists to de- 
termine in what way and to just what extent 
many of these tests do bear a relation to physical 
fitness. Their relation to age, previous personal 
history, and occupation might also be evalu- 
ated with interest and profit. 


[8104] 
ЖСоветох, ТномАз Kirk, Јв.; AsststEp BY Ё10\? 
Boys, Norris L. Ввоокемѕ, RrcHarp V. GANSLEN 
MaxwrtL R. Garret, ALFRED W. HUBBARD, PAUL 
HuNsickEm, FREDERICK W. KascH, WILLIAM Kas- 
TRINOS, BENJAMIN H. Massey, HENRY y: Моо 
NricoLAAs Srrypom, Јлсов С. Worr, AND nema 2 
WoLr. Physical Fitness of Champion Athle i 
Foreword by A. C. Ivy. Urbana, Ill.: University 
Illinois Press, 1951. Pp. xvi, 458. $7-50. * 

[ B105] y 
Cureton, THOMAS KIRK, JR.; ASSISTED BY PE 
W. Kascu, Јонм Brown, AND W. G. Moss. as 
cal Fitness Appraisal and Guidance. PM A 23! 
Mo.: C. V. Mosby Co., 1947. Рр. 566. $6.00. * ( 
4984) 

Am J Pub Health 38:1020-1 Jl 48. gis! 
E. Rugen. * Considerable attention is eien 
review of the literature dealing with metho a 
appraising body types, weight, cardio 
and respiratory fitness, strength and mom. | in 
formance. In fact the strength of the book pik 
the extensive bibliography it presents, e 
many basic studies are cited. The project eol 
ducted with rro first year medical gin 
stimulate their interest in physical MEET i 
and increase their understanding of phys!e 
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ness constitutes the “original” material for this 
book. Tests used with this group included an- 
thropometrical measurements, somatotype rat- 
ings, the Schneider and other tests of cardiovas- 
cular efficiency, vital capacity and other means 
of respiratory fitness, and a battery of motor 
performance tests including the one mile run. 
The discussion of these tests is distributed 
throughout the reference, making it difficult to 
discover the total results and conclusions of this 
study. Suggestions for the health and fitness 
guidance of the individual based on the results 
of these various tests also appear in a number of 
chapters. For the most part these suggestions 
stress the importance of exercise, especially “en- 
durance exercises." Generalizations, based pre- 
sumably on the data presented, to support this 
reurring suggestion may be questioned in a 
number of instances; for example, mental hy- 
giene guidance for body types (p. 129), dieting 
(р. 155); exercise to improve cardiovascular 
condition (p. 189), and a low basal metabolism 
(p. 304). From the quality of the writing one 
gains the impression that the literature cited and 
the data presented have not always been inter- 
preted too objectively and that the authors' ap- 
parent conviction of the importance of endur- 
ance exercise as a means of attaining optimal 
physical fitness has influenced particularly the 
suggestions given on guidance. Such expres- 
sion will lend strength to the criticisms, certain 
to be made, that the book presents а biased view- 
point and overlooks or passes lightly over inter- 
pretations that do not support the thesis that 
endurance exercise is the most important factor 
in physical fitness. Physiologists, physicians, 
public health workers, as well as educators and 
others who read this book should do so critically 
to gain the greatest value. 

Brit Med J 4557:883 My 8°48. Adolphe Abra- 
hams. What does physical fitness mean? This, 
the opening sentence of this remarkably com- 
Prehensive work, prompts the supplementary 
inquiry, Fit for what? In the past, attention has 

ееп particularly, indeed almost exclusively, di- 
rected to the circulation as the limiting factor in 
Physical endurance. It is clear that specific tests 
must be instituted to assess capability for а par- 
ticular purpose; for emotional, mental, and so- 
ү fitness are also important in adaptation to 
"s and living—problems which, the author in- 

lates, will be investigated in future. As we 
Must repeat ad libitum, perhaps ad nauseam, it 
is to the U.S.A. that we look for research in 
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this subject. Interest and opportunities there 
permit grand-scale organization and operations 
with unlimited subjects for investigation, luxuri- 
ous equipment, and staff to conduct the investi- 
gation. All of these are reflected in the present 
work—a stupendous compilation of details, 
page after page of charts and graphs and col- 
umns recording the results of tests of physical 
fitness and physique ratings. What are the con- 
clusions? To a great extent only an elaboration 
of the obvious, a reminder of what is already 
known, a repetition of the opinions expressed by 
workers in all parts of the world. We see, how- 
ever, the beginning of an important study of 
“body types.” “Just as we can purchase 10 dol- 
lar suits or 100 dollar suits, so some of us in- 
herit 10 dollar organs and others тоо dollar 
organs.” How far can we estimate virility and 
longevity on the basis of certain constitutional 
body types? We see in this work’s subtitle the 
claim that “appraisal” permits of "guidance" to 
utilize natural advantages to the utmost, to ac- 
cept the limitations imposed by natural deficien- 
cies and so avoid disappointments in attempting 
to achieve the impossible. * 

Lancet 254:328 Е 28 48. * All first-year stu- 
dents in the colleges were submitted to what is 
appropriately described as a "battery of tests" 
on a scale which we with our comparatively 
simple criteria can only regard as colossal. * It 
is unlikely that many readers will desire to 
peruse and study the numerous tables and charts 
that occupy the major portion of this encyclo- 
paedic production. Nor can it be said that such 
application can be of any value in itself. It is 
enough to browse, to seek for titbits to satisfy 
the palate from the heavy fare submitted for 
mastication and digestion. Furthermore, al- 
though the greatest credit must be accorded to 
the patience and industry that provided such 
elaboration, it cannot be said that much has been 
gleaned to replace or displace the conclusions of 
previous workers with their old-fashioned sim- 
plicity. * 

О R Biol 23:390-1 D’48.S. R. M. Reynolds. 
* The chapters that make up the bulk of this 
textbook are full of facts, methods of measure- 
ment, methods of tabulating data and for scor- 
ing fitness. These will be of utmost value to 
those who are concerned with physical educa- 
tion. The book ought to be of equal value to 
physiologists, psychologists, and sociologists, 
however, and should stimulate them to inquire: 
Are mere physical endurance, stamina, and 
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motor skill necessary conditions for a produc- 
tive, happy, and socially well adjusted life? The 
physiologist will look at many of these tests with 
a critical eye. There is no doubt that they meas- 
ure something, but what is it? Are they tests of 
tests? The discussion of somatotypes, for ex- 
ample, is complete and will satisfy those who 
must measure somatotypes. Yet thereis no doubt 
that many of the conclusions reached in this dis- 
cussion are empirical and lack a scientific basis 
of fact. The discussion of the Cureton technic 
for measuring body fat shows it to be empirical, 
too. Data are obtained by it and described in de- 
tail. Now no one will deny that extremes of fat 
and thinness are undesirable, but what consti- 
tutes a significant increment of fat? In this test, 
numerical scores are obtained and letter grades 
are assigned for fat in cheek, abdomen, hip, glu- 
teal region, and rear and front thigh. A critical 
reader will want to know how a score of D is any 
more indicative of poor “physical fitness” than 
one of C. Nowhere in the discussion of the re- 
lation of weight to physical fitness is there any 
mention or evaluation of the specific gravity test 
developed by Keyes. Is it useful or not? In a 
book on the appraisal of physical fitness, the 
student wants full discussion and evaluation by 
the experts. The section on tests for cardiovas- 
cular fitness are most complete, and all who need 
to apply them will find here a useful compen- 
dium. He will also find a fair account of their 
physiological basis. There is again lacking, how- 
ever, a critical evaluation of how the results of 
these tests are actually related to physical fit- 
ness in a fundamental sense. The pulse-scoring 
tests, for example, are among the most favored 
tests for evaluating physical fitness; yet it is 
known that frequently older individuals may ob- 
tain high scores, or that individuals without 
much endurance (e.g., those with rheumatic or 
other types of heart disease) may score high. 
This is not so much a criticism of the book under 
review as it is a plea for less emphasis generally 
on numerical scores as a means of judging fit- 
ness. The use of the heartometer provides a case 
in point. An excellent description of the use of 
this instrument for analyzing pulse waves is 
given, and many tests with it are cited. Numeri- 
cal data concerning the characteristics of pulse 
waves are quoted in extenso, with standard 
scores for excellent, very good, good, average, 
fair, poor, and very poor. The percentile distri- 
bution of these is given. If, then, we speak of the 
frequency of a score, that is one thing, but if one 
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goes on to state what is a “good” and what is 4 
“poor” score, he transgresses, it would seem, the 
limits of scientific objectivity. Other criticisms 
are equally apt. One might question the use made 
by the authors of the “angle of obliquity” in the 
systolic peaks of the heartogram pulse wave, 
since it is constructed arbitrarily with one 
straight side although the writing arm inscribes 
a radius of curvature. One might go on to ques- 
tion the significance of one or another aspect of 
these tests used to measure fitness. To do о, 
however, would detract unnecessarily from the 
value of a large job which has been done clearly 
and well. If a deficiency exists, it resides bas- 
ically in the failure of physiologists to define 
clearly what fitness and fatigue actually are. The 
psychological aspects of motivation are of ut- 
most importance, yet they do not, to judge from 
this volume, enter into ordinary procedures for 
evaluating fitness. 

Sci 107:403 Ap 16 '48. David B. Dill. This 
book represents an ambitious and laudable at- 
tempt, sponsored by the University of Illinois, 
to appraise physical fitness and to lay down prin- 
ciples for its improvement. A battery of fitness 
tests was applied to students entering the Uni- 
versity's Colleges of Medicine, Dentistry, and 
Pharmacy. The techniques and results are pre- 
sented, and the author, who administered the 
study, interprets the results and provides hints 
for the guidance of physical educators. The 
study signifies a growing interest in fitness and 
health; the "Committee on Physical Fitness 
that supported the study and encouraged the 
publication of results is to be congratulated. 
Since other universities are alert to the prob- 
lem, Physical fitness appraisal and guidance may 
well stimulate interest in other quantitative 
studies designed to evaluate the author's new 
techniques. It may be appropriate to refer to 
certain sections of Chapter 5, since these seem 
rather superficial. The author refers briefly d 
the major role played by excess fat depost 
in lowering fitness and favoring degenerativ 
disease. Convincing graphic evidence might W 
have been prepared from the morbidity pen 
of insurance companies. Seven methods * 
recommended for reducing, but these do a 
include a clear statement of the basic puer 
caloric intake must be less than caloric n ced 
Further, the idea that sweating and viia 
water intake modify fat deposition is, И? m 
nately, perpetuated. Finally, some rea 
question the recommendations that the 
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son should eat more bread and less cvy cereal, 
more turkey and less cheese. But these are 
minor points that can be corrected in later edi- 
tions. The important fact is that the author has 
attempted to provide a basis for greater physi- 
cal fitness and health. The crucibles of experi- 
ence and of physiological research will deter- 
mine the soundness of his approach. 


[B106] 
#Dant, Loraine Anson. Public School Audiome- 
try: Principles and Methods. Foreword by Wen- 
dell Johnson; Preface by Dean M. Lierle; Introduc- 
tion by Robert West. Danville, Ill.: Interstate Print- 
ers & Publishers, Inc., 1949. Pp. viii, 290. $3.00. * (РА 
25:2009) 

J Ed Psychol 42:64 Ja ' 51. Miles A. Tinker. 
Up to the present the measurement of hearing 
status among school children has not received 
the attention nor the support that it deserves. 
This is partly due to inconsistent techniques 
and standards in the various surveys and the 
programs for conservation of hearing. Never- 
theless, it is well established that there are a 
large number of children of school age who have 
serious hearing impairments. For purposes of 
medical care, and for aid in educational and 
personality adjustment, it is important that these 
children be identified. The material in the pres- 
ent book should both promote and facilitate more 
adequate measurement of hearing status. * pre- 
sents useful information for all those interested 
in the hearing problems of school children. No 
school nurse or other person concerned with 
audiometry can afford to be without it. 

J Excep Child 16:254 My ’50. Warren Н. 
Gardner. If a person planned to give hearing tests 
or start a hearing conservation program, the 
first step should be to read “Public School Audi- 
ometry” from cover to cover. In the first part, 
entitled, “Factors to be Considered in a Hear- 
ing Conservation Program,” the author has as- 
sembled limited but adequate information about 
oe : incidence of hearing defects, causes of 
ш, debunking of queer ideas about hear- 
sie ч deafness, success in modern treatment, 
d € socio-economic problems of the hard of 
mater These facts give the beginner working 
cae 5 that enable him to develop a hearing 

Prenon program with intelligence and con- 
tonside Although the average reader might 
ener many instructions trivial, the experi- 
stallin ester appreciates their importance in fore- 
other Е mishaps and misunderstandings that 

Se would prevent standardized proce- 
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dures and successful results. An important part 
of the book is the Appendices which are replete 
with forms, reports, supplies and letters neces- 
sary to the successful conduct and follow-up of 
a testing program. Although the incidences of 
hearing impairment submitted from the author's 
survey results are much higher than averages in 
most states, her analyses and classifications may 
be helpful to others in preparing reports. This 
manual, together with texts and writings pub- 
lished since 1944, should be most serviceable 
to hearing conservation specialists and students 
in the field of audiology. 

Nerv Child 9:93 Ja '51. From a rather un- 
known publisher comes this really excellent 
handbook in such a badly neglected field. The' 
book is very clearly conceived and very well 
executed, balancing sound theoretical concepts 
with practical advice. It is a little weak in ref- 
erences, probably due to the standing of the au- 
thor in the general field of work with handi- 
capped children. * 


[B107] 
XDairevy, Joun T., Ерток. Psychological Research 
on Flight Engineer Training. Army Air Forces 
Aviation Psychology Program Research Reports, Re- 
port No. 13. Washington, D.C.: U.S, Government 
Printing Office, 1947. Pp. vii, 227. Out of print. * (PA 
22:4141) 

J Consult Psychol 12:282 JI-Ag ' 48. Laurance 
F. Shaffer. * 'The report gives a job analysis of 
the flight engineer's functions, and accounts of 
the development of criteria of proficiency and of 
selection procedures. Predictive tests for select- 
ing men for training as flight engineers were ap- 
proximately as successful as those for pilots and 
navigators. There are implications for the selec- 
tion of other types of engineering personnel. 

Psychol В 46:519—20 N '49. G. Raymond 
Stone. * The present report is....notable for its 
account of work completed (chs. 4 & 5) in the 
face of what can only be called major difficul- 


ties. * 
For a review of the series, see B2. 
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un C. The Teachers’ Handbook o 

blogs bann Marking and Records. Lon- 

don: Crosby Lockwood & Son Ltd., 1949. Pp. 79. 47. 
6d. * 


Brit J Ed Psychol 20:206 N ' 50. * The princi- 
ples involved in the construction of tests are 


simply explained and examples are given of ob- 


jective tests in a wide variety of subjects. The 


simple statistics which are necessary for the 
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conversion of raw scores into standard scores, 
are handled lucidly at a level which will be un- 
derstandable by most teachers. There is a use- 
ful appendix of work sheets and tables which 
should reduce computation to the minimum. 
El Sch J 50:359-60 Е ?5о. C. L. Winters, Jr. 
* written primarily for teachers in the English 
school systems * will probably be of little in- 
terest to American teachers for its content value 
* presents an exceedingly elementary, oversim- 
plified, and often fallacious treatment of the 
areas listed in its title. The volume does have 
limited interest in that it indicates the level at 
which English educators are dealing with the 
. problems of testing and evaluation. Тһе author 
presents testing problems to his readers at the 
level at which they were presented to school 
teachers in this country about twenty years 
ago. The book consists of five short chapters: 
(1) “The Future of Examinations," (2) “What 
Is a Test Mark?” (3) “Interpreting Standard 
Scores,” (4) “Principles of Test Construction,” 
and (5) “Instructions for Calculating Stand- 
ard Scores.” Why an author of an elementary- 
school handbook would spend two-fifths of his 
time on standard scores when so many more im- 
portant and basic problems of testing could be 
discussed is not apparent to the reviewer. The 
major emphasis in the book is on testing or 
evaluation as a tool of certification. Many of the 
more important aspects of testing, such as evalu- 
ation of the objectives of instruction and diag- 
nosis of the needs of students, are stressed but 
slightly. The author also spends a good deal of 
time in extolling the virtues of the so-called “ob- 
jective tests” and frequently passes over or ig- 
nores some of the values of the essay-type test 
because of its low reliability. His quoted low re- 
liabilities for scoring of essay-type tests are 
partly due to the lack of clearly defined criteria 
for scoring the tests. * One expects from a hand- 
book a rather elementary treatment of the sub- 
ject for a novice in the field. However, one also 
expects the elementary treatment to be factu- 
ally accurate in the areas it covers and the sim- 
plifications that are necessary to be simplifica- 
tions of exposition and not simplification of 
ideas. This handbook does not meet these cri- 
teria. 
[ B109] 


*Dar ey, J. G.; ALEXANDER, W. B.; Barney, Н. W.; 
Соок, W. W.; Ерсевтом, Н. A.; AND VaAUGHN, К. W. 
The Use of Tests in College. Foreword by T. R. 
McConnell and E. G. Williamson. American Council 
on Education Studies, Series 6, Student Personnel 


Work, No. 9. Washi Gi i 
E Ba laser di d eral, ^m 
^ J Appl Psychol 33:96 F "49. Milton E. Hahn, 
1 Although....addressed to a college audience, 
it contains much of value for the users of tests 
at most educational levels. The frame of refer- 
ence centers upon five questions, “Who shall be 
admitted ?", "How shall students choose appro- 
priate curriculums ?", "How shall we counsel 
students ?”, “How shall we measure outcomes?”, 
*How do we measure behavior?" This method 
makes the material useful to college and sec- 
ondary school administrators, counselors, and 
thoughtful instructors. The recommendations re- 
garding the use of tests are properly cautious 
and practicable. A major value of the publication 
is its interpretative approach to test use in terms 
of generalizations, rather than use of a multi 
tude of specifics related to particular tests. The 
reader is urged to consider carefully Section V, 
How shall we measure outcomes? (pp. 43-57). 
The presentation of an examination structure 
for colleges on pages 45 and 46 supplies an ex- 
cellent general framework for considering test 
results in relationship to other types of data and 
several kinds of personnel workers. In the re- 
viewer's opinion, the objectives and implications 
stated in the Foreword....are well met. It is his 
opinion also that it is a somewhat restricted but 
generally excellent presentation. Strongly rec- 
ommended reading for student personnel work- 
ers and administrators in educational institu- 
tions. 

J Consult Psychol 12:123-4 Mr-Ap 48. Lat 
rance F. Shaffer. This brief report is а simple 
presentation of the part that tests may play in 
a college program, written for college adminis- 
trators and personnel workers who are not 
trained as psychologists. The subject matter 
and recommendations tend toward the conserva 
tive side. The writing is clear and convince 
with effective use of graphic materials and 0 
brief case studies. « tuse does 

Nerv Child 7:347 J1' 48. While this boo i 
not definitely state that it would be desiral in 
curb the testing nonsense which has become | 
academic pattern of our educators and psy¢ A 
ogists, we feel that its writers wish to AA я 
siderably the unintelligent use of test shee 
human beings. 

Fuerte 27:142-3 N "46. дите F 
Brayfield. * 'The quality of the report 18 Em К 
for by the presence as contributors pA 
nent technicians and practitioners as ": 
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exander, H. W. Bailey, W. W. Cook, H. A. 
Edgerton, K. W. Vaughn, and J. G. Darley. * 
a minimum of technical jargon * The point of 
view of the committee members is that tests, 
selected, administered, and interpreted under 
optimal conditions, and taken in conjunction 
with all the other available evidence, “do fre- 
quently give information of value in forming a 
judgment or reaching a decision and give it in 
an economical way.” The authors maintain such 
a moderate, practical position throughout. How- 
ever, despite their moderation, they have pro- 
duced а document which aims to disturb the 
complacency of teachers and administrators who 
give unquestioning acceptance to traditional 
methods of student appraisal and evaluation. 
This publication should find its greatest useful- 
ness with those numerous teachers and adminis- 
trators who have remained relatively untouched 
by the many changes in the theory and practice 
of the measurement of various aspects of human 
behavior (with perhaps particular emphasis 
upon achievement testing) which have occurred 
inthe past ten to twenty years. It is brief enough 
and sufficiently readable to hold their attention 
once they have been urged, cajoled, or goaded 
into opening its covers. 

Personnel Psychol 1:497-9 w '48. Harold B. 
Pepinsky. * When one keeps іп mind the non- 
technically trained reader to whom....[the bro- 
chure is] addressed, one is impressed by... [its] 
excellence in comparison with other books about 
tests. * Chapter 7, “How Do We Measure Be- 
havior?” (pp. 65-80), should be particularly 
useful even to the trained test builder, although 
lack of reference to the measurement of inter- 
personal relations (sociometry) is a minor omis- 
sion (p. 66). * the lay reader may find the going 
Tough because the discussion is on a more tech- 
"ial plane than that in preceding chapters. 
Chapter 6, on the organization of testing services 
(рр 58-64), is an excellent "selling job" to fac- 
i members and administrators who may be 
i ical of this function. A major criticism may 
е directed at Chapter 4, entitled “How Shall 
A aga Students?” (pp. 31-42). One could 
enis or greater stress upon the fallibility of the 
| alor as instrument, however precise and 
or rate the measurements at his disposal, and 
M iod emphasis upon the role of the coun- 
prob ш helping to direct inquiry into his own 

e | and in making decisions about himself. 

ers. { ogmatic assertions are made ; one won- 

ı lor example, whether “а test of general 
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academic ability" can indeed "tease out any dis- 
crepancies (italics mine) between ability as such 
and the use of ability in getting grades" (p. 35). 
How is "the adviser" to determine whether his 
judgments are "as precise as modern develop- 
ments can make them?" -(p. 37)—such a state- 
ment is of little help without explication since 
the average faculty "adviser" either cannot or 
will not be helpfully self-critical in this connec- 
tion. The discussion of “educational adjustment” 
(pp. 38-41) leaves much to be desired: e.g., 
such terms as “basic ability,’ and “under- 
achiever” or “over-achiever” may be semantic 
smokescreens raised by the “adviser” to cover 
insufficient or inadequate information about the 
counselee. In all, Chapter 4 appears as a weak 
spot in an otherwise excellent brochure. In con- 
clusion,....T he Use of Tests in College will make 
„.[а valuable addition] to the libraries of lay 
workers in guidance and personnel work. * the 
American Council on Education....[is] to be 
commended for...[its] sponsorship of....[this] 
important [book] * 

Psychometrika 13:193-4 5 48. Wilbur S. 
Gregory. * The authors do not exaggerate or 
minimize the function of tests in the college per- 
sonnel program. In spite of the brevity of each 
of its chapters, the bulletin presents an extensive 
review of the possible uses of tests in college 
personnel work. The booklet probably contains 
little or nothing that is "new" to those who have 
been supervising testing services in colleges or 
universities, but it will be of use to such super- 
visers as a concise description of the scope and 
objectives of their services. The bulletin will be 
of use to college administrators as a guide in 
evaluating the testing services of their institu- 
tions. It will be of use to teachers of courses on 
testing and personnel work as possible supple- 
mentary reading material for their students, An 
adequate testing service in any college should 
aid the college staff to answer the questions dis- 
cussed in this book regarding admission policies, 
counseling methods, and evaluation of objectives. 
Consequently, the book will also be useful as a 

ide to those who are in the process of organiz- 
ing a counseling and testing program for their 


institution. 
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Davis, Frepertck B. Utilizing Human Talent: 
Armed Services Selection and Classification 
Procedures. Prepared for the Commission on Impli- 
cations of Armed Services Educational Programs. 
Washington, D.C.: American Council on Education, 
1947. Pp. ix, 85. Paper. $1.25. * (PA 21:2775) 

Col & Univ 23:308-9 Ја ' 48. Verne C. Fryk- 
lund. * The report, designed for general reading, 
was written by an eminent psychologist com- 
missioned in the Air Forces for service in the 
intensive AAF selection program of War II. 
The civilian implications are clearly set forth 
in Chapter II of a three-part report. Leaders in 
guidance, school administrators, and test con- 
structors should take particular notice of this 
excellent report. There are two appendices con- 
sisting of technical data of value to specialized 
workers in pursuing study of test items for pre- 
dictive tests. * If there could be a criticism of 
the valuable report, it would be that the reader 
may be left with the thought that testing could 
be given primary consideration as a guidance 
device. It has been the writer's experience that 
much continued experimentation is necessary, 
and he believes that years may pass before great 
reliability can be placed on tests in connection 
with guidance service. It is rather difficult to 
match the constancy of statistical calculation 
with the variabilities of human nature. The most 
difficult characteristic yet to be measured is that 
intangible ideal of effort and industry. Psychol- 
ogists agree that tests have limited usefulness 
and should be used in connection with other per- 
sonal data on the individual. Unfortunately, 
however, guidance counsellors, and some psy- 
chologists, refer too positively to the tests in 
their counselling procedures. Aptitude tests are 
quite reliable in identifying abilities on the ex- 
tremes of the normal curve, but prediction for 
the great middle multitude is yet a problem. This 
volume should be on the shelf of all workers in- 
terested in the welfare of youth. There is much 
valuable evidence of success in a widely-known 
selection program of War II. It is an excel- 
lent report—readable, practical, forceful, and 
highly valuable to the professional advance- 
ment of guidance workers and psycholo- 
gists. 

Occupations 26:133 N '47. John G. Jenkins. * 
Only those who served in some phase of the 
programs maintained by the Armed Services 
are likely to realize the magnitude of the task 
accomplished in this brief volume by Davis. In 
order to secure the information needed for the 
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review, the author must have read many thoy. 
sands of pages of technical reports. Not infre. 
quently the contents of a single sentence in the 
review refer to a technique which was itself 
described in technical reports amounting to hun- 
dreds of pages. The fact that Davis’s grasp of the 
vast amount of source material is uniformly high 
can only reflect great credit on his labors, * It is 
difficult to find any serious fault with the first 
chapter. Since Davis himself was with the Army 
Air Forces psychology program during the war, 
it is natural that he should draw more heavily 
upon this than upon other programs for his basic 
material. The breadth and general excellence of 
the AAF program are noteworthy. The review 
of Navy contributions seems to the writer both 
adequate and significant, except for the omission 
of the important submarine program. He does 
not neglect selection programs maintained by the 
WAC, the WAVES, the Marines, or the Coast 
Guard. If the chapter has a major weakness, it 
lies in its failure to give adequate space to pro- 
cedures developed under the office of The Ad- 
jutant General. * The volume also contains two 
technical appendices of interest to those who 
work with tests. Viewed as a whole, Daviss 
monograph is a highly competent review of very 
widely scattered professional developments, to- 
gether with a thoughtful discussion of certain 
major implications of these developments. This 
is a volume to be bought by the psychologist for 
frequent use as a reference source. Pu 
should also place it in the hands of education 
administrators as an excellent statement of the 
case for vocational orientation through the use 
of validated predictors. j 
Psychometrika 12:292-3 D "47. Wilbur 2 
Gregory. * The book contains only tuo ad 
The first chapter contains a brief outline * it 
testing programs and classification proce Wat 
used by the armed services during World ble 
II. This chapter is so brief that it is шешш 
whether it can be very meaningful to n Mí Ў 
егапѕ. The primary objective of ће dM 
parently was to establish the fact that the vie i 
services used a variety of aptitude an ТШЕ 
ciency tests in addition to education and dif 
history data in classifying each person jon d 
Almost no data are presented regarding u^ 
lidity of the procedures and tests used. Cha te 
II, "Implications for Civilian Sum lica" 
gins with the following statement: , Sid ii 
tions for civilian education may be derive aie 
the selection and classification procedures 
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in the armed forces, but a consideration of them 
indicates that they are not new; they are, gen- 
erally speaking, principles that have been advo- 
cated for many years by educators and psychol- 
ogists.” * the person who is seeking for new ideas 
which originated during the war will be disap- 
pointed * The book will be of interest and value 
to three groups: (1) the appendices will be of 
interest to psychometrists and statisticians ; (2) 
the discussion of “implications” will be of in- 
terest to school and industrial administrators as 
a summary of arguments favoring the use of 
tests for guidance and placement purposes; and 
(3) the discussion of “implications” will be of 
interest to novices in the testing field as a con- 
cise statement of widely accepted principles to 
be applied in testing work. 

Q R Biol 23:86-7 Mr 48. J. M. Stephens. * 
The author has clearly made an effort to offer 
something to each of several very different 
groups. For the relatively uninitiated there isa 
brief factual description of the selection and clas- 
sification procedures employed by the various 
services. For those who want practical sugges- 
tions with a minimum of worry about theoretical 
background, there is a presentation of ten neat 
implications. For the more technical-minded, 
there is an appendix devoted to more technical 
problems. Although comprising only 31 of the 
85 pages, the section dealing directly with the 
implications is undoubtedly the meat of the con- 
tribution. The first section merely provides gen- 
eral orientation, and the appendix is truly an 
appendix, The hurried reader, by the way, could 
safely omit the first section with: its general 
description. There is very little specific reference 
to it, or use made of it, in the development of 
the argument regarding the implications. We 
are warned that only sparse and trite implica- 
iim for civilian education should be expected 
Tom the program of the services. The service 
Programs, after all, relied chiefly on techniques 
ànd procedures derived from civilian education 
^ the first place. For some of the implications 
ЖЕДЕ is well given. We are told, for in- 
fields that many more apt students in various 
P ien be discovered, and guided more ef- 
b ntly than at present into suitable training, 
d use of better (and longer) aptitude tests, 

that the progress of these students might be 
etter evaluated through tests having norms 
on from widespread regions. Rising one cut 
E, these rather platitudinous implications, 
* find some important, if not startling, sugges- 
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tions. There is no royal road to aptitude testing. 
Although a test of fundamental scholastic apti- 
tude is useful, it is not nearly enough. There is 
usually some specific test which excels the gen- 
eral test as an indication of a special aptitude. 
These specific tests, to be of maximum use, 
must be more complex than we may like to 
think. They must be longer than those now in 
use, and must give separate measures for a 
larger number of factors than we are accustomed 
to deal with. There are other important em- 
phases regarding the influence of a restricted 
range of talent, and the importance of realistic 
validity in a criterion test, even at the cost of 
reliability. We are also cautioned not to expect 
the insight of the clinician to add appreciably to 
the predictive value of test results. There is little 
attempt at a rigorous justification of any of the 
implications. Perhaps that would have been out 
of place. But the problem might have been 
raised, and an appendix might have given some 
hints, at least, as to how we might go about 
demonstrating some of the assertions made. 
What evidence is available, for instance, to 
demonstrate that “Effective (italics added) ed- 
ucational and vocational guidance can be pro- 
vided....” ? Even if clear demonstrations are not 
possible, we might be told whether a particular 
implication is based on the fact that the services 
merely used a given technique, or that some of- 
ficers liked it, or that there is objective evidence 
of its effectiveness. One might have expected a 
richer harvest of implications and on a grander 
scale. The unique thing about the Armed Serv- 
ices, after all, was not their use of novel tech- 
niques of selection, but the sheer size of the en- 
terprises. We remember the enormous number 
of cases, the appalling degree of control, the vast 
resources in money and in technical skill. There 
was also an unparalleled opportunity for bold- 
ness, as when a large number of apparent mis- 
fits was sent through a training program merely 
to test the validity of the selective procedures. 
What are the implications from these features 
of the program ? Can the civilian agencies expect 
satisfactory results with fewer cases and with 
less direct control? Must we be prepared to 
multiply our technical staff to parallel the tech- 
nical services used by the services? Along with 
a scarcity of implications of this nature, one 
notes relatively little distinct discussion of tech- 
niques for “selecting” emotional misfits or other 
persons in need of therapy. It is probable that a 
book which would have escaped these strictures, 
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would, by that very fact, have failed to accom- 
plish the purpose of those who originated the 
series. This contribution to the series probably 
does what it was supposed to do. It provides a 
clear unworrisome discussion of a minimum list 
of implications which, even when trite, can bear 
restatement. 


For an additional review, see 3:835. 
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Davis, ЕвкрЕвїск B., Ерток. The AAF Qualifying 
Examination, Army Air Forces Aviation Psychology 
Program Research Reports, Report No. 6. Washing- 
ton, D.C.: U.S. Government Printing Office, 1947. Pp. 
xvii, 266. Out of print. * (PA 21:3711) 

Psychol В 46:508—10 N ’49. Herbert S. Con- 
rad. * 'The volume is a meaty one, as evidenced 
by the 143 numbered tables. No thoughtful 
reader can plow through this book in an odd 
afternoon; nor can any conscientious reviewer 
hope to do it justice within a few pages. In brief 
overview, what do we learn? * Some more or 
less disturbing observations...may be worth 
mentioning: (1) Correlation coefficients tended 
to vary more than could be accounted for by 
random fluctuations of sampling; discourag- 
ingly, the variations were frequently in the 
“wrong” direction (i.e., the validity correlations 
turned out to be “lower....than substantial 
amounts of data had forecast....” (p. 39), or 
“the promising correlations obtained in earlier 
samples were not confirmed” (p. 187). A par- 
ticularly interesting example of excessive varia- 
tion is pointed out by the author on pp. 191—192; 
for other instances, see pp. 65, 70 (Table 5.10), 
123, 156, and 245-252. The author does not 
undertake to explain or track down the possible 
causes of variation. (2) For one reason or an- 
other—almost certainly practical, and not due 
to oversight by the author—certain desirable 
data were either not collected or not analyzed. 
Thus, applicants for aviation-cadet training 
could take the Qualifying Examination repeat- 
edly, “providing only that 30 days elapsed be- 
tween each testing, and that different forms of 
the examination were used” (p. 1) ; yet no data 
are reported on practice-effects. Similarly, no 
data are reported on the correlation between suc- 
cessive forms of the Qualifying Examination. 
Rather often, in the case of a research sample, 
the answer-sheets containing the necessary basic 
data became unavailable before statistical analy- 
ses could be completed—so that while validity 
coefficients were computed for a given sam- 
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ple, reliability coefficients and intercorrelations 
(among the predictor-variables) had to be com- 
puted for a substitute-sample of imperfect com. 
parability. Another shortcoming is the rather 
frequent calculation of validity-data for the indi- 
vidual parts of a battery of tests, but not for the 
total score on the battery. (3) A capital lack, in 
the reviewer's judgment, is the failure to explain 
in detail whatever strengths or weaknesses the 
criterion may have held. ( Not even an estimated 
reliability coefficient is offered.) The criterion 
generally employed is simply “graduation or 
elimination from elementary pilot training for 
reasons associated with proficiency in flying" 
(p. 2). Such a criterion seems extremely gen- 
eral or non-specific: not even the "academic" 
aspects of training are separated from the non- 
academic. Specific quantitative criteria would 
be more useful, not only for the validation and 
analysis of tests, but probably also as a means 
of stimulating better observation and evaluation 
by pilot-instructors. Granted that practical cir- 
cumstances prevented the use of anything buta 
simple pass-fail criterion, some brief acknowl- 
edgment and warning of the limitations of the 
criterion would still have been in order. (4) A- 
though the Qualifying Examination was vali- 
dated against the pilot criterion, this same ex- 
amination had to be passed also by prospective 
bombardiers, navigators, and (after June 1, 
1944) aerial gunners. The administrative соп- 
venience of a single screening test is obvious; 
on the other hand, it seems likely that the vat- 
ious groups of flying personnel have differenti- 
able aptitudes (see e.g., Table 11.2 on p. 220). 
(This, of course, is a criticism of an administra" 
tive practice, rather than of the present research 
report.) * There can be little doubt that the 
last series of the Qualifying Examination (the 
AC14 series) was more valid than the first ш 
АСто series). Because of the almost inevitable 
lack of comparability between the figures given 
for the first and last form, it is not possible P 
estimate exactly how much improvement in và 


ad ) à 3: 
lidity was accomplished ; the reviewer's va 
tive judgment is about .20 point of corre 

n. Part 0 


This appears a remarkably large gain. ган 
the gain may conceivably be due to pod 
ment in the reliability or validity of the criteri Tn 
the reviewer has no information about this. i 
conclusion, the reviewer would like to js 
what was stated at the outset: this book 18 


Я 0 
deed a solid, meaty work. The exigencies 
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wartime research have, to be sure, prevented as 
complete and close knit a series of studies as one 
might desire. But the author has striven con- 
gcientiously to present the information (both 
favorable and unfavorable) that has been 
gleaned. The task was immense, and the re- 
viewer cannot close the book without a profound 
feeling of gratitude for a significant accomplish- 
ment. The volume makes contributions both of 
methods and results. 

Psychometrika 13:284-5 D ^48. Andrew L. 
Comrey. * the AAF Qualifying Examination 
„has properly been characterized....as the most 
important single examination used by the Army 
Air Forces during World War II. * The prob- 
lems and techniques involved in the selection of 
appropriate kinds of items for the examination 
are discussed in conjunction with a short but 
excellent chapter devoted to such problems in 
test theory as item construction, item difficulty, 
item validity, intercorrelation of items, test pu- 
rity, and test reliability. Many technical prob- 
lems of statistics in relation to test construction 
and evaluation are discussed throughout the 
book, These discussions include some modifica- 
tions of older techniques and some new meth- 
ods such as one for determining the extent to 
which tests selected for their usefulness in con- 
junction with an established composite aptitude 
rating will increase the correlation between the 
aptitude rating and the criterion. * The discus- 
sion of factor techniques is almost entirely lim- 
ited to Kelley’s principal-axis method, which 
was applied to certain tests of practical judg- 
ment, reasoning, and mechanical comprehension. 
The author fails to indicate the limitations of 
z method and the advantages of others for 
"s purpose of discovering the underlying psy- 

ological variables involved in test perform- 
ШО Psychometricians will be interested in 
E andling of important statistical problems 
е in developing this examination as 
м. аз in the information reported concerning 
feat important mental skills. Psychologists 
i with problems of test construction for se- 
Чоп in purposes will find much useful informa- 

n this book. 


aM an additional review, see 3:836; for a 
"ew of the series, see B2. 
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and TO Watrer L., Jr, Eprror. Records, Analysis, 
Psych fae Procedures. Army Air Forces Aviation 

ology Program Research Reports, Report No. 18. 
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Washington, D.C.: U.S. inti 
Meee ca Өү КААН (EA IMMO E 

J Consult Psychol 12:124 Mr-Ap '48. Lau- 
rance F. Shaffer. This volume is a detailed re- 
port of the procedures used in the testing of 
hundreds of thousands of air crew candidates 
during the war, with emphasis on the keeping 
of records and methods of statistical analysis. 
Because of the magnitude and complexity of 
the Air Forces psychological program, prob- 
lems of the standardization of procedures arid of 
the compiling and analysis of records were en- 
countered on a scale never before known to 
psychology. Part I describes the standardiza- 
tion of testing procedures. Part II gives the 
machine (IBM) records in exhaustive detail, 
including, reproductions of forms, of machine 
writing diagrams, etc. Part III is on statistical 
analysis, including calculating machine proce- 
dures. 

Psychol В 46:526-7 N’49. E. К. Henry. This 
volume is probably the least interesting reading 
and at the same time the most important of the 
entire series of AAF Aviation Psychology Re- 
search Reports. * Part II, devoted, to “Records 
Procedures,” contains an excellent detailed his- 
torical description of the record keeping systems 
employed and should be useful to anyone desir- 
ing to use the records for research purposes in 
the future. * The volume is profusely illustrated 
with reproductions of record forms, punch 
cards, and pictures. This volume will be of in- 
estimable value to researchers of the future who 
may wish to make special studies using data al- 
ready available because of the minutely detailed 
descriptions of procédures and methods of rec- 


ord keeping. 
For areview of the series, see B2. 
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XDEPARTMENT OF THE Army, THE ADJUTANT GEN- 
ERAL’S OFFICE, PERSONNEL RESEARCH SECTION. Stud- 
ies in Visual Acuity. PRS Report No. 0. Wash- 
ington, D.C.: U.S. Government Printing ce, 1948. 
Pp. viii, 161. $1.00. * (PA 23: 2084, 24: 1636) 

[B115] P 

. Introduction to the Szondi Test: 

Ac Sus" Practice. Foreword by Lipot Szondi. 
Grune & Stratton, Inc., 1949. Pp. xiv, 354- 
$5.00. * (PA 23: 3748) For reviews, see 135. 1 


ism logical Test 

, SALVATORE С. Psychological Tests 

ID Adults in Vocational Rehabilitation. 
Foreword by Michael J. Shortley. Federal Security 
Agency, Office of Vocational Rehabilitation, Rehabili- 
tation Standards Division, Advisement, Training and 
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Placement Section, Rehabilitation Service Series No. 
29. Washington, D.C.: Office of Vocational Rehabili- 
tation, Federal Security Agency, May 1, 1947. Pp. ii, 
xiv. Paper. Gratis. * 
[ B117] 

*DowNAHUE, Wima T.; Соомвѕ, Суре H.; AND 
Travers, Rosert M. W. The Measurement of Stu- 
dent Adjustment and Achievement. Forewords by 
{ше P. Adams and Clark Tibbitts. Contributions 

rom the Institute for Human Adjustment, Bureau of 
Psychological Services, University of Michigan. Ann 
Arbor, Mich.: University of Michigan Press, 1949. Pp. 
xiv, 256. Out of print. * (РА 23:6416) 

J Higher Ed 21:220 Ap'50. Neal B. Andregg. 
* scholarly papers * Symonds and....Sherman 
describe a personality survey of a junior high 
school which might well give guidance workers 
leads for studying and improving their own 
school programs * This book is recommended 
as a valuable reference for workers in the field 
of educational measurement who are responsible 
for the guidance of students. 

Jun Col J 20:366—70 Е '50. L. A. Rutledge. * 
A timely warning is given with regard to the 
use of tests in the guidance program. * In view 
of the acknowledged paucity of effective guid- 
ance programs in junior colleges, teachers, 
counselors, and administrators should note with 
appreciation the efforts of the authors of this 
volume to raise the guidance program and those 
connected with it to the level of professional re- 
spectability and their suggestion of a specific 
training program to prepare those who expect 
to do guidance work. * 

Psychol B 47:181-2 Mr ’50. Harold Sea- 
shore. * In contrast with many educational con- 
ferences on the subject of guidance, the lecturers 
apparently were asked to assume reasonable 
training on the part of all members of the audi- 
ence so that the content of the lectures would be 
ata much higher level than is common. This vol- 
ume presents these lectures. * Travers....reviews 
research on the prediction of academic success. 
This chapter has a bibliography of 272 items 
which ought to be of interest to persons working 
in this field. * Dearborn delves back into his rich 
experience and comes up with some notes on 
the relation of a student's background to his 
School success. In the anecdotal manner, but 
quite pointedly, Dearborn extracts several ideas 
which should make those that think they know 
all the answers a little less smug. David Trout 
reviews attempts to correlate academic achieve- 
ment and subsequent success in life. * One of 
the most important points he makes is that col- 
leges and schools might do well to make re- 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK [ 910 


searches as to why persons who fail in their 
institutions somehow succeed in life, * Coombs 
«emphasizes a new slant in the measurement 
of psychological traits and outlines a program 
of research to investigate his ideas * Two per- 
sons can have the same status score on the basis 
of passing different sets of items in the test 
used. Coombs suggests also that a dispersion 
score be developed as a meaningful concept to 
help describe persons. * The writer...heard 
all of these lectures and talked with both trained 
psychologists and educators and with graduate 
students and teachers in attendance. His im- 
pression was that such persons appreciate a 
conference in which the level of presentation is 
kept high enough to be stimulating to all and 
which gives one a feeling of having moved for- 
ward in one's own development. Naturally, not 
all of these chapters are major contributions, 
but there is enough in this book to warrant care- 
ful reading by specialists and practitioners in 
the field of measurement. 


[ B118] 

X*DoNAHUE, WILMA, AND DABELSTEIN, Donato, Eni- 
tors. Psychological Diagnosis and Counseling of 
the Adult Blind: Selected Papers From the Pro- 
ceedings of the University of Michigan Confer- 
ence for the Blind, 1947. New York: American 
Foundation for the Blind, Inc., 1950. Pp. vii, 173. $2.00. 
* (PA 26: 493) 

Outlook for Blind 44:268-9 N ’50. Kathryn 
E. Maxfield. * significant contributions to 
knowledge regarding problems associated with 
blindness. These papers....point up the need for 
a well-planned, integrated, yet broadly vus 
ceived, research program in the psychologic 
evaluation and adjustment of the blind. n 
portant reading for those whose job it is to help 
blind adults with their problems of personal P 
vocational adjustment, even though some uh 
more highly technical papers will be skippe ш 
social workers and others whose урш 
follow other paths. The book is of vos ie 
rect consequence to the trained psycho нд 
and vocational counselor but it should Bee 
thoughtful attention from graduate тш p х 
paring for work with the handicapped. aa 
does not permit discussing each of the uin 
stimulating, challenging papers here pres gy 
To this reviewer the most satisfying 85р° i 
the book as a whole is its freedom HR ү ^ 
type and superficiality. There is a rea du p 
acknowledge the present dearth of Jaco Ир 
chological techniques for diagnosis an po | 
ing for this particular group of the hand! 


gi ] BOOKS & REVIEWS: DONAHUE 
The magnitude of the task involved in adequate 
research is recognized. Suggestions are advanced 
loking toward the organization and imple- 
mentation of dynamic and original research. It 
is greatly to be hoped that the National Psycho- 
logical Research Council for the Blind which 
had its birth in the Michigan conference will re- 
ceive the support it merits, both professionally 
and financially. 
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XDoonzn, M. ]о5ЕРН, AND MARQUIS, VIVIENNE; EDI- 
rors. Rating Employee and Supervisory Perform- 
ance: А Manual of Merit Rating Techniques. 
pe by ewes i eee New York: Ameri- 
E n" ga euer ion, 1950. Pp. 192. Paper. 
B Ind Psychol & Personnel Prac 7:40-1 Je 
^51, M. Bucklow. * Persons with an interest of 
any kind in rating will find this book stimulat- 
ing and valuable, for the opportunity to read at 
one time a series of papers of such variety of 
approach and content must necessarily provide 
| new insights and suggestions. The papers gen- 
erally adopt a critical and cautionary attitude 
|. to rating problems, and the emphasis through- 
| Out is on the need to develop techniques which 
|. Will produce the most objective judgments. The 
book also provides useful background informa- 
tion, particularly in the article "Developing an 
| Employee Merit Rating Procedure" by Reign 
Bittner. In the article on the forced-choice per- 
formance reports it introduces a new develop- 
mént and in the final part provides examples of 

à number of different kinds of rating forms. 
Occupational Psychol 25:69-70 Ja *51. J. D. 
Handyside. * consists of reprints of papers by 
fourteen authors * As a consequence the book is 
somewhat repetitive on several issues and curi- 
ously inconsistent on others. It is not written 
Primarily for industrial psychologists, but to 
tons an introduction to rating techniques for 
dii Pw highly specialized practitioners of in- 
if» management, * Welcome emphasis is 
alia placed on the importance of a careful job 
Eres before any rating form is drawn up, m 
ЖЫП. to ensure that the traits included are 
d associated with the job. It is in the dis- 
il lon of the traits to be rated that the authors 
h w most inconsistency. Not sufficient em- 
Wed is placed on the need for rating on the 
ái one number of relevant independent traits 
nif е references to factorial studies are not 
| ciently detailed. There is а rather unsound 
mil of the “halo effect" and it is not made 
ar that effort to avoid halo should in the first 
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place be directed to the problem of designing 
the rating form in such a way that the scales 
relate to genuinely uncorrelated traits. The best 
paper on this topic is the first in the book, in 
which it is recommended that the rating form 
should have two scales only, for job perform- 
ance and for “promotability” * The sections on 
validation of ratings are more than disappoint- 
ing. It is, of course, true that the problem is a 
very difficult one, and that where objective in- 
dices are available against which to validate rat- 
ings there is in fact no point in setting up a 
rating programme. But this is no excuse for 
suggesting that the conformity of the ratings 
to a normal distribution is in any way an indi- 
cator of their validity. There is a short discus- 
sion of the forced choice type of rating form and 
of its merit in reducing the effects of conscious 
insincerity on the part of the rater, e.g. in the 
Army where an unsatisfactory officer rated 
highly in order to accelerate his promotion out 
of the unit. However, this paper is too short to 
explain the severe practical limitations of this 
rating method, its need for highly trained statis- 
ticians in its design and the infrequency of situa- 
tions in which jobs are sufficiently uniform in 
their demands to render the method valid at an 
acceptable level. * The book is important mainly 
because of its insistence on the duty of inform- 
ing employees about their ratings and the rea- 
sons for them by means of a carefully planned 
interview between the employee and his im- 
mediate supervisor who has carried out the 
rating. This is obviously a valuable if somewhat 
dangerous technique for the development- of 
improved industrial relations and for assisting 
the supervisor to carry out his responsibilities. 
It is clear that the use of this procedure calls 
for high qualities of social skills in the super- 
visory grades. The manual however, despite its 
title, unfortunately says nothing specific about 
the evaluation of supervisory performance. 


[ B120] Wia } 
ADOPPELT, JEROME EDWARD. The Organization о 
Mental Ates in the Age Range 13 to 17, Co- 
lumbia University, Teachers College, Contributions to 
Education, No. 962. New York: Bureau of Publica- 
tions, the College, 1950. Pp. х, 86. $2.10. * (PA 24: 
5111) 

Am J Psychol 64:157-8 Ja '51. Another step 
toward resolution of Garrett's proposal that the . 
integration of special abilities into “the general 
factor” (in this study determined by “power 
tests) changes with advancing age. * Results 
from a factor analysis (Hotelling’s method of 
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principal components) indicate that, under cer- 
tain specified conditions, "the general factor 
tends to maintain, rather than to lose, its im- 
portance as age increases." This fails to support 
the Garrett "hypothesis" of “а general break- 
down of an amorphous general ability into a 
group of fairly distinct aptitudes." A substitu- 
tion of experimental, in place of testing, methods, 
with trained observers, might well add a great 
deal of significant coloring to this statistical out- 
come. 


[B121] 
*Dorcus, Roy M., AND JONES, MARGARET HUBBARD. 
Handbook of Employee Selection. New York: 
McGraw-Hill Book Co., Inc., 1950. Pp. xv, 349. $4.50. 
* (London: McGraw-Hill Publishing Co., Ltd. 38s. 
64.) (РА 24: 5488) 

Am J Psychol 64:158 Ja "51. * The general 
index of tests (over 300 listed) taken together 
with the “jobs” to which the tests are applied 
(about 225) gives an intimation of range and 
variety among the arts and trades of practice 
within which the testing profession makes its 
living. The job-list is itself impressive. Here are 
entries under three letters of the alphabet. Let- 
ter I: inspectors (13 kinds) and insulating ma- 
chine operators. Letter F: fabricators (2 kinds), 
finishers, firemen, fitter's apprentices, fixers, fly 
makers, food processors, foremen, forge shop 
employees, foundry machine moulders, foundry 
workers, freight solicitors. Letter W: water- 
meter readers, weavers, welders, winders (2 
kinds), woodmaker’s apprentices, wrappers * 

B Ind Psychol & Personnel Prac 6:47-8 D 
*50. M. N. Oxlade. Psychologists and personnel 
officers who are concerned with the selection of 
employees by means of psychological tests should 
find this handbook extremely useful. * a valuable 
reference book on employee selection. * many 
of the weaknesses in the references examined 
are discussed in the introduction. Anyone who is 
about to undertake an investigation into the use 
of selection tests would be well advised to read 
this introduction, which considers such problems 
as the effect on correlation co-efficients of small 
numbers of subjects, the use of biassed samples, 
and the need to consider the length of serv- 
ice of an employee when determining criterion 
value. There is a timely warning, too, of the 
danger of using stock selection procedures or 
techniques developed by other firms without 
first making a careful check of their predictive 
efficiency in one’s own firm, 

Ed Adm & Sup 37:383-4 O ’51. A. S. Ed- 
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wards. * The book presents ап immense amount 
of work and the gathering together of widely 
scattered material, much of it with great dif. 
ficulty. Not only does it save time and help di- 
rect the efforts of those who wish to use the 
best and most available test, but from another 
point of view, there is indication of the many 
gaps in our present knowledge of testing for 
employee selection. * This scholarly work also 
includes excellent advice in the introduction * 
The reader is told....that care must be taken in 
the selection and application of tests; warning 
is especially given that one should not merely 
attempt to follow the method of another com- 
pany but should take previous work as a start- 
ing point to determine the tests properly avail- 
able for the problem at hand. “There is no easy 
way to obtain an adequate selection program." 
The authors are to be congratulated upon this 
valuable and useful publication. For certain 
courses in testing the book will be at least a 
reference of great value. It is hoped that un- 
trained people will not assume that they can 
consult it over night and be prepared in the 
morning to set up a testing program. 

J Appl Psychol 35:81-2 Е ’51. Harold F. 
Rothe. * the book satisfies a need and fulfills the 
authors' intentions in an admirable manner s 
Although the abstracts are non-critical, rigid 
requirements had to be met for inclusion in this 
volume. * The book may, of course be criticized 
for having omitted many references. On the 
other hand, the authors may be commended oa 
having done an excellent job of what they starte 


to do. Practically all of the useful references аге 


included. And the elimination of scholastic and 
military studies does make the book more useít 
for those who are interested primarily in m 
dustrial testing. * The elimination of about d 
per cent of the references that the authors _ 
vestigated is a reflection upon the шай? 2) 
original papers rather than upon the аш tiat 
selection of papers. Another criticism 1$ E 
there is not enough detail, particularly pe it 
ing the criteria used. For example, where cds 
data are used as the criteria, the ore a 
want to know just what kind of data. : a. 
incentive system in operation on the jo 55 
cerned? Were the operators producer. 
high or low rate, for their industry Can we 
what conditions were ratings obtained? Ea 
be sure the ratings are valid? But once 3 
these criticisms are not very practical D. wert 
the answers to all questions of this kin 
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demanded, the book could probably not be writ- 
ten, Or at least it would be only a pamphlet and 
mot a usable book. The third possible criticism 
is again a weakness of published results and 
not of this book. That is, an inexperienced reader 
might conclude that tests always “work.” There 
are few studies that report low or useless validi- 
ties. This is again more truly a criticism of the 
original papers and not of this book. In con- 
clusion then, this book is an excellent summary 
ofthe available sources. It is an extremely handy 
reference book that will provide a convenient 
first step for anyone interested in seeing what 
has been done in using tests for selected indus- 
trial jobs. It is heartily recommended as such. 

J Consult Psychol 14:416 О "50. Laurance 
F. Shaffer. This unusual book consists of uni- 
formly arranged abstracts of 426 research stud- 
ies on the selection of workers for specific jobs. * 
Although the scope of the volume may not 
justify calling it а "handbook," it will prove a 
convenient reference source for workers, teach- 
ers and students in the field of vocational psy- 
chology. 

J Ed Psychol 41:502 D '5o. C. M. Louttit. * 
To be included...a report had to meet five cri- 
teria: the type of employee specified ; the num- 
ber of subjects given; tests used named or de- 
scribed ; the criterion of job proficiency explic- 
itly stated ; and the actual (quantitative) results 
obtained reported. It is an interesting commen- 
fary on research in this field that barely one- 
fifth of the reports examined could meet these 
minimal criteria. * This volume should prove 
useful for personnel administrators and for vo- 
cational counselors. 

Occupational Psychol 25:69 Ja '51. N. F. 
Kristy. This should prove to be a most useful 
hook for everyone interested in personnel selec- 
tion. * The standards which they set certainly 
do not seem to be excessively high. * Though it 
would be difficult to challenge with more than 
Personal bias the rules by which the authors 
Selected the studies for inclusion, the handling 
of the included material deserves some comment. 
Perhaps the usefulness of the book would have 
been unaffected had the authors eliminated the 
least interesting of the earlier studies. Also, 
many personnel selection people may feel that 

е abstracts are too short and do not provide 
enough information to preclude the necessity of 
referring to original sources. More information 
Would have been valuable concerning the selec- 
tion and differentiation of subjects, the way 1n 


Biz 


which tests were used, and finally the way in 
which the criterion was developed and corre- 
lated to the other data, The volume contains 
also a job index which permits quick reference 
to all studies relating to a particular job, the 
usual author index, and an index of the tests 
used in the studies, The authors promise fur- 
ther editions of this useful handbook whenever 
the accumulation of new research makes this 
advisable. 

Occupations 29:150-1 N ’50. Joseph Tiffin. * 
gives in succinct abstract form the essential data 
from 426 studies on the validation of personnel 
tests. It is “intended as a reference book for 
personnel executives, psychologists, vocational 
advisers, psychology students, and other persons 
interested in the selection of employees by means 
of psychological tests.” The book accomplishes 


' this objective in a very creditable manner. * The 


abstracts are not evaluative. * This volume will 
be a very serviceable addition to books kept 
“within reach” by the personnel research worker 
in business or industry. Certain criteria of mini- 
mum adequacy were established by the authors 
and only articles meeting these criteria were ab- 
stracted. These criteria were that the type of 
employee investigated must be specified; the 
number of subjects must be stated; the tests 
used must be specified ; the criterion of job pro- 
ficiency must be stated explicitly, and the ac- 
tual results obtained must be reported. * Even 
though such articles evidently were given the 
benefit of the doubt, it is interesting to note that 
of the 2,100 references examined, the criteria 
were met by only 426, i.e., 20 per cent. (The in- 
troduction states 427 but only 426 are included 
in the book.) Calling attention to this rather 
glaring inadequacy in the case of 80 per cent of 
published articles in the field may result in more 
carefully prepared research reports in the fu- 
ture. * The authors deserve much credit for or- 
ganizing and assembling in this volume a great 
deal of material that previously has been avail- 
able only in original articles, many of which 
often are not conveniently available. 

Personnel J 29:389-90 Mr Ara E: W. Fass- 
berg. This is a book for which psychologists in 
industry, personnel research workers and other 
persons interested in the use of tests in selecting 


employees will have frequent use. * Some may 


feel the greatest service the authors have ren- 


dered is to obviate the need for going to certain 
erences one might have in mind, for 


original ref e in mint 
an article that fails to meet. their criteria for 
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inclusion in this Handbook would be of doubtful 
value. It is astounding, indeed, that the authors' 
investigation revealed that, of 2,100 references 
they examined, only 427 met these criteria of 
minimum adequacy. More careful reporting of 
research in psychological testing is needed. 
Personnel Psychol 3:380-4 au ' 50. E. Donald 
Sisson. * a compendium of statistical findings 
relating to the validity of tests in actual use. 
Even this purpose is further narrowed to the 
validity of tests in the selection of employees; 
thus omitted are data on military jobs and mili- 
tary personnel, and on students, trainees and 
'apprentices. * Even with this narrower purpose 
of reporting objective findings relating to the 
validity of tests in the selection of employees, this 
reviewer was perhaps naive enough to antici- 
pate a truly tremendous volume of encyclopedic 
proportions. Purporting to cover all published 
studies, domestic and foreign, and a number of 
unpublished studies as well, from the birth of 
applied psychology up through the year 1948, 
the final number of entries, 426, does seem 
rather bleak—especially when it is considered 
that, even if all studies reported were American, 
it would mean less than two per current mem- 
ber of the Division of Industrial and Business 
Psychology. This relatively unproductive record 
of "pay off" studies in this area is an indictment 
which is not mitigated by the fact that a consid- 
erably larger number of studies was screened 
(more than 2100, all told), and only these 426 
measured up to the standards for inclusion. For 
these standards of minimum adequacy were not 
exacting. In the words of the authors, a study 
to be included must: (1) "specify to some de- 
gree the type of employee investigated"; (2) 
state “the number of subjects used" ; (3) specify 
"tests which were used as selective devices"; 
(4) state explicitly "the criterion of job pro- 
ficiency used”; and (5) report “the actual re- 
sults obtained." That such a large number of 
the references reviewed failed to conform to 
these elementary requirements of scientific re- 
porting is professionally embarrassing, to say 
the least. * in only about one of five references 
included is there any information reported about 
the reliability of the criteria of job success used. 
The apparent willingness of investigators to 
accept criterion measures—ratings, sales, pro- 
duction—without critical examination does not 
seem justified, especially since, in the small mi- 
nority of studies which reported such informa- 
tion, criterion reliability generally falls in the 
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-50-.75 range. In organizing their material, the 
authors of the Handbook presumably had sey. 
eral possibilities for arrangement of the abstracts 
—by type of job, by test, alphabetically by ay: 
thor, or chronologically by date of publication, - 
For some reason they chose the last of these, and” 
provided indices of the others—jobs, tests and 
authors—with references to the serially num- 
bered abstracts. While the arrangement dis- 
courages complete reading or study of the vol- 
ume, since there is little interest or coherence in - 
a temporal succession of studies of different jobs 
and different tests, it is obvious that such use 
of the book is not intended. As a reference work, 
however, it is a mine of information. Unfortu- 
nately the rich deposit is not always easily ac- 
cessible. The prospector will still have to do a 
lot of digging. The job index is most helpful in 
this regard. Where the user's purpose is to dis- 
cover what tests, if any, have been tried with 
various types of employees, he will find informa- 
tion, some very sketchy, to be sure, on over 200 
jobs. The other two indexes are somewhat less 
useful, particularly the test index. The user can 
look ир specific tests, if he knows their complete 
designations, and discover with what types of 
employees they have been tried and with what 
results. But if he wants to know what experience 
has demonstrated about the value of biographical 
information blanks in selecting for various jobs, 
where intelligence tests work and where they 
don't, or how valuable the newer projective 
techniques are when subjected to the critical 
test of validation, he will not find this volume 
very helpful. A listing of tests by broad cate- 
gories, rather than alphabetically by title, might 
have been of more value. * The book merits, ant 
will no doubt receive, much more than shelf- 
space. It can serve as a practical source of spe- 
cific information ; and it can serve as a а 
and a guide to more research. Researchers W | 
fail to find in it what they look for will do y 
to consider that all of its contents and theretor 
all of its major limitations are theirs. am 
Psychol В 48:537-8 N ’51. Carol 5. Bello ^ 
and Roger M. Bellows. * A few of the ше 
аге from foreign publications. The пен 5 
these latter is somewhat open to question: tific 
difficult to judge the applicability of a pon? 
procedure or test battery from one indus 
situation to another ; this difficulty is таг 


kedly 
ke suc 
other. How- 


j untry to an 
judgment from one country x research 


ever, it is of interest to the person 
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iker to see what has been done outside of th 
United States in personnel testing. * Studies 
conducted by such competent workers in the 
field of industrial psychology as Ghiselli, Harrell, 
Tifin and Viteles are well represented. One feels 
^ a bit uncomfortable, however, in the presence 
of the apparently glittering results obtained by 
Chapple with his Interaction Chronograph, re- 
sults which other workers have failed to dup- 
liate following Chapple’s procedures. In in- 
stances of this sort one wishes that the authors 
“had not adhered so uniformly to their objective 
- of noncritical abstracting. The general format 
of the book is good. * Three good indices should 
aid the user of this book in finding his way 
around. * This book meets a need of workers in 
personnel psychology, particularly those in busi- 
ness and industrial concerns who do not have 
constant and immediate access to a well-stocked 
library. Also, as the authors point out in the 
Preface, "literature in this field is so widely 
scattered that even in the libraries of large uni- 
versities it is impossible to find many of the 
references.” A real service has been done in 
bringing these source materials together. 


wo! 


[B122] 
*DresseL, Paur L., Anp SCHMID, Јонм. Ап Evalua- 
tion of the Tests of General Educational Devel- 
opment. Foreword by George F. Zook. Washington, 
D.C.: American Council on Education, 1951. Pp. x, 57. 
Paper, $1.00. * 
[8123] 

DvBors, Pair H., Eprron. The Classification Pro- 
gram. Army Air Forces Aviation Psychology Pro- 
be Research Reports, Report No. 2. Washington, 
.C.: U.S. Government Printing Office, 1947. Pp. xiv, 
394. Out of print. * (PA 22: 4143) 
sat Consult Psychol 12:60 Ja-F ’48. Laurance 

: Shaffer. The major task of the AAF psy- 
chologists, from 1941 to 1945, was the selection 
and classification of personnel for air crew du- 
ties, This basic report describes the history and 
organization of the program, the batteries of 
tests used, and the results of four years of re- 
search on the validity of the procedures. Of 
Special interest is the complete account of “the 
experimental group” of 1300 men placed in pilot 
anne in 1943 without psychological selec- 
ion, to test the validity of the procedures, free 
m the influence of the curtailment of range 
á talent. There are also sections on special ac- 
Wities, including the setting up of classification 
ee for the Free French and Philippine Air 
© orces, The volume is richly illustrated by charts 
nd tables, Appendices give tables of testing sta- 


tistics and a list of the more than one thousand 
officers and enlisted men who worked in the 
classification program. № 

Psychol B 46:501-3 № '49. Roger M. Bellows. 
* an important historical document * It is im- 
portant because some of the mistakes made in 
experimental design can, if viewed critically and 
constructively, point the way to more adequate 
research development of differential selection 
tools. (The book being reviewed does not, how- 
ever, look critically upon the experimental de- 
signs employed in the studies.) Furthermore, 
the report is important for the implications 
it has for organization and implementation of 
needed personnel research programs of large 
scope in the area of industrial as well as mili- 
tary personnel. * During most of the period the 
Air-Crew Classification Test Battery included 
approximately 14 objective-type printed tests 
and six apparatus tests. Of greatest interest to 
many personnel technicians is the manner of 
weighting the many tests comprising the classi- 
fication battery * The systems of weighting for 
differential prediction were, for the most part, 
quite arbitrary. It is here that a new look at 
experimental design can perhaps be of greatest 
value. * One of the crucial indices of effective- 
ness of a differential classification battery (as 
contrasted to one designed for mere selection) 
would seem to be validity of a battery weighted 
for one specialty for another specialty. For ex- 
ample, how well does the pilot battery work for 
bombardier, the navigator battery for pilot, the 
bombardier battery for navigator, etc. Judging 
from the results presented the answer appears 
to be that the battery has higher validity for 
the specialty for which it was designed. How- 
ever the critical observer may ask for: (1) an 
experimental design specifically aimed at evi- 
dence for or against the efficacy of a battery in 
differential prediction as contrasted to selec- 
tion; and (2) safeguards against artificial con- 
tamination of the criterion. Did the trainee 
know his stanine? Did the instructor know the 
trainee's stanine? If so, did such knowledge in- 
fluence the criteria of elimination for flying 
deficiency, for fear, or for elimination at the 
trainee's own request? One table of information 
is presented in chapter four that is particularly 
damaging to the belief that stanines have con- 
siderable power in differential prediction as con~ 
trasted to mere selection. This information per- 
tains to the validity of various stanines in pre- 
dicting scores of trainees in pistol and carbine 
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fire! For 350 cases the various air crew stanines 
were found to correlate from .25 to .40 when 
correlated with performance on the ground with 
the pistol, and from .21 to .31 with target per- 
formance with the carbine. The author refers to 
these results as evidence of "the versatility of 
the stanines" (p. 179). Based on statistical and 
clinical approaches, Ch. 5 tells what happened to 
1,311 air crew trainees who had been given tests 
but who were not screened by tests. For this ex- 
perimental group the pilot stanine correlated .66 
with the criterion. For the preliminary qualify- 
ing examination “only 45 out of 520 men who 
failed the examination were graduated from 
training, whereas the yield among those passing 
the examination was 211 out of 751” (p. 201). 
The study reported in this chapter is one of the 
most interesting to come out of wartime per- 
sonnel research. 


For a review of the series, see Вг. 


[8124] 
*DunaAN, June С. MacQuarrie Test for Mechani- 
cal Ability. Summary of Investigations, No. 2. Los 
Angeles, Calif.: California Test Bureau, 1950. Pp. 16. 
Paper. Gratis, * 

[ B125] 
*Dwyer, M. THEOPHANE. А Sociometric Study 
Among a Selected Group of Students in Nursing. 
Washington, D.C.: Catholic University of America 
Press, Inc., 1947. Pp. ix, 46. Paper. $1.00. * (РА 22: 
4670) 

[ B126] 
*EpucATIoNAL Recorps Bureau. 1948 Achievement 
Testing Program in Independent Schools and 
Supplementary Studies. Foreword by Ben D. Wood. 
Educational Records Bulletin, No. 5o. New York: the 
Bureau, July 1948. Pp. xiii, 86. Paper, lithotyped. $2.00. 
* (PA 23: 300, 312-3, 314) 


[ B127] 
*EpucaTionaL Recorps Bureau. 1949 Achievement 
Testing Program in Independent Schools and 
Supplementary Studies. Foreword by Ben D. Wood. 
Educational Records Bulletin, No. 52. New York: the 
Bureau, July 1949. Pp. xiii} 87. Paper, lithotyped. $2.00. 
* (PA 24:748, 750, 756, 759) 


[ B128] 
*EpucationaL Recorps Bureau. 1950 Achievement 
Testing Program in Independent Schools and 
Supplementary Studies. Foreword by Ben D. Wood. 
Educational Records Bulletin, No. 54. New York: the 
Bureau, July 1950. Pp. xiii, 119. Paper, lithotyped. 
$2.00. * (РА 25: 567, 569, 590-2) 


[ B129] 
*EpucATIoNAL Recorps Bureau. 1951 Achievement 
Testing Program in Independent Schools and 
Supplementary Studies. Foreword by Ben D. Wood. 
Educational Records Bulletin, No. 57. New York: the 
ae July 1951. Pp. xiii, 85. Paper, lithotyped. 
2.00. 
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[ B130] 
X*EpucATIONAL Recorps Bureau. 1947 Fall i 
Program in Independent Schools de senting 
mentary Studies. Foreword by Ben D. Wood. Edu. 
cational Records Bulletin, No. 49. New York: the Bu. 
reau, February 1948. Pp. xii, 69. Paper, lithotyped, 
Out of print. * (РА 22: 3281, 3644, 3648-9) 

[ B131] 
*EpucationaL Recorps Bureau, 1948 Fall Testing 
Program in Independent Schools and Supple- 
mentary Studies. Foreword by Ben D. Wood. Edu- 
cational Records Bulletin, No. 51. New York: the Bu- 
reau, January 1949. Pp. xiii, 72. Paper, lithotyped. 
$1.50. * (PA 23: 3422-3, 3446-7, 3452-3) 


[ B132] 
*EpucationaL Recorps Bureau. 1949 Fall Testing 
Program in Independent Schools and Supple- 
mentary Studies. Foreword by Ben D. Wood. Edu- 
cational Records Bulletin, No. 53. New York: the Bu- 
reau, January 1950. Pp. xiii, 70. Paper, lithotyped. 
$1.50. * (PA 24: 3894-5, 3897-8) 


[ B133] 
*EpucATIONAL Recorps Bureau. 1950 Fall Testing 
Program in Independent Schools and Supple- 
mentary Studies. Foreword by Ben D. Wood. Edu- 
cational Records Bulletin, No. 56. New York: the Bu- 
reau, January 1951. Pp. xiii, 89. Paper, lithotyped. 
$1.50. * (PA 25: 6416, 6442, 6477, 6479, 6482) 


[8134] 
*EnpucATIONAL Recorps Bureau. 1951 Fall Testing 
Program in Independent Schools and Supple- 
mentary Studies. Foreword by Ben D. Wood. Edu- 
cational Records Bulletin No. 58. New York: the Ви- 
A February 1952. Pp. xi, 86. Paper, lithotyped. 
1.50. 


[ B135] 
*EpucaTIONAL TESTING SERVICE, HENRY CHAUNCEY, 
Рвеѕремт. Annual Report to the Board of Trus- 
tees, 1948-49. Princeton, N.J.: the Service, [1950]: 
Pp. 40. Paper. Gratis. * (PA 24: 4364) 


[ B136] 
XEpucATIONAL Trstinc Service, Henry CHAUNCEY, 
Presipent. Annual Report to the Board of ei 
tees, 1949-50. Princeton, N.J.: the Service, [1051]. 
Pp. 79. Paper. Gratis. * (РА 25:7105) 


[ B137] A 
KEDUCATIONAL TEsrING SERVICE, HENRY CAUTE 
Present. Annual Report to the Board о (1952). 
tees, 1950-51. Princeton, N.J.: the Service, 
Pp. 105. Paper. Gratis. * 


views. 


[ B139] for Sti- 
*Epwarps, ALLEN L. Statistical Analysis ook: 
dents in Psychology and Education. New J ( 
Rinehart & Co., Inc., 1946. Pp. xviii, 360. S49 c jj. 
20:2965) For reviews, see 102 in Statistica 

ology Reviews, 1041—1050. 
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[B140] 
kEroneN, ANNA S. A Comparison of Two Tests 
of Intelligence Administered to Adults. American 
Psychological Association, Psychological Monographs : 
General and Applied, Vol. 63, No. 11, Whole No. 306. 
Washington, D.C.: the Association, Inc., 1949. Pp. iii, 
35. Paper. $1.00. * (РА 25:2452) 
[ B141] 

хЕвмкзт, Јонм W. Tests and Testing in Distribu- 
tive Education. Preface by Roy E. Simpson. Fore- 
word by Ira W. Kibby. California State Department of 
Education, Commission for Vocational Education, Bu- 
reau of Business Education, Business Education Publi- 
cation No. 38. Sacramento, Calif.: the Bureau, July 
1948. Pp. v, 87. Paper, mimeographed. Out of print. * 

J Bus Ed 24:49 D '48. * written to assist the 
instructor of distributive education in adult 
classes to develop personal evaluation by his 
students for the purpose of motivation. Realiz- 
ing that tests at the adult level smack too much 
of high school and college technique, these tests 
were planned to assist the student in discovering 
for himself how he is progressing. Tests are not 
graded by the teacher nor used as a means of 
grading. Illustrations of various type tests are 
provided, but more important the manual tells 
the general principles for testing, how to con- 
struct a test, how to give a test, interpretation of 
test results, and preparation of case problems to 
enliven the course. According to the author, the 
use of this material in his classes promoted in- 
terest, attention, and better attendance. 


[ B142] 
XEmow, Leonarp D. A Normative Study of the 
Thematic Apperception Test, American Psycho- 
logical Association, Psychological’ Monographs: Gen- 
eral and Applied, Vol. 64, No. 9, Whole No. 315. 
Washington, D.C.: the Association, Inc., 1950. Pp. v, 
48. Paper. $1.50. * (PA 25:6211) 
[ B143] 

Eysencx, H. J. Dimensions of Personality. А rec- 
ord of research carried out in collaboration with HT: 
Himmelweit and W. Linford Rees with the help of 

‚ Desai, W. D. Furneaux, H. Halstead, O. Marum, 
và McKinlay, A. Petrie, and P. M. Yap. Foreword by 
gorey Lewis. London: Kegan Paul, Trench, Trubner 

Со. Ltd., 1947. Pp. xi, 308. 255. * (PA 22:210) 
t Austral J Philos 26:141-4 S 48. C. A. Gibb. 

Many of the studies have been previously re- 
Ported, but these are now well integrated with 
new studies and contribute to an overall inter- 
Pretation which is new and very good. * an ex- 
tremely valuable book. It reports a great deal of 
Careful experimental work and no criticism can 
detract from that work and its contribution to 
the understanding of personality. Every student 
of personality will need to read it. Tt will not 


be popular in the undergraduate classrooms, but 
it will be required reading for more serious stu- 
dents as a companion to Cattell’s Measurement 
and Description of Personality. It contains 
many ideas which are out of the usual rut of 
studies in this area and consequently it is a 
valuable source of research suggestions, opening 
up many avenues which may fruitfully be pur- 
sued either by Eysenck himself or by other 
workers. Some criticisms, however, must be 
offered. Personality is defined (p. 25) as “the 
sum-total of the actual or potential behaviour- 
patterns of the organism, as determined by he- 
redity and environment; it originates and de- 
velops through the functional interaction of the 
four main sectors into which these behaviour pat- 
terns are organised : the cognitive sector (intel- 
ligence), the conative sector (character), the 
affective sector (temperament) and the somatic 
sector (constitution)." This definition is un- 
doubtedly neat and it is arrived at after some 
consideration of the literature; but it does have 
its difficulties. It is difficult enough to know just 
what a sum-total of behaviour patterns can be 
but it seems impossible to conceive potential be- 
haviour patterns. And it is difficult to know how 
else such "patterns" could be determined than 
by heredity and environment. It is not quite 
clear why Eysenck has preferred this definition 
to (say) that offered by Burt in 1945 by which 
personality is said to be “the entire system of 
relatively permanent tendencies, both physical 
and mental, that are distinctive of a given indi- 
vidual, and determine his characteristic adjust- 
ments to his material and social surroundings.” 
There is a difference between the definition 
Eysenck offers and this one which, two pages 
earlier, he has implicitly accepted. It would 
seem that Eysenck has wanted to use, but has 
failed to make clear, the distinction made by 
Burt when he said in a footnote that if per- 
sonality were defined as he defined it then what 
the psychologist really studies are interactions 
between personality and environment. This 
whole question of what is being studied in work 
of this kind is too large to be tackled adequately 
in a brief review, but one might point out that 
in so far as Eysenck has studied dimensions or 
factors at all these are not factors of personality 
as Warren or Burt define it, but of actual be- 
haviour. It seems likely that Eysenck is aware 
of this distinction, but he has not brought it 
clearly to light. The situation here is analogous 
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to the everyday treatment of the concept of 
weight. It is convenient to ascribe weight to 
the object rather than to an object-system rela- 
tion, but discussions of personality could be con- 
siderably clarified if this relativity were made 
clear. In defining character Eysenck has been 
rather more specific. He points to the two cur- 
rent uses of the term, one evaluative, the other 
conative. Deliberately he chooses the latter be- 
cause evaluative concepts are of little use in 
psychology. This is almost certainly true but it 
is doubtful whether this typically European 
choice will affect the already well established 
American usage of character for the moral and 
ethical evaluative aspect of personality, and un- 
til it does some confusion is bound to occur. It 
would seem desirable to define character evalu- 
atively and so let the term slip from psychology 
altogether except in the special cases when con- 
formity to moral and ethical codes is itself the 
object of study. The theory.of personality advo- 
cated here is very like that of G. W. Allport. 
Eysenck's statement of the position is clear and 
concise and the implications are directly drawn. 
Eysenck writes always well and clearly. This 
first chapter is a good one. It orients the reader 
immediately. It causes him to ponder deeply the 
nature of the phenomena to be examined and 
the methods employed and their appropriate- 
ness. It is a critical chapter though the reader 
may feel that its decisions are not always well 
taken in the light of recent American work of 
which frequently the author makes no mention 
and which, of course, may not have been avail- 
able to him during the war years. The factorial 
analysis described in Chapter 2 yielded two 
main factors: (1) a general factor of "neuroti- 
cism"...and (2) a bipolar factor (introversion- 
extraversion) opposing the “dysthymic” group 
of symptoms (anxiety, depression, obsessional 
tendencies, apathy, irritability, etc.) to the hys- 
terical group of symptoms (conversion symp- 
toms, sex anomalies, low intelligence, etc.) This 
interpretation has been made without recourse 
to rotation of axes. Cattell has already shown 
how greater, though not markedly different, 
meaningfulness may be obtained if rotation is 
used. The material of Chapter 3 is of some in- 
terest but it is generally not well validated and 
is certainly not vital. An index of body build is 
derived (which rather misleadingly is compared 
with І.О.) and it is shown that “hysterics are 
distinguished from dysthymics not only on the 
intellectual and temperamental level, but also 
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with respect to body build” (p. 91) ; but no esti- 
mate of the accuracy of this relation is offered 
until on page 94 it is said correlations between 
personality traits and body type are not large 
(about 0.30) and that indices of body type are 
thus useless as diagnostic criteria. In the fourth 
chapter the basic comparisons are for intelli- 
gence, level of aspiration, personal tempo, and 
perseveration as between normals and neurotics 
and between dysthymics and hysterics. Neu- 
rotics tend to be slightly below normal in intelli- 
gence, while hysterics are inferior to dysthymics. 
Level of aspiration is higher for dysthymics than 
for hysterics but the other comparisons yield 
nothing of significance. Minor statistical points 
may be queried: he quotes a tetrachoric r of 
0.66 as evidence of things “agreeing extremely 
well” (p. 63) and accepts a C.R. of 2 as indica- 
tive of significance (p. 136). On the other hand 
the discussion (p. 176) of the relation between 
a theoretical normal curve and an obtained U- 
curve reveals a lot of critical insight into sta- 
tistical and measurement concepts. Chapter 5 
presents two significant findings: (1) sixteen 
different tests of suggestibility define two dit 
ferent types of suggestibility, one only of which 
is related to hypnosis; (2) contrary to common 
psychiatric assumptions hysterics do not appear 
to be more suggestible than other types of neu- 
rotics. * Eysenck’s method has been to obtain 
his differential groups by factorial analysis of 
ratings and then to search for tests which will 
differentiate between them. One sometimes gets 
the impression that the object of this search is 
to prove that the factorial isolation was justi- 
fied, which, of course, it cannot do. In some re- 
spects this might be thought of as a companion 
volume to Rapaport’s Diagnostic Psychological 
Testing, but where Rapaport is interested pri- 
marily in the tests as diagnostic instruments 
Eysenck is interested primarily in a classifica- 
tion to which he seeks to show certain tests an 
measurements are relevant. Thus it 1s doubly 
unfortunate that he has not chosen to investigate 
the rotation of his factorial axes with а Wide 
obtaining a more fruitful interpretation 0 д 
material. In brief, one may say this is an gue 
ing book which will be welcomed especially pi 
those psychologists interested in theoretical 27 
: are prac 
pects of personality and by those who in 
tising clinicians. It submits to experiment Men 
statistical investigation a number of prO only 
usually dealt with müch more loosely, o. way 
providing information but also pointing ШЕ 
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to quantitative research in clinical psychology. 
Brit J Psychol, Stat Sect 1:131—2 J1'48. E. К. 

* Psychiatrists have long been urged by psycho- 
logical writers to check their methods and con- 
clusions by adopting modern statistical devices; 
and here is a book which will show that the 
suggestion is actually practicable. A special fea- 
ture of the work has been the close collaboration 
between non-medical psychologists and medi- 
cally qualified psychiatrists. The majority of the 
persons tested were patients drawn from the 
Services. It is emphasized that the term “neu- 
rotic" as applied to such groups does not possess 
the same meaning of civilian patients. "The 
‘neurosis’ exhibited was not so much an illness 
as a simple failure to adapt to Army routine and 
discipline." This naturally limits the generaliza- 
tions that can be drawn. * Some readers may 
feel a little disappointed that the investigators 
were not also able to include in their scheme of 
inquiry the method of "observation in stand- 
ardized situations"—a method which with chil- 
dren has given reliable results, and which, ac- 
cording to those who have adopted it in per- 
sonnel selection for the War Office and Civil 
Service, is said to be at once more reliable and 
more valid with adults than any set of personal- 
ity-tests used by itself. The sections of the book 
Which are most likely to arouse questions in the 
minds of the readers experienced in this field 
are those discussing the diagnostic value of the 
methods proposed. Several critics have pointed 
out that, in their earlier publications, the in- 
vestigators seemed a little too ready to infer that, 
ifthe mean scores obtained from the normal and 
heurotic groups with any given test were sta- 
tistically significant, then that test might be used 
to discriminate between the two groups. * To 
assess discriminative value, two different formu- 
lae were first employed. “In the first place," we 
are told, "an Index of Screening Efficiency was 
elaborated” : this seems to be simply what Yule 
calls the “product-moment correlation for a 
2X 2-fold table,” which assumes the frequen- 
cies to be concentrated at single points (a some- 
what doubtful assumption with traits whose dis- 
tribution is admittedly continuous and presum- 
ably unimodal). The second formula was Hunt's 
Selection Index.” By these or other criteria a 

attery of то tests was finally selected and ad- 

ministered to тоз neurotic patients and 93 surgi- 
cal cases, The correlation between the total 
Scores, reduced to an 8-point scale, and the psy- 
chiatric classification into neurotic and non- 


neurotic, proved to be 0.73. This figure is far 
higher than those usually reported with tests of 
the type employed ; and it would therefore be a 
little more satisfying to know how the correla- 
tion was calculated: if it is a biserial correla- 
tion, were the requisite conditions fulfilled? A 
further method of assessing the correlation was 
also adopted. The correlation between two sub- 
batteries, consisting of 5 tests each, was calcu- 
lated, and found to be 0.60, This is "corrected 
by the Spearman-Brown prophecy formula”; 
and thus raised to 0.75. From this it is inferred 
that the correlation of the test battery with a 
perfect criterion would be У 0.75 = 0.87. This 
is modestly described as “а comparatively high 
value” for such tests; and it is suggested that 
the actual validity of the tests probably lies mid- 
way between the two figures thus reached. The 
final conclusion is that “tests of temperament 
and personality may have reached a stage where 
they can take their place with the usual tests of 
cognitive function and special abilities.” This 
conclusion, like the tone of the whole volume, 
is perhaps a little sanguine. If, therefore, any 
practical psychologist is thinking of using such 
tests for purposes of personnel selection or psy- 
chiatric diagnosis, he would be wiser to wait 
for a repetition of the experiments by wholly 
independent investigators. Time after time we 
have heard of equally promising results claimed 
for similar test-procedures only to be sadly dis- 
illusioned when the first hopeful enquiry was 
repeated. This is merely a warning to the prac- 
tical worker. The theoretical psychologist will 
form his own conclusions; and will assuredly 
find much in this book that is at once stimulating 
and suggestive. The industry and enthusiasm of 
the whole team, and the success with which they 
have organized a co-operative scheme of work, 
deserve unhesitating praise. f 

Can J Psychol 2:225-6 D '48. Louise M. 
Thompson. * should be read by all psychologists 
interested in personality either from the the- 
oretical or clinical point of view. Although the 
title gives promise of a broad treatment, the 
book was not designed as a general text in the 
subject and is not recommended for beginners 
in the field of personality. It is a technical re- 
search study, or rather a series of researches * 
As the author himself points out, most of the 
data were gathered from neurotic service per- 
sonnel; thus, in general, the findings are ap- 
plicable only to the neurotic population from 
which they were derived. There were also, how- 
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ever, normal subjects used in several researches. 
* Eysenck's results throw light on several con- 
troversial questions in the field of personality. 
To give one illustration, his findings are op- 
posed to Janet's theory of the positive relation- 
ship between hysteria and suggestibility. The 
conclusions drawn from the present work were: 
(1) suggestibility is closely related to neuroti- 
cism; (2) suggestibility is not related more 
closely to hysteria than it is to other neurotic 
disorders (the data even indicate that dys- 
thymics are more suggestible than hysterics). 
The author stresses the fact that in using terms 
such as “suggestibility,” "persistence," and 
"rigidity" to summarize his results, he does not 
intend them to be interpreted in a popular sense. 
They refer to exact quantitative variables, sta- 
tistically derived. (In spite of this, however, he 
introduces his early discussion of these variables 
with a brief survey of previous views and ex- 
periments, which for the most part were non- 
statistical and representative of the "popular" 
concepts). Even using the most careful statis- 
tical methods, one is eventually faced with the 
problem of giving psychological meaning to 
the derived factors, and it would appear that 
Eysenck's general organization of the book im- 
plicitly recognizes this fact—even though he 
emphasizes the purely statistical reality of his 
variables. In an interesting final chapter the au- 
thor gives a synthesis of his findings and out- 
lines the conclusions that can be safely drawn. 
This chapter would have been better, in the 
reviewer’s opinion, if the author had confined 
himself to pulling together the experiments al- 
ready presented, and had omitted the introduc- 
tion of new data at this point. * there is an ex- 
cellent bibliography. Eysenck has set a high 
standard for this type of research, and his suc- 
cessful use of objective and statistical tech- 
niques augurs well for the future of experi- 
mental work in personality. 

Eug R 39:69-71 Jl '47. C. P. Blacker. This 
important book could be reviewed from sev- 
eral standpoints. Its many interesting features, 
among them its bearing on eugenics, spring 
from its novel treatment of data belonging to one 
province of science by methods pertaining to 
another. Dr. Eysenck is a psychologist; but the 
materials on which his thesis is based are mainly 
psychiatric. Indeed, the book might be described 
in one of its aspects as an application of psycho- 
logical methods, some of them ingenious and 
. original, to psychiatric material. Something is 


thus done to bring closer together two domains 
which bear to each other much the same relation 
as does physiology to pathology. Though for. 
mally and practically distinct, psychology and 
psychiatry are obviously complementary: each 
stimulates and fertilizes the other. * the main 
conclusions were suggested by an analysis of 
the records made by psychiatrists of some 7,300 
male and female service patients, mostly neu. 
rotic, admitted during the war to Mill Hill 
Emergency Hospital. The human material was 
thus mainly abnormal to the extent of needing 
the attentions of psychiatrists; and it was in 
their abnormal features that Dr. Eysenck was 
especially interested. Particulars of these 7,300 
Service patients—some 5,000 of them males and 
the remainder females—were entered on a docu- 
ment called an “item sheet," which provided an- 
swers to over 200 questions or items : these bore 
on the social, family, personal and clinical his- 
tories of each subject, his personality, the symp- 
toms and findings, and the causes, diagnosis 
and treatment of his illness. Dr. Eysenck's cen- 
tral thesis emerged from an intensive study of 
a part only of this material—i.e. from thirty- 
nine items relating to 700 of the aggregate of 
7,300 patients. Correlations were calculated be- 
tween these thirty-nine items and the resulting 
table was factor analysed. There emerged four 
general factors, of which Dr. Eysenck's book is 
concerned with the first two. Both express them- 
selves, as one would expect from the selected 
nature of the human material, in somewhat 
pathological terms. The first is a general factor 
of neuroticism. * The opposite pole of Dr. 
Eysenck's general factor for neuroticism 1$ dis- 
cerned in Webbs's “w” factor as described by 
this author in 1915. * Another factor designated 
by J. B. Maller as “с” is also apposite * Eysenck 
denotes this factor as “Р” * The second factor 
„does not receive an alphabetical baptism. 4 
covers the familiar dichotomy between intr» 
version and extraversion. I have foun р 
Eysenck's exposition of the nature of this 18 
somewhat difficult to follow because the samp. ; 
of 700 people from whom this factor ei 
tracted was pre-selected on the grounds of t a 
neurosis. Normal people no less than mp К 
сап be rated on the scale of which introve 
and extraversion are the extremes; and 1 in 
bi-pofar factor is, as Dr. Eysenck mois о ; 
dependent of the “Р” factor aboye meni a 
then its independence would surely € nin 
more readily demonstrable in a normal tha 
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a neurotic population. In Dr. Eysenck’s sam- 
ple, the antithesis between introverts and extra- 
verts was revealed by contrasting hysterical with 
“dysthymic” forms of neurosis. (The term 
“dysthymia” is used to denote the combination 


of anxiety, reactive depression and obsessional - 


tendencies.) The most general conclusion which 
emerges from Dr. Eysenck’s argument might be 
put in this way. Spearman's “g” factor expresses 
intelligence and operates in the cognitive ac- 
tivities of the personality, “g+” people being 
intelligent and “g—” people the reverse. Here 
is a “dimension” of the cognitive functions of 
the mind. The introversion-extraversion polar- 
ity expresses a comparable factor or dimension 
of the affective functions; and “P” plays a 
similar part in the conative. These three “di- 
mensions” are also factors of the most inclusively 
generalized or "type" level. Strictly they are to 
be regarded as principles of classification. But 
if they are good and not bad principles, they will 
do more than serve a taxonomic purpose: they 
will give a picture of the actual structure of the 
mind, That the picture of the mind they give is 
in fact accurate Dr. Eysenck sets out to prove 
in later chapters. By numerous measurements, 
tests, and factor studies, and by the use when 
appropriate of questionnaire inquiries, he shows 
that at what he calls the “trait” level (a lower 
level than the “type” in the factorial hierarchy), 
his basically antithetical configurations can be 
discerned. They are distinguished in measure- 
ments of physique and physiology, of suggesti- 
bility, of intelligence, of perseveration, of per- 
Sistence, of aesthetic appreciation, of sense of 
humour and even perhaps of handwriting. Dr. 
Eysenck displays an encyclopaedic familiarity 
With the enormous and growing literature of 
these diverse subjects, and not the least valuable 
part of his book is a compendious list, covering 
Some twenty-four closely printed pages, of ac- 
Curately documented references. From the eu- 
aie s standpoint, the most important feature 
of this book has already been indicated. Few 
кене can doubt that Dr. Eysenck's “Р” factor 
Hina) something of indisputable social sig- 
of ance. * Hitherto, the quantitative sciences 
реу and psychometry have been 
"M y concerned with measuring physical fea- 
ies, and cognitive faculties. Dr. Eysenck’s 
ia that orectic qualities (that is to say ex- 
EN of personality in the spheres of af- 
d оп and conation) are also measurable is 

arly of great interest to the eugenist who is 


concerned with the complex but generally recog- 
nisable qualities which make for useful citizen- 
ship. * Author and collaborators have broken 
new ground and opened new vistas in a valuable 
0 somewhat difficult book. 

n & Social Psychol 43:247-8 Ap ’48. 
H. J. A. Rimoldi. * As described by the ino 
the subjects on which the experiments were per- 
formed were for the most part patients showing 
“a mainly neurotic symptomatology of a rather 
monotonous character.” Although the main 
stress is laid on the study of the behavior and 
characteristics of these neurotics, the author 
compares his findings with those of normal sub- 
jects. He concludes, however, that "definite 
proof that we are justified in extending our re- 
sults in this way (to normals) is lacking." This 
presentation tends to confuse the reader. Some 
of the results are interesting, give matter for 
careful consideration and further research, and 
reveal a good deal of scientific ingenuity and 
considerable work. Others, however, cannot 
withstand critical examination. In this sense 
some of the opinions require further study, while 
some of the hypotheses put forward look prom- 
ising and worth while testing. Dr. Eysenck 
makes most of the comparisons in relation to 
the results of a factorial analysis. The intercor- 
relations of the scores obtained on 39 items given 
to a population of 700 patients form the basis 
for this study. These items were selected “оп 
the basis of their psychological interest: these 
items covered the social history of the patients, 
his personality and his symptoms fairly ade- 
quately." We do not know up to which point 
these 39 items can be considered fairly ade- 
quate. А glance at Table 1 of the book, or at 
the item sheet in pages 3 to 8, will indicate how 
very difficult it must have been to evaluate pre- 
cisely the adequacy of the items for the purposes 
of statistical analysis. Of the four factors ob- 
tained, accounting respectively for 14 per cent, 
12 per cent, 8 per cent, and 6 per cent of the 
variance, the author interprets with certainty 
the first two. The first one is considered as the 


general factor of neuroticism. The loadings of 


the 39 items reveal thàt several of them—ob- 


sessional, somatic anxiety, depression, among 
others—have loadings of: .00, .05, and .04 re- 
spectively. Out of 39 items, seven have loadings 
less than or equal to .10. If by general factor we 

as significant loadings in all 


mean one which hi n | 1 
of the tests, can We, then, consider this factor as 


a general one? Without going into the details of A 


à 
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the author's discussion about the general factor, 
his problem, as factorial problems usually are, 
is to find the axis of the configuration that will 
best explain the results. Any factorial matrix, as 
obtained by direct factorization of the correla- 
tion matrix, is only one of the possible solutions, 
where the orthogonal axes have been placed 
arbitrarily according to the special technique 
employed. Therefore; and although eventually 
this factor matrix may be accepted directly, a 
simple plotting of the values will convince the 
reader that many a time these first results are 
not clear enough. A change in reference axis 
will often add not only clarity but new leads for 
a more fruitful interpretation. Unfortunately, 
the author has not used this technique. Sum- 
marizing, this is an interesting book in many 
senses, not only in terms of the effort of the 
author for delimiting and submitting to experi- 
mental and statistical control problems usually 
dealt with in a much looser fashion, but in terms 
of the amount of information and scientific in- 
genuity shown through certain parts of the text. 
The title of the book suggests, at least to us, a 
different content than the one given. Some of 
the opinions are logically and clearly presented, 
while others are somewhat disconcerting in view 
of existing theories of personality and of neu- 
rosis. We are certain that the book will be wel- 
comed by many psychologists and psychiatrists 
and that Dr. Eysenck provides here a wealth of 
valuable suggestions for further research, apart 
from the corroboration of some facts already well 
established by clinicians and others. 

J Appl Psychol 33:614 D’49. Arthur Weider. 
* Eysenck and his collaborators have turned to 
fruitful purpose the opportunities such a place 
affords for studying some of the main factors of 
personality and have shown how profitable may 
be the collaboration of psychologists and psychi- 
atrists in pursuit of this aim. * While this book 
may presently contribute little that is useful to 
harassed personnel psychologists, these special- 
ists should nevertheless be familiar with the 
methods of investigation utilized. Much that is 
presented may influence our concepts of psycho- 
pathology in the future. However, it is recom- 
mended to all applied psychologists working in 
clinical fields and is “must” reading for the 
small corps of experimental clinical psycholo- 
gists. 

J General Psychol 38:271—4 Ap 48. Raymond 
Cattell. This book will become a landmark in 
the advance of psychiatric knowledge. For psy- 
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chiatry, like any other art based on science, comes 
of age when it begins to be able to make true 
measurements. * In this pioneer study the au. 
thor...sets out boldly first to discover what di- 
mensions, vectors or factors most need to be 
measured in defining any given abnormal per- 
sonality. Apart from a few sporadic studies, such 
as that of F. W. Moore (not listed in the other- 
wise very adequate bibliography) no previous 
serious attempts have been made to factorize 
psychiatric data, though the present reviewer's 
12 factors obtained with normal personality data 
existed as a possible point of comparison and 
generalization, Eysenck proceeded as far as four 
factors, but the bulk of the data concerns the 
two most important, on which it was happily 
decided to concentrate, namely, general neu- 
roticism and hysteria-dysthymia. These can al- 
most certainly be recognized as equivalents of 
the obverse of the C factor of Emotional Sta- 
bility-Integration v. Genetal Emotionality and 
the F factor of Surgency v. Desurgency, respec- 
tively, discovered in the normal personality. The 
weakness in the definition of these factors is, in 
the writer's opinion, that they were not rotated 
for simple structure. Fortunately, after rotating 
them, we still find the same patterns, with slight 
differences of emphasis, in those two of the four 
factors on which Eysenck decided to concen- 
trate his experimental work. The strong points 
of this factorial study, on the other hand, are 
the very extensive population of neurotics on 
which it is based—unrivalled in any other study 
and statistically adequate for the experimental 
design—and the wide variety of personality 
manifestations—behavior rating, self evaluation 
in questionnaires and objective test data— 
brought together in the correlations. * Such T 
these tests as repeat previous investigations con 
firm the direction of association previously мА 
and it is gratifying to see that on this АШ, 
population the correlation coefficients for А 
objective tests are of a magnitude like % y 
be clinically useful. Indeed the practicing Р ul 
chometrist in mental hospitals will find La 
valid objective tests for two important are 
sions—the degree of dynamic disintegra pe 
and the temperamental continuum from sd. 
version to anxiety hysteria—than и 
found in many tests, e.g., the Кг gi 
popularly in use. The tests which bre? ей 
ground, such as the humor test, the d 
test, the aesthetic preferences, the се, 
of inhibition of salivary secretion and € 


BOOKS & REVIEWS: EYSENCK : 


93 | 


esterase concentration, on the other hand, need 
confirmation of the significant correlations dis- 
covered here before they can be incorporated in 
the routine clinical batteries of applied psychol- 
ogy. After determining the unitary personality 
traits or personality dimensions which are avail- 
able for measurement among neurotics, and 
after demonstrating that certain tests give valid 
measurements of these dimensions, the experi- 
menters set out to relate high or low endowment 
in these factors to a variety of constitutional 
and background conditions. Here we encounter 
special studies by Desai, Furneaux, Halstead, 
Marum, McKinlay, Petrie, and Yap. Indeed 
most of the data in this book has been published 
recently in article form by various co-workers 
and Dr. Eysenck's task has been to integrate the 
presentation of the various ancillary studies in- 
volved in his main research plan. Not least 
among these co-workers has been his wife, whose 
work on the organization of intelligence in de- 
teriorating conditions, with its brilliant logical 
and statistical analysis, is invoked to throw light 
in the chapter on “Ability and Efficiency.” In 
reading the chapters on these special aspects and 
branching studies one tends to get a little lost, 
particularly as to the populations involved (e.g., 
whether they were the main population of the 
primary research), whether their classification 
as high or low integrates is based on ratings or 
tests, and so on. Nevertheless these chapters аге 
presented systematically and inter-related as far 
as integration is possible at the present stage of 
knowledge. Radical departure from previous re- 
Search findings and conceptions occurs at two 
points which will doubtless provoke contro- 
versy. First, Eysenck finds suggestibility, which 
itself factors into two distinct varieties, related 
to the C factor of Emotional Integration vs. 
Neuroticism, and not to hysteria vs. dysthymia 
as most clinicians, in the wake of Janet, have 
Previously believed. The hysteric in short is 
found to have no significantly greater suggesti- 
bility than the anxiety neurotic. The field of 
hypnosis and suggestibility has for long re- 
mained in a confused state due to the almost 
exclusively qualitative clinical treatment. The 
Present discovery of two, and possibly three, dis- 
tinct varieties of suggestibility in the tests used 
| E Hull and others, may clear the way to а reso- 
th lon of this confusion. Meanwhile the finding 
ue the suggestibility involved in the sway test 
re in hypnotizability is related to neuroticism 

er than the hysteria dimension awaits con- 
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firmation. Secondly, Eysenck’s chapter on 
Physique and Constitution” disagrees radically 
with Sheldon’s theories. He finds no significant 
relationship between the breadth-girth (viscero- 
tonic) component and cyclothyme-schizothyme 
traits. Instead he obtains—in a comparison of 
120 eurymorphs, 150 leptomorphs, and 730 
mesomorphs—significant relation between body 
breadth and conversion hysteria (surgency) 
component. It is necessary to remember that 
Eysenck is dealing with well established person- 
ality factors or source traits whereas Sheldon 
dealt with surface traits, e.g., somatotonia, es- 
tablished on relatively small groups. On the 
other hand, where physical types are concerned, 
Sheldon’s partly qualitative and more subtle 
components may be more fundamental than 
those obtained by the few tentative factoriza- 
tions of physique yet available. Eysenck obtains 
a factor of overall size, another contrasting 
narrow (leptomorph) with broad (eurymorph) 
build and possibly a third expressed in chest 
depth. He brings evidence that this depth com- 
ponent is related to the cyclothyme (4 factor) 
component in temperament. But breadth is pri- 
marily related to conversion hysteria, i.e, the 
surgency component. Over and above this rela- 
tion there is a tendency for neurotics as а whole 
*to be pronouncedly leptomorph as compared 
with normals.” Although this book presents a 
turning point in psychiatric methodology it will 
probably be read with more understanding by 
psychologists than psychiatrists, for few psy- 
chiatrists have been trained to think statistically. 
Because it points the way to methods of meas- 
urement that are going to be increasingly im- 
portant in the near future it is absolutely essen- 
tial reading for the clinical psychologist. Yet its 
greatest interest will be for the research worker, 
who will find in this great mine of facts from the 
workers at the Mill Hill Emergency Hospital an 
opportunity to test many hypotheses, and who 
will enjoy the breadth of scholarly knowledge 
which Dr. Eysenck has brought to bear. Tech- 
nically, one could have wished that the factoriza- 
tions had been carried further, for in rotation 
the definition of the earlier, larger factors 1s af- 
fected by contributions from the small factors 
extracted later, and it is possible that some of 
the slighter correlations which Dr. Eysenck gets 
between dysthymia and character-like traits is 
due to the contamination of his dysthymia factor 
estimates with items which cleaner rotation 
would eliminate. To anyone familiar with the 
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technical complexity and arduous labor involved 
in factorial studies, however, this may seem a 
counsel of perfection. Finally it is a pleasure to 
see boldly planned original work which is also 
well rooted in historical awareness and does not 
repeat mistakes through lack of reading of psy- 
chological literature. The bibliography shows a 
familiarity with significant trends in both Amer- 
ican and British psychological research which is 
quite rare in books now appearing in Britain and 
America. 

Lancet 252:713 My 24°47. * A useful varia- 
Чоп of the complicated Rorschach test is 
described, and showed significant differences be- 
tween normals and neurotics, and between séri- 
ous and mild neurotics. Dr. Eysenck and his col- 
laborators have been most searching in checking 
their results mathematically, ensuring that only 
significant differences are recorded, and usually 
studying sufficiently large control groups. Their 
mass of evidence is formidable but welcome: it 
brings into psychiatry the method of mathe- 
matics and gives hope that we can be more ac- 
curate in our assessments, more certain in our 
pronouncements. But it would be a pity if Dr. 
Eysenck’s modest claims led to facile statements 
by the less well trained—e.g., that “long persons 
are essentially neurotic” or “dull persons are po- 
tentially hysteric.” Nor has he quietened all 
doubts. He seems himself to be too dogmatic in 
what he says of graphology, and those who are 
not allied too closely to the theory of type- 
identity will wish to know what happens in Dr. 
Eysenck’s future assessment of the man who, at 
Mill Hill, was a conversion-hysteric and has 
now become suicidal and depressed, or of the 
apparent depressive whom electrical convulsive 
treatment has apparently (as it does sometimes ) 
converted into a querulous somatic pain-ridden 
“hysteric.” The apparent rigidity of some of the 
diagnoses—which he admits—points to the crit- 
ical spirit in which this book must be received, 
but such a spirit should also appreciate the fresh 
channels of thought which it opens, and which 
Dr. Eysenck’s team will clearly continue to 
provide. 

Nature 160:887 D 27 ’47. Clifford Allen. 
* This research is interesting, and the frequent 
quotations show a wide knowledge of the litera- 
ture. A few show an unnecessary desire to quote 
from other Maudsley colleagues, as, for example, 
that on page 51: “A person’s constitution may 
render him liable to breakdown at the slightest 
provocation ; on the other hand, a person of first- 
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class constitution may finally break down when 
the stress is too great.” This is such a psychia- 
tric commonplace that surely it has been stated 
many times before Slater in 1944. These, how- 
ever, are minor blemishes and the book as а 
whole is valuable. The format is neat, the paper 
good and the printing clear. It is written as easily 
as such a subject allows. There is a useful sum- 
mary of the findings at the end of each chapter 
and many diagrams clarify the text. There is a 
long bibliography of 24 pages and an excellent 
index. 

Psychiatric Q 23:783 О '49. * Eysenck han- 
dles well the various issues of theories of tem- 
perament, personality structure, neuroticism, 
oscillation, perseveration, ideo-motor action, 
graphology, narcosis and suggestibility ; and the 
author also comments intelligently on the Ror- 
schach test. The scope of analysis is wide; the 
techniques are scientific and objective. * Chapter 
VII, “Synthesis and Conclusions,” sums up bril- 
liantly the significant findings concerning neu- 
rotic introverts and neurotic extraverts, hyster- 
ics and dysthymics. The statistical tables offer 
interesting data; especially since the author 
widely realizes that statistics make an invaluable 
servant, but a bad master. 

Psychiatry 12:312-3 Ag ’49. Jurgen Ruesch. 
* the whole book conveys a sense of modesty and 
moderation for which the authors are to be con- 
gratulated. This reviewer was particularly in- 
terested in the theoretical framework and the 
value premises which formed the base for this 
gigantic undertaking. The theoretical frame de- 
rives from descriptive psychiatry as it exists 01 
the Continent and in England. The senior at- 
thor, being a psychologist, clings to the descrip- 
tive approach to psychiatry because his statis- 
tical methods warrant collection of Le 
gained information based upon public data. Бу 
using statistical methods he concentrates upon 
the study of convergent events. However the 
present-day psychiatrist, at least in аа 
seems to be primarily interested іп pate 
gained information and in unique events W je 
constitute the basis for private data. It is 1! à 
obvious that a statistical approach to a 
events is not feasible. The work of the аш a 
thus confines itself to viewing the neurotic Lah 
son from without, and though this aoe y 
strictly scientific, the question arises 1n W N brey 
such a view is useful to psychiatry. A" rd: 
Lewis answers this question in the xt " 
“Psychiatrists, concerned to understan 
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patient rather than to measure him, are disposed 
to look askance at methods which could seem 
to them atomistic, aridly statistical, and untrue 
to the dynamic influences which mould and de- 
termine human individuality. It is, however, 
precisely in its methods that the research de- 
scribed in this book may be found to contribute 
most to the psychology of personality, and conse- 
quently to the theory and ultimately to the prac- 
tice of psychiatry.” * The book is eminently 
suited for the research man who is interested in 
mental measurements, especially the clinical psy- 
chologists; it is one of the best documented 
pieces of research which has been written on the 
experimental and psychometric approach to psy- 
chiatry. To the therapeutically oriented psychi- 
atrist this volume will appeal less because it 
doesn’t contribute towards new therapeutic ap- 
proaches or a better understanding of the pa- 
tient. The theoretically-minded psychiatrist will 
be interested to see how, when complicated sta- 
tistics are applied, the choice of variables and 
the theoretical premise have to remain simple. 
This piece of research provocatively raises the 
question whether simple theoretical premises 
which are essentially based on causal, linear, 
dichotomous, or converging systems will solve 
the riddle of mental health ; or whether the exact- 
ness of the method has to be sacrificed in order 
to allow the use of more complex theoretical 
schemes. 

Psychol В 46:319—20 Jl’ 49. Leona E. Tyler. 
In our present state of knowledge, any large- 
scale investigation of personality is an important 
event. * Since most of the specific studies have 
already been reported in journals, Eysenck does 
not attempt to describe them all in detail but 
tries rather to bring them together into a со- 
herent organization and fit them into the context 
of past research on personality. He does this 
well, analyzing probable reasons for discrepan- 
cies from study to study. His definitions of terms 
and concepts, clarifying what he and his co- 
Workers mean by trait, factor, introversion, etc., 
аге very helpful in this connection. * А very 
Catholic attitude seems to have characterized 
their activity. Physical and physiological meas- 
urements, ability tests and personality question- 
naires, laboratory performances of several kinds, 
and projective or expressive techniques were all 
tried. A few strikingly significant relationships 
eed General neuroticism seems to be re- 
td to a significant extent to poor scores on 

tk vision tests and to body-sway measures of 


primary suggestibility. Dysthymics are clearly 
differentiated from hysterics in some aspects of 
their level of aspiration scores. There are many 
other suggestive findings and promising tech- 
niques, such as the Ranking Rorschach Test and 
the analysis of responses to humor of various 
sorts. In the last chapter, the author brings to- 
gether the bits of information into a composite 
picture, but there are really not enough fairly 
certain facts to make this very convincing. The 
weaknesses in the work are those which would 
be likely to show up in a situation where many 
individual investigators branch out into separate 
paths without much central control over formu- 
lation of problems and methods. Although it is 
stated that 10,000 subjects were involved in the 
whole undertaking, the number of cases in many 
individual studies is entirely inadequate to set- 
tle the issues being investigated. Ways of show- 
ing relationship between the basic dimensions 
and other traits vary from one study to another, 
and many of them present data in an awkward 
form which makes interpretation difficult and 
comparison with other studies impossible. One 
wonders why the weighted scores for neuroti- 
cisin and introversion which were used in plot- 
ting Figs. 4 and 5 could not have been used in 
all the separate studies and correlated with meas- 
urements of the traits being investigated. The 
most serious limitation of the work arises from 
the non-representativeness of.the population, 
especially that upon which the initial factor 
analysis is based. Patients at this hospital were 
not even representative of neurotics in general, 
and men and women appear to have been dif- 
ferently selected. The author, of course, recog- 
nizes this difficulty, and it is to his credit that 
he describes the experimental group in some de- 
tail and brings in comparison data from normal 
populations wherever possible. These compari- 
son groups, however, are very diverse and often 
very small (e.g. twenty normal controls were 
used in the exercise response study, p. 103). 
Sometimes no information is given as to how 
they were chosen (e.g. Ranking Rorschach 
study, p. 212). In a largely uncharted area like 
personality research it would be unfair to judge 
a preliminary voyage of exploration by ideal 
standards. Limited data are certainly of a great 
deal more value than no data at all. The main 
task accomplished, as the author sums it up, is 
that of "isolating two personality dimensions 
and discovering a series of tests which enable 
us to perform quantitative investigations along 
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those two dimensions.” This alone is a consid- 
erable accomplishment. It is as a framework for 
further quantitative personality study and as a 
rich mine of ideas for laboratory and test meth- 
ods by which one might proceed that the book 
will be most useful. It is in no sense the last word 
on the subjects it discusses. 

Psychosom Med 12:213 My-Je ’50, Josef 
Brožek. It is an eloquent testimony of the dis- 
tressingly primitive state of the science of per- 
sonality that all serious investigators tend to 
start laying the foundation ab initio. This is 
fully understandable as agreement is lacking on 
such basic issues as the dimension of personality 
along which measurements could be undertaken. 
Quantitative description of personality is im- 
possible without such a knowledge. * How suc- 
cessful was this “concentrated and cooperative 
effort to discover the main dimensions of per- 
sonality and to define them operationally, i.e., 
by means of strictly experimental, quantitative 
procedures”? The authors themselves claim to 
have progressed but a small distance toward 
their goal, although at times they wax enthusi- 
astic (“tests of temperament and personality may 
have reached a stage where they can take their 
place with the tests of cognitive function and 
special abilities,” p. 259). About the soundness 
of the fundamental approach—combining psy- 
chiatric evaluation, personality questionnaires 
(self-ratings), and an objective quantitative 
study of the human organism—there is little 
doubt. Only time and experience of other in- 
vestigators (and clinicians) can provide the test 
of the validity and value of the specific conclu- 
sions. Some limitations become readily apparent. 
Factor analysis is a useful tool for a parsimonious 
account of the interrelationships between a set 
of variables. It is obvious that the outcome, the 
“output,” depends heavily on the “input,” on 
the information contained in the original data 
which form the basis for calculation of the cor- 
relations. The fundamental data were the psy- 
chiatric ratings of 39 “personality traits.” The 
items covered the social history of the patient 
(unemployment, married, unsatisfactory home, 
domestic problems), personality characteristics 
(dependent, cyclothymic, schizoid, hypochon- 
driacal personality), and symptoms (effort in- 
tolerance, dyspepsia, fainting fits, headaches). 
The reviewer questions the adequacy of this in- 
ventory, on several accounts (items not homo- 
geneous enough, incomplete coverage, inclusion 
of items lacking general applicability, such as 
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“wartime separation,” “bomb and exposure,” 
“boarded out of the army”). It may be noted 
that R. B. Cattell began his analysis, in many 
respects similar to that of Eysenck, with 171 
basic traits, condensed into 35 clusters, with 
“neuroticism” as one of the clusters, and ended 
with 12 primary personality traits. The discrep- 
ancy between the results of these two major 
recent efforts at a quantitative analysis and 
definition of personality is striking and should 
lead to a serious re-thinking of the basic issues, 

Q R Psychiatry & Neurol 5:248-9 Jl '50. 
Isabelle V. Kendig. * Some....findings, and some 
of the special technics used, are more challeng- 
ing than convincing. While a very large number 
of subjects was involved (10,000 in all, it is 
stated), in many of the individual studies the 
numbers employed were too few to warrant 
the conclusions drawn. There is marked varia- 
bility in the use of the technics described and in 
the method of presentation from study to study. 
The representative character of the experimental 
and control groups may also be seriously ques- 
tioned. Despite such criticisms, however, Ey- 
senck and his co-workers have done a brilliant 
job of pioneering in a relatively uncharted field 
and, perhaps best of all, they have demonstrated 
the value of the team-work approach in research. 


For an additional review, see 3:673. 


[B144] 

*FarnswortH, PauL Ranpotpx. Musical Taste: 
Its Measurement and Cultural Nature. Stanford 
University Publications, University Series, Education, 
Psychology, Vol. 2, No. т. Stanford, Calif.: Stanfor 
University Press, 1950. Pp. 94. Paper. $1.50. * (Lon- 
don: Oxford University Press. 12s. (PA 24: 5733) 

Am J Psychol 64:455-7 J1’51. R. М. Ogden. 
* Farnsworth goes, not to the laboratory, but 
to a “pooling of the ballots of musicologists, the 
study of space allocations in scholarly ren 
the counting of the compositions which pd 
been recorded on phonograph disks, and а 
analysis of programs" in order to Е : 
rather good picture....of the men regarde No 
most eminent in any one time and place. А 
clear distinction is made between the d 
of a composer and the popularity of his m os 
No reference is made to the quality of ne 
tions, light or heavy, joyful or sad, nor 0 ins 
tion in the appreciation of any one Cod 
in comparison with another. Indeed, the ai ic 
of the inquiry only confirm а suspicion ue 
one might have had to begin with, that «1 "o 
composers are better known than others. 
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a psychological study of musical taste by “lab- 
oratory methods," Farnsworth decries it as 
a search for a "set of absolutes." “No music,” 
he writes, “сап be inherently good or bad, for 
goodness is only an evaluation by a group of 
men trained to accept a particular set of stand- 
ards.” The implication seems to be that “the 
particular set of standards,” if not quite arbi- 
trary, is at best “relative, culturally derived and 
stable for only limited areas and periods of 
time.” Admitting that “in some degree our tools 
have been found to be unreliable and invalid,” 
Farnsworth still believes that “they are good 
enough to have uncovered ‘facts’ which are worth 
the consideration of musicologists, musical edu- 
cators and social scientists in general.” If we 
examine the author's “concluding remarks" to 
find out what these “facts” are, we shall be at a 
loss, excepting, perhaps, in the sentence: “If 
this reasoning is sound, it can be predicted that 
Beethoven, Bach, Mozart and Wagner (the com- 
posers given the top four ranks by the musi- 
cologists) will maintain their positions in our 
American texts at least for some time to come.” 
This reassurance may be helpful to textbook 
writers and also to program and disk makers. 
As regards the nature of musical taste, it is not 
revealing. Whether or not a science of art is 
possible in terms of its fundamental perceptual 
patterns, it does not appear from this study that 
a sociological method is calculated to tell why we 
appreciate anything. As to what we appreciate, 
à distinction must surely be made between the 
tminence of a composer and the enjoyment of 
his music, A comparison of the number of disks 
recording particular compositions sold in a 
given period of time, might furnish a better 
indication of the musical taste of that time than 
any of the statistics which Farnsworth has so 
Painfully collected. 
en Л Psychol, Gen Sect 42:199-200 Mr & 
oh 51. * a closely packed survey of American 
Xd British studies of musical taste and its as- 
Mic * Three...chapters deal usefully and 
‘Neally with methods which have been and 
pee be used to assess taste. * 
me B 30:167 S 12°51. Joseph A. Leeder. 
di 15 much confusion in the minds of musi- 
dm те educators, and psychologists in те- 
"high?" үшүн taste, whether one is "low" or 
through row. Mr. Farnsworth persuades us 
nb Scientific and statistical approaches that 
is nota matter of whim but follows a type of 
: Such studies as this are needed to guide us 
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in the choice of music for classrooms and for the 
concert stage wherever the layman or the mu- 
sically élite are considered, This book gives the 
reader a sane outlook concernitig our musical 
future as well as an evaluation of our musical 
heritage. 

J Appl Psychol 35:301 Ag ' 51. Kate Hevner 
Mueller. * The book...begins where the usual 
history and exposition of aesthetics ends, al- 
though in carrying the story and the problem for- 
ward into the realm of empirical data, it may turn 
a corner which many older workers and writers 
will find a little disconcerting. The style and 
organization are simple and informal and will 
therefore be especially welcome to students of 
psychology as well as to the many readers in 
the field of music where statistical complexities 
and methodological intricacies are inappropri- 
ate. It should serve as an indispensable hand- 
book for all workers and experimenters in the 
field of musical taste. This enlightening and 
stimulating review of music research by one of 
its most sophisticated scholars has two most es- 
sential virtues; it is clear and complete, and it is 
never tedious. These virtues may be due in part 
to the fact that research in musical taste is still 
in its infancy, characterized more by breadth and 
boldness than by subtle semantic differentiations 
and esoteric techniques which are inevitable with 
the rich development of materials in other fields. 
They are also due in part to the fact that the 
author has dealt exclusively with experimental 
work, not with speculation and argument un- 
supported by empirical data. And they are cer- 
tainly due also in large part to the author's skill 
in organizing the available studies around sig- 
nificant problems, stripping the problems clear 
of metaphysical nonessentials and resetting them 
in a new experimental frame of reference no 
matter how spare and unglamorous the new set- 
ting may be. To those who may be interested in 
another dimension of musical taste, especially 
if they approach it from a more subjective and 
intuitive point of view, these chapters may seem 
unconvincing, for the methods, the vocabulary, 
the concepts, are those of the social scientist and 
the author makes no attempt to meet other dis- 
ciplines on their own grounds. He merely gives 
a clear and full picture of the ground on his 
own side of the pasture. " Psychologists who 
prefer to remain unconvinced had better not 


read this book. 


Sch R 50:124-: 
Books dealing wit 


4-5 Е ’51. V. Howard Talley. 
h musical taste and other 
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aesthetic problems are not often brought to the 
attention of workers in music in the secondary- 
school and junior-college areas. Yet there is no 
good reason why they should not be. Secondary- 
school teachers of music are too often exclusively 
preoccupied with the practical problems of their 
calling, too little aware of what has already been 
done for music in the fields of psychology and 
aesthetics, A casual glance at the footnote cita- 
tions of studies in these two fields found on al- 
most every page of Farnsworth’s new publica- 
tion, Musical Taste: Its Measurement and Cul- 
tural Nature, will disclose many interesting and 
provocative titles that should lure the curious 
reader forward to explore their contents. * It 
is unfortunate that Farnsworth does not begin 
by telling us what the elements of musical taste 
are; he somewhat arbitrarily selects one—the 
determination by rank of eminence among a 
selected group of composers—and presents the 
results of polling procedures done by him in an 
earlier project and by others. This is the element 
which is stressed by the author throughout the 
rest of the work as reflecting current conditions 
of musical taste. * Limitations of space will not 
allow discussion of some of the findings pre- 
sented. A few of them offer wide latitude in 
interpretation. Take, for instance, the statement 
(not Farnsworth’s) that, over a considerable 
period of time, Beethoven’s Ninth is the lowest 
in popular favor among his nine symphonies 
(p. 27). Here the statistical method is not a 
reliable guide. “Popular favor” is measured by 
the number of times a work was listed on sym- 
phony programs over a given period of time. 
When it is recalled that the Ninth is the longest 
and the most difficult of the nine symphonies and 
that it requires four eminent (and expensive) 
soloists and a large chorus for the taxing music 
of its final movement, it is easy to understand 
why it is “lowest in favor.” It can be given in 
most cases only under festival conditions. Its in- 
frequent performance, therefore, is no indica- 
tion of a decline in musical taste but is rather a 
reflection of the control that economic conditions 
exercise over the “popularity” of a musical work. 
One has the uncomfortable feeling that many 
other of the book’s statements based on statisti- 
cal findings would be susceptible to ambiguous 
interpretation—in other words, would have lit- 
tle bearing on the matter of taste. With this res- 
ervation, one can accept the results of old and 
new researches as reflecting the prevailing state 
of musical taste in the domain of “serious” music 


(“popular” music is only tangentially consid- 
ered). Even Farnsworth admits the tentatiye 
nature of this study. * 
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ЖЕлтто, NicHoLAs A., AND Fox, WiLLIAM Н. Scores 
on the Interpretation of Data Test: Their Rela- 
tionship to Measures of Achievement, Personal- 
ity, and Interest. Bulletin of the School of Education, 
Indiana University, Vol. 25, No. 3. Bloomington, Ind.: 
Indiana University Bookstore, May 1949. Pp. 54. Pa- 
per. $0.75. * (PA 24: 1715) 

[8146] 
*Fisuer, SEYMOUR. Patterns of Personality Rigid- 
ity and Some of Their Determinants. American 
Psychological Association, Psychological Monographs: 
General and Applied, Vol. 64, No. т, Whole No. 307. 
Washington, D.C.: the Association, Inc., 1950. Pp. v, 
48. Paper. $1.00. * (PA 24:6255) i 
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X*FLANAGAN, Joun C. Eprtor. The Aviation Psy- 
chology Program in the Army Air Forces, Army 
Air Forces Aviation Psychology Program Research 
Reports, Report No. 1. Washington, D.C.: U.S, Gov- 
ernment Printing Office, 1948. Pp. xii, 316. Out of print, 
* (PA 22: 4144) 

Am J Psychol 61:602-3 O 748. Madison 
Bentley. * This volume offers a good illustra- 
tion, on a large scale, of the way in which uni- 
versity psychologists, with certain specific prep- 
aration and of a certain bent, may enter into, 
and offer solutions upon, a variety of problems 
brought to them from non-psychological sources 
—in this instance, from The Army. * Among the 
psychologists who will read with interest and 
profit the volume which Flanagan has put to- 
gether are those who sought to perform a like 
service in 1917 for the primitive and ae 
otic air-group within the Signal Service, ун 
called upon a certain psychologist to assem 
a number of his colleagues and to institute pro 
cedures of selection, classification and ind 
later adding problems of training and ашин 
tion of personnel. Although these men were s a 
siderably restricted by want of specific E 
tion, more by the want of governmental p ia pe 
and equipment, and still more by high aut vi 
granted to an imported school of otolca 
to other medical agencies, they may now ; jm 
that practically all of the added чай eri 
enumerated beyond "selection and c wor T 
tion" (i.e. as regards most of the у 
corded in Chapters 5-8) were broug kn in 
their own attention before Armistice 
1918. These men stand, therefore, 1 eo 
position to weigh and to appreciate tically all 
advancement made since 1940 In prac Sut 
aspects of personnel service in the air, а 
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many improvements touching equipment and 
operation. The “general contributions” of Part 
iii also gain in significance when we reflect that 
practically no organized effort was made by the 
War Services to salvage for use in a second crisis 
the results—well formulated in official records 
and in the annals of professional research—of 
the earlier task. For the psychologist whose first 
interest and first obligation lie in the progress of 
his own subject, the last section of the book— 
which wisely goes beyond military accomplish- 
ment and toward psychological advancement— 
will be of primary concern. While the topics and 
conclusions there reviewed must await candid 
reflection, а single provisional remark upon the 
four main contributions (psychology of similar- 
ities and differences, psychology of "successful" 
learning, equipment design, and experimenta- 
tion and prediction) may be indulged. While 
new capital (or at the least new incremental re- 
turns) may in time be realized from the first 
two, and high advantage to the arts of flight from 
the third, it is primarily the new gains from the 
procedures of testing and statistical interpreta- 
tion which prodigiously add, both directly and 
indirectly, to aviation in war and in commercial 
transport and communication. 

J Consult Psychol 12:282 Jl-Ag '48. Lauw- 
rance F. Shaffer, The nineteen volumes of Avi- 
ation Psychology Reports constitute the most 
complete record of any coordinated effort in 
applied psychology. Report No. 1 is an intro- 
duction to the series, and a summary of the prin- 
cipal findings. * Readers who lack time to study 
the entire series of reports will find these sum- 
mary chapters a valuable condensation of the 
entire program’s work. The unique contribution 
of the report is contained in the four chapters of 
Part III which evaluate the general contribu- 
tions to aviation psychology and, more broadly, 
to applied psychology. 

J Ed Psychol 39:441-4 N °48. Lee J. Cron- 
bach. Most of the work of psychologists during 
the war took the form of skillful application of 
Previously-established principles. Even the war- 
time research projects rarely attacked frontier 
problems, The two major exceptions, and there- 
fore the most significant for psychologists gen- 
erally, are the program of the OSS described 
in Assessment of Men (Rinehart, 1948) and 
that of the Army Air Forces * The AAF pro- 
gram provided a unique opportunity for the ap- 
plication and development of personal tech- 
hiques, Because psychologists of vision directed 


the program, problems were studied imagina- 
tively ; the latest methods were given a thorough 
trial, often on vast numbers of cases, and a large 
number of new ideas were conceived and tried, 
The AAF and OSS projects are in interesting 
contrast. The OSS implemented the most ad- 
vanced clinical methods and had some success 
in predicting performance on the basis of a per- 
sonality diagnosis. The AAF group tried the 
last word in psychometric and statistical method ; 
they had great success in predicting from abili- 
ties, but almost none in using personality data 
(p. 265). Perhaps that is a reflection of the im- 
potence of present psychometric approaches to 
personality. Flanagan has provided a satisfying 
one-volume survey of the work in aviation psy- 
chology. Fifty pages are devoted to a history of 
the program—pages which the reader looking 
for ideas will probably begrudge. But this sec- 
tion once past, the reader embarks on a veritable 
merry-go-round of ideas and useful data. The 
breathless quality of Flanagan’s book, which 
sometimes leaves a major idea after only the 
briefest presentation, is understandable, since 
his volume is but the “show window” telling 
what nineteen other volumes have to offer. * 
Flanagan’s is no perfunctory and formal sum- 
mary. Instead, he writes as a psychologist ap- 
praising the program. He does not hesitate to 
give his professional judgment on the signifi- 
cance and implications of the studies of his co- 
workers. The result is stimulating and challeng- 
ing, and has implications for everyone using psy- 
chological tests, training men, or doing research 
in applied psychology. A great advantage in the 
hands of the AAF investigators was the large 
numbers of cases available, and the possibility 
of repeating studies on successive classes. From 
the work, we learn that multiple regression 
weights vary widely from sample to sample ; that 
negative weights are suspect and usually van- 
ish; and that combination of separate tests which 
have low ‘validity coefficients’ can yield a highly 
valid estimate of probable success in learning a 
complex skill. Such findings must be considered 
in any future processing or prediction. Certainly 
the fact that as many as twenty tests were needed 
to select for only three standardized air-crew 
jobs is a devastating reflection on the assiduity 
of peacetime studies of aptitude. The schools 
can scarcely take pride in the thoroughness of 
programs for educational and vocational guid- 
ance based on only one or two general tests. The 
improvement of civilian predictions will, how- 
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ever, be harder than in the Air Force, where 
duties are highly uniform. Another of the find- 
ings crucial for education is that various highly 
responsible activities had quite different apti- 
tude requirements (pp. 244-5). Academic apti- 
tudes were by no means the key to the good 
pilot. Men who failed in one gunnery job proved 
excellent on another. Flanagan’s comment on 
the aptitude test intercorrelations is striking : 
“The principal point gained from studying these 
intercorrelations is not so much the hypothesis 
for explaining the small amounts of common 
variance as it is the obviously large amount of 
unique variance in each of the tests in the bat- 
tery” (p. 244). This is in direct contradiction to 
the impression given by Guilford in Volume III 
of the series. Guilford made a factor analysis of 
the AAF tests, and has stated repeatedly that 
suitable tests of the common factors isolated 
wotld be the best basis for attacking all predic- 
tion problems. On the contrary, the number of 
practically useful aptitudes is apparently almost 
limitless. Flanagan (p. 241) speaks of "fifty, or 
one hundred, or even more" important inde- 
pendent traits. Considering that twenty-seven 
common and dozens of unique factors were found 
just in the limited sphere of human performance 
involved in air-crew skills, even this seems an 
underestimate. Of the hundreds of topics dis- 
cussed, Flanagan stresses most the idea of job 
analysis. He considers determining the “criti- 
cal requirements" of tasks a significant way to 
improve school curricula, ratings of success, and 
selection (pp. 251-253, 272-276). The critical 
requirements of a job are those aptitudes, traits, 
and skills that make the difference between suc- 
cess and failure. In the AAF, follow-up studies 
of graduates identified causes of mission fail- 
ure, which in turn implied the ways in which 
training had been inadequate. From these stud- 
ies, together with conventional "judgment of ex- 
perts" procedures. Flanagan considers that the 
nature of air-crew jobs was well understood. He 
then takes off into a strong recommendation that 
all education should be reconsidered in terms of 
“a comprehensive, scientific analysis of the crit- 
ical requirements of working and living in our 
modern civilization." While Flanagan's pro- 
posals are not explicit and not discussed in de- 
tail, it appears that he urges a Charters-type 
analysis of adult activities (commercial, civic, 
cultural, vocational, and avocational). If a fac- 
tor can be found which separates the successful 
from the unsuccessful, this would become an ob- 


jective of education. This proposal seems to 
the reviewer to be unsound, although it may be 
developed in Flanagan's mind better than in hiis 
book. Aside from. the specific goals of vocational 
training, success is undefinable, and probably an 
inappropriate concept. Is the school to turn out 
successful trout fishermen? Is being cultured 
something one “does successfully ?" Are the crit- 
ical requirements of civic success the same for 
everyone—or is the intelligent voter as success- 
ful in his way as is, in a different way, the po- 
litical leader? Second, one can question the 
philosophical basis of the proposal. Flanagan, it 
seems, would equip citizens for life as it is lived, 
rather than as it should be lived. If there are 
values which our present culture does not re- 
ward, should the schools abandon those values? 
And, in looking at success, may not education 
overlook happiness as a goal? The third diffi- 
culty is practical, Will methods useful in in- 
vestigating a specific skill, such as piloting, be 
useful when applied to life as a whole? May not 
the “critical requirements” of business success 
be in part the opposite of the requirements of 
avocational success—and if so, what does the 
school do? Can we train for thrift and enter- 
prise, suspended judgment and decisiveness, 
perfectionism and adaptability—all at once? 
Certainly Flanagan’s proposal deserves consid- 
eration and a fuller exposition, but the present 
statement suggests that he has a dangerous plan 
in mind. It is rare that the interpretations 
the volume go beyond data as in this instance 
More often, the data are left without drawing 
their full implications. A few of the findings 
educators must ponder for themselves are these: 
Tachistoscopic training of perception had no ap- 
parent value (p. 178). Morale in combat appears 
not to be related to understanding “What we 
were fighting for," according to reports of re- 
turnees (p. 250). Films have their greatest ac 
vantage over other methods in teaching dynamic 
relations—changing events or co-variation (P. 
187). Time in training courses is often pim 
because of inadequate practice, delays por 
practice and use of skill, or inadequate ke 
tion of training content (pp. 166, 169). 50 E p 
a sampling of ideas does not reflect fully A 
much the Air Force discovered about me it 
how they learn. The volume, and the palet 
represents, can surely be recommended to 

ous students of educational psychology. dut 
only adverse comment one can suggest tempts 
packing so much material into one volume 
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the reader to overlook some ideas which merit 
more amplification. 

Occupations 27:286-7 Ja "49. Bertram К. 
Forer. The project described in this book and 
its 18 companion volumes is especially signifi- 
cant in that it represents an approach to social 
maturity of scientific method in the integration 
of men, machines, and activities on a mass scale. 
Abandonment by military leaders of traditional 
stereotypes regarding personnel practices and 
delegation of authority to civilian scientists has 
been a major contribution of the war challenge 
to both increased military efficiency and the 
status of applied psychological science. This was 
a comprehensive and well-coordinated program 
of research and practical application of research 
findings. * Since this book is largely a preview of 
things to come, and so admittedly treats many 
problems superficially, one feels some hesita- 
tion in criticizing points which may be clarified 
in other volumes. However, there is one bias 
which seems to have existed in the programs 
that demands attention. This is an "anti-per- 
sonality" conviction in the selection of aviation 
personnel. Officials of the program believe mo- 
tivational or interest factors to have been ac- 
counted for in the objective evidence of acquired 
knowledge. While this appears to be the case 
for some specific occupational interests, there 
аге other motivational factors which may relate 
to military success. Interest in the job at hand 
's not a measure of total personality, and be- 
havior on a job is partially dependent on other 
than job motivation. Those who have inter- 
Viewed, tested diagnostically, and treated psy- 
cho-therapeutically returned combat men are 
aware of the frequent dependence of job motiva- 
tion upon neurotic need. One feels that in this 
Program the men were lost in their test profiles. 

ome personality measures were used experi- 
mentally and failed to prove their value. Yet, in 
follow-up studies it was found that personality 
and emotional factors were major determinants 
of the failure of high-scoring men on the selec- 
tion battery and in the success of low-scoring 
men, In a study of bombardier instructors an 
Opinion questionnaire proved to be one of the 
est predictors of success, correlating higher 
with supervisor ratings than did any other meas- 
ше, While quantification and “objectivity” are 
Noble scientific aspirations, much superficiality 
can be couched under their guise and much that 
8 of Predictive value can be brushed aside. It 
s evident from their own data that the Air Force 
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group has avoided coming to grips with the 
problem of measurement of highly important 
emotional factors. This is not a comfortable 
book to read. It tends to be repetitious and not 
smoothly integrated, a frequent fault of this type 
of compendium. Some readers will regret the - 
lack of clarification of many unfamiliar aviation 
terms and procedures which are no doubt basic 
vocabulary for members of the project. The level 
of discussion is variable and the summary of 
theoretical contributions is brief and superficial. 
Here, again, one feels that little consideration 
has been given to the Air Corps member as a 
person. Most provocative are the glimpses into 
some of the companion volumes which deal with 
aspects of research in a wide variety of areas: 
biomechanics, the use of motion pictures in 
training and examining, clinical programs, sta- 
tistical and research methodology, and test meth- 
ods. 

Personnel Psychol 1:489-93 sw '48. Dewey 
В. Stuit. The Aviation Psychology Program of 
the Army Air Forces is doubtless one of the 
most intensive and extensive programs of re- 
search in the history of applied psychology. * 
While not as spectacular as the proximity fuse, 
radar or the atomic bomb, there can be little 
doubt that this program had a tremendous im- 
pact upon the prosecution of the war, and that 
it assisted in no small measure in bringing hos- 
tilities to a successful conclusion. * the first of 
a series of nineteen reports dealing with various 
aspects of the total Aviation Psychology Pro- 
gram * constitutes an overview of the „total 
program * Of special interest [Chap. 2]....is the 
section dealing with initial plans and decisions 
which set the pattern for what was to follow. * 
The statement of the nature of the classification 
problem in general on pages 58-61 is excellent. 
* One of the most interesting and significant re- 
search projects reported in the book is that of 
the group of applicants who were sent through 
pilot training without reference to their Quali- 
fying or Classification Test scores. This study 
gave those in charge of the research program a 
very unusual opportunity to test the validity of 
their procedures—an almost impossible task in 
an on-going organization where the need for us- 
ing the best available procedures continuously is 
very great. The results of the study were im- 
pressive. * Studies of individual reactions to 
combat are presented in Chapter 8. * The ex- 
perience with group counseling procedures 
should have important implications for civilian 
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counseling programs. * There seems little doubt 
that the material in this chapter provides mili- 
tary psychologists with a better understanding 
of the behavior of individuals in combat and of 
the adjustment problems growing out of break- 
downs under extreme stress. Chapters 9, IO, 
II, and 12, composing Part ПІ, should be recom- 
mended reading for every applied psychologist. 
While no discoveries of fundamental impor- 
tance to psychology were made, there are many 
interesting and valuable findings concerning the 
nature and measurement of psychological traits, 
the engineering of the learning process, the con- 
tent, construction, administration and scoring of 
tests, procedures for developing criterion meas- 
ures, mathematical methods of prediction, and 
techniques of experimentation. As this reviewer 
reached the last paragraph of the volume, he 
had the feeling that the studies presented and 
discussed represented a most important mile- 
stone in applied psychology. That they had con- 
tributed immensely to the effective prosecution 
of the war seemed beyond question. If there was 
any disappointment in the reading of the book, 
it was the failure on the part of the investiga- 
tors to find anything really revolutionary which 
would contribute markedly to the development 
of psychology as a science. Perhaps that would 
be too much to expect in a program which was 
conceived in a time of dire need and carried 
through with the sole objective of helping in the 
winning of the war. The academic laboratory 
where more time for pause and reflection is pos- 
sible is probably a better spawning ground for 
new ideas in science. Those responsible for the 
Aviation Psychology Program in the Army Air 
Forces deserve the praise and commendation of 
their colleagues as well as the Air Forces itself. 
They saw the problems clearly and applied in 
a very effective manner those techniques which 
the field of psychology provided. The amount of 
work which they did is prodigious and they are 
to be congratulated for having presented ac- 
counts of that work for the benefit of present 
and future generations of psychologists. 
Psychol B 46:499-501 N '49. Lyle H. Lanier. 
On July 15, 1941 John C. Flanagan was com- 
missioned as a major in the Army's Officers' 
Reserve Corps and was at once assigned to duty 
in the Medical Division, Office, Chief of Air 
Corps. This was the beginning of the extensive 
research enterprise which was to become the 
Aviation Psychology Program of the Army Air 
Forces. Four years later the program included 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK [ 932 


some 200 officers, 750 enlisted men, and soo сі. 
vilians. At that time, Colonel Flanagan could 
look back with pardonable pride upon a dis. 
tinguished record of leadership in research plan- 
ning, organization, and application. The com. 
prehensive account of this work in the nineteen 
volumes of research reports under review in this 
issue of the Psychological Bulletin symbolizes 
the effective manner in which the program de- 
veloped under the direction of Colonel Flana- 
gan and his many able associates. In historical 
perspective, this collective research effort will 
stand out as an important landmark in the evo- 
lution of applied psychology, as well as a very 
substantial contribution towards the effective- 
ness of the Army Air Forces in World War II. 
* 'The official account of the expansion of the 
program is most impressive. * Like all other war- 
time research efforts, the gains were far greater 
in the technological than in the scientific dimen- 
sion. One looks in vain for really significant 
scientific progress in any of the areas discussed; 
immediate military necessity would not permit 
long-range, basic research. The classification re- 
search made possible extensive studies of the 
interrelations of traits, which seemed to support 
an atomistic theory of psychological organiza- 
tion. The factor analyses and the low correla- 
tions among apparently quite similar activities 
both indicated that “the number of basic uncor- 
related aptitude factors....(is) probably much 
larger than most experts had supposed.” In gun 
nery training, for example, men who did wel 
on one type of trainer might do very poorly is 
another which seemed quite similar. Because 0 
such results, generalization and transfer of d 
ing find little place in Flanagan's psychologic? 
scheme of things. (Except in the emphasis upon 
correlations between test scores and а hdi] 
fairly dissimilar criteria!) In the field of ua 4 
ing the most outstanding contribution was б 
determined effort to secure adequate Lip 
proficiency against which both classification 
educational procedures could be checked. 
nature and the determinants of йы e 
havior on the job became the central vices 
the program in its later stages. These e pe 
were for the most part models of skil AT 
persistent attack upon these most difficu p de- 
problems in applied psychology. A 

finitive solutions to these problems were gains 
a rule forthcoming, the methodologic in this 
were very substantial. Future workers 

field will know better how to approac 


The 


933 |} BOOKS & REVIEWS: FLANAGAN 


tasks, as well as what hypotheses and procedures 
to reject. 

Psychometrika 13:283-4 D '48. Andrew L. 
Comey. * a valuable guide and reference work 
for psychologists in every field, especially for 
those who do not have sufficient time or reason 
for reading the entire nineteen reports. It seems 
worthy of comment that in spite of the mass of 
heterogeneous data, directives, and findings with 
which the author has been forced to cope in writ- 
ing this report, he has succeeded in making his 


book interesting to read as well as authoritative. 


For a review of the series, see B2. 
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XFLANAGAN, Јонх C.; ADKINS, DOROTHY; AND CAD- 
WELL, DOROTHY H. B. Major Developments in Ex- 
amining Methods. Foreword by Kenneth O. Warner. 
Introduction by Albert H. Aronson. Personnel Admin- 
istration Reports, PAR No. 504. Chicago, Ill.: Civil 
Service Assembly, November 1950. Pp. iv, 24. Paper, 
mimeographed. $1.50. * (PA 25:7710) 
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FORD, MEREDITH P.; NAGAY, JOHN A.; PRESTON, Har- 
uty O.; SPAULDING, GERALDINE; WAGNER, ВКлрн F.; 
AND West, ELMER D. Critical Requirements for 
Research Personnel: А Study of Observed Be- 
haviors of Personnel in Research Laboratories. 
Conducted by the American Institute for Research un- 
der the sponsorship of the Scientific Personnel Ad- 
visory Committee of the American Council on Educa- 
Eus with funds provided by the Scientific Personnel 
prion of the Office of Naval Research. Pittsburgh, 
a.: American Institute for Research, March 1949. Pp. 
viii, 66. Paper, lithotyped. Gratis. * 
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„Ат Ј Orthopsychiatry 19:557-8 1 '49. Bes- 
sie Sperry. * It is in....[the] chapters on the 
projective methods themselves that one wishes 
the author had expanded his work and indicated 
in more detail the connections between his theo- 
retical approach and the instruments he is de- 
scribing. The title of the book may be a little 
misleading, and those who expect an exposition 
on the clinical use of any particular projective 
methods will be disappointed. But the book 
considers some of the fundamental theoretical 
problems of psychological techniques and em- 
phasizes an important trend in psychology—the 
willingness to take responsibility for the theo- 
retical foundations of its useful techniques. 

Am J Psychiatry 106:638 F ’50. Arthur L. 
Benton. In 1939, Frank, in an extremely well- 
written paper (J. Psychol., 8: 389-413), coined 
the term, “projective technique,” to describe 
such psychological evaluative methods as the 
Rorschach and Thematic Apperception Tests 
and offered a provocative and insightful discus- 
sion of the significance of these newer methods 
for disclosing personality trends. The paper 
ranks as one of the most important of recent 
years. Providing both a name for this series of 
psychological test methods and a convenient 
conceptual frame of reference for their utiliza- 
tion, it had the effect of accelerating significantly 
the growth of interest on the part of psychia- 
trists and clinical psychologists in these tech- 
niques. * The present monograph is essentially 
an extension and elaboration of the 1939 paper 
with, however, some changes in emphasis and 
tone. In respect to emphasis, the idea of the 
uniqueness of the individual's "private world" 
and basis personality structure is more strongly 
developed in the monograph. As a corollary, the 
idea of applying normative standards or frames 
of reference to projective technique data is de- 
cisively rejected as being applicable to the meas- 
urement of the characteristics of the “anony- 
mous units" of a group but not to the interpre- 
tation of the productions of an individual who 
by definition presents à unique personality con- 
figuration which cannot be meaningfully com- 
pared with other unique personality configura- 
tions. In support of this thesis, Frank makes 
numerous references to specific procedures and 
concepts of modern physics as contrasted with 
those of classical physics. This reviewer 15 not 
able to assess the validity of this type of analogi- 
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cal support for a psychological thesis and sus- 
pects that many another reader will have the 
same difficulty. One wishes that Frank would 
cite data closer to hand. Since his own “Partial 
Bibliography” lists some 325 papers on pro- 
jective methods and J. E. Bell's new book lists 
not less than 798 references to the Rorschach 
test alone, this should not be too difficult to do. 
As mentioned, there is also a shift in tone be- 
tween the earlier paper and the present work. 
The paper was expository in nature ; the mono- 
graph is frankly polemical. Frank vigorously 
crosses swords with those psychologists (here 
called “psychometrists” or “statistically inclined 
psychologists” ) who insist that the same criteria 
of credibility be applied to communications deal- 
ing with the utilization of projective techniques 
as are applied to any other scientific communi- 
cation. As is usually the case with polemics, 
there is a liberal use of value terms and more heat 
than light is generated. With respect to Frank’s 
strictures concerning the supposedly inappro- 
priate standards and points of view of “tradi- 
tional” psychologists, it may be said that what 
is asked essentially of reports dealing with pro- 
jective techniques or any other techniques is not 
a “statistical” treatment of the data or that the 
generalizations which are made meet a particu- 
lar technical criterion of credibility, but simply 
that: (1) the statements made be operationally 
meaningful, i.e., that the procedures employed, 
the observations made, and the conclusions 
drawn be stated specifically and concretely ; and 
(2), the assertions made be verifiable, i.e., that 
they be stated in such a form that an interested, 
competent person can undertake appropriate in- 
vestigation to check on their truth or falsity. 
With respect to the point of view that normative 
and group studies are alien to the spirit of pro- 
jective techniques and that nothing worthwhile 
can be expected to come from this type of pro- 
cedure, it may be said that empirical, normative 
group studies, rather than lengthy, unverifiable 
analyses of "unique" individuals, have formed 
the basis for the clinical application of these tech- 
niques. Rorschach's Psychodiagnostik was an 
extensive comparative study of 405 normal and 
pathological subjects (to the reviewer's knowl- 
edge, still the largest single group of subjects 
studied) which detailed the characteristic per- 
formances of "anonymous units" of a number of 
subgroups. One wonders how much attention 
would have been paid to this volume if, instead, 
it contained lengthy, unverifiable analyses of, 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK [ 934 


say, 10 "unique" personalities, Frank's position 
is justified to the extent that it refers to an un- 
critical use of norms, i.e., the application of em. 
pirically established norms to an individual (or 
a population) significantly different from the 
population from which the norms were derived. 
The necessary correctives for this error would 
seem to be: (1) more sets of norms rather than 
fewer, and (2) training in the sophisticated use 
of normative standards. All in all, this reviewer 
feels that the monograph is a weaker exposition 
of Frank’s ideas than is his 1939 paper, 

J Am Med Assn 141:1106 D 10°49. * excel- 
lent little monograph * АП the material pre- 
sented is exceptionally well organized and clearly 
presented, and the book has an excellent bibli- 
ography. 

J Consult Psychol 12:361 5-0 '48. Laurance 
F. Shaffer. * a brilliant and valuable explora- 
tion of the significance and underlying theory of 
projective methods, which will be read with 
profit both by experienced clinical psychologists 
and by advanced students. 

J Nerv & Mental Dis 111:263 Mr'50. (230 
munt A.) Piotrowski. * a well thought-out ra- 
tionale of the projective experimental methods 
of personality investigation * This volume does 
not paraphrase the basic texts in the field (as 
the textbooks do it) but offers descriptions of 
the similarities and differences of the new per- 
sonality techniques which will clarify anybody's 
thinking about the projective methods. Of the 
five chapters, the last one on reliability and va- 
lidity, is the shortest one and the one most likely 
to provoke doubts. The author reasons as if the 
projective methods were incapable of preci 
quantitative validation. This belief is justified 
when we speak of the present time. However, the 
methods can and will be improved methodolog 
ically, Eventually we will know which compo 
nents of the methods can be considered homoge 
neous (as far as the purpose of the investit 
is concerned) and we will then be able to calc! 
late sensibly. It is true that some impatient m^ 
think that they are already in possession of ut 
knowledge and have started to quantify pre m 
turely. Agreement or disagreement with mud 
thor on validating procedures does not de 
from the author's main and ins 

Mental Н. 4:480-2 Л '5o. Mi Й 
Siegel. * The clinician will find Projective MS 
ods extremely valuable in crystallizing the o 
cepts that underlie his work. It provides d: E 
working analysis of various techniques 8 
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the Rorschach tests, Stern’s cloud pictures, and 
yarious constructive and cathartic methods as 
exemplified in Levy’s therapeutic dolls. * 

Nero Child 7:453 О '48. The author is anxious 
to reserve for himself the right to have invented 
and to be considered first to have used the verbal 
expression “projective method.” As proof he re- 
prints (page 86) a few sentences from a lecture 
of his of 1939. Unfortunately, as representatives 
not of superficial journalistic meddling with sci- 
ences, but of well grounded international scien- 
tific affairs, we cannot “copy-right” this claim 
for Dr. Frank. The concept of projection and 
projective method is one of the basic concepts of 
Carl Gustav Jung’s psychology. He applied it 
in his Symbole und Wandlungen der Libido of 
1912 and we find it fully developed in his vol- 
ume on Psychological Types, 1920. Since our 
author quotes Jung's writings of 1910 and 1919 
he must have stumbled over this term. For the 
remainder we would like to say about this book 
that it surveys in not too thorough a manner art 
testing, Rorschach methods and the Thematic 
Apperception test, whose creator Henry A. 
Murray had been one of the earliest scientific 
students of С. С. Jung in this country. 

Philos Sci 16:87 Ja’49. R. L. Ackoff. * De- 
spite its brevity, this work is a valuable and in- 
tensive study of a critical methodological prob- 
lem in psychology. Had the author more clearly 
delineated between “method” and “technique” 
the problems might have come into better focus. 
There is no mention made of testing procedures 
which measure in terms of scale units rather 
than statistical intervals based on social norms. 
If psychological tests made explicit what they 
measure, i.e., introduce real units, then they 
would grasp individuality, and clinical judgment 
would involve less guessing. The control of clin- 
ical judgment is essential, and Professor Frank’s 
discussion, though only suggestive, represents 
a first step toward replacing experience by ex- 
periment, His book should serve to stimulate 
others to make further steps. 

Psychoanalytic Q 18:247-9 Ap '49- Roy 
Schafer. This book is written on a consistently 
abstract level. It does not present specific prin- 
ciples of interpretation for specific projective 
techniques ; its purpose is to formulate the theo- 
retical position regarding personality that is re- 
quired by contemporary scientific theory and is 
necessary for the effective use of projective tech- 
niques, The emphasis throughout is on process, 
Organization, and the individual; the attack is 
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and isolated traits by ма, и "tein 
. А y statistical, 
group- or norm-oriented research, * The gen- 
eral rationale advanced in this volume seal 
the projective tester's interest on strivings and 
feelings, and particularly on defensive and ad- 
justive mechanisms ; this focus is a prerequisite 
not only of meaningful test reports but also 
of meaningful research with psychological tests. 
Psychologists have too often mechanically ap- 
plied essentially inadequate statistical techniques 
in the study of test results of various clinical 
groups; they have relied heavily on averages of 
single scores, thereby ignoring the configura- 
tional aspect of personality and its expression in 
test results; as a result they have often ended 
up with contradictory positive findings, untrust- 
worthy negative findings, and underemphasis on 
individual variability. Frank performs a service 
in implicitly calling attention to the need for 
clinical research in which the material to be 
summarized and compared consists of individual 
personality analyses, in preference to the atom- 
istic averaging and comparison of single scores, 
as in much current Rorschach Test and Wech- 
sler-Bellevue Scale research. 

Psychol B 47:89-91 Ja’50, Arthur Weider. * 
Chapters one to three, approximately 41 pages, 
are a plea for newer types of psychological test- 
ing in light of changing concepts of person- 
ality theory. To bolster his position, the author 
borrows heavily, perhaps too frequently, from 
events in physics in particular and science in 
general. In subsequent chapters the author 
makes many statements without supplying the 
reference to the literature, which detracts from 
the book’s usefulness to the advanced student. 
This limitation occurs rather frequently in the 
discussion of reliability and validity of projective 
methods. Frank raises a tremendously important 
point which research psychologists as well as 
clinicians must some day attempt to rectify. 

While psychologists customarily rely upon the judg- 


ment of lay, naive individuals to validate their findings, 
it is difficult to find any scientific justification for suc 
a procedure. If a scientific method or test is used, one 
cannot assume that the agreement or disagreement with 
its findings by а group of untrained, naive judges (as 
in matching) has any significance. Yet, this reliance 
upon such judges is frequently cited as validating or in- 
validating a method (p. 66), 


The appendix contains a partial bibliography of 
projective methods taken in part from Helen 
Sargent (pp. 68-86). This is a useful addition 
but has already appeared in the literature. This 
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little volume is not prepared for the undergradu- 
ate major in psychology, nor will it be useful 
to the graduate student without preliminary 
training and experience in clinical psychodiag- 
nostic methods. It has merit for theoretical ex- 
position and for indicating the need for more 
research on projective tests before they earn 
their place in a scientific psychology. * 

О R Psychiatry & Neurol 3:495-6 О ' 48. Isa- 
belle V. Kendig. * a brilliant presentation of the 
rationale for projective technics. In a very few 
pages he outlines the psychocultural conception 
of the emergence of personality and its dynamic 
operation on which their use is predicated; re- 
lates them to current procedures in the physical 
sciencés and in medicine; contrasts the new 
methodology with the old which, guided by the 
evident regularities of human behavior, evalu- 
ates the individual in terms of group norms; 
describes five different types of projective tests ; 
and discusses the development of new criteria 
to establish their reliability and validity. * Ap- 
pended to the monograph is an excellent bibli- 
ography. А 

[B154] 
*Fraser, Jonn Munro. A Handbook of Employ- 
ment Interviewing, Second Edition. London: Mac- 
donald & Evans, Ltd., 1951. Pp. vii, 214. 9s. 6d. * 

Brit J Psychol, Gen Sect 41:92-3 S '50. F. E. 
Macneill. * a most readable and comprehensive 
volume which has long been needed * the author 
builds a clear picture of how information can be 
collected about an individual * one cannot help 
being struck by the way in which the author uses 
extreme (although from his point of view prob- 
ably justifiable) examples to bring his points 
home. This is not so much a criticism of the 
author, as the result of an uneasy feeling one 
has, that it illustrates the difficulty in presenting 
selection methods to the layman. It is easier, for 
example, with students of psychology, since 
one can lead up from the fairly exact general 
framework, to a study of the real complexity to 
which any attempt at understanding the indi- 
vidual personality brings one. Although the au- 
thor devotes some space to "The Unique In- 
dividual" it might have been a good idea to 
risk expanding his discussion somewhat further. 
He has obviously tried to be as straightforward 
and clear as possible, but the danger lies in the 
possible belief of the layman that this scheme of 
assessment really is as straightforward as it 
looks, and that he will cheerfully learn about 
the methods, become identified with the very 
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straightforwardness, and resist coming to grips 
with the complexity. Occasional brief but mis. 
leading statements are made by the author 
which overdo the simplicity, such as his com. 
ment on Freud's theories, and his inaccurate 
statement that “follow-up figures indicate that 
methods of assessment outlined in previous chap- 
ters put the right person in the right job in about 
nine cases out of ten.” As the book is designed 
to be essentially practical the author might have 
indicated how exactly information about edu- 
cation standards might be obtained, and he might 
also have included a short bibliography of ele- 
mentary references. But Mr. Munro Fraser has 
dealt extremely well with a difficult task, and his 
book will certainly be very valuable also to uni- 
versity students of occupational psychology, 
however unacademic he intended it to be. 

J Appl Psychol 35:214-5 Je ’51. Robert N. 
McMurry. In spite of an apparent unfamiliarity 
with the work on the patterned interview and 
allied procedures as selection techniques which 
has been done in this country (at least no ref- 
erence is made to it), Mr. Fraser has developed 
an approach which parallels it very closely. He 
recognizes the need for having comprehensive 
job descriptions and specifications as a basis for 
selection. Likewise, he is aware of the contribu- 
tion which a detailed application form can make 
and recommends a suitable one. He is aware of 
the parts played by intelligence and aptitude tests 
but does not elaborate their contributions. He is 
less aware of the importance of preliminary 
screening standards and makes no allowance for 
reference checks of any kind either written or by 
telephone. Neither does he suggest the recording 
of the interview findings in any formal and un 
ganized manner. (Apparently his interview 15 
based principally on a somewhat informal revien 
of the data on the application form.) The ш 
viewer guides the general course of the S old 
sation, but gives the applicant a great dea dis 
latitude in determining the content of the Б 
cussion. * He does not place as great en 
on “health” as distinct from “physique n " 
common here. The greatest difference (an i it 
tation) between his procedure and those Pi 
commonly used in this country is that eee 
sents no systematic and organized set o he ар 
pretive concepts to be used in analyzing y f 
plicant and' determining his suitability ing 
particular opening. In common with ps "e 
here, he postulates that an individual ud cus ons 
constitutes the best guide to his quali 
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and that modes of action, once established tend 
to repeat themselves and can, therefore, be used 
asa basis for predicting what he may be expected 
to do. Likewise, he recognizes the influence of 
the family environment in childhood as a factor. 
He also makes use of the concept of "interests" 
as a measure of the applicant's "motivations." 
However, he has no concepts comparable to those 
of "character" or specific "habit patterns" or of 
"emotional immaturity." Furthermore, he makes 
no systematic effort to evaluate the influence of 
the applicant's domestic situation. He makes a 
rather naive reference to Freud "as concerned 
wholly with sex" which indicates his unfamil- 
iarity with the latter's work. Beyond this, he 
appears to be largely unfamiliar with the psychi- 
atric concepts which apply in this field. To sum- 
marize, the book is simply and interestingly writ- 
ten, The principles it enunciates are sound. For 
a beginner in the field it will have much value 
because its approach is strictly “common sense.” 
Its greatest limitation is that it offers little that 
is new, i.e., it contributes no strikingly great ad- 
vances in concept or technology. Probably its 
greatest contribution is the picture which it pro- 
vides of the current state of personnel thinking 
in Great Britain where it is apparent that there 
is still considerable resistance to the assembly 
of facts about an applicant prior to his employ- 
ment. This conviction that such inquiries are an 
invasion of the individual's personal privacy is 
neatly illustrated by the author's quotation from 
à recent Spectator article: "The Post-Master 
General is curious. Before he will let you help 
deliver the Christmas mail, he requires full de- 
tails of your past, your health and your mother's 
birthplace," 

Occupational Psychol 24:265 О '50. A. Rod- 
ger. This well-produced book is exactly what its 
title indicates: it is a practical guide for people 
Who, as part of their regular task, have to inter- 
View candidates for employment. It is thought- 
fully planned and simply written. It tells the 
reader what it is going to tell him; then it tells 
him; and after that it tells him what it has told 
him. For many personnel officers and others who 
have attended the National Institute's courses in 
Selection methods, it will serve as an excellent 
reminder, To the many more who have not, it 
k provide a taste of what they have missed 

© perhaps stimulate some of them to appro- 
Priate action, The weaknesses of the book, such 
» they are, are mostly on the theoretical side. 
n his firm resolve to make his advice usable, 


Mr. Munro Fraser has taken a substantial num- 
ber of intellectual risks, His interpretation of the 
Seven-Point Plan, on which the book is based, 
seems from this standpoint rather far removed 
from the original, in its apparent indifference 
to some fairly important psychological and sta- 
tistical considerations. But defects in the author’s 
basic concepts are not likely to worry anyone 
except psychologists and statisticians, and Mr. 
Munro Fraser implies in his preface that the 
book is not really meant for them, It is perhaps 
a pity that there is no bibliography. Other guides 
.. Should, in the reviewer's opinion, have re- 
ceived at least a mention * There are many vol- 
umes of which it has been said, “This is a ‘must’ 
book for the readers for whom it is intended.” 
Mr. Munro Fraser’s is probably a good deal 
better than most books so described. 

Personnel Psychol 3:467-71 w ’50. John P. 
Foley, Jr. * written in a lucid and interesting 
style. Its effectiveness а$ a training manual or 
guide is further enhanced by the summary out- 
line which follows each chapter. The emphasis 
upon operational definitions of specific job ac- 
tivities rather than upon vague and general job 
specifications is commendable. The author also 
cautions against the dangers of over-simplifica- 
tion and of generalizing from a single bit of evi- 
dence. Stress is placed upon the need for secur- 
ing supporting data from a variety of sources 
and for evaluating the over-all pattern. Intelli- 
gence and aptitude tests are pictured in proper 
perspective, with due recognition of their prac- 
tical possibilities as well as of their limitations. 
Much of the discussion of interviewing tech- 
niques is likewise quite sound. There can be 
little doubt that most employment interviewers 
can profit from a careful reading of this volume. 
Despite its virtues, however, the book shows a 
number of shortcomings. At the outset, the title 
is a bit misleading. The term “handbook” will 
cause the reader to expect a somewhat more in- 
tensive treatment of employment interviewing 
techniques than is actually provided. There is, 
moreover, a complete absence of references to 
other publications—either to primary or sec- 
ondary sources. In certain places the terminology 
is a bit confusing, as in the loose use of the term 
“circumstances” in the description of the indi- 
vidual’s psychological characteristics. The au- 
thor also betrays an inherent mentalism in his 
concepts of “aptitude” and “intelligence, which 
he regards as “innate predispositions. It is like- 
wise unfortunate that he delays intensive treat- 
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ment of “job specifications” until Chapter VII. 
Other criticism can be leveled against the claim 
that it is possible to conduct an adequate inter- 
view within a ten-minute period. In his treat- 
ment of the longer and more intensive interview, 
moreover, the author suggests no organized plan 
for securing the appropriate information. This 
is one of the book’s most serious limitations. Al- 
though much of the material presented is quite 
appropriate, many of the fine points of interview- 
ing are completely missing. The author not only 
fails to discuss some of the most recent tech- 
niques, but he overlooks numerous implications 
in the two hypothetical interviews which are pre- 
sented for demonstration purposes. The inter- 
viewers’ comments and questions in these inter- 
views are also open to serious criticism, since 
they illustrate some of the very things the author 
is cautioning against. It is interesting to note 
that the two sample cases lead to inferences 
which are exclusively positive: no liabilities are 
brought to light by the author. This fact alone 
would lead one to suspect that the interviewer 
had missed important negative clues—and a 
careful examination of the interview transcript 
will verify this supposition. Although the author 
is on solid ground when he states that the inter- 
viewer must strive to evaluate the applicant’s 
make-up on the basis of “the kinds of róles the 
individual has played in the past,” the author 
fails to provide the reader with many techniques 
for uncovering these róles. One almost gets the 
impression that the employment interview con- 
sists simply of engaging in a bit of initial “small 
talk,” which will suddenly cause the applicant 
to "unburden his soul.” Although such coopera- 
tion can be expected to a large degree in the case 
of the vocational guidance interview, it certainly 
cannot be anticipated in the interview for selec- 
tion and appraisal purposes. Certainly, we must 
assume that the applicant who desires a job will 
be motivated to show himself at his best. The 
skillful interviewer must thus be provided with 
techniques which will enable him to get rid of 
some of the wool which the job-oriented appli- 
cant continually tries to pull over his eyes. 


[ B155] 

XFREEMAN, FRANK S. Theory and Practice of Psy- 
chological Testing. New York: Henry Holt & Co., 
Inc. 1950. Pp. xxiii, 518. $3.50. * (London: Sir Isaac 
Pitman & Sons, Ltd., 1951. 25s.) (PA 24: 4344) 

Am J Psychol 63:645 О ’50. M. Bentley. А 
clear and straightforward account of many scales 
„апа tests * For detailed reviews of topics, pro- 
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cedures and points of view, the reader will look 
among the journals devoted to tests and Statis- 
tics. Freeman's volume possesses many virtues 
of the acceptable teaching textbook. 

Ed Outlook 25:222-3 My'sr. Miles Murphy. 
* a valuable contribution to the growing, but still 
by no means large, number of textbooks available 
in the field of psychological testing * The author 
succeeds in giving an adequate amount of in- 
formation on tests of intelligence, of special apti- 
tudes, of achievement, and also on tests and tech- 
niques for the assessment of personality both of 
the inventory and the projective types. In deal- 
ing with the latter, he presents a balanced point 
of view, avoiding the enthusiasm of the “cultist” 
as well as the excessive skepticism of the reac- 
tionary. Approximately one-half of the book is 
devoted to intelligence—an analysis of the abil- 
ity, a survey of methods of measuring intelli- 
gence, and a critical discussion of the clinical 
use of these methods. This is not unreasonable 
considering the historical importance of intel- 
ligence testing and the fact that many aspects of 
the theory of testing can best be treated with 
reference to that type of test which has been the 
subject of the greatest amount of careful study. * 
contains a useful chapter on statistics * 

J Consult Psychol 14:330 Ag '50. Laurance 
F. Shaffer. * 'The coverage is encyclopedic, in- 
cluding descriptions of many little-used tech- 
niques as well as of the common ones. The clar- 
ity and accuracy of the book fall somewhat be- 
low its scope. Some descriptions of tests are i 
greatly abbreviated as to be incomprehensible 
to a student who does not have access to the ma- 
terials. Even worse, there are a number of errors 
of fact. Wechsler’s method of deriving 10s for 
the Bellevue scale, for example, is described its 
correctly. As a whole, the book does not ү 
pare favorably with other texts m the hr 
course in psychological measurement. 4 

Occupations 29:232-3 D '50. S. eth di 
* the presentation is not comprehensive, bon 
cially in the areas of the measurement of p 
interest, attitude, and school achievement. ui 
half of the book, including eight of the pa 
chapters, is devoted to description and discu ea 
of intelligence testing. The single ШЫ. Par 
to aptitude tests includes discussion pori, 
urement of reading readiness and of inf sam 
In the latter category only two inven n 
mentioned, namely, those of Strong ч d to the 
The same niggardly treatment is afforded | 
discussion of “tests of attitudes and values 
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to "opinion polling," each of which occupies 
only three pages in the book. It is perhaps un- 
realistic to expect a single volume to treat com- 
prehensively all areas in which psychological 
testing is done. The material in this field is so 
varied and extensive that two or even three 
yolumes might be necessary to do the job ade- 
quately. In the reviewer’s opinion, therefore, 
this volume is not comprehensive as it is claimed 
to be, and if it is used as a text for a year-long 
course in psychological testing, it must be sup- 
plemented by other texts or materials, The sec- 
ond.chapter of this text is devoted to statistics, 
in the writing of which T. A. Ryan has collab- 
orated. This is one of the clearest short treat- 
ments of statistics seen anywhere by the re- 
viewer. Despite this fact, however, one must 
question the usefulness of a single chapter of 
statistics—no matter how well written—in any 
textbook of psychological testing. For those stu- 
dents in the course who have not previously had 
statistics, this chapter is bound to be too difficult 
unless a great deal of class time is given to it. 
For students who have had statistics this chapter 
will be repetitious. It is high time that students 
entering a course in psychological testing, which 
it is to be hoped will be offered on graduate 
level only and with other sustaining courses, be 
required to have completed or to carry concur- 
tently a full course in statistics. Let us now 
enumerate the many positive accomplishments 
of this book. With a few minor omissions it is 
an up-to-date coverage of the field in psycho- 
logical testing. The experiences with and con- 
tributions to psychological testing during World 
War II are summarized and presented in а 
sensible manner. The author's scientific orien- 
tation and his objectivity in his treatment of 
the subjects throughout the book are admirable. 
His constant inquiry for demonstrated reliabil- 
ty and validity of the different tests he presents 
is worthy of emulation. He is cautious in his 
interpretations, but no fence sitter. When the 
evidence is in, he does not shrink from drawing 
the logically necessary conclusions. The theoreti- 
a and clinical framework in which the material 
denn and evaluated demonstrates а posi- 
fuer tee in a difficult field where a sys- 
lace 1с апі rigorous treatment has often been 

«Ing, Finally, it must be said to the credit of 
€ author that his writing is lucid and concise. 
P Sychol В 48:279-80 My '51. Frederick B. 
di * well written * should be a very useful 

Xtbook for courses in educational and psycho- 
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logical testing. The material included seemed to 
the reviewer to be satisfactorily accurate, al- 
though the tone or emphasis sometimes struck 
him as inappropriate. However, this is to be 
expected as a result of differences in points of 
view and is not a criticism of the book. The au- 
thor is considerably more lenient with respect 
to the tests he discusses than the reviewer would 
be. It is entirely possible that in an elementary 
text materials that deserve severe criticism 
should simply be ignored; otherwise, students 
may become confused, The reviewer would like 
to have seen somewhat greater emphasis given 
to the flaws and limitations of tests such as 
the Bellevue Scale and the Chicago Tests of 
Primary Mental Abilities, As it is, the misin- 
formation and speculation in Wechsler’s Meas- 
urement of Adult Intelligence is never called to 
the student’s attention. And the spuriously high 
reliability coefficients reported in the manual for 
the Chicago Tests of Primary Mental Abilities 
are accepted at face value, Unfortunately, a com- 
peting set of tests, the Differential Aptitude 


` Tests (published by the Psychological Corpora- 


tion), for which parallel-forms reliability co- 
efficients are provided, is not mentioned in the 
book. It is gratifying to note that the author is 
cautious in accepting the claims made for tests 
based on the results of factor analyses. Although 
the reviewer is firmly convinced of the value of 
factorial techniques, he is worried by the uses to 
which these are sometimes put. A couple of what 
seem to the reviewer to be non sequiturs were 
noticed. Both are of minor importance. (1) In 
the footnote on page 314, We find the statement : 
“But so far as AQ's above 100 are concerned, 
they are illogical and psychologically unsound, 
for actually no one can achieve above his ca- 
pacity." It is no doubt desirable to point out the 
dangers of the AQ, but it seems a bit strong to 
indicate that all AQ's above 100 are illogical and 
psychologically unsound. Though it may be im- 


ssible, by definition, for anyone to achieve 


above his ultimate capacity in а given field, this 


t preclude our obtaining meaningful AQ's 
Pla pi For example, a child may show as 
much capacity on à certain mental test as the 
average child does at age 10-0 but score as high 


on a test of general science as the average chil 


does at age 12-0 because of intense interest in 


iti follow 
he field and favorable opportunities to 1070 
his interest. An AQ above 100 for this ios 
would be quite meaningful. (2) On page 173, the 
author mentions a correlation coefficient between 
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scores derived from the Pintner-Paterson Scale 
and the Stanford-Binet Scale in a group of gifted 
children, and concludes : *Furthermore, the very 
low coefficient of .23 suggests, also, that the per- 
formance scale is particularly inadequate for dif- 
ferentiating between performance levels of gifted 
children.” To the reviewer the coefficient sug- 
gests that the distribution of scores for each test 
was probably greatly restricted. In any event, it 
would be possible to have two tests yielding un- 
correlated scores that would each make excel- 
lent discrimination among gifted children. 


[B156] 
*FnENcH, Joun W. The Description of Aptitude 
and Achievement Tests in Terms of Rotated Fac- 
tors. Psychometric Monograph No. s. Chicago, Ill.: 
University of Chicago Press, 1951. Pp. x, 278. Paper. 
$4.00. * 

[B157] 
FRIEDMAN, Bertua B. Foundations of the Measure- 
ment of Values: The Methodology of Location 
and Quantification. Columbia University, Teachers 
College, Contributions to Education, No. 914. New 
York: Bureau of Publications, the College, 1946. Pp. 
ix, 227. Out of print. * (PA 21:1550) 

Am J Sociol 53:233 N °47. Louis Guttman. * 
The title of the volume is not lived up to. The 
review of the literature, which constitutes the 
bulk of the work, is meandering and not well di- 
gested, and the original experiments, after such 
an extensive preliminary discussion, are frag- 
mentary. 

Am Sociol R 12:23ў-8 Ap '47. Ernest J. 
Chave. * a significant study of some of the basic 
principles which undergird developments in the 
field of measurement of values. There is not 
much new material from experimental research 
but there is a penetrating analysis of the theory 
and concepts involved, and an evaluation of dif- 
ferent types of value-measurement. * The book 
may be disappointing to those who are searching 
for dependable measuring techniques but it is 
valuable to those who are ready to face the limi- 
tations of present analyses and indices. * In 
general the reviewer finds the study more of the 
type of “Excavations for Foundations” rather 
than “Foundations,” but nevertheless distinctly 
valuable in an important field for both psychol- 
ogy and education. 

J Ed Psychol 38:313 My ’47. J. B. Stroud. 
This book recounts some original research in 
value measurement in a portion of Chapter 15. 
Obviously the presentation of this experimental 
work was not the chief object of the book. This 
purpose could have been served very well by an 
article in one of the psychological journals, 
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Chiefly, the book isa critique of value behavior 
and its locations, naming (classification of spe- 
cific values under the proper heads), and quan- 
tification. 

Nerv Child 6:476-7 O '47. Sometimes "in. 
between" books carry the torch of progress of 
two or more sciences better than those on a 
strictly specific subject. This attempts to do 
such a service for psychology and philosophy but 
unfortunately not on the positive side. It is neces- 
sary to study and understand the profound and 
extensive literature on the philosophy of values 
before beginning to evaluate the problems of 
measurements pertaining to psychology and edu- 
cation. Of course certain academic groups be- 
lieve that a traditional authority justifies the 
publication of inferior students' publications and 
certain critics will automatically approve any- 
thing from Teachers College of Columbia. We 
regret that our sense of responsibility obliges us 
to describe this book as very poor. 

Psychometrika 12:237 S °47. Andrew L. 
Comrey. * The greatest value of this book per- 
haps lies in its description of the various tech- 
niques which are available for measuring values. 
It is also valuable as a review of the literature in 
this field and for its account of the difficulties 
involved in this type of measurement. It is prob- 
ably least valuable as a discussion of the theory 
of measurement applied to this particular prob- 
lem. The definition of value-behavior is phrased 
and expounded in such a way as to obviate the 
possibility of obtaining very accurate measure 
ment in this field. She states that estimation by 
the measurer will always be necessary 1 value 
measurement due to the dynamic quality of p 
personality. The book will be of most vM 
those persons who are actively engaged in. Ш 
task of measuring values, attitudes, and age 
aspects of the personality. For those who т 
interested in the problem of developing ШИН 
logical measurement on a rational basis, ! 
be of less interest. 


For additional reviews, see 3:587. 
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KFRIZZLE, ARNOLD LUTHER. A Study of Some б 
Influences of Regents Requirements / Teachers 
inations in French. gounis уи ў Net 
Coll Contributions to Education, 7 . P» 
York: Bureau of pde the College, 1950 
xi, 154. $2.75. * (PA 25: 6410 

54. $2.75 p. Thart 


Ed Res B 30:163-4 S 12°51. James ath 
* Frizzle finds that practically all d n of 
which offer French (and that is 7° pe 
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the secondary schools in New York) use the 
Regents Examinations. Although the Depart- 
ment of Modern Languages has advised against 
using them as criteria for graduation, the ma- 
jority of the schools still use them this way be- 
cause of a general demand for state credentials. 
Frizzle concludes : “Regents examinations have 
shown themselves to be fairly reliable predictors 
of college language success and are widely ac- 
cepted by eastern colleges as satisfactory require- 
ments for entrance to the freshman year” (page 
49). The book makes interesting reading in parts 
tothe student of modern-language methodology. 
The author has naturally been influenced by the 
points of view of New York educators, but he 
has reported fairly the divergent opinions and 
avoids bias in his evaluations. His facts are 
amply documented and the book contains little 
error. It is hoped that this study will stimulate 
other inquiries and research into examination 
systems and tests in general. The results should 
improve instruction and operate favorably on 
the place of the subject in the school curriculum. 


XFROEHLICH, TN ШУ Benson, ARTHUR L 
scd саше Foreword by Raymond W. Greg- 
1948. Pp. m MET Research Associates, nc., 
, 104. Paper. $1.25. * (PA 22: 3452) 

A ANM J 31:144 My 48. W. D. Perry. * 
а ook is well planned. The introductory chap- 
F concerning the place of testing in the guid- 
ce program, and the planning of a testing pro- 
gram are clear and concise. * There are helpful 
шее on how to compute local norms, and 
iua as to the books needed for a basic 
"e 2 testing. * a book which is timely and 
ha ү of help to counselors who have not 
litis uch work in the field of tests and measure- 
ache Psychol 33:96-7 F '49. Milton E. 
UE The book is definitely posited on the 
ане philosophy that naive personnel 
Dd Sere will use tests and that it is sensible 
jor E them to avoid errors in practice. A ma- 
ria ength is the frank facing of the fact that 
А ds use of tests requires statistical knowledge 
ни include descriptions of simple statistical 
ERG 5 and interpretations. * Although agree- 
КОШЕ the best instruments 18 probably im- 
s be some evaluation of excellence might 
asked een helpful. One of the most frequently 
Sa ee legitimate questions of the new 
pur UR What are the best tests for my 
poses?" Perhaps the answer is, “Consult the 
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nearest competent person.” * a hel ful and use- 
ful book. This is particularly ERY the audi- 
ence for which it is intended follows through 
with graduate training which will make the book 
no longer needed. 

J Consult Psychol 12:282 Jl-Ag '48. Lau- 
rance F. Shaffer. * a clear and explicit small 
handbook for guidance workers who have only 
limited training in measurement * While it is 
no substitute for adequate training, the volume 
is decidedly superior to many elementary pam- 
phlets in its area, and can be of real help to the 
school counselor. 

Occupations 26:543-6 My 248. Stewart Mur- 
ray. * For purposes of this review, educational 
and vocational counselors may be classed in three 
groups in so far as their attitudes toward tests 
and measurements in guidance are concerned. 
Tn the first and unfortunately the smallest group 
are counselors who have made a reasonably 
thorough and continuing study of tests and who 
use them frequently and appropriately. In the 
second group are counselors who have not un- 
dertaken a similar careful study of tests but 
who use them anyway. In the third group are 
those who admit their lack of knowledge con- 
cerning tests and are more or less reluctant to 
use them. With regard to the first group, the 
text, Guidance Testing, will be of little help, as 


“its members are already in possession of the 


information which the text has to offer. For the 
more self-confident second group the book will 
be somewhat more valuable. It would not, how- 
ever, improve their techniques a great deal be- 
cause of its brevity and might even perpetuate 
in an undesirable manner their present over- 
simplified conception of the use of tests in guid- 
ance. The third group should find this inter- 


estingly written book much to their liking. It 


should serve them well as an introduction to 
ber that 


the testing field so long as they remem! 
it is only an introduction. The possibilities of the 
use of tests and measurements seem to be wel 
covered. It is a book on how tests may be us 

rather than a book on how to use them. When 
one considers that in the short space of less than 
ninety pages, eighteen of which are devoted to a 
catalogue of current tests, the authors have 
written chapters on (1) The Place of Testing 
in the Guidance Program, (2) Planning a Test- 
ing Program, 3) Deciding What to Measure 
with Tests, (4) ‘Administering, Scoring, ап 

Recording Results of Tests, (5) Using Test Re- 
sults, and (6) Improving our Counseling Skill, 
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the suggestion that the text is merely an intro- 
duction to the field seems almost too obvious for 
mention. The authors admit that their treatment 
is brief and they suggest certain other supple- 
mentary references such as Buros’ Mental Meas- 
urements Yearbook, and Bingham’s Aptitudes 
and Aptitude Testing, which texts are equally 
sketchy in their accounts of any given test, and 
their supplementary value is therefore limited. 
What is needed at the present time is not another 
book as brief as its predecessors in the field, but 
one which really gets down to the business of 
analyzing the various tests in current use by 
drawing upon the numerous field validation 
studies which have been carried out with such 
tests during the last decade. There is also a 
danger that the simple contiguity of the two 
words Guidance and Testing in the title may 
serve in a subtle and unintended manner to re- 
inforce the all too prevalent conception among 
students and the public generally that guidance 
and testing are synonymous. A title such as The 
Use of Tests as an Aid in Guidance, while 
longer and more clumsy, would, in the opinion 
of this reviewer, be more accurate and appro- 
priate. Considering this little book within the 
limits of its own scope, however, we find it care- 
fully written and its use of concrete examples 
makes the reading of it clear and interesting. Its 
content seems to have been planned for schools 
and it assembles and organizes for a teacher or 
school counselor a number of important points 
which should be kept in mind in planning and 
administering a testing program. 

Personnel Psychol 1:497-9 10 '48. Harold B. 
Pepinsky. * When one keeps in mind the non- 
technically trained reader to whom....[the bro- 
chure is] addressed, one is impressed by.... [its] 
excellence in comparison with other books about 
tests. * contains a "Basic Library on Testing" 
(pp. 88-90) which should be very useful to the 
beginning counselor, and a section on “How to 
Compute Local Norms” (pp. 91-101) which 
should also be very helpful, although somewhat 
technical for the untrained counselor. * Chap- 
ter 6, on "Improving Our Counseling Skill" 
(pp. 80-87), is a valuable antidote for those 
guidance workers who have accepted uncritically 
the writings of certain "non-directivists," mini- 
mizing the usefulness of test results in counsel- 
ing. The discussion of kinds of tests (pp. 20-46) 
is less useful to guidance workers than it might 
have been had the authors been free to make crit- 
ical summary statements about each test. The 
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authors" governmental connections * obviously 
made this impossible, yet one doubts whether the 
unsophisticated reader can make adequate com. 
parative judgments from the available test de. 
scriptions. Commendable caution has been ex. 
ercised in discussing reliability and validity (pp. 
12-16), though there is some tendency to pre- 
scribe “rule of thumb" interpretation of correla- 
tion coefficients which may not be valid in all 
applications. Again, the authors might have 
pointed up the desirability of “spot-checking” 
machine-scored test results (p. 49). Another 
minor omission is a caution against interpreting 
norms when the test format has been changed, * 
In conclusion, Guidance Testing....will make... 
[a valuable addition] to the libraries of lay 
workers in guidance and personnel work, The 
U.S. Office of Education... [is] to be commended 
for....[its] sponsorship of....[this] important 
[book] * 

Sch R 56:428-9 S ’48. Thomas E. Christen- 
sen. * This brochure should be of some aid to 
guidance workers in improving the administra 
tion of tests. However, it is questionable whether 
it will be of much assistance in dispelling the fog 
of misinterpretation which hovers over 50 much 
of “guidance testing.” * “Place of Testing m the 
Guidance Program,” “Planning a Testing Pro- 
gram,” “Deciding What To Measure with 
Tests,” and “Administering, Scoring, and Re 
cording Results of Tests"..are the chapters 
which are most likely to be of value to the neo- 
phyte in testing. They furnish him with criteria 
for the selection of tests, caution him about the 
limitations of these instruments, and provide 
him with tips on the administration of ҮК 
On the other hand, chapter v, which is desig" А 
to aid the reader in interpreting test iso 
not likely to accomplish its purpose; for it n 
to provide a factual basis for test interpre m 
or to furnish references to published researc det 
vocational testing. It tends to give the que 
the illusion of understanding without the ya "i 
For illustration, the assumption is made d 
test of mechanical comprehension mne : 
of the aptitudes necessary for success vie am 
making" (p. 76), but no study i$ cite 2 

; $ х he stateme 
stantiate this point. Furthermore, the 


5, exterity 
is made that a low score on a tweezer d he 


test is a strong warning against еме, 
watchmaking trade. At first sight, this wi of the 
would appear to be logical, since so muc air 0 
work in a watch factory is done with ар 


E + nent prac 
tweezers. However, in one experimen; 
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tically rio relationship was found between the 
results of a tweezer dexterity test and foremen's 
ratings. The point is that the counselor ought to 
be told the unvarnished truth. If he is to learn 
how to interpret tests, he must be willing to do 
two things: (1) acquire the necessary statisti- 
cal concepts and (2) secure a factual basis for 
test interpretation through a study of published 
research on vocational tests. An attempt is made 
in Appendix A, “A Basic Library on Testing," 
of Guidance Testing to assist the student in ac- 
quiring this necessary background for test inter- 
pretation. Six books for that purpose are sug- 
gested, with the assurance that they “will pro- 
vide a well-rounded basic library on testing" 
(p. 88). It is of passing interest to note that the 
books of three members of the committee which 
reviewed this book are included in the basic 
library. This reviewer finds it rather difficult to 
understand why the committee included the book 
Personnel W ork in High School in its approved 
list since the authors of this volume present no 
statistical data about their recommended tests. 
Instead, they blandly assert that “the validity 
and reliability of these tests are sufficient and 
satisfactory for their use and purpose.” Appen- 
dix B, “How to Compute Local Norms,” de- 
scribes graphic methods of computing percentile 
norms and standardized score norms. While the 
purpose in providing this informatjon is laud- 
able, it seems likely that this brief discussion will 
serve only to give the reader a false sense of 
security with regard to his statistical knowledge. 
In order to obtain the necessary statistical judg- 
ment for the effective use of such tables, the 
student should have experience in working with 
measures of central tendency, dispersion, and 
the probability distribution. Guidance Testing 
does not provide material whereby this experi- 
ence may be secured, The book is easy to read. 
Infact, it is too easy to read ; that is, the language 
Which is employed may invite misinterpreta- 
tion. The following quotations may serve to il- 
lustrate this point: “It [a test] is not a 100 per 
cent valid [test] [p. 59]”; “Perry is a typical 
overachiever [p. 61]" ; “The counselor has other 
tricks up his sleeve [p. 82]”; and “This true 
Score is the one he would make on a similar 
teading-readiness test having perfect reliability 
PP. 99-100].” The critical reader is prone to 
ask such questions as the following: Since it 
18 impossible to obtain 100 per cent perfect meas- 
ures of any mental trait for any group of sub- 
Jects, how is it possible to have a “100 per cent 
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valid test”? Is not the term “typical over- 
achiever” an artificial classification? Is the 
counselor in the same classification as the ma- 
gician who has tricks up his sleeve? Is not the 
"true" score the hypothetical average of an in- 
finite number of obtained scores rather than the 
score which an individual would make on a test 
of perfect reliability ? * This reviewer would add 
that it is highly dubious whether the book under 
review helps counselors to do that better job, 
However, as the authors of Guidance Testing 
state, “Life will go on.” 


[B160] 
XFuzss, Сглоре M. The College Board: Its First 
Fifty Years. New York: Columbia University Press, 
1950. Pp. vii, 222. $275. * (London: Oxford Univer- 
sity Press. 185.) (PA 25:3418) 

High Sch J 34:26 Ja '51. M. J. White. The 
history of the College Entrance Examination 
Board is told chiefly in terms of the personalities 
who down through the years have formulated 
and shaped the policies of the Board. The ac- 
count of the evolution in test emphasis and de- 
sign is completely non-technical and within the 
grasp of the reader as well as interesting reading. 
À reasonably careful consideration of the book 
leaves the reader with the impression that the 
Board has been more sensitive to new and pro- 
gressive testing devices than to new and pro- 
gressive educational philosophies. The book will 
be of chief interest to the educational philoso- 
pher because it high-lights the tremendous in- 
direct influence exerted by a relatively small 
group on the curricula of American schools at 
the high school as well as the college level. 

J Ed (London) 83:420-1 Л1°51. J. А, Petch, 
* The narrowly defined regional body founded 
at the turn of the century by Eliot of Harvard 
and Butler of Columbia has in fifty years grown 
to be a force of major importance in American 
education from the Atlantic to the Pacific. Many 
men and women eminent for their scholarship 
have shared in fostering this growth. What the 
second half-century of the Board’s work will 
bring forth is perhaps as obscure as is the future 
of our own developing system of education ; the 
first half-century, of which Dr. Fuess has given 
so clear and so candidly loyal an account, has 
certainly produced material which we in Eng- 
land who are engaged in somewhat similar work 
would do well to ponder on, even if in the end 
we should conclude that servile imitation 1s not 
the most sincere compliment which can be paid 


to pioneers. 
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*Gacr, N. L. Scaling and Factorial Design іп 
Opinion Poll Analysis. Purdue University, Division 
of Educational Reference, Studies in Higher Education 
LXI; Further Studies in Attitudes, Series X. Lafay- 
ette, Ind.: the Division, December 1947. Pp. vi, 84. Pa- 
per. $1.25. * (PA 23:2184) 

Int J Opin & Attitude Res 3:600-І w ’49. 
Lessing A. Kahn. The primary aim....is method- 
ological in character, that is “....to formulate and 
demonstrate a method for determining the rela- 
tive importance of personal-data correlates of 
opinion.” Basically, the formulation of the 
“method of attack” may be conveniently divided 
into three phases, namely, (1 & 2) the combined 
use of factor analysis and Guttman Scale Analy- 
sis for the development of uni-dimensional 
scales, and (3) the rather novel use of analysis 
of variance for attitude and opinion determina- 
tion. To my own knowledge, this is a first at- 
tempt in the combined use of these powerful sta- 
tistical techniques ; the results of this investiga- 
tion as well as others by Eysenck, Preston and 
Kahn, etc., are very promising for significant 
advances in scale construction and attitude 
theory. * The final meaningful chapter treats of 
the application of analysis of variance as a de- 
vice for getting at the correlates or determiners 
of opinion. These correlates or personal-data 
factors are designated as high school grade, 
rural-urban residence, political party preference, 
religious affiliation, home environment, sex, and 
North-South residence. In a sense, the results 
which are generated from this portion of the 
investigation constitute a secondary purpose. 
Unfortunately, however, this reviewer feels that 
there is little from the point of view of content 
analysis that is contributed to our knowledge 
about the structure of attitudes (except, per- 
haps, should we be interested in adolescent 
opinion). The basis for this verdict may be sub- 
sumed under the general problem of sampling 
in psychological research. Thus, we may ask 
ourselves such questions as, what meaning shall 
we attach to the political party preferences of a 
high school freshman in terms of opinion-atti- 
tude correlates ?, ог what meaning shall we at- 
tach to the grade level of a high school student ?, 
etc. In sum, the problem of determining the 
correlates of opinions and attitudes by means of 
the analysis of variance technique is a novel 
approach and it is clearly evident from this study 
that there is much promise for significant work 
in this area of social psychology. In this instance, 
however, this reviewer was somewhat disap- 
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pointed to note that rather generalized conclu- 
sions were drawn from a somewhat selective 
populational sample, namely, high school sty. 
dents. Such selective sampling is in the tradition 
of most psychological research, and constitutes 
another example of the kind of sampling tech- 
nique about which McNemar, for example, has 
so unflatteringly but justifiably written. None- 
theless, our objection should not be considered 
too seriously in view of the methodological char- 
acter of the investigation. The monograph has 
been well written and the various statistical 
techniques employed are adequately described 
in detail. As an example of the application of 
some of the more recent advances in statistical 
method to problems of attitude and opinion 
measurement, the monograph is an excellent 
contribution. The conclusions generated with re- 
gard to determinants of attitudes toward the 
Negro seem secondary in importance to the 
statistical means employed. 


[ B162] 
*GaLviN, James P. A Comparative Study of Per- 
sonality Patterns in Boys of Different Ages. 
Washington, D.C.: Catholic University of America 
Press, Inc. 1950. Рр. v, 47. Paper. $1.00. * (PA 25: 
6422) 

[ B163] 
*Garrett, Henry E. Statistics in Psychology and 
Education, Third Edition. New York and London: 
Longmans, Green & Co., Inc., 1947. Pp. xiii, 465. ond 
25s. * (PA 21:980). For reviews, see 132 in Deos 
cal Methodology Reviews, 1941-1950; for reviews 
the second edition, see 38: B621. 


[ B164] kao 
*Gasxine, D. A. T. Examinations and the rer Es 
Education, Second Edition. Melbourne, AT 
Melbourne University Press, 1048. Рр. 45. 25. Press 
(London and New York: Cambridge University ' 
1950. 2s. 6d.; $0.50.) (PA 24: 2088) 


Гре £f Reading: 
Gates, Автнов І. The Improvement 0 à 
A Program of Diagnostic and Remedial Мейо» 
Third Edition. New York: Macmillan Co. HAS E 
xxi, 657. $4.75. * (РА 22: 3105) For reviews, 


[ B166] es 
XGronce, У/пллам У. Rating Employee, Perfor a 
ance, Second Edition. University of Mississippi d 
reau of Business Research, Business Aids За "paper 
University, Miss.: the Bureau, 1950. Pp. 1, 1» 
Gratis. * (PA 25: 3990) 


[B167] | 
жСитвкшл, Eowin E. The Validity of Common 
Employed Occupational Tests. DONE No. 9 
fornia, Publications in Psychology, V.O "press, 1949 
Berkeley, Calif.: University of Califor, 
Pp. 253-87. Paper. $0.75. * (PA 24:204 
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i icture 
Ствѕох, James J., Eprror. Motion Pictu 
and Research. Army Air Forces Aviation 


Testing 
Psychol 
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Program Research Reports, Report No. 7. Wash- 
oon, С: DS; Government Printing Ов, 1947. 
Рр. хі, 267. Out of print. * 


J Consult Psychol 11:338 N-D ' 47. Laurance 
F. Shaffer. * reports fundamental research of 


permanent value to psychology, and applications. 


to the production and use of a variety of aptitude 
and achievement tests. 

Psychol В 46:511—2 N *49. W. C. Н. Pren- 
tice. * Approximately half the report is con- 
сегпей with the motion picture as a technique 
and is thus of rather specialized interest. Some 
of these studies are impressive in the care with 
which they have been carried out, and some of 
the results bearing on seating, illumination, and 
their like are clearly of use to certain readers. 
It is, however, the other half of the text that 
will interest most of the readers of the Psycho- 
logical Bulletin. * A number of ingenious mo- 
tion picture tests for Flying Orientation, Ability 
to Judge Distance, Estimation of Velocity, Mul- 
tiple Perception (dial reading) and other abili- 
ties were devised and are reported in some de- 
tail. The general reader will be struck by the 
very low correlations between these tests and 
other tests, between these tests and validating 
criteria, and even between these tests and them- 
selves (various reliability criteria). Those who 
were engaged in similar tasks with the armed 
services will recognize, however, that 75 of 
25-55 may often represent genuine achieve- 
ments. Nevertheless, the low level of predicta- 
bility makes the permanent usefulness of these 
studies doubtful, and they must be accepted 
primarily as guideposts for others who will face 
such problems in the future. * The investigations 
which Gibson’s group carried out in....[airplane 
recognition research] are interesting for two 
reasons. First, they illustrate the danger of bas- 
ing large scale activities on inadequate research. 
The “flash technique” of teaching aircraft rec- 
ognition was accepted by both Army and Navy 
for reasons that were supported only by gen- 
eral theoretical considerations. The Santa Ana 
group has subjected the underlying assumptions 
of the method to a series of careful tests, only 
to discover that none of those assumptions could 
be verified in practice. Second, the findings have 
an intrinsic interest. We now know that train- 
ing in making shape discriminations at very 

igh speeds does not in itself increase the ac- 
curacy of such judgments when longer times 
аге available. We also know that frank analy- 
ses of the airplane gestalt into certain compo- 
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nent parts that distinguish it from others will aid, 
at least in the early stages of learning, in making 
the necessary discrimination. * The discussion 
of investigations related to the perception of 


aerial space and distance is undoubtedly the most 


important part of the book from the point of 
view of the general psychologist. Gibson and 
Glaser have presented in Chapter 9 a newly or- 
ganized “system” of explanation for depth per- 
ception. It differs in three main ways from tra- 
ditional systems. First, more emphasis is placed 
on monocular stimuli. Second, these stimuli are 
all treated as matters of gradients of retinal 
stimulation. Third, the treatment of "gradients 
of velocity" for a moving observer is (so far as 
the reviewer knows) entirely original. All three 
of these departures from classical textbook treat- 
ments are suggestive and warrant the use of this 
chapter as supplementary reading for students 
of visual perception. At least two other contri- 
butions deserve mention. One is Gibson's em- 
phasis on position on the vertical meridian as a 
factor in depth localization. Vertical position 
(the so-called "stage effect") is an important 
influence under many conditions of viewing but 
is often completely omitted from discussions of 
depth. The other is the quantitative study of size 
constancy under natural conditions of viewing. 
The presence of practically complete constancy 
up to nearly half a mile when only the texture 
of the terrain gives any distance cues may force 
a revision of some notions about depth per- 
ception based only on laboratory findings and 
theoretical considerations. 


For a review of the series, see B2. 
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, WiuLIAM, AND WEEKS, Epwarp J. The 

карсаны Development of Children: The 

Teacher’s Guide to the Keeping of School Rec- 

ords. Foreword by Cyril Burt. London: University o 
London Press, Ltd., 1950. Pp. xiii, 248. 8s. 6d. 


B Int Bur Ed 24:75 9 2 50. * will be of great 
assistance to the primary class teacher who 
wishes to keep proper records of the progress of 
his pupils, so as to adapt education to the needs 
of each individual child, but is baffled by the tech- 
nical vocabulary of the psychological research- 
ers and has little knowledge of the science of 

atistics * 

я Brit J Ed Psychol 20:205-6 N’50. L. B. B. 
* This book, which deals with all aspects of 
record making, should help very much in the 
training of teachers and also aid those practis- 
ing teachers whose own training left them ill- 
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equipped in the knowledge and techniques re- 
quired to form a judgment of a child's personal- 
ity. Addressed, not to experts but to "the 
ordinary class teacher," it begins, after an in- 
troduction, with a chapter on elementary statis- 
tics. Here are dealt with such things as standard 
deviation, correlation and the properties of the 
curve of normal distribution. It is simply and 
lucidly written and is probably the minimum 
without which the full use of the numerical data 
of school records is impossible. Such is the 
aversion to statistics, however, that there are 
probably many otherwise able teachers who will 
fail to get beyond this point. This is no criticism 
of the book's lay-out, for it is difficult to imagine 
any other place for this chapter. Following the 
statistics are chapters on physical qualities, gen- 
eral ability, attainments, special aptitudes, in- 
terests, disposition, and the child's environment. 
There are appendices giving sample record 
cards, filled in and explained by case histories; 
there is a useful list of tests, an extensive bibli- 
ography and some sets of mathematical tables. 
А great deal of ground is covered and this very 
adequately. It is a book which was very much 
needed and it is to be hoped that Sir Cyril Burt, 
who contributes a foreword, is not being too 
optimistic when he says, "teachers themselves 
will eagerly welcome this contribution to their 
everyday work." 

Brit J Psychol 41:204 D ' 5o. M. A. Mellone. 
The teacher in every type of school should find 
valuable guidance in this book for the keeping 
of his records. It is perhaps a pity that so much 
statistical information should be given as early 
as the second chapter, for it may discourage the 
non-mathematically minded from further study 
of the book. It would have seemed that a few 
simple definitions might have sufficed at this 
stage, with further information in a final chapter 
for those who were minded to acquire it. Apart 
from this one defect, the whole book is well bal- 
anced. The suggested form of record card seems 
to cover all necessary information concerning 
the child’s own general mental ability, special 
aptitudes and interests and his attainments in 
various fields along with relevant information 
about his environment, without the extreme 
complications of some forms of record at pres- 
ent in use. In particular, the book should help 
to clarify the teacher's mind on the object and 
usefulness of record keeping and show him just 
what can and cannot be expected of standardized 
tests at the present time. The chapter on General 
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Mental Ability seemed to me especially valuable 
in this respect. 
J Ed (London) 83: 106 F'sr. C. M. Fleming, 
very simply written. It is packed with details 
and it ranges widely over the physical and 
mental attributes, the educational attainment, 
the interests, aptitudes, and home background 
of pupils in infant, primary, and secondary 
schools. It appears to have been planned as a 
teachers' guide to the keeping of school records, 
and it is to be regretted that the advice it in- 
corporates should not have been published sepa- 
rately in compact form ready for use. As it 
stands, considerable effort is required to dis- 
entangle the statements relevant to record- 
keeping from the somewhat general discussion 
of topics reminiscent of Jung, McDougall, Bal- 
lard, and other distinguished writers of the 
1920's and 1930's. The authors appear hopeful 
of attaining predictive certainty through im- 
proved recording. They suggest the desirability 
of classification in terms of intelligence quo- 
tients, but they also mention (somewhat sur- 
prisingly) a recent comment by Dr. Watts that 
a child's performance in an intelligence test is 
influenced by his cultural background. They are 
well aware of the significance and the responsi- 
bility of the decisions which a teacher is called 
upon to make. They are sensitive to the fact that 
it is hard to forecast at the primary or early 
secondary stage what level of performance a 
child will ultimately reach. It is therefore the 
more to be regretted that they make no mention 
of the findings of long-term studies reported in 
book form since 1940, and that they make no 
use of the conclusions which can be drawn from 
the study of personnel selection in the arme 
forces during the last ten years. 

New Era 32:16 Ja '51. I. E. Campbell. The 
authors....state....that their aim is “to answer the 
questions of the ordinary teacher and to enabe 
him to master the necessary techniques "Th 
ing in, interpreting and using record cards. T 
do this, not only by explaining the теси 
measurement and assessment required, but : 
giving in admirably simple and compact fant A 
theoretical background of relevant psycholog! 
data. It is obvious that they ante a 
thority of experience of the classroo Я 
thorough knowledge of the field of child pue 
ogy and of research in progress. * There 7 S 
ful appendices, listing standard tests ап jd 
for further reading, etc. Appendix I Tro 
especially noted since it rounds off the 


and à 


he at- , 
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treatise in a masterly way by relating three case 
histories side by side with their records. One 
minor criticism might be offered. The sub-title 
which defines more particularly the aim of the 
volume appears only on the title page. Scanning 
arow of new books, the teacher might easily miss 
a book of immediate value to him. 


[B170] 
#Gouren, HowARD W., AND KAVRUCK, SAMUEL. Se- 
lected References on Test Construction, Mental 
Test Theory, and Statistics, 1929—1949. Foreword 
ty John F. Scott. United States Civil Service Commis- 
sion, Washington, D.C.: U.S. Government Printing 
Office, 1950. Pp. xii, 209. $1.50. * (PA 25: 5823) 

J Consult Psychol 15:355 Ag '51. Laurance 
F. Shaffer. * Although the original purpose of 
the list was oriented to tests for personnel selec- 
tion, the wide coverage makes the references 
useful to all who are concerned with human ap- 
praisal. 

Psychiatric Q 25:720 O ’51. * The design is 
such that references are easily found, and the 
manual is exhaustive enough to save both time 
and labor in research within the areas covered. 
It would be expected, however, that only a lim- 
ited number, concerned with devising tests un- 
der pressure of time, would find the work useful. 
‚ Pub Personnel R 12:167 JL’ 51. * will....be of 
value to....public merit-system jurisdictions and 
private organizations engaged in research and 
test construction * 

И [В171] 

History pein ЕговЕхСЕ 1. Mental Testing: Its 

Fes p and Applications. New sien 

don: Staples си nc., 1949. Pp. xxi, 609. $5.00. (Lon. 
ress Ltd., 1950. 42s.) (PA 24:848) 

тен ыманы 20:848-9 О '50. Adolf 

. mann. * simple and nontechnical lan- 
guage * Part II discusses the underlying statis- 
Ev principles and methods of mental testing. 
in d part is by no means a substitute for a course 
m cna but Dr. Goodenough's clear style 
ji : orms this dreaded subject matter into con- 
: ра CREAN are easily grasped and understood. 
fis omen gives a thorough, clear and basic 
itg ps of the general field of mental test- 
м her book certainly merits the close at- 

wx of the groups [for which it was written] 
ih E Sie merit of.... [this book is that it re- 
«ын psychologist that his real and un- 
ЖШ € strength lies in a sound scientific 
ogyis E The present trend in clinical psychol- 

. з coward mastery of projective methods and 

Practice of psychotherapy. This new devel- 


0j . 
Pment is certainly welcomed and desirable, but , 
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pi a Pe to failure if the psychologist 
s rigorous training in the field of 
mental testing. The scientific basis of mental 
testing serves as a check and prevents intuition 
from getting lost in hypothetical speculations. 

‚ Ат 1 Psychiatry 108:69 Л°т. Leola Е. Neal. 

Goodenough has from time to time made out- 
standing contributions to the field of mental 
testing * [This book] is no exception * “Mental 
Testing" is designed to serve as an orientation 
text for students planning to enter the field of 
testing. Clinical psychologists, however, will find 
that it contains material of interest to them. 
Other professional workers who make use of 
the results of tests that they do not themselves 
administer will find that it gives them a theoreti- 
cal background as well as considerable practical 
help. 

Am J Psychol 63:155-6 Ja '50. Madison 
Bentley. An exceedingly able and reliable ac- 
count of more than a half-century of professional 
testing (APA had a Committee on Tests early 
in the gos), now presented for the instruction of 
students and professional teachers and as a guide 
and sanction to a great movement which pri- 
marily arose (as the author shows in Chapters 
i and ii) from social and educational needs. The 
present wide application of the testing arts is at 
an exceedingly favorable time for the expert 
organization of history (part i), principles and 
methods (part ii), description of tests and scales 
(part iii) and applications (part iv) into one 
consolidated writing. Few possess the wide 
knowledge, critical acuteness, diligence and con: 
tinuity to put in hand the integrated record that 
Dr. Goodenough has written for use and direc- 
tion throughout the second half of our century. 
Of especial interest to psychologists at large will 
be the relations set between the tests, on the one 
hand, and individual and class differences, the 
development of "cn epar and experi- 
mental psychology, on the other. 

Ed cman "sr. G. E. К. B. * Goode- 
nough attempts to survey in less than 600 pages 
the whole field of testing, cognitive and affective, 

sonal and social, from the point of view first 
of historical development, secondly of principles 
of test construction, thirdly of types of tests and 
test methods and fourthly from the point of view 
of their uses and applications. Such a survey, it 
is felt, is too all-embracing and results, in this 
case, ina general lack of depth. If the book was 
intended as а general introduction and guide to 
the newcomer, then it can rightly claim that 
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depth can be sacrificed somewhat to reasonable 
comprehensiveness. Yet it is difficult to see 
what worthwhile information such a reader will 
get from the fifteen page chapter on "Analysis 
of Variance," with its page and a quarter of 
pepped up information on the design of experi- 
ments and the Latin Square, with a table of F 
thrown in for good measure, or the even more 
concentrated five pages of factor analysis where 
tetrad differences, factor residuals, rotation of 
axes are tossed around as if they were the com- 
mon knowledge of every schoolboy. Most of the 
statistical section suffers in this way, being either 
too compressed or too superficial and thus serv- 
ing very little good purpose. Some of the sec- 
tions, too, seem to have a somewhat tenuous 
connexion with mental testing, as, for example, 
the brief section on the psychology of the un- 
married mother. One cannot avoid the feeling 
that Dr Goodenough's real interests lie more 
with the application of tests in clinical practice 
than with test construction and evaluation and 
the psychometrics thus involved, for the best 
sections of the book come when the author is 
discussing the actual practical details of test ap- 
plication to children and others. There is one 
particularly good section (Chapters 7 and 8) 
dealing with the "naming" fallacy and the fu- 
tility of deciding what a test is going to measure 
from a priori considerations and, again, with 
the variations in reliability and validity to be 
found in different groups from different areas 
and of different socio-economic standing. Forty- 
two shillings seems a great price for a book of 
such variable quality. * 

J Abn & Social Psychol 46:263-5 Ap 'sr. 
Ann Magaret. This volume represents a many- 
sided contribution to the field of psychological 
testing. It is in one sense a critical evaluation of 
the theories and assumptions, the advantages 
and shortcomings of the test as an investigative 
method in psychological science. In another 
sense it provides a philosophy for the judicious 
application of test procedures in clinics, schools, 
industries and social agencies. And in still an- 
other sense the book is an insistent reminder 
to those who use tests that future progress 
in this field, however greatly needed, can be 
achieved only through slow, painstaking, sys- 
tematic research. Because it is written by one 
who has for many years identified herself with 
the construction and application of psychologi- 
cal tests, the volume can hardly fail to influence 


its readers. In general coverage, Mental Testing 
is somewhat more ambitious than most con. 
temporary texts on testing. The first six chap. 
ters present a detailed history of the testing 
movement. The account is precise, scholarly and 
well documented ; but it may prove difficult for 
the reader who is not reasonably familiar with 
the history of psychology and of its related fields, 
* Procedural errors in the construction of tests, 
unwarranted inferences and hasty overgenerali- 
zations are pointed out with candor; and their 
significance, not only for the research worker 
but also for the practicing clinician, is stated 
clearly and openly. Thirteen chapters follow 
(Part II) which describe at length the theory of 
test construction, a variety of techniques for 
reporting test results, and the problems involved 
in selecting and applying criteria for validating 
tests. Included also are discussions of a num- 
ber of basic statistical concepts and methods... 
directed toward practical workers who may have 
had no previous training in statistics. Neverthe- 
less, the discussion is at points highly sophisti- 
cated, and will almost certainly present some 
difficulties to the statistically naive reader. The 
distinguishing feature of this section is its it- 
terlocking of theory and practice. The chapters 
never lose sight of the practical significance of 
tests, scores, validities and experimental pro- 
cedures for those who use them in the study of 
the single individual. Part III (“Tests and 
Scales”) comprises nine chapters which describe 
current devices for measuring intelligence, edu- 
catiońal aptitude and achievement, special abil- 
ities, motor skill, interests, attitudes and pt 
sonal-social characteristics. * Two chapters D 
.... [Part IIT] are particularly noteworthy. Chap 
ter 20 ("The Conduct of an Examination") В 
a succinct, interesting and thoroughly gom 
account of both the techniques and the attitu т 
of the competent psychometrician. Chapter к 
(“Projective Methods for the Study of ue 
ality") provides a brief, pointed and telling © 4 
tique of the projective approach to pers? 
ity. Goodenough’s well-supported уап dd 
“projection, as thus conceived, is a tool thai that 
both ways” (p. 440) and her insistence i 
caution temper enthusiasm in this area, LAE 
strong antidotes for the prevailing o scit 
and often careless—attitudes toward pro) made 
methods. The final section of the volume n ap 
up of seven chapters outlining the heres * 
plications of test methods in schools, © ш 
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social agencies. This section is relatively brief, 
but it presents the major applications of tests 
in realistic fashion, and reflects again the au- 
thor’s concern over integrating research and 
practice. A fairly comprehensive glossary of 
technical terms, a bibliography of some 400 
items, and author- and subject-indices conclude 
the volume. Perhaps largely because of its am- 
bitious scope, this book will strike some readers 
as uneven, probably insufficiently detailed at 
points. Instructors in testing courses who prefer 
a text which presents the properties of dozens 
of published scales in capsule form, will cer- 
tainly be disappointed. Those who are seeking 
a practical handbook on applied statistics may 
be dismayed to discover only a paragraph or 
two on the Latin square and other experimental 
designs. And readers to whom the “applica- 
tion” of psychological tests means lists of aver- 
age scores for varied groups, or tables of inter- 
correlations among instruments which purport 
to measure the same characteristic, will not be 
satisfied. For the reviewer, however, these ap- 
parent shortcomings are actually the greatest 
strengths of Goodenough's unique presentation. 
The student in the testing course who grasps 
the implications of Goodenough’s philosophy 
may not know the procedures, items, reliabili- 
ties and validities of many current scales, but 
he will know how to evaluate them when he 
meets them later. The inexperienced research 
worker in test construction may have to look 
elsewhere for his formulas, but by this volume 
he will be defended against that cookbook ap- 
proach to statistical problems which leads to 
inappropriate methods, and ends in irrelevant or 
ambiguous results. The practical worker in clinic, 
school system or industry who absorbs Goode- 
nough's viewpoint will never again scrutinize 
a test profile in the hope of finding "the delin- 
quent," "the retarded child," "the genius" or 
"the mechanic." There is no separation—spatial 
or logical—of “риге” from “applied” investiga- 
tion in the discussion; there is instead a con- 
Sistent and effective fusion of theory and prac- 
tice. Detailed factual information found in most 
contemporary texts on testing is often missing 
from this book, but in its place Goodenough has 
Presented a philosophy of the construction, ap- 
plication and evaluation of tests which is in- 
finitely more important. One final characteristic 
of this volume should be particularly noted. 
Writers on psychometric and clinical topics are 
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today increasingly emphasizing the need for 
long-time, carefully designed research on prob- 
lems of practical significance. Ordinarily, how- 
ever, the writer considers his duty to research 
discharged when he has pointed out the need, 
or at most mentioned a gap or two in our knowl- 
edge. In Goodenough's book, the emphasis on 
research attitudes is no idle generalization, Re- 
peatedly she calls attention to errors in pub- 
lished research, suggests alternative procedures, 
and outlines in some detail investigations which 
might be expected to yield information which 
is badly needed. It would be an unimaginative 
reader indeed who did not find here a great 
many ideas for significant research. It is Goode- 
nough's purpose, in this volume, to bring the 
theoretical principles underlying test procedures 
to two groups: to students of testing and to pro- 
fessional workers who do not administer tests 
but who depend heavily upon test results, It is 
the reviewer's opinion that this book should be 
required reading for both groups. If students 
and professional workers alike could acquire 
something of the wise, critical, dispassionate at- 
titude toward psychological testing which per- 
meates this volume, then the early promise of 
the test as a psychological method might yet be 
fulfilled. 

J Appl Psychol 35:219-20 Je "51. Harold D. 
Carter. * professional workers....who are not 
primarily psychologists, will find here a. read- 
able introduction and explanation of psycho- 
logical testing techniques, including such newer 
devices as projective techniques. Students of 
psychology, even relatively advanced ones, will 
find the reading refreshing as well as instruc- 
tive. Goodenough's Mental Testing is a. scien- 
tific book. It bristles with footnotes which give 
further information on technical points, and the 
text throughout is packed with information es- 
sential to a well-organized discussion of psy- 
chological testing. Rarely does one find a book 
which offers so much instructive material for 
both elementary and advanced students. Few 
persons could have written such a text. Not only 
does it show the results of years of careful study, 
but it has an individual tone and flavor, being 
written by a well-trained psychologist, with a 
special interest in technical and theoretical prob- 

d a wealth of experi- 


lems of measurement, an ealth of exper 
ence with children. The result is originality o 


treatment of topics, abundance ‘of. illustrative 
material, and correctness of viewpoint on tech- 
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nical and theoretical matters. The book is the 
scholarly general treatment of mental testing 
which has been needed for a long time. It makes 
a rare and valuable contribution to psychologi- 
cal literature. 

J Consult Psychol 14:159 Ap ’50. Edgar A. 
Doll. The most refreshing book yet published in 
its field, and easily the best. Long-needed broad 
coverage of the history, principles and methods, 
instruments and applications. Sound and schol- 
arly treatment of basic issues in theory and prac- 
tice. Judicious and authoritative exposition of 
statistical desirabilities. Excellent formulation, 
exemplary style, straightforward content, imag- 
inative outlook, and (our compliments to the 
publisher) beautiful book-manufacture design. 
A worthy capstone to the brilliant career of a 
meticulously conscientious pioneer in scientific 
and professional psychology. (And this reviewer 
is seldom intemperate !) 

J Ed (London) 83:104+ Е '51. C. M. Flem- 
ing. Of making many books on testing and re- 
cording there seems no end. Another such book 
from the pen of Professor Goodenough is nev- 
ertheless always welcome ; her latest will be read 
with interest by her many admirers. As was to 
be expected it is full of suggestive comments and 
replete with scholarly references. Mental test- 
ing has suffered in the past from the indiscrimi- 
nating enthusiasm of its supporters. Tests have 
been commended as instruments through which 
innate ability could be discovered, aptitudes 
could be predicted, and classification into stable 
groups could be effected. Prof. Goodenough’s 
book is a useful corrective and a reminder of 
the complicated theories of sampling and proba- 
bility on which the proper interpretation of test 
results must be based. It offers also an illumi- 
nating statement of the inadequacies of the I.Q. 
as a descriptive measure and it outlines the com- 
plicated procedures which must be followed if 
group tests and individual tests are to be satis- 
factorily constructed and administered. It is 
inevitably selective and places no emphasis on 
recent studies reported from Harvard and Chi- 
cago, while attaching perhaps undue significance 
to the symptoms of mental functioning offered 
by projective tests. As a whole, however, it is 
well worth study both as a reminder of the dis- 
tance travelled in the last forty years and as a 
pointer to possible lines of future research. 

J Ed Psychol 41:122-3 F *50. Lee J. Cron- 
bach. Those who have admired and profited 
- from the works of Dr. Goodenough throughout 


her career will be delighted that she has pro- 
duced this volume. In it she looks at four decades 
of mental testing, lays bare the issues of most 
importance, and evaluates the progress that has 
been made. Her comments acquire impact both 
from her wisdom and experience, and from her 
no-nonsense style. Because this volume tries to 
cover all her viewpoints, the contents are ex- 
tremely varied. There are sections on all types 
of testing, including motor development, ratings 
of social behavior, and projective tests as well 
as tests of the Binet genre. Because she equates 
critical thinking with statistical thinking, there 
is a series of chapters on sampling and statisti- 
cal methods through analysis of variance. Sev- 
eral concluding chapters, much less meaty, sur- 
vey the use of tests in schools, clinical practice, 
industry, social work, etc. Goodenough covers 
knotty problems of mental-test theory which 
have been adequately restated in no modern 
book. There is, for example, a careful analysis 
of the scaling procedure of Heinis, Woodrow 
and Arthur, van Wagenen, and others. She 
makes considerable use of a provocative distinc- 
tion between tests as samples of behavior, and 
test performance as a sign of traits, Another 
asset is a very satisfying history of early devel- 
opments in testing. Comparatively little space 15 
given to describing tests or to citing evidence. 
Instead, Goodenough presents a personalized 
evaluation of each area. Often these comments 
are on the devastating side; scarcely a chapter 
dealing with work after 1916 is without such 
phrases as “statistical illiterates,” “insufficient 
controls,” or “blunders and false conclusions. 
Her position is conservative, and so will please 
some readers more than others. While she 1s 
open-minded on projective tests, for example, 
she chooses to approach them through a detailed 
consideration of Binet’s case studies of his 
daughters, and then is quite explicit that the 
moderns have lacked the insight and the tech- 
nique to do as well as he in the same area. Some 
of her treatment is open to criticism. She nd 
mends using vocational aptitude tests of the 
Minnesota variety in a way the reviewer eh 
siders insufficiently critical. Her unqualil x 
stress on the desirability of a normal distribu 
tion of test scores disregards recent ИШЕ 
about maximum discrimination. Her sugg iin 
that reading readiness tests are to be us ot 
deciding whom to admit to first-grade до a 
consider their use in planning a suitable sc ed 
program for every six-year-old. While ev 
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reader will find something to disagree with in 
so uncompromising a book, no psychologist or 
educator can come away from the book without 
fresh insights. It is particularly valuable to have 
attention drawn to some of the underlying prob- 
lems and assumptions which are ignored when 
testing becomes a business or a technology. Be- 
cause of Goodenough's perspective and tough- 
mindedness, her book will make a distinct con- 
tribution to the education of the advanced stu- 
dent. 

J Genetic Psychol 78:123-5 Mr "51. Gladys 
C. Schwesinger. * This book is intended chiefly 
for students planning to take up mental testing 
professionally and for workers in clinical psy- 
chology. No less is it aimed to reach professional 
workers, themselves not doing actual testing, but 
who nevertheless are concerned with {һе in- 
tellectual and social competence of others, and 
who traditionally require a better understanding 
of how mental abilities are appraised. If this sec- 
ond group of workers would read Part II seri- 
ously, they would not become statistical techni- 
cians, it is true, but they would emerge from the 
experience with a better grasp of the possibili- 
ties and limitations of mental tests, not to men- 
tion human beings themselves, and the proper 
use of psychological findings and reports. Dr. 
Goodenough has most assiduously pointed ott 
erroneous conclusions which have been drawn 
in the past and which continue to be made by 
Workers whose enthusiasm and naiveté exceed 
their scientific wisdom and insight. * This book 
is well illustrated with line drawings, sketches, 
ànd photographic reproductions. It will become 
something of an album to the elder statesmen in 
the profession. It contains 26 pages of glossary 
and 21 pages of bibliography. It is the first book 
which has surveyed the field of mental testing 
ш all its aspects and ramifications. In content 
it is almost encyclopedic. It reflects incredible 
Industry, as well as keen analytical ability, on 
the author's рагі. To the psychologist of limited 
library facilities, the book will become a “must” 
Purchase, For soundness and scientific caution, 
as well as for scope in a field which she knows 
80 well and to which she herself has already 
contributed so heavily both academically and 
Practically, Dr. Goodenough needs bow to no 
one. Organization of materials is well planned 
and their presentation is well executed. Dr. 

Codenough writes not only authoritatively but 
kin and with a vitality which does not let down 
ven with diminishing interest in content here 
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or there for various readers. Mental Testing is 
timely, coming as it does at a milestone within 
the four decades of evolutionary growth of the 
testing movement, and on the eve of change, per- 
haps revolutionary, in scientific human appraisal. 
The book looks to the future as well as the past, 
but it will serve well in the present for many 
years ahead. 

„Мот R 39:561-3 S50. John C. Flanagan. * 
differs in a number of ways from most of the 
recently published books in this field * it places 
an emphasis on history, principles, and applica- 
tions to social problems which is somewhat 
unique. It represents an attempt to indicate not 
only the present status of the field of mental 
testing, but the lines of development which got 
us there and the principal theoretical issue on 
which there is still controversy and need for 
further research. * The book is divided into 
four parts. Part L..provides an excellent ac- 
count of the development of mental tests * Part 
II....contains much wisdom based on the author's 
extensive experience in the testing field. * Fairly 
typical of the material of this part is the thorough 
treatment of the Heinis Personal Constant. * 
this book is well designed to meet the purposes 
of the author * It emphasizes the testing of in- 
dividuals rather than groups, though the latter 
field is not ignored. It stresses basic theory and 
fundamental principles rather than descriptions 
of currently available tests. It should be of real 
value to both beginners in the field and to pro- 
fessional workers who wish to attain a more 
thorough understanding of the problems of men- 
tal testing. i 

Occupations 28:336-7 F '50. Morris Krug- 
man. Florence L. Goodenough wrote Mental 
Testing for two distinct sets of readers: first, 
* as an orientation text for students planning 
to enter the field of testing," and second, “....the 
large body of professional workers who make 
use of the results of tests which they do not 
themselves administer. School principals, teach- 
ers, social workers, psychiatrists, and pediatri- 
cians, juvenile court judges, and many others 
are daily faced with problems in which the ques- 
tion of the intellectual and social competence of 
children or adults is involved." The result is an 
encyclopedic work unintentionally divided into 
two volumes, one of which is suitable for the 
first group of readers, and the other for the 
second. Mental Testing actually consists of id 
parts. Part I, "Historical Orientation, i 
Part II, "Principles and Methods,” constitute 
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an extremely technical treatment requiring a rich 
background in advanced statistics. In the preface 
Dr. Goodenough says, “In the use of the book 
some previous training in statistical methods is 
desirable, but not essential." It is not essential 
only if one will omit reading most of the first 
293 pages. The 27-page glossary at the end of 
the volume will not compensate for lack of tech- 
nical information. Psychologists and psycho- 
metricians, however, will find this part of the 
volume a well-organized, logical presentation 
of the development of tests; principles of test 
construction ; units of measurement ; devices for 
interpreting test scores, and reliability of test 
scores. One wonders how far the average 
teacher, social worker, pediatrician, or judge 
will get with such items as “Factor analysis and 
the ‘purification’ of tests”; “the null hypothe- 
sis” ; “Chi-square” ; “differences between means 
of continuous variates”; and “analysis of vari- 
ance,” to mention some of the items treated in 
considerable detail. Part III, “Tests and Scales,” 
and Part IV, “Applications,” make up what the 
reviewer calls the second part of the book, ap- 
proached so differently from the first half of 
the volume, that it could have been compiled by 
a different author. These sections are non- 
technical, concise, and cover the description and 
use of numerous psychological measurement de- 
vices briefly and clearly. The method of treat- 
ment in these parts reminds one of the reviews 
of large psychological areas in such publications 
as the Psychological Bulletin or the Review of 
Educational Research. The non-professional in 
the field of psychological testing will find these 
parts readable, instructive, and intelligible. The 
professional psychologist or the graduate stu- 
dent in the field will consider them too thin. The 
guidance worker will probably find the many 
discussions of the application of tests repetitive 
and incomplete, and the guidance uses of tests 
too brief. In the section on “Applications” much 
of the material of “Tests and Scales” appears 
again, sometimes more than once. This stems 
from the attempt to consider tests in different 
media such as schools, clinics, industry, social 
agencies, the armed forces, and in research, 
among others, This approach may have merit, 
but it requires frequent rereading of material 
previously met. Mental Testing is a very useful 
volume for the reference shelf. In attempting to 
cater to too many types of readers, it does not 
completely satisfy all. It does, however, have a 
thorough treatment of the theory and principles 
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of test construction and use, and an extensive 
if not intensive, treatment of available measur. 
ing devices and their application. 

Personnel Psychol 3:471-3 w ’50, Wesley 
Osterberg. * an admirable critical survey, al- 
though it does not, of course, include all of the 
material available on industrial and military 
testing. Part I especially is excellent....for any 
reader. The author raises, and discusses at some 
length, the following question: “Is the score 
obtained to be regarded as a sign, a sample, or 
a measurement of the characteristic named?” If 
this discussion were understood by test users 
it would reduce many basic misunderstandings 
of what test scores are and how they should be 
used. Almost any one, from a beginner to the 
most advanced student, will find this section a 
profitable one to read. The same cannot be said 
for the rest of the book. Parts III and IV contain 
relatively little material new to the advanced 
reader, although they probably are adequate for 
the beginner. Part II, which discusses theory, 
sampling and statistical methods, does not seem 
especially suited for any audience. Although 
many elementary concepts are adequately dis- 
cussed, the average beginner frequently will find 
himself, perplexed. * In the space of a little over 
100 pages he will find an abbreviated course in 
statistics, including analysis of variance and fac- 
tor analysis. The audience for which the book is 
intended will probably have difficulty deciding 
what must be reviewed and learned in this sec- 
tion, and what can be safely ignored. For ex- 
ample, there is a whole chapter (although à 
short one) on interpretative devices such as the 
Heinis Personal Constant, Per Cent Placement, 
Coefficient of Intelligence, and other meth м 
very seldom used. Yet the explanation of why 
World War I soldiers had an average men Я 
age of about thirteen years is carried in à foo! 
note, as is the explanation of the meaning M 
use of critical scores. The whole book, ee 
dentally, is reminiscent of John rane 
Principles of Economics, in which much 0 id 
“meat” of the book is contained in the foot? 
By saying that certain parts of the book are is 
suitable, and that emphasis is occasionally me 
placed, this reviewer means, of course, tha iv 
book is not a reflection of his own prin js i$ 
ideas of what such a book should be. + : 
not to say, however, that the book is not v tL get 
while. Non-psychologists who read it M get 
very little from parts of it, but what t ШУ 
from the book as a whole will more than J 
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the reading. For beginning students it is suit- 
able as а text, provided considerable help on 
Part II is available. Even the most advanced 
reader will profit from the author's careful think- 
ing and from her insightful and cogent com- 
ments. 

Psychosom Med: 12:204 My-Je '50. Josef 
Brožek. * Readers of this journal might find 
particularly interesting the chapter on mental 
tests as an interdisciplinary tool of research (pp. 
504-509). The volume is directed to students 
in psychometrics and clinical psychology and to 
the numerous individuals who make use of the 
test results but do not administer the tests. The 
ground covered is large and the'treatment of 
single topics, although never. superficial, is of 
necessity brief. In a way it is a pity that the 
author did not concentrate on the mental test- 
ing of children, a field of her specialization. 

Times Ed Sup 1855:878 N 1750. * intended 
for teachers, social workers and others who are 
interested in tests, or who make use of their 
results, but is probably too advanced for most of 
this class of reader and would be more suitable 
for professional students. Admirable as are its 
aims and much of its content, it is disappointing 
in many respects. It lacks the notable clarity 
and succinctness of some of Dr. Goodenough's 
earlier books, such as Experimental Child Study 
and Developmental Psychology. It is unneces- 
sarily discursive in some parts, and too brief to 
be of much value in others. For example, five 
pages are devoted to Binet’s early observations 
on his daughters with “projection” tests, and no 
more than this to listing or describing educa- 
tional attainment tests for use in the primary 
school. Again much of the historical Part Iis 
duplicated in the descriptive Part Ш. Reading 
is made unnecessarily difficult by the inclusion 
of over 300 footnotes (apart from bibliographi- 
cal references), many of them very long. Yet at 
the same time both text and footnotes so fre- 
quently contain flashes of insight or wisdom that 
even the most experienced tester can study them 
with profit. A good example is the author's 
Criticism of a conception which has become 
rather popular among teachers in this country 
Who know a little about tests, namely the A.Q. 
от Achievement Quotient. Although Dr. Goode- 
nough brings out the importance of the range 
of variation, and of the effects of age back- 
ground, and other factors in interpreting the 
LQ, she glosses over two serious controversies. 
First, while admitting the influence of coaching 
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on intelligence tests, she refuses to tackle the 
question of І.О). variability and cultural effects. 
Neither Dearborn and Rothney's nor the Iowa 
investigations are mentioned. Secondly, Thur- 
stone's factorial researches are stated to reveal 
“the basic elements of which abstract intelli- 
gence is made up," but she does not try to ex- 
plain how the American factorist’s denial of the 
existence of any general intellectual factor (g) 
can be reconciled with the widespread use of 
general tests, whether Stanford-Binet or group. 
And although she stresses the need for valida- 
tion of tests, she gives no indication of how in- 
telligence tests can be validated, though this is 
a problem which factorial analysis has solved. 
In discussing validity she makes much of a 
distinction between tests based on “samples” 
and those based on “signs” of significant be- 
haviour. Apparently the latter are much more 
objective and scientific, and yet the main ex- 
ample that she cites is projection tests! Perhaps 
the reader would do best not to bother about 
this section. In contrast, her critical account of 
projection tests is excellent, though her de- 
scription of other personality tests is scrappy 
and unpractical. One of the best chapters is that 
giving advice on how to apply individual tests 
like the Stanford-Binet or Merrill-Palmer. 
There is also a useful glossary of technical terms. 
The price of the book seems excessive consider- 
ing that the paper is rather mediocre and the 
illustrations often bad. 
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integrate and supplement the diverse reading 
assignments and to provide laboratory exer- 
cises" for an advanced undergraduate course in 
personnel psychology. * The assigned readings 
have been selected from a number of sources in 
industrial psychology, but the authors have 
drawn most heavily upon Lawshe's Principles 
of Personnel Testing. Indeed, one is led to be- 
lieve that the course is based primarily upon this 
textbook. The introductions to, and summaries 
of the topics are, in general, commendable in 
terms of their purpose. The authors have per- 
formed well the difficult task of presenting con- 
cisely many of the basic theories, research find- 
ings and techniques of personnel research while 
at the same time cautioning the student of the 
pitfalls, practical problems, and limitations to 
be encountered. The work projects themselves 
are concerned almost wholly with selection prob- 
lems and provide the student with opportunities 
for practice in only two basic research tech- 
` niques—the method of correlation and the criti- 
cal score method. The function of the discussion 
questions, presumably, is to give the student 
practice in interpreting the results of his statisti- 
cal analyses. As such, they would seem to fulfill 
their purpose adequately. The fact that the con- 
tent of the syllabus is almost entirely concerned 
with selection techniques and instruments is by 
design. The reasons given for this emphasis are 
that knowledge of selection procedures is par- 
ticularly demanded by employers of personnel 
psychologists, that more intensive work has been 
done in this area, and that many techniques used 
in selection are applicable to other personnel 
problems. This reviewer believes, however, that 
this is the truth but not the whole truth. It 
would seem that projects based on problems and 
techniques of job evaluation, the measurement 
of employee attitudes, problems peculiar to in- 
spection, and similar topics might properly find 
a place in a course bearing the rather compre- 
hensive title "Personnel Psychology." The bi- 
ased selection of content is perhaps the major 
weakness of the work. One minor point may be 
mentioned. The number of pages is misleading, 
since much space is wasted. In some instances, 
as little as one-fourth of the page is utilized. This 
may be a function of the way in which the stu- 
dents are to be required to turn in their ma- 
terial. This syllabus makes no pretext at being 
anything other than what it is—a fairly ele- 
mentary workbook. In an area where so much 
has to be garnered from a number Of sources, a 


compilation such as this does the student a sery. 
ice without being extensive enough to warrant 
the possible criticism that the advanced student 
should be required to do his own bibliographic 
ploughing. This contribution to the field pro. 
vides an opportunity for the student to work ош 
some of the statistical aspects of typical research 
projects in personnel selection and stimulates 
his thinking about the theoretical problems in- 
volved in such projects. It provides the in- 
structor with a means of systematically eval- 
uating the ability of the student to grasp the 
fundamentals of this area of personnel psy- 
chology. 
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print. * 


955 ] BOOKS & REVIEWS: GUILFORD 

J Consult Psychol 12:125 Mr-Ap '48, Lau- 
rance F. Shaffer. The development of pencil- 
and-paper aptitude tests for the selection and 
classification of air crew candidates was the larg- 
est single research activity of the wartime AAF 
psychological program. This huge report gives 
detailed and uniform data on each of the several 
hundred tests experimented upon. Its major 
contribution, however, is to the methodology of 
aptitude testing. It sets a pattern that may well 
shape the course of the applied psychology of the 
future: the extensive factor analysis of tests, 
the determination of the validities of factors 
rather than merely of tests, and the selection of 
tests that most efficiently represent the factors. 
The influence of this volume will be widespread 
and enduring. 

Psychol В 46:506-8 N ’49. Thomas W. Har- 
rell. * gives a detailed account of methods, re- 
sults, and conclusions concerning paper and 
pencil tests used and considered for use in the 
selection of Army Air Force pilots, navigators, 
and bombardiers. The volume will be of interest 
not only to aviation psychologists but to all psy- 
chometricians because of the important factor 
analyses reported. It presents impressive evi- 
dence of the effectiveness of the organization, 
direction and continuity of purpose of the group 
of psychologists who worked in the AAF re- 
search program. The methods followed in the 
development of the AAF tests were in the ap- 
proved text-book style of test development that 
is almost never pursued so thoroughly in actu- 
ality: job analyses were made ; hypotheses were 
clearly formulated for factors that would appear 
to be needed ; tests were given before the groups 
were selected; factor analyses were made of 
tests and criteria; new tests were based on the 
validity and factor studies; the process was re- 
peated. This is the first successful attempt to 
use factor analysis in а selection problem to 
show the factorial composition of the criterion 
às well as of the tests. The criterion unfortu- 
nately had to be only training school records 
rather than combat success. Factors extracted 
by the centroid method were rotated orthog- 
onally to an approximate simple structure. * 
the editor remarks that a complete classification 
battery should cover at least 25 factors. This 
probably overstates the case since the inclusion 
of those factors having lower validities does not 
contribute substantially to the validity of the 
composite. In addition, no more than 15 of the 
27 factors are defined adequately enough for 
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inclusion in a comprehensive classification bat- 
tery. The work that is reported is gigantic, 
There will be enough leads to explore for dozens 
of doctoral dissertations based on the factors 
found and suggested, Perhaps the most impor- 
tant new factor found is Judgment. Also of im- 
portance is that the nature of mechanical tests 
is clarified somewhat, Mechanical Experience 
comes out as a clear factor, Splitting what has 
been called Spatial Relations into а Visualiza- 
tion and three Space factors needs additional 
clarification. Little comfort will be found in this 
volume for enthusiasts for the Rorschach, TAT, 
other clinical procedures for diagnosis, and per- 
sonality inventories (including Bernreuter, 
Minnesota Multiphasic, and Humm-Wads- 
worth). Clinical procedures referred to are a 
clinical interview, observations of behavior dur- 
ing psychomotor testing, observations during a 
stress situation, observation during a short so- 
cial situation. None of these procedures was 
found to be useful: Significant at the one per 
cent level were only the Thinking Introversion 
score of Guilford’s Inventory with a validity 
coefficient of —.09 (judged not to be promising), 
Cooperativeness and Agreeableness of the Guil- 
ford-Martin Personnel Inventory, with validity 
coefficients of .14 and .12. Two categories on the 
Humm-Wadsworth, Epileptoid and Hystefoid, 
gave coefficients higher than those in the inven- 
tories mentioned above, —.21 and —.18, but they 
were significant only at the five per cent level 
since only 202 cases were used. Humm supplied 

not differentiate 


passes successfully from failures; however, hís 


svchologists. Had combat criteria been avail- 
Pin it is possible that some of these personality 
measures would have been more useful. The vol- 
ume has shortcomings. The mass of material has 
not been smoothly presented. Clarity is exc 
by bulk. It appears to have been done hastily for 
the magnitude of the undertaking. The index is 
incomplete. Research ps s will not 
be seriously bothered by the shortcomings since 
the book will probably —— bible whose 
failings will probably come to take оп 
uius ia the same way that philatelists treasure 
a rare stamp with some slight imperfection. 


For a review of the series, see B2. 
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Am J Psychiatry 108:319—20 О ^51. Robert 
M. Allen. For the neophyte in the field of tests 
and measurements the title of this volume may 
prove misleading. The sophisticated test-oriented 
psychologist will find that this book affords ex- 
cellent insights into those considerations that in- 
fluence the construction of psychological tests. 
* For example, in the second chapter....there is 
a skillful translation of the qualitative aspects of 
test theory into statistical language and a sub- 
sequent retranslation of these numeration con- 
cepts into English that adds considerably to the 
understanding of the ideas in this chapter. * An 
interestingly valuable feature of the book is the 
subject matter for Chapter 21, “Item Analysis." 
This section is especially important for test con- 
structors and teachers of courses in test and 
measurement theory. * fine addenda * Sample 
examination questions in test theory for use at 
the beginning, during, and:*upon completion of 
the course indicate the care with which this vol- 
ume was written for practical application. * This 
volume is recommended for an advanced and 
graduate course in test theory and/or construc- 
tion. 

Am J Psychol 64:639—40 О ' 51. Paul Horst. 
As tlistinguished from the great output of books 
on statistics and tests in recent years, this vol- 
ume represents a distinct contribution to psy- 
chology. * The....chapter....“Correction for mul- 
tivariate selection in the general case,” is the 
most thorough and comprehensive treatment of 
the subject that this reviewer has seen. * An 
excellent chapter on problems of weighting and 
differential prediction brings together the ma- 
jor procedures elaborated both in earlier and in 
recent years. * The text is extremely well or- 
ganized, both by chapters and in presentation 
within chapters. The typical chapter-plan con- 
sists of an algebraical development of the ap- 
propriate formulae, together with clear verbal 
expositions of the meanings of these formulae. 
The earlier, more systematic, chapters in the text 
also include graphs to aid in the interpretation 
of formulae developed. Each chapter concludes 
with a summary which repeats and interprets 
concisely its essential formulae. * The book in- 
cludes an excellent 23-page bibliography. * The 
book demonstrates unusual competence both in 
the selection and the presentation of subject- 
matter. Although the chapter on the statistical 
criterion for parallel tests makes an important 
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contribution, there are those who will question 
whether this is really an appropriate criterion for 
these tests. The criterion seems to ignore the 
fact that tests may have the same intercorrela- 
tions without being parallel. If the tests are suf- 
ficiently alike in intercorrelation, no matter how 
low, and the means and variances are sufficiently 
alike, the criterion will indicate that they are 
parallel. A tribute to the excellence of the au- 
thor’s achievement is the fact that only one 
instance of incorrect logic was discovered. This 
concerned the discussion of the principle-axis 
solution as a method of weighting to maximize 
the dispersion of weighted scores. The author 
implies (p. 344) that the difficulty with this 
procedure is an ambiguity concerning what val- 
ues to put in the diagonal of the correlation or 
covariance matrix. Actually no ambiguity is 
present, since it can be mathematically demon- 
strated that, in order to maximize the dispersion 
of these weighted scores, one must use the vari- 
ance rather than the reliability or communality 
in the diagonal. The Theory of Mental Tests 
is an important and long-needed contribution to 
psychology. 

B Ind Psychol & Personnel Prac 7:41 S51. 
G. D. Bradshaw. This is undoubtedly the most 
important technical work on test construction 
yet published * It is theoretical rather than prac- 
tical in approach * Not only does Gulliksen de- 
rive his formulae most lucidly, but he is careful 
throughout to state the assumptions which are 
implicit in each formula. This is most valuable 
since the simplest derivation, which 1s the one 
he usually presents, is not always the one im- 
volving the least assumptions. * For University 
teachers and honours students in peychümews 
and for psychologists engaged in test ien 
the book is essential reading. Field psychologic 
and personnel officers are likely, however, | 
find the book too detailed and too шашын 
for convenient use. It is not possible to rea е 
at any stage without becoming involved in T a 
ematical formulations. The book has an exce m 
bibliography, and as appendices has a miri 
of the mathematics required for an unders ial 
ing of the book and objective questions on 
theory. 

ry ee Clinic 15:195 5 '51. 7 aei 
Chotlos. * should serve well as а rele 
for psychologists who have research pu p 

Calif J Sec Ed 26:188 Mr '51. John E. ct 
This scholarly publication for the ае 
brings together into one volume recent 
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cal developments in test theory with particular 
reference to aptitude and achievement testing. 
The book is designed primarily for persons 
working in test development. However, it does 
present techniques applicable to many fields 
other than test construction as for example in 
the measurement of attitudes or opinions, the 
appraisal of personality, and clinical diagnosis. 
Some interesting aspects of test construction 
emphasized in this book are: (т) derivation of 
basic formulas of test theory from the definition 
of random error and also from the definition of 
true score; (2) the effects of test length and of 
group heterogeneity in terms of invariants; (3) 
facilitating the proper use of corrections for re- 
striction of range; (4) the effects of test time 
limits on estimates of reliability; (5) a syste- 
matic mathematical theory of item analysis ; and 
(6) various weighting methods. The author 
states that the book is designed for readers who 
possess a knowledge of elementary algebra, ana- 
lytical geometry, and elementary statistics. In 
the opinion of this reviewer the reader in order 
to follow Dr. Gulliksen’s presentations should be 
well versed in the above mentioned areas as well 
as experienced in dealing with the many sym- 
bols and terms which come only from advanced 
study in these fields. * This challenging book is 
worthy of the attention of teachers, supervisors, 
research specialists, curriculum coordinators, 
principals, superintendents, graduate students, 
college professors, and others whose duties in- 
volve detailed and advanced work in testing, test 
construction, evaluation of educational out- 
comes, and educational research. This book 
Tepresents a distinct contribution to the field of 
testing in education. 

El Sch J 51:534-5 My '5r. Kenneth W. 
Vaughn. * presents a unified quantitative theory 
of mental testing in terms of its basic mathemati- 
cal rationale, The book is particularly useful for 
Specialists in the construction and appraisal of 
Mental tests, * In each chapter, the author pro- 
ceeds from basic assumptions expressed in 


quantitative terms and explains and demon-. 


strates, in highly detailed stages, the mathe- 
matical steps involved in developing the basic 
Concepts and formulas. The student is led, by the 
and as it were, through the successive mathe- 
matical developments, To the mathematically 
Sophisticated this procedure will prove to be 
needless detail; to the typical student of mental 
testing it will be very valuable. While the no- 
lation js clearly defined and conforms with ac- 
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cepted practice, it will probably appear formid- 
able to many students. The expository style 
often verges on the pedantic, but the wording is 
carefully designed to carry the exact meaning 
intended, Although much of the book is essen- 
tially a compilation of the work of many test 
theorists, it is carefully integrated into a compre- 
hensive whole. The author has made a particu- 
larly significant contribution in chapter xx, 
“Weighting and Differential Prediction,” by 
bringing together and interpreting the most sig- 
nificant work on the combination of test scores 
to produce a single composite score or multiple- 
composite scores. His treatment of speed versus 
power tests in chapter xvii, which he admittedly 
found stimulating, is of more academic than 
practical concern. Rarely is one concerned, in 
practice, with a “pure speed” test or a “pure 
power” test. The problem of highly speeded tests 
is an eminently practical one. Until more ex- 
perimental work can be done on the author’s 
equations 25 and 28, it may be more desirable 
to obtain reliability coefficients experimentally 
than to attempt to estimate them on the basis 
of theoretical assumptions. In general, the book 
represents a thorough coverage of the literature 
of test theory. * The author has drawn on a 
wide variety of sources in presenting his mate- 
rials and has faithfully and painstakingly ren- 
dered acknowledgments to the many workers 
whose studies are presented. L. L. Thurstone’s 
Reliability and Validity of Tests, now out of 
print, is essentially preserved and expanded in 
the early chapters. Incorporated also are the 
pertinent publications of the Psychometric So- 
ciety, the Institute of Mathematical Statistics, 
and other more diffuse sources of test theory. 
By bringing these materials together and by ex- 
pressing them in a uniform and consistent nota- 
tion, Gulliksen presents a unified quantitative 
theory that is a significant contribution to edu- 
cational and psychological measurements, 

J Consult Psychol 15:168 Ap '51. F (red) 
McK (inney). * The style is simple and the pres- 
entation is clear. This Men a contribution to 

surement in psychology. 
ier Teach Ed 2:66-7 Mr '51. Max D. Engel- 
hart. * an extremely important contribution in 
the field of testing. It isa comprehensive and 
scholarly treatise on the mathematical and sta- 
tistical methods used in interpreting test data. 
Tt is not concerned with the non-quantitive as- 
pects of test construction such as the art of writ- 
ing test exercises, nor is it concerned with de- 
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scription of available educational and psycholog- 
ical tests, or with their uses in programs of 
testing. The detailed derivations of numerous 
formulas may make the book seem formidable to 
a student who has no more than an elementary 
knowledge of mathematics and statistics; but 
the author's clarity of expression, his practice of 
giving all of the steps in derivations with de- 
tailed explanations of each step, and his coher- 
ent organization of the material make the treat- 
ment much less difficult than it first appears. 
Furthermore, a few sections requiring knowl- 
edge of advanced mathematics may be omitted 
without disturbing the general continuity of the 
book. * In teacher-education institutions this 
book can be recommended as the basic text for 
an advanced course in measurements, as required 
reading for the teacher of more elementary 
courses, and as an indispensable source of in- 
formation for the graduate student whose thesis 
problem involves the handling and interpreta- 
tion of test data. 

Nature 169:49 Ja 12°52. W. E. Hick. If that 
old friend and inveterate sightseer, the visitor 
from Mars, were informed that a large, learned 
and modern book on the theory of mental tests 
contained not one word about the nature of in- 
telligence, aptitude, intellectual processes or in- 
deed any other attribute of the human mind, he 
might be surprised. The present reviewer, who 
does not come from Mars, is not so much sur- 
prised as resigned. These things are signs of the 
times. After the innumerable and fruitless discus- 
sions, arguments and effusions occasioned by the 
problems of intelligence and aptitude, there has 
been some reaction towards the firmer ground 
of statistical methodology. Statistics has always 
played a large part in mental testing; but one 
might be inclined to wonder whether the servant 
is not in a fair way to becoming the master. That 
is perhaps an exaggeration; it would be fairer 
to say that Prof. H. Gulliksen has made a special 
study of this aspect—what may be called the 
mathematical theory of testing the test. A book 
on the theory of structures will tell you how to 
calculate the strength of any given structure, but 
it will not, except in a few simple cases, tell you 
how to design a structure for any given job; that 
is for your ingenuity. There is some justifica- 
tion for taking the same attitude to mental test 
theory; at any rate, it is clearly the attitude 
that Prof. Gulliksen does take. As a text-book on 
statistics applied to a special field, the book cer- 
tainly covers a great deal of ground. The student 


may not find much help in improv 
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will find that he needs some initial help in 
ing the basic ideas of probability distributions, 
sampling error, and the jargon of correlations 
covariances, etc., and that then the earlier chap. 
ters are rather long-winded. The author seems 
to be straining (commendably) after a sim. 
plicity of presentation which he never quite at- 
tains. A number of "computing diagrams’— 
families of curves of various commonly-occur- 
ring functions—are given, with the suggestion 
that the student should reproduce them to a 
large scale. Some of them will undoubtedly be 
useful, but it seems unlikely that many people 
will take the trouble to use a chart when they 
have to add two variances together. * It is not 
immediately obvious why the author adds that 
parallel tests, in addition to satisfying statistical 
criteria, should contain items dealing with the 
same subject-matter and having the same for- 
mat—in other words, that they "should be par- 
allel as far as psychological judgment is con- 
cerned.” The explanation that springs to mind 
—if one rejects the obvious “sop to Cerberus’ 
and “insurance policy” hypotheses—is that the 
statistical criteria are not adequate. But it is hard 
to see any fundamental reason why they could 
not be made so. * Oddly enough, there is no 
section dealing explicitly with factor analysis, 
although the procedure is mentioned here and 
there in various connexions. This is doubtless 
no accident, but probably the result of a general 
veering towards a more empirical and perhaps 
more fundamental approach. * Taking it as 2 
whole, this is a book for the specialist, in all his 
metamorphoses from egg to imago. ^. 
Occupations 29:480-1 Mr ?5т. Clifford P. 
Froehlich. * the book would be accurately de- 
scribed by the title, “The Theory and Mathe- 
matics of Test Development.” * The volume x 
perform a service because it approaches m 
sen's goal of bringing "the technical deve ud 
ments in test theory of the last 50 years n d 
in one readily available source." In the lig "s 
the foregoing, it is clear that a typical counse 
ing his к | 
tests with counselees by reading this кее dint 
though this book could be improved by inc! e 
more examples of the application of theory P i 
sented and by a more lucid style of set 
does present material in a straightforwar Ше, 
ner. As a reference volume on test uae 
it is without equal, and its value for this ee j 
should win a place for it in the professio 
braries of test technicians. 
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Psychol B 48:456—60 S ' 51. Quinn McNemar. 
This is an important book—a must for a large 
number of psychologists. It is not just another 
book. It is a book that fills a void by bringing 
together in a unified fashion the scattered litera- 
ture and new material on test theory. Its author 
is not only qualified to do an expert job—he did 
an expert job. * Reliability is defined in terms 
of parallel tests in such a way as to avoid circu- 
lar reasoning. The exposition on reliability is 
straightforward, and the reader with only a 
modicum of elementary statistics will have no 
trouble following the argument until he en- 
counters, in chapter 5, the explanation of error 
asinteraction. Here a minor slip occurs: whereas 
the s? values prior to page 50 are simple sample 
variances, the s? values in the development on 
pages 50-57 are unbiased estimates of variance; 
yet the two types are used interchangeably. Cor- 
rection for this will lead to minor changes in 
equations 16, 26, and 39, and in the verbal state- 
ments based thereon. The next three chapters 
are devoted to the effect of test length on relia- 
bility and validity. These chapters are clearly 
written and include some instructive diagrams. 
The Thomases who doubt “prophecy” formulas, 
such as the Spearman-Brown, should read pages 
64-67. * Chapter 13 will be omitted by the 
reader unfamiliar with matrix algebra, and it is 
possible that those who can read this chapter, 
though admiring the neat presentation, may raise 
the question as to how many ever encounter 
situations where correction for multivariate se- 
lection is feasible. About half of chapter 10 in- 
volves an analysis (following Mollenkopf) 
which purports to show that whether or not 
Measurement errors are correlated with test 
score depends upon the skewness and kurtosis 
of the distribution of test scores. The reviewer 
is highly skeptical of certain parts of this deriva- 
tion and equally skeptical of the outcome. For 
instance, the end result simply does not con- 
sistently predict the known direction of the re- 
lationship between measurement errors and test 
Scores on the 1937 Stanford-Binet. * The ques- 
tion of the experimental methods of obtaining 
test reliability is...considered (chap. 15). This 
55 an excellent discussion of the nonstatistical 
aspect of reliability determination. * a fifty-page 
Chapter on methods of standardizing and equat- 
Mg scores, Here one finds a first-rate discussion 
of the various scores derived by transformations. 

he author points out the difficulties encount- 
ered in age scales, particularly those difficulties 
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which arise from choice of the regression line 
used in defining mental age. He does not how- 
ever, contribute anything as to which regression 
line is the worse for this purpose. The author 
has nothing favorable to say for the MA-IQ 
scheme of scoring. A more objective account 
would have listed both the advantages and dis- 
advantages of such scoring. * The dangers in- 
volved in using so-called differential tests, lead- 
ing to patterns or profiles, are pointedly set 
forth; it is to be hoped that our clinical friends 
note his discussion. The final chapter, which is 
concerned with item analysis, was somewhat of 
a- disappointment to the reviewer, who had 
hoped to find therein a critique of the many and 
sundry schemes which are currently advocated 
and used in item work. Instead, one finds a note- 
worthy consideration of the important topic of 
item parameters as related to the mean, vari- 
ance, reliability, and validity of a test. This sets 
the stage for the type of item analysis advocated 
by Gulliksen, and which the reviewer heartily 
supports. Simply determine for each item its dif- 
ficulty, its standard deviation, its "reliability in- 
dex" (defined as its point-biserial correlation 
with total score multiplied by its SD), and its 
“validity index” (its point-biserial with the cri- 
terion multiplied by its SD). Each chapter ends 
with a neat summary, followed by a set of exer- 
cises. * The general typographic style is pleas- 
ing, and the text is practically free of the nui- 
sance errors which are so difficult to detect in 
mathematical material. Gulliksen has done an 
excellent, highly commendable job. The few crit- 
ical points raised by the reviewer are indeed 
minor in contrast with the innumerable instances 
deserving praise. This volume represents a valu- 
able contribution; it is a volume of distinction. 

Pub Personnel R 12:104-5 Ap ' 51. Thomas 
L. Bransford and Joseph Lev. * Although there 
are occasional references to practical problems 
in developing and scoring tests for personnel 
selection (among more frequent references to 
educational testing), the bridge between theory 
and practice for the most part must be developed 
by the reader who is concerned with the practi- 
cal problems. * The reviewers would have ap- 
preciated more specific help from the author in 
demonstrating the relation between item diffi- 
culties or mean raw per cent score on a test 
and precision both in separating test passers 
and test failers and in ranking test passers for 
Civil service officials 


need further convincing that it is not practicable 
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or desirable to develop tests on which per cent 
scoring without transformation will provide the 
desired number, quality, and ranking of eligi- 
bles. The reviewers were pleased to note that 
the author strongly supports their objection to 
the common practice of allowing candidates a 
choice of alternative questions in examinations. 
* This book fulfills the need for a comprehensive 
presentation of mental test theory as it had de- 
veloped to the date of the publication of the book. 
It makes conveniently available in one place a 
great mass of material scattered in many books 
and periodicals. * The value of a mathematical 
treatment is that it reveals true relationships 
between test characteristics and points out the 
falsity of relationships which appear to be true 
but are actually false. The author discusses the 
familiar topics of test reliability, test validity, 
and item analysis, and a number of additional 
topics which are not usually found in textbooks. 
The treatment of familiar topics takes into ac- 
count new research. * The use of weights to 
combine tests is discussed in a novel manner. 
* The book contains numerous aids for the 
reader and the teacher. It abounds with graphic 
aids to computation. These graphs are also valu- 
able as aids to comprehension of relationships. * 
A comprehensive bibliography is included. A 
valuable aspect of the book is that it contains 
derivations of most of the formulas with explicit 
statements of assumptions, definitions, inter- 
relationships, and limitations. * This book should 
be valuable to all persons interested in test con- 
struction. It is useful for the variety of new 
methods which it provides. It is also useful for 
its clarification of existing methods. While its 
bibliography and references open many ‘doors 
to readers interested in pursuing specific sub- 
jects in more detail, it will be an even greater 
timesaver to many readers who will acquire a 
substantial background in test theory without 
wading through the mass of material to which 
it refers and on which it is based. 

Sch Sci & Math 51:508-9 Je’51. Kenneth E. 
Anderson. * Students of statistics and measure- 
ment will find the book an excellent addition to 
their library for the author's treatment of many 


concepts is the best available. * The concepts of . 


validity and reliability and the factors which in- 
fluence these concepts are given a most modern 
treatment. The chapter on “Methods of Stand- 
ardizing and Equating Test Scores" should 
prove to be extremely useful to test constructors. 
All in all, this is one of the best books to appear 


in the field of measurement and evaluation, and 
students of education and psychology will find 
it extremely useful both as a text and reference 
The book should help to consolidate the Tecent 
gains in test construction and should do much 
to clarify the thinking of both test users and test 
producers. 
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*Gurvirz, Mitton S. The Dynamics of Psycho- 
logical Testing: A Formulation and Guide to In- 
dependent Clinical Practice. Foreword by Joseph 
S. A. Miller. New York: Grune & Stratton, Inc, 1951, 
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Merit Rating Programs for Employees and Man- 
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General Clinical Counseling in Educational In- 
stitutions. New York: McGraw-Hill Book Со, Inc, 
1950. Pp. xi, 375. $4.50. * (London: McGraw-Hill 
Publishing Co., Ltd. 38s. 6d.) (РА 25:2051) 


Brit J Psychol, Gen Sect 42:303 Ag 31. 
M. D. V. * the specimen procedure described 
appears excessively elaborate and time-consum- 
ing, and also distinctly vague and lacking in 
definite conclusions. The methods adopted in 
"counseling" may seem to us at first sight some- 
what repellent in the amount of interference they 
may cause, both in the individual's personal af- 
fairs, and also in the teacher’s planning of his 
work and his relation with his pupils. We hear, 
for instance, that “practically all pupils are € 
posed to classes in occupational information : 
the eighth and ninth grades.” But this pen 
may Ье а mere prejudice surviving from the ved 
when it was thought that parent and teacher ; 
tween them could guide the child eiam 
Perhaps a good careers master and a good ge » 
could .do all that is done by the “counselor 
But do they? i 

Ј Appl Psychol 35:299-300 Ag 57. Вт 
Fagin. This book suggests a job title for л \ 
of practice, and defines that area systema д 
It attempts to clarify the semantic 
such terms as counseling, guidance, 


5 ri 
apy, and “personnel work” are ad Я 
s А ional, educt 


the 
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accurate specification of training, qualifications, 
and ethical practices. Hahn and MacLean have 
not only attempted to set down these specifica- 
tions, but to clarify the skills, knowledge, and 
competencies required of practitioners. In so 
doing they have distilled much thinking and 
reading and clinical experience into a book 
which provides excellent professional orienta- 
tion, brief but adequate instruction in many tools 
and techniques, but somewhat less than a com- 
prehensive understanding of the psychology of 
counseling. As the authors see it, general clini- 
cal counseling in educational institutions com- 
prises such a breadth of diagnostic and thera- 
peutic skills that training for the work must 
blend that of the psychotherapist, the vocational 
counselor, the educational adviser. * In dis- 
cussing the competencies involved the authors 
mark out no new ground, but emphasize the 
common ground of the various specialties. Oc- 
cupational information, case study techniques, 
and projective techniques are thus given equiva- 
lent space. The discussion of problems and tech- 
niques contributes to the clarification of many 
concepts, notably the term "prediction" in re- 
lation to counseling. The authors put their 
finger on some misleading current catch phrases, 
eg, "blind alley jobs," ''social intelligence," 
group counseling," and they vigorously dispose 
of some fallacies which counselor-trainers know 
tobe more durable than straw. A brief considera- 
tion of the application of semantics to counseling 
is a helpful feature, although the reference to 
Semantics as a “tool” is confusing. It is striking, 
however, that a good part of the text is devoted 
to diagnostic skills, while therapeutic skills are 
given honorable mention but practically no de- 
velopment. The picture of counseling which 
emerges is the eclectic-diagnostic one of the Wil- 
lamson tradition, in which diagnosis is taken 
as the keynote of counseling skill. The manner of 
emphasis tends to infer that counselor insight is 
Considered to be somehow therapeutic for the 
Client, an inference which many counselors (in- 
cluding perhaps the authors) would reject. In 
any case, the reader's impression is that thera- 
Peutic skills are neither tangible nor teachable, 
While diagnostic skills are. The picture of the 
counselor as therapist simply does not emerge. 
| he book would convince the beginner that there 
5 а lot to learn, For the advanced student, it 
Fe Short in basic theoretical orientation to 
ү unseling. For example, the Rogerian approach 
‘ characterized as a set of special methods 
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(without describing them) and the more funda- 
mental contribution of non-directive theory is 
not discussed. This would seem to be a major 
shortcoming in view of the inclusive title of the 
book. Similarly, the psychological treatment of 
interviewing seems sketchy by comparison with 
the "rules for orientation" proposed by Roeth- 
lisberger and Dickson years ago. The chapter on 
interests, although expertly calling attention to 
the practical difficulties of interpreting interest 
inventories, seems less theoretically oriented 
than it might have been. In a sense, the major 
contribution of the book is its title. This is not 
faint praise. The term general clinical counsel- 
ing represents an area of training and practice 
which already has wide recognition, but for 
which a widely accepted name has been lacking. 
It is to be hoped that the job title will enjoy 
the vogue it deserves, that its use will further 
the destruction of artificial compartments of 
training, and result in the effective cross-fertili- 
zation of clinical psychology, guidance, and 
“Rogerianism.” 

Occupations 29:317-9 Ja ’51. Joseph V. 
Hanna. This book on general clinical counseling 
impresses the reviewer as being the most im- 
portant contribution in its field that has ap- 
peared in the last decade. The book is well bal- 
anced, well documented, and well written. The 
well-selected but not exhaustive bibliography 
at the close of each chapter enables the reader 
to become familiar with the most recent im- 
portant experimental and other contributions 
to the field of counseling. * The distribution of 
contents suggests the skilful balance the authors 
employed in dealing with the essential topics 
treated. Several over-all features of the book 
might well be mentioned before reviewing its 
detailed contributions. Many students of coun- 
seling will be gratified with the restoration of 
the term “clinical” to its rightful place in gen- 
eral counseling. Before World War II the clini- 
cal method had been established as the sine qua 
non of counseling. Following the War the term 
came to be so universally identified with psycho- 
therapy that many in the field of general counsel- 
ing, including the reviewer, all but despaired 
of using the term without being misunderstood. 


БӨРҮ | 
ts reapplication to the field of normal counsel 
ur aa the principle that the clinical 
method applies to all responsible case work and 


counseling. The authors are to be congratuated 
for rendering this service to the counseling move- 
ment at this time. * The authors express the 


В186 


intention to be panoramic rather than encyclo- 
pedic in their treatment, and this intention is 
reflected throughout the book. The panoramas, 
however, are not presented in hazy outline but 
are quite definitive in terms of theories, general 
principles, and methods. While the volume is 
perhaps less a handbook of methods than a 
treatise on basic principles and theories, several 
solid chapters are devoted to a discussion of tools 
and techniques. Nothing is regarded as fixed. 
The point of view is expressed that even before 
the manuscript would come off the press cer- 
tain changes and additions would probably be 
needed. While the non-directive school has ob- 
viously had its impact on the book the funda- 
mental relationship between counselor and coun- 
selee has not changed materially from the point 
of view expressed previously by Hahn, William- 
son, and their colleagues. The counselor is pri- 
marily responsible for the conduct of the inter- 
view, yet the counselee is not to be dominated 
by the counselor. The authors appropriately 
point out the unnecessary confusion resulting 
from the use of the term "'client-centered." All 
counseling worthy of the name is client-centered, 
whatever the school using the term. * In dis- 
cussing tools and techniques of counseling the 
authors add sociometric and semantics devices 
to those usually dealt with in most earlier books. 
Appropriate emphasis is given to records, the 
interview, and systematic case study methods as 
indispensable over-all tools, in addition to stand- 
ardized measuring instruments and other de- 
vices. It would seem to the reviewer that the dis- 
cussion of the rating scale might have been 
enlarged somewhat in the light of data on buddy 
and forced choice ratings—recent contribu- 
tions from the armed services and industry. The 
authors wisely stress the importance of voca- 
tional information in counseling, a subject which 
has been too greatly subordinated to the analysis 
of the individual in a great deal of the counseling 
literature. Much of the material from detailed 
job analysis is regarded as largely irrelevant to 
vocational counseling, a point of view which 
many counselors will welcome. The large pic- 
ture of the occupation as a whole must not be 
lost through a too early emphasis on a minutia 
of details. The counselor must be familiar with 
vocational, industrial and social trends, and 
must be familiar with sources of more detailed 
information. In adopting a working definition 
of aptitude the authors face up to a multitude of 
semantic difficulties. They reject the narrow or 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK [ % 


“риге” definition as inadequate in appraisi 
those complex patterns of abilities required by 
learning and work situations to which youn, 
people must adjust. A wide synthesis therefore 
is required in the analysis of aptitude, They fol. 
low rather closely Bingham's concept that apti. 
tude is “a pattern of traits deemed to be indica- 
tive of potentialities.” In spite of the recognized 
semantic limitations and confusions the authors 
choose to use the familiar designations such as 
“clerical” and “mechanical” in their discussion, 
In doing so they are in step with practices found 
in most counseling agencies. Throughout the 
book much emphasis is given to field and level 
concepts. In considering the appropriateness of 
the field the counselee's potential level within 
the field is too frequently neglected. The coun- 
selee may consider a field which is appropriate 
but at the same time aspire to an inappropriate 
level within the field. However, the authors warn 
against attempts to predict level on the basis 
of a single index of aptitude. The determination 
of level is a complex process and must be based 
on a wide synthesis of aptitude, interest, and 
other factors. This warning is timely. It should 
help prevent the too easy use of single test scores, 
prediction tables, and other single devices m 
educational and vocational counseling. One 
chapter of the book is devoted to methods of 
collecting data in preparation for counseling à 
student. In connection with the collection of 
data, 55 questions raised by the hypothetical 
counseling intern are offered. * This teaching de- 
vice is unquestionably good. Most of the ques- 
tions are excellent, but some of them seem rather 
elementary for the graduate intern who is n 
grounded in the principles and methods of S 
ing. In view of the crowded schedules of a 
counselors and counselor-trainers it would se 
that the list of questions might well be nnn : 
However, the reviewer finds little in the us 
which requires negative criticism. If the Na 
of the book had been widened to ia die 
social agencies, many of which have ma fin 
able gains in recent years in gaien ae 
counseling, its usefulness would have bee m 
creased for a considerable number of wen iis 
Most of the content of the book is am i 
about as applicable to social as to Ө ip 
agencies. While the authors give the H pn 
a dignified and active role in the ve 1 ei 
tionship, it seems to the reviewer that e adiit 
have gone even further in playing up fcing tH 
role of the counselee, without sacr! c 
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rightfully authoritative role of the counselor, 
There will, of course, be differences of opinion 
among counselors in regard to these respective 
roles. The common use of the term “diagnosis” 
jn counseling would seem in part responsible for 
the over-formalization of the counseling rela- 
tionship. Borrowed as the term is from medical 
practice, it implies a too dominant role by the 
counselor. Since the term is so widely accepted 
and used in counseling, writers face the obliga- 
tion of so qualifying its traditional meaning as 
to preserve and safeguard the client's active role 
in the counseling relationship. No counselor, or 
teacher of counseling courses, can afford to be 
without this excellent book. For it to run into 
several editions would guarantee the more rapid 
advancement of counseling as a profession. 
Psychiatric О 25:166 Ja '51. * The material 
is well selected * The text covers its area well 
and should be of tremendous help to the student 
of counseling, in understanding his field and his 
role as a counselor. 
Psychol В 48:28 5-6 M y'51. Ralph F. Berdie. 
* The authors' approach is problem-centered or, 
perhaps better, centered about students with 
problems, and the general role of counseling in 
education for personality development is given 
little attention, Accepting this limitation, Gen- 
eral Clinical Counseling is a commendable text. 
The first part of the book concerns itself with a 
definition of counselors and counseling. * In- 
cluded in this section is a discussion of the 
Preparation desirable for counseling. Conspicu- 
ous through its absence is the statement issued 
by the National Vocational Guidance Associa- 
tion in 1949. * undue attention was given to the 
Minnesota Occupational Rating Scale. Although 
this scale has been found to be extremely useful 
in the training of counselors, its actual use in 
counseling interviews is relatively limited. The 
final Section of the book is devoted to a syste- 
matic case study. The case study would be of 
greater instructional value if it included actual 
excerpts from the interviews. Before interviews 
could be recorded and accurate typescripts ob- 
шеф, а textual description of case work was 
€ best that could be used for instructional pur- 
Poses, but now that recording equipment is so 
easily available, samples of the actual interviews 
“an be used most effectively. The chief criticism 
that can be made of this text is that it looks back- 
i rather than forward. Most of the discus- 
d Centers about what counseling has been, 
ather than what it could be. Much of the rele- 


vant research done in counseling, and certainly 
there has been relatively little, is omitted, and 
the authors apparently do not base their presen- 
tation on that research. Such pertinent research 
as that recently coming from the University of 
Chicago is not mentioned. The recent emphasis 
on the social context in which counseling takes 
place also fails to appear in this book, although 
the recent contributions of semantics and soci- 
ometry are recognized. The social significance 
of counseling is not explored. Worth particular 
mention is the book's fine and lucid style of 
expression. It provides both interesting and 
pleasurable reading. 


[B187] 
X*HarsEv, Grorce D. Selecting and Inducting Em- 
ployees: A Handbook of Tested Procedures. 
New Y: Harper & Brothers, Hur E xi, 361. 
t 


ork: 
$4.50. * (London: Hamish Hamilton . 36s.) 


Mgmt R 40:314-5 My'5r. Martin M. Bruce. 
One's first reactions to Mr. Halsey's latest work 
is that the title is misleading. The author offers 
21 chapters on the subject of selection but only 
one on induction. Throughout the book his 
approach is atomistic, He deals largely with spe- 
cific factors, doing little tying together. In gen- 
eral, this book is more a conglomeration of quo- 
tations and excerpts from authorities and from 
people considered to be authorities than a mean- 
ingful unit. Those looking for a profound or 
theoretical treatment of selection methods and 
practices will be disappointed. Mr. Halsey has 
been rather cursory instead of intense in his 
treatment of the subject matter, However, his 
presentation is, for the most part, fairly lucid ; 
business men will not find it a chore to read a 
chapter at a time. Because too many subjects 
are treated, some which cannot be dealt with in 
a cursory manner suffer. Care must, therefore, 
be taken not to view the author's words as final, 3 
particularly in his discussion of such technical 
subjects as "The Meaning of Test Scores, and 
in his presentation of just one "system 1 of 
selecting salesmen, despite his statement, In 
few, if any, fields has there been so much study 
and research as the use of tests in the selection 
of salesmen.” One very common shortcoming 
in other volumes on personnel appears here too. 
Nowhere is the individual looked upon as a 
dynamic whole. Instead, he is viewed from dif- 
ferent angles only, depending on the subject 
matter. Greater stress on personal cleiagoi 

roaches rather than mass impe n- 
зү. methods would probably result in 


, 
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greater progress. The author’s choppy approach 
to his subject suggests that readers would do 
well to limit their reading to the better sections 
—for example, those treating the legality of 
questions in application forms, physical exami- 
nations (particularly the section on vision), and 
company benefit programs. Good use of the tech- 
nique of borrowing from the experiences of 
others is made in his discussion of benefit pro- 
grams and again in the section entitled “Five 
Successful Selection Programs.” The same tech- 
nique makes some of the material on recruiting 
of value to most readers. One topic that receives 
broader coverage than others is interviewing. 
Most employment interviewers would do well 
to study the author’s “Aids in Keeping the Ap- 
plicant Talking,” his “Twenty Suggestions for 
Making the Directive or Controlled Interview 
More Effective,” and his admonition to “Ex- 
amine and Discount Your Own Prejudices and 
Unfounded Beliefs.” ( All of these, however, are 
tall orders—as is his treatment of the concept 
of halo, where, like so many others, he says “Do 
it!” without explaining how.) Mr. Halsey takes 
on a Herculean task when he tries to deal with 
tests, particularly personality tests, in a brief 
space. He covers paper-and-pencil inventories in 
a single page, though O. K. Buros reviews some 
30 instruments in this category in his Third 
Mental Measurements Y ear Book. The author's 
fairly long quotations from this volume prob- 
ably constitute the best parts of the chapters on 
tests. (One quotation is particularly worth re- 
peating: "It seems probable that, unless the 
services of a well-trained and experienced psy- 
chologist are constantly available, the interview, 
properly conducted, will be more effective as a 
means of determining personality traits than all 
the tests now available") In the discussion of 
the theoretical aspects of mental ability, the 
thinking seems rather loose, some of the com- 
ments representing Mr. Halsey's own opinions 
rather than customarily accepted theory. How- 
ever, the reader will find better and more mean- 
ingful statements when practical aspects are dis- 
cussed. There is an occasional remark that is 
almost aphoristic. One such is hidden away in 
his discussion of mental ability tests: “Job titles 
mean entirely different things in different or- 
ganizations.” Among the additional valuable 
items in the 11 chapters on testing is a very com- 
prehensive summary of a mimeographed report 
of the Employee Relations Department of the 
Standard Oil Company (New Jersey) entitled 
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Notes: Conference on the Use of Employee 
Tests and Measurements. In general, though 
material worth reading can occasionally be 
found, the book as a whole can hardly be cop. 
sidered a contribution to personnel literature, 
Personnel J 30:71—2 Je’51. E. Fassberg, Mr, 
Halsey...has contributed many helpful ideas in 
his previous books. * The present book, however, 
is very disappointing. It contains an astounding 
number of quotations from the work of other 
authors. There is relatively little material writ- 
ten by Mr. Halsey, and hardly any integration 
of the quoted material. In many instances a ma- 
jor section or chapter is made up chiefly of quo- 
tations, sometimes even from only one author on 
the subject, with merely a few brief explanatory 
remarks by Mr. Halsey. The quotations are 
often very lengthy. Most of Chapter III is quoted 
from the "Training and Reference Manual for 
Job Analysis," published by the Division of 
Occupational Analysis of the United States Em- 
ployment Service. Chapter V, on the Applica- 
tion Form, and Chapter VIII, on the Employ- 
ment Interview, are also largely made up of 
quotations that run to several pages. This 
method is typical of the whole book. The neo- 
phyte is bound to find much here that is o 
interest, but anyone at all familiar with the lit- 
erature will not. Much of the material is quoted 
from company publications, reports, etc. and 
this, of course, is less readily available to the 
average reader. Mr. Halsey visited and qd 
sponded with a great number of firms to 8 
up-to-the-minute information on how е 
and induction are being carried on. It is regret 
table that he did not present his ire 
more selective manner. In his Preface, Mr. 
sey makes clear that this method of па 
the book was chosen after careful ci 
He says, “....while I have, of course, drawn 2 
ily on my experience of over thirty years auf 
employment manager and personnel pre 3 
my aim has been to report as accurately an М 
lly accepte 


: s Шу 
best practice, rather than merely to expres i 
ferences Cl 
k do not 


; ive 
indicate that Mr. Halsey has been very intensi” 
in his research. On many subjects, he, ad 
from only one or two sources when, in that the 
literature offers a much richer vues gel 
reader should be apprised of if he Á ү ы 
fair picture. On the other hand, if t dets the 
tions represent what Mr. Halsey ons 
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best and most representative material on these 
subjects, it is regrettable that he does not com- 
ment more fully on the selections, instead of 
merely quoting them. The most helpful feature 
of the book is the approximately 120-page sec- 
tion reproducing forms, tests, etc. This consti- 
tutes almost one-third of the whole book. There 
isalso an Appendix listing several publications 
recommended by Mr. Halsey, and another in 
which *Sources of Information and Help" are 
listed. (How could the National Industrial Con- 
ference Board have been omitted from this list?) 
The Index is carelessly prepared. АП in all, this 
book falls far below what we have come to ex- 
pect of Harper publications in this field. 

Pub Personnel R 12:105-6 Ap 51. Norman 
C. Ecklund. Mr. Halsey significantly subtitles 
this reference as “A Handbook of Tested Pro- 
cedures.” It is at once apparent that the author 
has drawn from a wide variety of personal ex- 
periences and associates in garnering together 
many of the best personnel selection practices 
known to industry, government, or education. 
The book is set forth as a handbook for execu- 
tives in factories, stores, and offices who are 
responsible for the selection of new employees. 
Its use as a text or reference by groups studying 
personnel management is suggested with a plea 
for its evaluation in terms of its practicality. The 
claim for practicality is well founded. * Readers 
will be especially interested in the kinds of in- 
formation held discriminatory by the New York 
State Commission Against Discrimination in 
its 1948 Report of Progress. * an excellent treat- 
meni of the industrially all-important interview 

an interesting description of the Interaction 
Chronograph. The heart of this work is its au- 
thoritative reporting of the steps to follow in 
introducing and developing a testing program. 
4 Employee induction as distinct from selection 
's given too brief treatment to satisfy the re- 
Viewer that even the highlights of the subject 
lave been given an accounting. А suggested 
library on selecting and inducting employees 
contains a cross-section of the basic works on 
the subject. Further sources of information and 
һер are listed, but they are by no means ex- 
austive, Samples of forms which illustrate the 
Subjects of successive chapters of the volume 
die up some 123 pages. These are representa- 
ive of the field of the text itself and exemplify 
Some of the best in the profession. The Job 
nalysis Schedule drawn from the War Man- 
Power Commission is a first-class model for the 
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technician doing classification or recruitment 
work. Throughout the book there is stress on 
the practical aspects of personnel operations. 
Every effort has been made to lead the reader to 
an understanding of the technical if not the 
mathematical basis of modern test usage. But 
the application of this fund of experience and 
information will be easier for the personnel man 
who brings to its reading more than a course in 
psychology and a host of good intentions. It is 
significant reading for anyone who has a part in 
selecting, placing, or supervising people. 
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*Hanp, Harop C. What People Think About 
Their Schools: Values and Methods of Public- 
Opinion Polling as Applied to School Systems. 
Yonkers, N.Y.: World Book Co., 1948. Pp. iv, 219. 
$2.60. * For reviews, see 53. 
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*Hanrrn, Artur Epwin, Jn. Differential Patterns 
in Schizophrenia on the Wechsler-Bellevue In- 
telligence Test. Allahabad, U.P., India: the Author, 
Ewing Christian College, 1950. Pp. ix, 71. dv Rs. 
4-6-0 ($0.95) ; cloth, Rs. s-14-0 ($125). * (РА 26: 
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Harris, Еѕтнев Кїтє. The Responsiveness of 
Kindergarten Children to the Behavior of Their 
Fellows. Monographs of the Society for Research in 
Child Development, Vol. 11, No. 2, Serial No. 43. 
Evanston, Ш.: the Society, Northwestern University, 
1948. Pp. vii, 184. Paper, lithotyped. $2.50. * (PA 23: 
5357) : 

Q R Biol 25:420-1 D ' 50. Helen Arthur, 'This 
is another psychological monograph turned out 
with typical, if monotonous, competence. It fol- 
lows the standard form, so there are no sur- 
prises in the presentation or in the conclusions 
summed up so efficiently in the last pages. Using 
a pictorial Guess Who technique on 24 kinder- 
garten children, plus their teacher's evaluations 
and parents' reports, Mrs. Harris is able to state 
that social responsiveness is a: personality factor 
which is not related to age, LQ., or sex. Even 
in kindergarten children, there is a wide range 
of social responsiveness—some preferring а 
small group of congenial friends and others a 
larger social orbit. The children gave evidence 
that their judgments on each other were largely 
determined by their own social indoctrination, 
whence follows the inevitable hint that children 
should be “rightfully guided and directed” so 
that their inherent social responsiveness will be 
harnessed to the best interests. 
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ry. The Group Approach to Leader- 
E ETI Foreword by Ronald Adam and Pref- 


1 


В1д1 


асе by A. Torrie. London: Routledge & Kegan Paul 
Ltd., 1949. Pp. x, 288. 215. * 

В Ind Psychol & Personnel Practice 6:48—9 
D'50. R. S. Horne. * The author addresses him- 
self to people with some psychological back- 
ground who are concerned with selecting lead- 
ers in any field. Such people will find the main 
part of the book which describes the methods of 
the Boards, complete enough to be used as a 
starting point for developing methods suitable 
to their own field. In the book, however, the 
problems of adapting the methods to non-mili- 
tary purposes are summarily treated, and the 
author appears simply to have jotted down a 
few points that occurred to him. It would per- 
haps have been better to discuss more fully the 
broad principles on which these methods are 
based, rather than to concentrate on the pro- 
cedures involved. The theoretical discussion 
which precedes the description of the methods of 
the Board is inclined to be superficial. In an 
attempt to present theory simply and with a 
minimum of technical terms, the author has 
fallen into over-simplification, sweeping gen- 
eralisation, and psychological cliché. For ade- 
quate treatment of theoretical aspects of these 
methods the psychologist must await a more 
complete work, but the book may be regarded 
as a useful interim account of the methods of 
the War Office Selection Boards. 

Brit J Med Psychol 23:122 pts 1-2 "50. B. S. 
* 'The author and his sponsors point out that 
the book is in no sense an official account of the 
work of War Office Selection Boards....but is 
to be regarded as “an individual reaction to a 
collective effort." * many of the details of tech- 
nique described by Dr. Harris were not in gen- 
eral use at W.O.S.B.'s during the period under 
discussion, but were manifestations of the “local 
idiom" which individual Boards were at one 
time encouraged to develop. In particular the 
"Human Problems Session," a form of drama- 
tized stress interview on Moreno lines, which 
Dr. Harris singles out as the most useful single 
component of W.O.S.B. procedure, was dropped 
quite early from general use in normal Board 
practice, and was not even included for study 
in the important 1945 reliability experiment, an 
experiment of which, incidentally, Dr. Harris 
makes no explicit mention. When it is realized 
that Dr. Harris uses "one" in the sense of the 
first personal pronoun it becomes a great deal 
easier to disentangle his personal opinions, ad 
hoc neologisms and favourite working methods 
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from normal Board procedure. As it i 
ae of discussion are interspersed a 
purely descriptive material, an 
which tends to obscure both the thesia, 
the practical issues. Rather disconcerting ds 
are the author's frequent references to Zen ii 
losophy and illustrations from the orti й 
Judo, most of which appear to this reviewer to 
have not more than borderline relevance to 
selection for leadership. As Dr. Harris himself 
points out, a reader who has not himself wit. 
nessed a Board in action, will find it difficult to 
visualize the proceedings and to sense their at- 
mosphere, A clearer division between descrip. 
tion and rationalization would have been of as- 
sistance to such a reader. On the basis of his 
experience as a W.O.S.B. psychiatrist, Dr. Har- 
ris considers the services of an experienced med- 
ical psychologist to be a sine qua non for effec- 
tive functioning of a Selection Board. To one 
familiar with the work of a Board thus equipped, 
a Board lacking professional psychological help 
is indeed difficult to envisage, and it is perhaps 
early yet to say what will be the result of the 
withdrawal (since 1946) of such personnel. Con- 
sequently, too, the future development of certain 
techniques specific to the military field, which 
Dr. Harris treats as a live issue, remains prob- 
lematical. In a short closing section, however, 
entitled “Trends and Implications,” the author 
discusses the application of Leaderless Group 
methods to nonmilitary fields, and this is per- 
haps the most valuable portion of the book. 
Brit J Psychol, Gen Sect 40:227-8 Je '%. 
John Munro Fraser. * Harris has given an ac 
count of his experiences and reflexions while 
serving as a psychiatrist on several War Offic 
Selection Boards. From it the determined reader 
can piece together the procedure and its 18 
tionale, though his task is made more difficult by 
Dr. Harris's predilection for referring to things 
by their initial letters. In some cases this 5 
misleading, as when he uses IQ to mean 7" 
terests Questionnaire; in some cases it 18 m 
accurate, for he refers throughout to Р50% 
when he means what, until 1945 at any js 
were МТО” ; and in all cases it is irritating, "i 
if he expects a reader to unravel а sentence 
the following he should provide a min в 
MATCOP whose СЕ is high, but whose б ui 
low may be quite unable to function ОП s йе 
ship level. An IMDEP or IMAG may, de? 
a personality handicap, have a sufficiently н 
GE to so function without serious strain Of 
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stability ; though his GE will of course be cor- 
respondingly diminished.” The reader might 
then be able to deal with the lingering suspicion 
that the last GE is really a misprint for СЕ. Dr, 
Harris makes no secret of his belief that without 
the clinical experience of the psychiatrist such 
Boards would have been gravely handicapped. 
This view has apparently not been widely shared 
by the authorities concerned, who dispensed 
with the psychiatrists’ services shortly after the 
end of the war. Psychologists seem to have ren- 
dered certain limited services in looking after 
some cognitive testing, checking educational 
standards and vocational requirements, and gen- 
erally taking care of some of the paper-work. In 
terms of their usefulness to the psychiatrist, 
which appears to be the essential criterion of 
value throughout the book, they seem to have 
ranked rather below the sergeant testers (p. 
228). Paper-and-pencil testing, of course, has its 
limitations in this type of selection work, but 
these are not quite so serious as Dr. Harris's 
somewhat naive conception of cognitive functions 
would lead one to believe (p. 55). In pro- 
cédures where a more developed theory of psy- 
chometrics is accepted, the role of the psycholo- 
gist has been rather different, and possibly more 
significant. The author does well to stress the 
importance of the psychiatrist's clinical experi- 
ence in the assessment of personality, otherwise 
the reader might be encouraged to attempt an 
ideational comprehension of the categories with 
which he works. In a chapter entitled “The Pro- 
jective Psychiatric Interview” he puts forward a 
series of concepts such as the individual’s level 
of functioning, actual and potential, his atti- 
tudes to other people, objective or immature, and 
his affective balance as shown by the warmth, 
spontaneity and effectiveness of his behaviour. 
These could be put together into an adequate, 
though perhaps incomplete, theory of personal- 
ity, on which reasonably objective assessments 
9n à commonly accepted series of categories 
might become possible. There is, however, a 
joker in the pack. This is entitled "stability," 
and on it there can be no common ground be- 
tween those with “clinical experience" and those 
outside the magic circle. *Ratings on Stability 
аге a matter for the medical psychologist." The 
book depends on theories of leadership which 
derive largely from Lewinian concepts, and the 
leader is seen as one who can promote co-opera- 
tion as required by the Law of the Total Situa- 
tion. It is curious that this conception, with its 


low valuation of competition and dominance by 
an individual, should have emerged in such a 
military setting. War is in its essence competi- 
tive, the aims of each side being to create a Total 
Situation to which the other cannot adjust with- 
out losing its integrity and effectiveness as a 
fighting force. Military commanders must, on 
occasion, resist the demands of the Total Situa- 
tion, and those who are tempted to adapt them- 
selves too readily to certain group pressures 
may find themselves in danger of being consid- 
ered bad officers. Provided that enough time is 
taken to observe the candidates, however, and 
so long as the assessment of their potentialities 
is made the subject of rational discussion, it 
seems that satisfactory results can usually be 
expected from Selection Boards, no matter what 
the underlying theories may be on which they 
are conducted. The years since the end of the 
war have seen an extended use of group pro- 
cedures and other protracted methods in selec- 
tion for higher appointments. Many interesting 
problems are posing themselves, such as the 
reality-level to be aimed at in group sessions, 
the most economical and effective means of ac- 
cumulating projective material from candidates, 
and the relation of the short-term intensive ob- 
servation of them in the interview and group 
sessions to the longer-term extensive view 
gained in the biographical case-history. Dr. Har- 
ris rather tantalizingly shows himself aware of 
these in incidental passages, but in general he 
neglects them in favour of what appear to be 
less essential and at times even trivial issues. It 
is a matter for regret that from the vast mass of 
case-material which these appointments put at his 
disposal he makes use of only one real or imag- 
inary case as an illustration. And he reduces the 
value of this drastically by talking of a Scottish 
student gaining Matriculation. | 

J Mental Sci 97:217 Ja '51. 5. W. Gillman. 
* a difficult book to read because of the language 
used by the author, and because the technique of 
selection is given in great detail, It will be of 
great interest to those who are working in the 
psychological field of selection, and to psycholo- 
gists and psychiatrists now working on projec- 
tion test techniques. It is an excellent account of 
officer selection methods during the past war. 

Lancet 257:1224 D 31 '49. * The author was, 
by general repute, highly adept in the techniques 
he describes. Unfortunately skill in the prac- 
tice of a technique does not always make for 
clarity in its exposition, and Dr. Harris has not 
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simplified his task by combining psychiatric jar- 
gon with a military system of abbreviations. 
We encounter in his pages such monsters as 
MATCOP (mature codperative or objective 
` personality) and IMAG (immature-aggressive 
personality) and we make our way through the 
"progressive revelation of personality from the 
UGD through the IS to the HPS." Neverthe- 
less, those who are concerned to discover “what 
went on at WOSB" will find the procedures 
fully described here. Dr. Harris is at his best 
when what he is describing is most concrete, 
whether it be the various group situations which 
were contrived for officer selection, or the meth- 
ods of Japanese wrestling, which analogy 
prompts him to consider ; he is least at home in 
his introductory treatment of theory. The for- 
midable task of validation is not seriously at- 
tempted (we learn from Sir Ronald Adam's fore- 
word that it will be undertaken in a later, and 
official, account) ; and the chapter, by another 
author, on the validation of boards, observers, 
and selection procedures could probably have 
been omitted from the present work without 
detriment. - 

Nature 166:166 Jl 29 50. Е. C. Bartlett. * It 
is interesting and possibly significant that the 
account begins by a discussion of "theories." 
Maybe this is merely a matter of convenience in 
exposition; but it is hard to escape from the 
impression that, first, a general and very specu- 
lative view of personality and group functions 
and relations was accepted (mainly in terms of 
the “field theory" of Lewin and others, with a 
strong mixture of general psychiatry, perhaps 
largely of a Freudian type) and then all sorts of 
ingenious ways of illustrating this view in con- 
crete situations were organised and applied. If 
this is what happened, nobody has any right to 
complain, for the conditions were urgent and 
the time short. But the basic problem of how to 
find sectional leaders remains as important in 
peace as in war, and, if it is implied that the 
procedure adopted should remain a model for all 
subsequent approach, difficulties are bound to 
arise. Obviously the justification of any selec- 
tion procedure must rest mainly upon valida- 
tion. A chapter on validation of the War Office 
Selection Board methods, written by Major 
Gavin Reeve, is included in this book ; but it is 
not satisfactory. It does show that the verdicts 
reached were, by the accepted statistical criteria, 
reasonably reliable; but that is not enough, es- 
pecially in view of the very uniform training 
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given to the essential officers of the selection 
boards. It shows also that training wastage was 
substantially reduced, and this, in the circum. 
stances, is a great and important point in favour 
of the general scheme. But it does not, and in- 
deed cannot, show whether the leaders selected 
were in later trial significantly better than those 
picked out in any other way. That this is a criti- 
cally difficult task anybody with experience, even 
in much more defined fields of personnel selec- 
tion, will at once agree ; but it is a task that must 
be attempted when there is time, for it will cer- 
tainly never be accomplished in an atmosphere 
of crisis. On a descriptive level, the book is ex- 
tremely interesting and well written. Tt reveals 
a carefully planned and conducted effort which 
deserves all the praise—and perhaps some of the 
blame—which it has received. Very appreciative 
comments by Sir Ronald Adam and Brigadier 
Torrie are included, and these two are, beyond 
doubt, among those who should know. 
Occupational Psychol 24:186-7 Jl '50. N. 4. 
B. Wilson. It is now realized that the institution 
of the famous War Office Selection Boards 
(WOSBs) to select British Army officers in- 
volved no striking innovations in psychological 
theory but that the movement was remarkable 
in two other ways. In the first place, the pioneers 
of the system showed great boldness in deciding 
to test candidates in action with their fellows as 
well as individually in conversation with one 
or more interviewers. In the second place, they 
were remarkably skilful and successful in elab- 
orating their central idea into a comprehensive 
selection procedure, which quickly became the 
standard method of officer selection throughout 
the Army and eventually spread to many 0 к 
institutions. While the curiosity of the gene = 
public about tlie WOSBs has long since abated, 
many psychologists continue to feel the need e 
a full dress account of both technical and 4 : 
ministrative aspects of the system; and all thos 
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concerned with training and selection for pos 
d the professions 


und and i 
y in other 

m : ; an 
spheres than the military. It is to this ag em 


especially to the latter part of it, that Dr. 
addresses himself. The publishers Й 
book is designed primarily for tho E 
be concerned with selection for leaders 
a managerial level who have some pase 
background. The result, unfortunate y M 
tremely disappointing. The writing is 50 
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sistently bad (except in the short section on vali- 
dation, by Major Gavin Reeve) that the reader 
must suspect that the content cannot be sensible. 
There is a lush growth of jargon, abbreviations 
and other pseudo-technicalities, and the think- 
ing seems wooly. When all this has been 
discounted, there remain some sensible observa- 
tions, e.g. that selectors should, if possible, sat- 
isfy themselves that they have obtained all the 
relevant evidence about a candidate before they 
dispense with his presence. But such observa- 
tions, though useful, are neither novel nor es- 
pecially related to a discussion of WOSB meth- 
ods. The reviewer’s main quarrel with the book, 
however, is that it does not seem well suited to 
its purpose of informing and guiding selectors 
in industry and the professions. Instead of ab- 
stracting the main features of WOSB practice 
and discussing their applicability outside, the 
author occupies most of his space with detailed 
descriptions of all the WOSB “tests” and with 
speculative judgments as to “what one gets out 
of each.” By contrast, the question of applica- 
bility to non-military selection receives exceed- 
ingly superficial attention in a chapter of less 
than five pages. Despite the emphasis upon 
“tests” there is very little that is helpful about 
the interpretation and significance of candidates’ 
performance in them. Nor are such obvious dif- 
ficulties as differences between candidates in 
age, maturity, education and social environment 
adequately treated. Dr. Harris realizes that 
something constructive must be done about those 
not chosen for leadership, but in this matter, as 
in others, his prescription is vague and porten- 
tous. The “able disruptors,” for example, are 
to be "watched for and....given special place- 
ment, therapy if required or—in the last issue— 
controlled and supervised.” In general, the 
book’s method of dealing with any serious diffi- 
culties that arise is to suggest that they be re- 
ferred to the medical psychologist. The possi- 
bility that this may not be feasible, even if 
desired, is not faced. The Group Approach to 
Leadership Testing could give rise to three dan- 
gers, in the reviewer's opinion. In the first place, 
its over-emphasis upon WOSB “tests” might 
Ps to a procedure which had the WOSB form 
but was without the spirit and knowledge which 
informed at least the better WOSBs. In the 
Second place, it might lead in some cases to an 
е dependence upon the psychologist in selec- 

n, and particularly upon the medical psychol- 
gist. In the third place, it might lead some of 


its more critical readers to conclude that their 
suspicions about psychology and psychologists 
were confirmed, and decide henceforth to have 
nothing more to do with either. 

Personnel Psychol 4:306-11 au 51. Bernard 
M. Bass. * Spontaneous leadership behavior is 
thought to be an indicator of sound mental and 
social health and represents the antithesis of 
neurotic behavior, which is considered a func- 
tion of maladjustment of interpersonal rela- 
tions. The reviewer agrees that leadership be- 
havior and neurotic behavior for a particular 
situation are continuously distributed, but it is 
probable that each is a separate variable which 
is far-from-perfectly negatively correlated from 
the other. Certain neurotic tendencies (with- 
drawing, shyness, negativism) may correlate 
with lack of tendency to play leadership roles 
but what about those who assume positions of 
leadership in order to compensate for insecurity 
feelings through dominant, aggressive behavior ? 
* The War Office Selection Board (WOSB) 
Observer Team consisted of six Army officers 
* For one sample of thirty-three candidates, 
pooled ratings by all assessors of the WOSB 
yielded a validity of .58 against a criterion of 
final grades in officer candidate school. (A single 
assessor correlated .59 with the criterion for 
this same sample.) Since the range of the cri- 
terion was reduced to one-third the original by 
allowing only the top one-third who passed 
WOSB, the validities are impressive. Unfortu- 
nately, except for a table of successes and fail- 
ures in assessment to be discussed below, no 
validity coefficients are reported of any samples 
of a respectable size although undoubtedly they 
are available. Gavin’ indicates that validity 
coefficients varied from one type of officer candi- 
date school to another, but does not cite sup- 
porting evidence. He also indicates that the con- 
tribution of the separate parts of the testing 
procedure towards the validity of the whole was 
not measured, but that work in this direction is 
now possible. * The reviewer was somewhat 
disappointed in the failure to present more de- 
tailed statistical analyses. If such analyses had 
been given here, it is quite possible that the 
following hypotheses might have been verified 
with a resultant increase in the efficiency of the 
program of assessment. (1) The results of only 
a few tests when optimally combined will yield 
predicted success scores in officer candidate 
school more valid than those obtained by means 
of the more lengthy test battery. (2) To pre- 


В192 


dict future behavior in specified situations, the 
best predictor of that behavior is behavioral rat- 
ings in a test situation, similar to the real situa- 
tion to be predicted, rather than ratings of a 
complex of intervening variables "personality," 
whose nature is inferred from behavior in the 
test situation. (3) Independent ratings by as- 
sessors combined statistically in optimum fashion 
yield more valid assessments than pooled rat- 
ings based on a conference of assessors. It is 
probable that pooled ratings are biased in the 
direction of the rater of highest status. This book 
ought to have great appeal to those favoring a 
clinical approach to assessment. However, the 
reviewer believes that the most progress beyond 
the pioneering stage represented by this book 
will be made by reducing the subjectivity of 
evaluations and reducing the need for observers 
to infer status on various personality dimen- 
sions from the behavior of the examinee. Until 
we have anchored the dimensions of personality 
down to reality, until the constructs of our the- 
oretical systems have been found both empiri- 
cally and rationally sound, it is probable that 
greater forecasting efficiency will be obtained 
from the observations of examinees' behavior in 
situational tests, using these kinds of observer 
assessments as predictors of desired criteria, 
keeping evaluations on the level of the exhibited 
behavior rather than by attempting to make as- 
sessments of personality inferred from the ex- 
hibited behavior. The book should be regarded 
as the history of another important episode in 
the field of situational tests. However, the 
hypotheses presented in the main must be con- 
sidered as unverified. In fact, some of the hy- 
potheses have already been refuted by quanti- 
tative analyses made since 1945 by other 
investigators. The book has made an important 
contribution to the field of leadership assess- 
ment and despite disagreement with its theoreti- 
cal orientation, the reviewer feels the report can 
serve a useful purpose as a source book of tech- 
niques and hypotheses for personnel psycholo- 
gists in this area. 
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*Harris, Rosert E.; MILLER, James G.; Мокхсн, 
GEORGE A.; Stone, L. JOSEPH ; TEUsER, HANs-LuKas; 
AND ZUBIN, JosepH. Recent Advances in Diagnos- 
tic Psychological Testing: A Critical Summary. 
Springfield, Ill.: Charles C Thomas, Publisher, 1950. 
Pp. x, 120. $275. * (Oxíord, England: Blackwell 
Scientific Publications, Ltd. 20s.) (PA 25:1788-9, 
1797, 1823, 1855, 2006) 
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Brit J Psychol, Gen Sect 42:302-3 Ag "gi 
R. W. Pickford. * written by six separate ay. 
thors. The first contribution is by James G. Mil. 
ler and deals with contemporary trends in 
psycho-diagnosis. In this he discusses, in a valų- 
able way, changes in outlook which have taken 
place in recent years. * The reader will find this 
book a useful review of present-day tests, points 
of view and methods, and it will serve to bring 
his ideas up to date on the problems of diagnostic 
testing with which it deals. There is no index, 
but there is a useful bibliography attached to 
each chapter. 

J Consult Psychol 15:86—7 Е ' 51. Laurance F, 
Shaffer. In its six critical reviews, each by an 
exceptionally qualified author, this small mono- 
graph offers more meat than does many a more 
pretentious book. A number of important central 
ideas for the future development of psychodiag- 
nosis seem to emerge independently from its sev- 
eral chapters. For example, research originating 
in varied settings points more and more clearly 
to a measurable human characteristic of general 
goodness of adjustment, or ego strength, or 
therapy-readiness. Such a concept has sweeping 
implications both for practical clinical diagnosis 
and for personality theory. АП clinical psycholo- 
gists will find the monograph entirely deserving 
of their careful attention. 

Psychol В 48:363-4 JL’ 51. William А. Hunt, 
* The book is even more brief than its 120 pags 
might indicate since 15 of them are taken up ty 
bibliography, the pages themselves are em) 
and the type face is reasonably large. Tt would, 
perhaps, have been better presented as a mono 
graph. The contributions are excellent, pei 
ever, and they are well integrated, with Led 
sequent absence of annoying overlap. Mi ан 
introduction is competent and timely ; Dus 
contribution deals interestingly with а not m 
often discussed facet of testing ; and the Teu Чч 
and Zubin articles are able and ETE vt 
formances in special technical fields. In ш; di 
viewer's opinion, special praise should B wh 
papers by Harris and Stone, where | vid d 
strated clinical sensitivity goes hand p b 
with theoretical competence and thoroug н 
preciation of experimental methodology: 
are a pleasing confirmation 
chologist can function clinica 
ficing his scientific heritage and t 
ing of these two influences can 
and possible reality, not a mere 1 
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*HARROWER, M. R., AND STEINER, M. E. Large Scale 
Rorschach Techniques: A Manual for the Group 
Rorschach and Multiple Choice Tests, Second 
Edition. With contributions by Floyd O. Due, Bea- 
trice A. Wright, and M. Erik Wright. Springfield, 
Ш.: Charles C Thomas, Publisher, 1951. Pp. xx, 353. 
$8.50. * For a review, see 122. For reviews of the first 
edition, see 3: 82. 
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*HanrMAN, А. A. An Experimental Examination 
of the Thematic Apperception Technique in Clin- 
ical Diagnosis. American Psychological Association, 
Psychological Monographs : General and Applied, Vol. 
63, No. 8, Whole No. 303. Washington, D.C.: the As- 
sociation, Inc., 1949. Pp. v, 48. Paper. $1.00. * (РА 24: 
6354) 

[B195] 
*HarHAWAY, STARKE R., AND MEEHL, PauL E. An 
Atlas for the Clinical Use of the MMPI. Minne- 
apolis, Minn. : University of Minnesota Press, 1951. Рр. 
xliv, 700. $9.75. * (London: Oxford University Press. 
78s.) (PA 25:7468) For reviews, see 72. 

[B196] 
XHzLLERSBERG, ELızaBeETH F. The Individual's Re- 
lation to Reality in Our Culture: An Experi- 
mental Approach by Means of the Horn-Hel- 
lersberg Test. Springfield, Ill.: Charles C Thomas, 
Publisher, 1950. Pp. x, 128. $3.25. * (Oxford, England : 
Blackwell Scientific Publications, Ltd. 25s.) (PA 25: 
2131) For reviews, see 109. 

[ B197] 
Hoses, NrcHoLAs, Eprror, Psychological Research 
on Flexible Gunnery Training. Army Air Forces 
Aviation Psychology Program Research Reports, Re- 
port No. rr. Washington, D.C.: U.S. Government 
Printing Office, 1947. Pp. viii, 508. $1.75. * 

J Consult Psychol 11:338 N-D ’47. Laurance 
F. Shaffer. * reports achievements and tech- 
niques that can be applied to other complex 
military, industrial, and educational problems. 

Psychol B 46:516-7 N "49. W. E. Kappauf, 
Jr. * The most important systematic problem 
discussed by Hobbs and his collaborating au- 
thors is that of establishing a satisfactory cri- 
terion of gunnery performance. No completely 
Satisfactory criterion was established during the 
four years of research described in the volume. 
Combat data, which one might expect to be the 

est possible kind of criterion, were of limited 
Validity, since records for a particular gunner 
depended not on his skill alone, but on that of 
other persons as well, both friend and foe. * 


For a review of the series, see B2. 
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Носн, PAuL H., AND Zustn, Joserm, Eprrons. Re- 
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lation of Psychological Tests to Psychiatry, The 
Proceedings of the Fortieth Annual Meeting of the 
American Psychopathological Association, Баа іп 
New York City, June 1950. New York: Grune & Strat- 
ton, Inc., 1951. Pp. viii, 301. $5.50. * 


[ B199] 
ЖНонме, Н, Н, Hints on Interviewing. Melbourne, 
Australia: Australian Council for Educational Re- 
search, July 1947. Pp. 40. Paper. 1s. 6d, * 


[8200] 
XHorziNGER, Kart J., AND Harman, Harry Н. Fac- 
tor Analysis: A Synthesis of Factorial Methods. 
Chicago, Ш.: University of Chicago Press, 1941. Pp. 
xii, 417. $6.00. (London: Cambridge University Press, 
1943. 305.) * d 16:411) For reviews, see 168 in 
Statistical Methodology Reviews, 1941-1950. 
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ЖНомрнввү, Сүре W., AND Lams, Marion M. 
Evaluating and Reporting Student Progress in 
Business Education. Foreword by R. W. Gregory. 
Federal Security Agency, Office of Education, Voca- 
tional Division Bulletin No. 238, Business Education 
Series No. 17. Washington, D.C.: U.S. Government 
Printing Office, 1949. Pp. iv, 16. Paper. $0.10. * 


J Bus Ed 26:182 D '50. * should be interest- 
ing as a basis for securing an overview of the 
trend toward evaluation as a concept in which 
testing is only a phase of the total program. 


[ B202] 
*Hunt, E. P. ALLEN, AND SMITH, Percivat. A Guide 
to Intelligence and Other Psychological Test- 
ing, Revised Edition. Foreword by Peter D. Innes. 
London: Evans Brothers Ltd., 1947. Pp. 108. 45. * M 
22:3454) For reviews of the first edition, see 38: B392 
and 36: B141. 

[ B203] 
*Hourp, Arcuer W. Evaluating Student Success 
in Medical Education. Richmond, Va.: Bureau of 
Educational Research and Service, Medical College of 
Virginia, October 1951. Pp. vii, 69. Paper, mimeo- 
graphed. $1.75. * 

[ B204] 
*Hustn, TORSTEN, AND HENRICSON 
Some Principles of Construction of Grou Intel- 
ligence Tests for Adults: A Report on the Con- 
struction and Standardization of the Swedish In- 
duction Test (The I-Test). Preface by Gósta Bratt. 
Stockholm, Sweden: Almqvist & Wiksell, 1951. Pp. 
100. Paper. 5 Sw. Cr. * (PA 26:3134) 
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CowrERENCE ON TESTING PROBLEMS, 
Ti onal. Projects in Educational Measurement: 
A Report of the 1946 Invitational Conference on 
Testing Problems, New York City, Мону, a 
1946. Herschel Т. Manuel, с Еу py b E 

ican Council on Edu п 
Мааа 28, Washington, D.C.: the Council, 


August 1947. Pp. vii, 8o. Paper. $1.00. 
Psychometrika 14:75-8 Mr "49. Dorothy C. 

Adkins. * many interesting ideas and....diverse 

points of view are represented * Traxler dis- 


Sven-Eric. 
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cussed a project on development and use of 
testing procedures in the field of accounting. 
This project, initiated by the American Insti- 
tute of Accountants, is being worked on by a 
committee appointed by this business group, 
representatives of Columbia University, and the 
Educational Records Bureau. It thus offers an 
interesting example of cooperation between busi- 
ness and educational groups. * Two other papers 
which transcend specific applications of testing 
procedures will probably be of more general in- 
terest to readers of this journal. In a challenging 
article on "Validity of Educational Tests," Phil- 
lip J. Rulon gives a clear statement of his point 
of view that the basic criterion of the validity of 
a new test springs from the “direct operations- 
upon-material" approach. Consequences of this 
point of departure as it affects the practice of 
educational achievement testing are carefully 
explored. The test that calls for processes and 
materials directly related to teaching objectives 
needs no further establishment of validity. A 
test that uses different processes and materials 
requires evidence that it is related to the abilities 
constituting the aim. Robert L. Thorndike, 
speaking on "Logical Dilemmas in the Estima- 
tion of Reliability," asks us to take a fresh look 
at the basic postulates of reliability estimation. * 


[ B206] 

INVITATIONAL CONFERENCE ON TESTING PROBLEMS. 
Exploring Individual Differences: A Report of 
the 1947 Invitational Conference on Testing 
Problems, New York City, November 1, 1947. 
Henry Chauncey, Chairman. American Council on 
Education Studies, Vol. 12, Series 1, No. 32. Washing- 
ton, D.C.: the Council, 1948. Pp. 110, Paper. $1.50. 

Psychometrika 14:75-8 Mr ’49. Dorothy С. 
Adkins. * many interesting ideas and....diverse 
points of view are represented * The 1947 Con- 
ference was divided into two parts with four 
papers each on quite different themes, “Pro- 
jective Techniques and Their Validity” and 
“Differential Prediction.” In many respects the 
most stimulating of these papers is Percival M. 
Symonds’ “Survey of Projective Techniques.” 
* On the basis of the limited evidence available, 
Symonds concludes that expressive techniques 
have some diagnostic value, as partially evi- 
denced by correspondence between blind inter- 
pretations and independent diagnoses, but little 
predictive value. On the other hand, he cites his 
own studies as well as others in support of the 
generalization that projective techniques have 
little if any value either for diagnosis or predic- 
tion. Abandoning the psychometric approach to 
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both expressive and projective techniques, how. 
ever, Symonds finds their real significance ina 
“descriptive and analytical approach.” This posi- 
tion may leave some readers wondering, first 
what one does with a diagnosis from which Нан 
type of prediction cannot be made and, Second, 
what value attaches to insight gained through a 
descriptive and analytical approach if it cannot 
in some way be used for either diagnosis or pre- 
diction. * A different type of application of a 
projective technique is described in Ruth L, 
Munroe's paper on the relation of academic suc- 
cess at Sarah Lawrence to ACE scores and to 
a measure of personal adjustment called the 
"Inspection Rorschach." The score is based on 
a prescribed check list of items, which represent 
a sort of distillation of reported experience with 
the Rorschach and the speaker’s personal train- 
ing and experience. This measure showed posi- 
tive association with academic standing and, 
combined with ACE scores, improved the pre- 
diction of scholarship. In view of the results 
obtained, some of the cautions with which the 
speaker surrounds her interpretations, such as 
the defense of the position that an “over-all em- 
pirical criterion of success in a given situation 
defeats its own end," seems unnecessary. Аза 
matter of fact, the measure which was shown to 
be positively related to the personal adjustment 
score, academic standing, is an over-all empiri- 
cal criterion of success! To be sure, the adjust 
ment score does not predict it perfectly any 
more than does an aptitude test. The most strik- 
ing point in Thomas M. Harris’ paper on the 
use of projective techniques in industrial selec- 
tion relates to the success of a single projective 
test, an ink-blot series called the ITP, m E 
dicting success or failure in jobs that yo 
specialized talents. We are told little "a s 
test itself or its interpretation except gon be? 
quite far removed from the Rorschach. a ; 
cites a dramatic instance of the success ° ix 
ITP test in selecting creative chemists, н 
terion being an unusually objective меаи 
on patent records. Somewhat to our Kis + the 
ment, we are indirectly led to believe gin ys 
same test, presumably scored in differen ШШ 
is useful for selecting engineers, productor it 
agers, salesmen, research workers, supe ur 
and executives. The second half of the Tons 3 
ference had differential prediction as ane Ei 
Ledyard R. Tucker presented some the of dif 
considerations bearing on the problems 


оп of the 
ferential criteria. To answer the questio" 
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feld in which a person is most likely to 
succeed, Tucker proposes analyses of related cri- 
teria and concentration on those elements dis- 
tinctive of the several fields. He outlines a pro- 
cedure calling for collection of data on all 
established aptitudes (possibly along with per- 
sonality and interest traits), a job analysis of 
criterion fields, obtaining measures of profi- 
ciency in the various tasks for each field, per- 
forming a factor analysis for each field, and then 
applying the results. It will be a long time before 
this job is done. * Dyer....describes a recently 
begun project at Harvard....which is being fo- 
cused on attempts to discover which of many 
particular observations and their combinations 
are most predictive of specific types of college 
achievement. This study should go a long way 
toward telling us whether differential prediction 
of college success is feasible. * No attempt will 
be made to review the discussion....except to 
mention that it is very discursive. Perhaps more 
liberal editing and deletion would not have been 
amiss. 


[ B207 ] 

INVITATIONAL CONFERENCE ON TESTING PROBLEMS. 
Proceedings of the 1948 Invitational Conference 
on Testing Problems, October 30, 1948. John C. 
Flanagan, Chairman. Foreword by Henry Chauncey. 
Princeton, N.J.: Educational Testing Service, 1949. 
Pp. 117. Paper. Gratis. * (PA 24:2228, 2231-2, 2237, 
2395-2400, 2826-9, 2831-2, 2835-6) 


[ B208] 
ŽIĪNVITATIONAL CONFERENCE ON TESTING PROBLEMS. 
roceedings of the 1949 Invitational Conference 
g Testing Problems, October 29, 1949. Oscar K. 
Pee Chairman. Foreword by Henry Chauncey. 
pe eton; N.J.: Educational Testing Service, 1950. 

р. 94 Paper. Gratis. * (PA 25: 4970, 4972, 4975, 4977, 
4981, 4989-90, 4992, 4996, 5000) 

" [ B209] 

INVITATIONAL CONFERENCE ON TESTING PROBLEMS. 
on осеейїпев of the 1950 Invitational Conference 
on Testing Problems, October 28, 1950. Robert L 

orndike, Chairman. Foreword by, Henry Chauncey. 
APEP N.J. $ аан тоса Service, 1951. 

. 117. Paper. $1.00. : 164, 1, 546, 592, 
595, 599, бот) ( 4, 539-41, 546, 5 


J Consult Psychol 15:356-7 Ag '51. Law- 
rance F. Shaffer. * Clinical psychologists will 
be ‘especially interested in the papers on the 
Validation of professional aptitude batteries, in- 
cluding those for law, dentistry, and medicine. 
L. L. Thurstone's address on creative talent is 
a striking contribution to that elusive topic. 


áü [ В210] 

in VES, Orve Lor. A Critique of Teachers’ Rat. 

Lee of High School Boys as an Indication of 
ater Neuropsychiatric Rejection for the Arme 


Services. Columbia University, Teach 
Бордоп e ШАА, Мо, келс е 
of Publications, n . Pp. i 1 
v PA 36 UT e College, 1949. Pp. ix, 68. $2.10. 
[ B211] 

KJACKSON, ROBERT W. B., AND PHILLIPS, ALEXANDER \ 
J. Prediction Efficiencies Ьу Deciles for Various 
Degrees of Relationship. University of Toronto, 
Ontario College of Education, Department of Educa- 
tional Research, Educational Research Series, No. 11. 
Toronto, Canada: the Department, 1945. Pp. 18. Pa- 

per, mimeographed. $0.25. * (РА 19:3241) 

[ B212] 
*Janopa, Marre; Юеотѕсн, Morton; AND Соок, 
Stuart W. Research Methods in Social Rela- 
tions: With Especial Reference to Prejudice, 
Volumes I and II. New York: Dryden Press, Inc., 
1951. Pp. x, 421; X, 423-759. Two volumes, $6.50; 
single volume, $4.00. * 

[ B213] 
XJarora, S. Scientific Personnel Selection Pro- 
cedure: A Study. Banaras 5, India: the Author, 
Hindu University, 1950. Pp. vi, 80. Rs. 10. * (PA 25: 
5722) 

[8214] 
*]лүко, Јоѕерн. Air Corps Practice Tests, Avia- 
tion Cadet Edition. Adams, Mass.: Cramwell Pub- 
lications, 1950. Pp. ii, 41. Paper, mimeographed. $3.00. * 

[8215] 
Ж*]лүко, JOSEPH. Preparation for General Classi- 
fication Tests: Get the Rating You Want: Qual- 
ify for Officer or Technician Training. Adams, 
Mass.: Cramwell Publications, 1951. Pp. iii, 30. Paper, 
mimeographed. $3.00. * 

[8216] 


Heren HarL. Leadership and Isola- 


*TENNINGS, 
in Inter-Personal 


tion: A Study of Personalit 
Relations, Second Edition. oreword by Gardner 
Murphy. New York: Longmans, Green & Co, Inc, 
1950. Pp. xvii, 349. $4.00. * (PA 24: 5772) [“This edi- 
tion is not a revision—it is an expansion: Part IV pre- 
sents new data..."—Preface to Second Edition.] 


Am J Psychother 5:470-1 Jl 51. Joseph I. 
Meiers. The appearance of the second edition of 
this important contribution to sociometric “рег- 
sonality" and interpersonal psychology is not 
only a matter of just pride for the well-known 
author: for sociometry, a kind of disputed bor- 
derland between sociology and psychology— 
heretofore often overlooked, if not disregarded, 
by pundits of either discipline—has until shortly 
been a thorny field. * This, it seems to this re- 
viewer, is also a suitable occasion to draw the 
attention of all those psychotherapists to Jen- 
nings’ book who want to keep abreast of the new 
and useful results of sociometric personality re- 
search. The author rightly states that this sec- 
utes not so much a revised 


ond edition constit с vis 
as a greatly enlarged version of the first edition 


i i d in 
of 1943. While based on and indeed rooted in, 
the sociometric method devised by J. L. Moreno 
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and on the ample experimental work in group 
psychotherapy done by him and Jennings since 
1931, her new book gives the reader an excel- 
lent succinct introduction into the entire scope 
' of this method and its application to casework, 
to family, school and community problems. Her 
own investigations are supported by a wealth 
of references to sociometric and kindred litera- 
ture, both in the text and in the carefully ar- 
ranged bibliography. The special value of this 
book for the psychotherapist lies in the fact that 
Dr. Jennings presents a large number of prob- 
lems which are close to, and partly identical with 
those encountered in psychotherapeutic “every- 
day" practice. * It seems pertinent to add that, 
far from being a “mere” theoretician, Helen 
Jennings has given ample proof of the practical 
applicability of sociometry by describing her 
work with school children, teachers, and inter- 
racial groups, and at New York University, 
where she teaches. Even the reader who is not 
familiar with the sociometric method is able to 
derive useful information for his psychothera- 
peutic work and, in addition, to enjoy the au- 
thor’s lucid and careful description of her work 
in the field of research and education. 
Am J Sociol 57:84—5 Jl’51. Morris Janowitz. 
* an ingenious empirical study which helped 
fashion sociometry as a research tool * The 
added careful analysis of the data on choices 
still suffers from the lack of relevant subjective 
indexes to the psychological meanings con- 
nected with these choices. A new concluding 
chapter, entitled “New Directions,” reviews 
some of the main empirical studies and applica- 
tions of the sociometric technique since the pub- 
lication of the first edition. The new directions, 
as reported here, still reveal a preoccupation 
with the exclusive application of the sociometric 
technique rather than with its broader incorpo- 
ration into research. In this connection it is 
striking to note that although social scientists 
are particularly concerned with the analysis of 
the integration of primary groups into larger 
social institutions—factory, government agency, 
etc.—practically no progress has been made in 
the analysis of choice patterns among the various 
levels of leaders on whom the task of integra- 
tion and co-ordination falls with particular 
weight. As yet, sociometry has not found a solu- 
tion for the complex of value judgments in- 
volved in its research. Generally, it is implied 
that more cohesiveness in group structure is de- 
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sirable and that the expression of latent prefer. 
ences and the arrangement of the group in terms 
of them will contribute to bringing it about, 
Certainly, preoccupation with the building of 
group cohesion is a fundamental task, But if this 
standpoint were made explicit at all points, so. 
ciometric research could free itself from the as- 
sumption that all choices, if only made manifest, 
make possible a restructuring of the group in 
such a way as to increase its cohesiveness, The 
converse is an equally relevant assumption ; cer- 
tain types of individual choices, if given full 
expression, may weaken or disintegrate the 
group. The process by which the group prevents 
the expression of disruptive choices ought to 
become a central concern of research. 

B Menninger Clinic 16:35 Ja ’52, Bernard 
Steingor. * this book is a classic in the area 
indicated by its subtitle * 

J Abn & Social Psychol 46:274-6 Ap 'sr. 
Renato Tagiuri. * The first edition (1943) of 
this book made several original contributions. 
It was the first sociometric study to use an шь 
limited number of choices, to examine the choice 
and rejection process at intervals, and, therefore, 
to discover the trait-like stability of "emotional 
expansiveness" and “desirability.” It was also 
the first to present behavioral data and case 
studies in relation to choice-status, especially of 
“leaders” and isolates, and to investigate the 
motivational background of choosing and г 
jecting. The main contribution of the secom 
edition is Part IV, where the distinction [4 
tween “psychegroup” and “sociogroup” is made, 
the literature is surveyed, and the data are A 
viewed in relation to a number of importan 
social-science problems. The “psychegroup б. 
"sociogroup" distinction is an important h 

e SEA. nding of the 
pirical contribution to the understa al 
bases of the difference between function? 


n 
distinct groups. This difference has y v 
recognized by sociologists, as 1n Code 


Gemeinschaft and Gesellschaft or in ^ tif 
primary and secondary groups. J eu by 
inal findings have been utilized [ШИЛ 
researchers working in the field of socio ge 
The material, however, contains many i у 
tions for other investigators as well, "E iud 
for the student of personality and deve Hg p 
There are aspects of Jennings’ findings," be of 
ficiently emphasized by her, that mig tion d$ 
great interest to workers in social рее нел 
for example, the very small relationship 
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the subject's choice behavior and the nurturance 
of his social environment. The individual's in- 
terpretation of the environment seems to give 
more weight to personal needs than to the re- 
ality situation. Jennings makes some suggestions 
regarding the relevance of her material for sey- 
eral problems, but this part of the book is not 
sufficiently expanded. This work could have 
profitably been made more readable, and in quite 
a few places, less repetitious. The analysis of the 
data could have been sharpened further by con- 
trolling the factor of religious differences, which 
is known by now to be an important variable 
in sociometric choice. No mention of it is made. 
Although the contributions of this revision can- 
not be as numerous as those of the first edition, 
the reviewer believes that the book was highly 
worth printing since research workers can still 
utilize the many important findings of Jennings 
as well as discover in her material many points 
relevant to their special interests. 

Personnel Psychol 4:415-7 w °51. Donald Р. 
Roberts. * The findings, which are seemingly of 
most importance to those who must deal in 
problems of isolation and of leadership, are lost 
in the abundance of other material. The author 
could advantageously develop another book to 
serve those who are solely interested in the re- 
sults and the implications of such a study as this. 


For reviews of first edition, see 3:956. 


[B217] 
XJeNNINGS, HELEN HALL; IN ASSOCIATION WITH THE 
TAFF OF INTERGROUP EDUCATION IN COOPERATING 
ScHoo.s, HitpA Tapa, Drrecror, Sociometry in 
Group Relations: A Work Guide for Teachers. 
Intergroup Education in Cooperating Schools, Work 
G Progress Series. Washington, D.C.: American 
ouncil on Education, 1948. Pp. ix, 85. Paper. $1.25. * 
A 23: 4197) 
B Menninger Clinic 14:148-9 Jl * 50. Louisa 
Holt. While considerable scepticism must 
Breet many of the esoteric and extravagant pro- 
Touncements of J. L. Moreno and his disciples, 
there is nevertheless something of value in what 
е have {о say. In S; ociometry and Group Re- 
ations a clear distillation of valuable elements is 
fo be found. Although this pamphlet is written 
Primarily for teachers, I believe few hospital 
ychiatrists could read it without being tempted 
ону out these excellently described techniques 
dealing with patient groups. If we agree that 
| Y classroom should provide conditions for 
Сау emotional development, perhaps it is 
50 true that the psychiatric ward should pro- 
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vide Such conditions; this pamphlet indicates 
E steps which may be taken in that direc- 
on. 

Clearing House 24:119-20 О '49. Stanton B. 
Langworthy. Don't be dismayed by the pallid 
title of this book. It provides a flesh-and-blood 
answer to part of the problem of diagnosing re- 
lationships in school groups. It has been de- 
veloped out of the experience of many teachers, 
and is written in a manner calculated to interest 
all. In simple, measured words it shows the im- 
portance of understanding group relationships 
for education in democratic living. It shows, 
step-by-step, how the teacher can find and chart 
the personal cross currents and interlocking re- 
lations within a group as a whole. With caution 
and careful reservation it helps the teacher de- 
velop ways of analyzing and interpreting these 
data, and offers suggestions for improvirig the 
relationships revealed. Those teachers who ac- 
cept responsibility for improving group rela- 
tions but who worry about subjectivity will find 
the method outlined in this book of marked value 
as a supplement to insight. They will appreciate 
the explanation of the psychological principles 
involved, and the illustration of the simple forms 
used in the process, Those who care to investi- 
gate further will find a good springboard in the 
selected references to sociometric studies. It will 
be gratifying to most teachers to find a tightly 
written work guide which does not belabor their 
intelligence with “inspiring challenges,” but 
which offers a blueprint of what can be done 
and how to do it. 

J Consult Psychol 13:146 Ap ’49. Laurance 
F. Shaffer. This clearly written booklet de- 
scribes methods for constructing sociograms of 
classroom groups, and indicates many practical 
applications of the results toward providing chil- 
dren with learning experiences in group co- 
operation. à 

P Social & Social Res 35:307 Mr-Ap ’5r. 
E(mory) S. B(ogardus). * A large propor- 
tion of teachers who read this document will be 
stimulated to use it in their classrooms and in 
so doing will gain nev' light on why some pupils 
do better, or poorer, than do others. * A great 
deal of illustrative material is to be found in this 
brochure. The importance of the sociometric 
structure of a group is made evident. * an out- 
standing contribution to the work of all persons 
interested in the social growth and development 


of children. 
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[ B218] 
xJounson, Parmer O. Statistical Methods in Re- 
search. New York: Prentice-Hall, Inc., 1949. Pp. xvii, 
377. Text edition, $5.00 ; trade edition, $6.65. * (PA 23: 
5874) For reviews, see 182 in Statistical Methodology 
Reviews, 1941—1050. 

[ B219]. 
*]ош„Ез, Isaac. Handbook and Manual for the 
Qualified Psychological Examiner in the Illinois 
Plan for Special Education of the Educable Men- 
tally Handicapped. Foreword by Ray Graham. Sup- 
plement to Circular Series “B,” No. 12. Springfield, 
Ill.: Department of Public Instruction, State of Illi- 
nois, [1949]. Pp. 64. Paper. Gratis. * (PA 24: 5247) 

[ B220] 
Ж*]охЕз, Frank Н. Guide to Examination Success. 
Stanmore, England: Barkeley Book Co. Ltd., 1047. Pp. 
60. 3s. * [The second edition (1949) is a reprinting of 
this edition.] 

Librarian 37:220 Ag '48. A. M. M. If it is 
possible to give hot tips to examinees that will 
really ensure their success, this little manual 
might do it. It is a print of a stimulating talk 
given personally to examination candidates, 
recommending guaranteed tactics for getting 
through their examinations. The advice offered 
is sound, and receiving it may give nervous or 
inexperienced examinees a confidence they 
would not otherwise feel. But surely the real 
secret is the one expressed by one of my own 
students recently: "I used to think getting 
through exams was a matter of luck. It is a 
matter of work, isn't it!" And when all is said 
and done, that is what it is. Isn't it? 


[ B221] 

*Jones, Harop E. Motor Performance and 
Growth: A Developmental Study of Static Dy- 
namometric Strength. University of California, Pub- 
lications in Child Development, Vol. т, No. т. Berkeley, 
Calif.: University of California Press, 1949. Pp. x, 181. 
Cloth, $3.00; paper, $2.00. * (London: Cambridge Uni- 
versity Press. 24s.) (PA 24: 3647) 

J Abn:& Social Psychol 45:568—9 ЛІ 50. Al- 
fred L. Baldwin. * presents material gathered 
from a longitudinal study of the development of 
the physical strength of children between the 
ages of 11 and 18 * Within the ranks of research 
workers in child development there can be dis- 
tinguished a group primarily concerned with 
growth curves. These people plot growth curves, 
study their variations, and relate the variations 
in the developmental course of one measurement 
to other growth curves or to external factors. 
This book represents a very careful study along 
these traditional lines. The comparison of growth 
curves is tricky. Jones presents in this study 
four different methods of plotting growth curves. 


In the first method, type I, the averages of each 
age level are plotted in absolute units, ie, in 
terms of pounds or inches or the unit in which 
the original measurement was made, Somewhat 
greater comparability of different curves сап 
be obtained if the standard deviation is used as 
the unit of measurement (type II). In this study 
the average of the 11 year old sample was arbi- 
trarily called zero and the standard deviation of 
the тї year old sample was made the unit of 
measurement. A type III plot directs the atten- 
tion to the growth rate rather than to the at- 
tainment of the individual by plotting the incre- 
ment over a six month interval. A type IV plot 
measures how closely the performance of each 
individual approaches his final terminal status; 
each performance is plotted as a percentage of 
the terminal performance. The author presents 
many of the comparisons in the book in several 
different ways. For example, sex differences are 
shown in terms of all four kinds of plots. The 
point of the four different plots is to draw atten- 
tion to different aspects of the growth curves. 
Boys, for example, are stronger than girls; boys 
show an acceleration of growth at age 13 years 
while girls show a deceleration; boys approach 
their terminal strength much later than do girls. 
This summary of sex differences gives the gen- 
eral flavor of the the first three quarters of the 
book. It is carefully worked out, full of detailed 
factual statements about the data, almost com- 
pletely disorganized in terms of a general theory 
of growth, and makes but a few distinctions be- 
tween what is significant and which is ршщ. 
tant in the development of motor strength. : i 
first three quarters of the book sticks very ш 
to statistical data, treating each individual 
growth curve rather than as a реи! s 
chapter VII, however, the whole tenor 0! iif 
book changes with the introduction of peste 
case studies of ten boys at the upper ire a 
of strength and ten boys of the low extr "m 
These cases are very striking in the jer 
strong boys appear to be happy, рор n 
adjusted and the weak boys appear 2: pellig: 
unhappy and either shy or unnecessary m 
erent. Even within the case history of an vid 
vidual there are sometimes changes 1n o in 
ity and adjustment corresponding to wm 
strength. While this chapter presents we М 
interesting data, there is but little e con 
fit it together into some sort of a theory: is 
saying that motor development 1s comp es 
and that the various characteristics of a P 
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reciprocally interact, the author makes but little 
of the findings from individual cases. The reader 
is likely to come away from this book with an 
appreciation of the fact that motor development 
is a consequence of a complex interaction of 
numerous factors and that we don’t know very 
much about it in spite of the considerable amount 
of carefully collected descriptive information. 
This book adds its increment of information and 
the reviewer, at any rate, found it interesting 
information. It is interesting, however, in a 
fact-collecting sense rather than as a contribu- 
tion to our understanding of growth. This last 
sentence expresses a basic dissatisfaction with 
the book: There is a certain unwarranted note 
of complacency in the author’s presentation of 
a sequence of independent factual findings. The 
information contained in the book is, in its 
present form, of very little practical help to 
guidance workers and teachers except possibly 
to make them tolerant of individual differences. 
And it certainly does not build a theory of motor 
development. Such a study might be useful in 
providing the groundwork for further research. 
If so, the author is in the best position to outline 
fruitful lines of investigation. Yet the author of 
this volume devotes almost no space to the dis- 
cussion of future research efforts. Apparently 
the book presents facts for the sake of facts. In 
the opinion of the reviewer, this approach seri- 
ously limits its value. 

J Ed Psychol 41:373 О ’50. Miles A. Tinker. 
This investigation is concerned with the devel- 
opment of muscular strength as revealed by 
semiannual dynamometric measurements of 
eighty-nine boys and eighty-seven girls from age 
eleven to seventeen and one-half years. Although 
the author considers it of little significance, it 
Should be noted that the occupational distribu- 
tion of the sample is not representative of the 
general population. There was a marked excess 
of white-collar workers and skilled workmen, 
and almost none from the semiskilled and un- 
skilled groups. Interpretations, therefore, prob- 
ably should be confined to the present sample 
rather than extended to the general population. 
* It is probably best to receive many of these 
findings as tentative and suggestive rather than 
firmly established until additional studies are 
made with more representative samples. 
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ton, D.C.: U.S. Government Printing Office, 1947. Pp. 
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Psychol B 46:514-5 N ’49. Delos D. Wickens. 
* The contribution of the present study is one 
of emphasis, an emphasis upon the difficulty in 
obtaining a reliable criterion. Superficially it 
would seem that the problém of obtaining an ac- 
ceptable criterion for bombardier proficiency 
should not be difficult. The bombardier's mili- 
tary task is to drop a bomb on a particular target, 
and this task can be readily duplicated under 
non-combat conditions. By using a target with 
concentric rings the magnitude of error could 
be obtained either by estimations made by the 
pilot, a fellow bombardier, or by means of aerial 
photography. One could not quarrel with the 
validity of such a criterion, for it consisted of a 
fairly exact duplication of the combat job. The 
major difficulty with this criterion, called the 
circular error, lay in the fact that it had zero 
reliability and hence could give no precise indi- 
cation of any particular bombardier’s profi- 
ciency. Obviously such a criterion had no cor- 
relation with any potential predictive measures. 
A number of studies were conducted to deter- 
mine the sources of errors in the criterion meas- 
ure. In the first place the fault of the measure 
did not lie in the estimation of the magnitude 
of error for any single bomb burst. Photo- 
graphic records could be read with accuracy and 
the estimates of observers on the magnitude of 
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error were in agreement. A series of experi- 
ments indicated that the pilot, the degree of air 
turbulence, and the particular sight used were 
all factors contributing to the error. (If pre- 
cision-built sights differ from each other, what 
about some of our home-made laboratory equip- 
ment?) An extensive experiment which served 
to control these sources of error was terminated 
before completion by V-J Day. Unfortunately 
it has not been continued. However, the project 
was not so hampered by the criterion problem 
that it was unable to make any contribution to 
bombardier training, It developed tests for se- 
lecting bombardier instructors, proficiency tests 
to measure knowledge of bombing and naviga- 
tion, and was able to conduct a learning study 
which.gave information on the amount of train- 
ing necessary to reach a consistent level of 
performance in bombing accuracy. * 


For a review of the series, see Ba. 


[ B227 ] 
*KeEnnepy, Emery GILBERT, Eprrog. Problems in 
Individual Analysis: Emphasizing Techniques 
Other Than Standardized Tests and Measure- 
ments. Report of a Conference of City School Super- 
intendents, Principals, Directors of Guidance, and 
Counselors Held at Kansas State Teachers College, 
Pittsburg, Kan., July 13-14, 1950. Kansas State Teach- 
ers College Bulletin, Vol. 47, No. 7. Pittsburg, Kan. : 
the College, May 1951. Pp. 96. Paper. * 

© 

[ B228] 
*Kent, Grace H. Mental Tests in Clinics for 
Children. Foreword by J. P. Guilford. New York: 
D. Van Nostrand Co., Inc., 1950. Pp. xii, 180. $2.45. * 
(London: Macmillan & Co. Ltd. 185.) (PA 24: 5256) 

J Consult Psychol 14:417 О "50. Laurance F. 
Shaffer. This is a richly self-revealing book that 
no reader can complete without a feeling that he 
has become quite well acquainted with Grace Н. 
Kent. The author has attempted to distill her 
four decades of clinical experience in the ex- 
amination of children. She is at her best while 
describing what to observe, how to write a re- 
port, and how to use clinical resources to study 
people instead of test scores. Her greatest short- 
coming seems to be a confused attitude toward 
test standardization, which is rejected because 
of the rigid interpretations of test scores that 
untrained workers often make, without adequate 
recognition of the values that standardization 
has for even the most broadly trained psycholo- 
gists. Many of the clinical insights of the book 
are rewarding, and it contributes to a broaden- 
ing of one’s historical perspectives. 
Psychol B 48:284 Му ’51. Helmer Р. M yhle- 
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bust. The author states that this book is intended 
for advanced students in clinical Psychology, She 
states further that it was her wish “to present { 
point of view not generally recognized," This 
reviewer thinks that the author has achieved 
both goals with unusual success, This book will 
be of considerable help and interest to advanced 
students in clinical psychology, and the point 
of view, while not generally recognized, needs 
to be emphasized. * Chapter III, “Sources of 
Error in Use of Tests," might be considered as 
the author's statement of rationale. The point of 
view is that undue emphasis and importance have 
been given to the quantitative aspects of mental 
testing. Diagnostic errors may derive from tak- 
ing scores too literally. This emphasis is the ma- 
jor contribution of this book. * This reviewer 
finds the emphasis on the need to base a diag- 
nosis on the complete appraisal of the child, 
rather than primarily on his scores, to be re- 
freshing and stimulating. Some psychologists 
will not agree with this point of view, but Dr. 
Kent speaks authoritatively, and her discussion 
reveals a wealth of clinical experience from 
which her arguments derive. It is a practical and 
timely book. Clinical psychologists in serv- 
ice clinics and those responsible for training will 
find it helpful. 
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Am J Psychol 62:629-30 О '49. 7 I ji 
cific guide for nursing schools is ne | 
carefully written manual should be use Total 

Am J Pub Health 39:804-5 Je 49. ^7 pj 
Deming. * The title....suggests that nurse 
find useful material applicable to their Pd the 
in testing students and staff members, Б some 


first 85 pages of this book fulfil this hope 


ments Ap- 
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extent. The author covers the first steps in tabu- 
lating, classifying, and determining modes, 
means, medians, and percentile ranks of scores, 
On proceeding to more advanced calculations, 
however, the author’s presentation becomes con- 
fused. It lacks: (1) clarity, because the basic 
principles are not known to nurses and are not 
explained, the sentence structure is frequently 
ambiguous and the thought underlying the state- 
ments is not clear. (For example: the first sen- 
tence of the new section explaining Binomial 
Expansion (page 86) reads: “Where there is 
one factor involved, it may take one of two 
forms, i.e., either happening or not happen- 
ing.”) ; (2) accuracy, because the writer tries 
to cover too much ground in too little space; 
and (3) applicability, because the purpose of 
the calculations is not clearly defined and the 
nurse is not shown how to use the results after 
the calculations are made. In the main the subject 
matter on pages 87-138 is too advanced for 
those nurses who have not had special prepara- 
tion in tests and measurements, or who are not 
natural-born mathematicians and statisticians. 
Few nurses in any case would perform the cal- 
culations given in these chapters in their daily 
work, for example, positive and negative skew- 
ness, Chapter X attempts to cover the subject of 
test construction in ten pages. This is one of the 
much needed techniques among teachers, but 
is so superficially dealt with here as to be danger- 
ous. There is far more to test construction than 
a mere listing of the various forms of objective 
questions. (Incidentally, true-false questions are 
presented without any mention of the fact that 
this form of question is discredited for profes- 
sional tests and has been discontinued by the 
best test technicians.) The author fails to give 
Instructions that would aid in writing discrim- 
inating multiple-choice questions to sample 
knowledge and judgment and leaves out the 
usual warnings against the pitfalls in writing the 
distractors or wrong choices. “Reliability” and 
‘validity’—two really essential definitions in a 
hurse’s understanding of testing—are disposed 
of in a few brief sentences without illustration. 

here are simpler and more accurate texts for 
nurses who wish to become familiar with the 
elementary techniques of this subject. If more 
advanced procedures are needed, the best device 
is to seek the aid of a test technician and not 
attempt these skilled procedures—certainly not 
s a mere reading of a series of mathematical 
Steps, 
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Ed 70:131 O *49. Marguerite C. Holmes. * 
a practical and valuable combination text and 
workbook which may be used both by the be- 
ginner and the more advanced student. The lan- 
guage is simple, material gradual in progression 
and the chapters are arranged to give the stu- 
dent, teacher and the administrator in nursing 
education a thorough understanding of testing 
techniques and their application. * a meaningful 
contribution to nursing education. 
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Brit J Psychol, Gen Sect 39:119-20 D ^48. 
H. M. This book is an account of a comparative 
survey of intelligence and school attainment of 
children in the United Provinces, India. The 
tests used were drawn up by Mr. Lall himself. 
One gets the impression that the author has tried 
to introduce Western educational practices into 
Indian schools without at all understanding what 
those methods were intended to achieve. No- 
where in his book has he mentioned the most 
important function of the type of tests he used ; 
ie. as a means of gaining an understanding of 
the needs and difficulties of the individual child 
in relation to his learning, Instead, the author 
has used the tests as an instrument for deter- 
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mining which of the rival social and religious 
classes are intellectually superior. He has pre- 
sented the reader with a mass of statistics whose 
implications he has not attempted to explore. 
Not only are such figures irrelevant, but they 
lend themselves to abuse. The greatest value 
of the survey was probably as a means of stand- 
ardizing tests suitable for Indian children. Un- 
fortunately, the author has given no account of 
this aspect of his work, nor of how the English 
tests were adapted to fit the Indian cultural and 
educational patterns. Although the book is, as 
it stands, very disappointing, the work described 
there has great potentialities. If the author's tests 
are used less for comparing groups and more for 
assessing the needs of individuals they may help 
to establish a more human approach in schools. 
J Appl Psychol 33:415-6 Ag "49. Henry E. 
Garrett. * presents results obtained from the ad- 
ministration of three tests to approximately 2000 
Indian children in 58 government high schools. 
The children were 11+ years old and the tests 
were a group verbal intelligence test, an English 
language test, and an arithmetic test. * the most 
interesting part....is the presentation of compara- 
tive test scores for four Indian castes, for chil- 
dren from different geographical regions, and 
for children whose parents fell into various oc- 
cupational groupings. Most of the differences 
are quite small but some are statistically signifi- 
cant. It seems likely, however, that the apparent 
significance was, in many cases, a resultant of 
the small and probably unrepresentative sam- 
ples. How representative the samples were we 
have no way of judging. On the whole the mono- 
graph is somewhat amateurish and reminds one 
of publications in this country of some 25 years 
ago. This is a pioneer job, however, done under 
considerable difficulties, and the author deserves 
a great deal of credit. The references in the book 
are to Thomson, Spearman, and Burt, under 
whom the author apparently had his training. 
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Am Bus 18:66 My ?48. * packed with care. 
fully documented cases designed to prove that 
it pays to use tests in determining the value of + 
employees * sound common sense and experi- 
ence of Mr. Lawshe is indicated * sound ap- 
proach * Access to a large amount of factual 
material and wide experience to draw on gives 
the book a down-to-earth quality seldom found 
in books of this nature. 

В Ind Psychol & Personnel Prac 7:40 $51, 
A. J. Wyndham. * The book is mainly devoted 
to a description of the sorts of jobs for which 
selection tests have been used and the kinds of 
tests employed. Whilst the treatment of this 
aspect is not unusual, it is sufficiently compre- 
hensive to satisfy the curiosity of the student 
and the personnel officer who seeks to know the 
extent of the work being carried out in other 
firms. There is a most important last chapter for 
those who are seriously considering the intro- 
duction of a testing programme in their organi- 
sations. Professor Lawshe gives us the benefit 
of his own experiences, indicating the sorts of 
dangers that can exist for the unwary. In this 
connection he considers the steps that might be 
taken in gaining management and union sup- 
port. A useful appendix which lists commercially 
available tests, rounds off the contents of a very 
handy reference book on personnel testing. 

J Appl Psychol 33:92-3 F ’49. Edwin E. 
Ghiselli. * an elementary treatment of the prob- 
lems involved in the testing of employees T 
purposes of selection * The approach taken y 
Lawshe will meet the approval of industrial 7 
chologists. He is concerned with many 0 | 
problems in testing that сап be appreciated 027 
by опе who has worked directly іп nae 
However, in the reviewer's opinion m cem 
gives only a general overview of the field. P 
coverage of important problems, methods, $ w 
findings is spotty, and any reader, E iif j 
testing, may obtain an incorrect, and ps ating 
an incomplete, picture. The problems ie Е 
with test construction and validation qi f 
quate as far as they go—but they do not 8 de 

Н ME not m 
enough. The concept of reliability 15 - 
tioned either in connection with eur 59 
tests. There is no treatment of the com c states 
or weighting of tests in a battery. Laws 
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that the purpose of the book is to serve as an aid 
to management. But the level at which the book 
is cast suggests that the author is underestimat- 
ing the intellectual capacities of his potential 
readers. The point of view seems to be that 
concepts of second order difficulty, even though 
they be fundamental in nature, have no place in 
an introductory presentation. This simplified 
treatment is likely to convey to the wrong people 
the erroneous impression that anyone can de- 
velop and operate a testing program without 
getting into difficulties. The chapters concerned 
with reporting previous findings on the validity 
of tests will be disappointing to many. It is by 
no means an extensive review. Rather these 
chapters are intended to present illustrative find- 
ings concerning the usefulness of different types 
of tests for various kinds of jobs. Several com- 
ments seem pertinent in this connection. Not a 
single example of the work of the U.S. Employ- 
ment Service on aptitude testing is cited. Yet 
surely Stead and Shartle's now classic “Осси- 
pational Counseling Techniques” which sum- 
marizes so many of these excellent investigations 
deserves mention here. However, of consider- 
able value are the findings of a number of in- 
vestigations not published elsewhere. Of the 
validation studies cited, only a very few with 
negative findings are given. Thus tests are 
hardly ever put in an unfavorable light. It re- 
minds one of the optimistic descriptions of valid- 
ity given in the manuals of directions of so many 
published tests. Any members of management 
or unions who believe this to be the true picture 
are in for a sorry disappointment. Even a cur- 
sory review of published reports will indicate 
that there is considerable variation in the ef- 
fectiveness of any given test applied to different 
groups of workers on the same job. Yet in fair- 
ness to the author it should be pointed out that 
he does not subscribe to the rather extreme 
Views as presented in his publisher's advertis- 
ing. Thus while the publisher claims that “From 
now on you can place the right person in the 
Tight job every time!”, Lawshe emphasizes that 
tests are by no means a cure-all and can simply 
Increase the probability of selecting better em- 
Ployees. The third topic, establishment of test- 
ing programs, is likely to be dismissed as being 
another set of “practical” rules. This would be 
an error. While one might have hoped for a 
More expanded treatment, Lawshe here deals 
with most important problems. Today any book 
In this field purporting to be a “Principles” of 


B239 


employee testing would be woefully inadequate 
if it avoided such areas as supervisory support, 
budget, labor unions, personnel records and 
management reports. The day when the psy- 
chologist’s task begins with the writing of items 
and ends with the computation of the validity 
coefficient is past, if it ever existed. Lawshe rec- 
ognizes this and entertains for discussion cer- 
tain of the important implications of testing in 
its larger setting of personnel problems and 
labor relations. In sum, the reviewer's chief criti- 
cisms of this book are omissions of fundamental 
problems and concepts, and the elementary na- 
ture of the presentation. Those who use the book 
as a text either in college courses in testing or 
in similar courses for members of management 
or labor unions will undoubtedly find it necessary 
to provide supplementary material and discus- 
sion. 

Mgmt R 37:653 D’48. F. E. Hewitt. * a good 
introductory summary of these tests for per- 
sonnel people who are planning to establish a 
testing program. The simple step-by-step pro- 
cedure for inaugurating and operating such a 
program will appeal to the practical business 
executive whose beginning interest in the field 
of psychometrics requires a non-technical guide. 
In preparing such a guide, Dr. Lawshe has done 
well to emphasize procedure rather than theory 
and to provide an accurate and clearly stated 
analysis of the advantages and limitations of 
tests in each field. In addition, he has included 
a thorough analysis of certain tests developed 
at Purdue University. * One of the book’s warn- 
ings to those contemplating a test program bears 
repetition here: Do not depend upon the name 
of the test as a sure indication that it measures 
what its title suggests. A so-called clerical apti- 
tude test may be better used for selecting cer- 
tain factory employees. * In his chapter on 
temperament and personality tests, Lawshe con- 
fines the greater part of his analysis to valida- 
tion reports from authors of the particular tests 
discussed. This reviewer would have welcomed 
additional validation from other sources. Despite 
these and other minor points, however, Dr. 
Lawshe has throughout the text fulfilled his 
stated objective—that of providing а volume 
which should prove useful to personnel adminis- 
trators and for training those who eventually 
will administer testing programs. Ау 

Personnel Айт 11:37+ S "48. Luigi Pe- 
trullo. The aim of this book is a practical one. It 
is written primarily for "those now 1n or soon 
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to be in managerial positions as a statement of 
what can legitimately be expected of tests and as 
a guide to the establishment of the policy frame- 
work within which a testing program must func- 
tion." This objective is fairly well met. In sim- 
ple and straightforward language the author 
presents a summary of available information in 
the field of industrial and business testing. Busi- 
ness or industrial executives who have little or no 
technical knowledge of the testing field should 
be able to follow the exposition without much 
difficulty. The test research man will also find 
a great deal of information organized in a very 
creditable manner. To be sure, much of the ma- 
terial is undoubtedly already well known to such 
persons, but its organization and the addition of 
illustrations from the author's personal experi- 
ence succeed in making the book a compact 
package on tests and their use in industry and 
business. Perhaps one of the most important 
contributions of the book is the insistence of the 
author that tests must be validated before they 
are used in the industrial situation. What this 
means in actual practice is that there are very 
few available tests which can be used in indus- 
trial situations without some experimental work 
to show that they are applicable to the particular 
situation. * To the extent that Lawshe's book 
proves the need for "testing the tests," the vol- 
ume has made a real contribution to the field 
of industrial-psychological relations. * One of 
the best features....is the use of examples to illus- 
trate the usefulness of tests. Also of practical 
value is one of the appendices which gives in- 
formation about where to get professional test- 
ing assistance. There is very little theory pre- 
sented in this book. The student would look in 
vain for philosophical or theoretical justifica- 
tion for the procedures and methods described 
in the volume. This is not a criticism since the 
objective was honestly set out to exclude such 
considerations. Perhaps Dr. Lawshe could at a 
later date prepare a companion volume for the 
student's use. Such a volume would be very 
much worth-while and of real assistance to re- 
search people who need a theoretical background 
against which they may conduct their test re- 
search. * nothing is to be gained by perpetuat- 
ing in industrial literature such terms as “mental 
ability tests” and “temperament and personality 
tests” which are in the book. In the field of in- 
dustrial testing, the need for such terminology 
is gradually fading away ; indeed these terms are 
usually rather confusing to the layman. It is 
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much more desirable when discussing tests with 
industrial people to refer to occupational tests 
for a particular occupation or for a field of work, 
There is enough apprehension and confusion 
about so-called mental ability and personality 
tests in the public’s mind at the present time to 
make the continued use of these terms inadvis- 
able. All in all, Dr. Lawshe has made an im- 
portant contribution to a very practical field, 
Personnel J 27:320-1 Ja’ 49. Clifford E. Jur- 
gensen. Many companies now use personne 
tests, and many more companies are interested 
in learning what tests will—and will not—do, 
Executives, however, are frequently unable to 
obtain the information they need and want. Dis- 
cussions with other executives frequently abound 
with misconceptions because of absence of tech- 
nical information, and discussions with psychol- 
ogists are frequently not intelligible to the ex- 
ecutive because the psychologists are unable or 
unwilling to discuss testing in a manner which 
is intelligible to persons lacking a technical back- 
ground in psychology. Dr. Lawshe's book... 
should overcome much of this difficulty. * He has 
emphasized procedures and results rather than 
theory. The result is a highly. readable book 
which will be valuable to executives who do not 
wish to obtain a detailed background in testing 
but who do wish to be generally familiar with 
some of the basic principles and procedures of 
testing. The book also has considerable value 
for companies who employ test technicians. Fre 
quently the job training of such technicians is 
made difficult because their superior lacks um 
derstanding of test principles. Test technicians 
may thus be "left on their own" to the ultimate 
disadvantage of the company. At other times, 
incompetent test technicians (including ch 
tans) are employed, but their incompetence re 
mains concealed because their superiors op н; 
way to check ће adequacy of their work. | Р 
book is not without fault. A statistical table i 
determining the probability that test dis 
of two groups are due to chance coritains inco 


"nut nd тар S 
rect probabilities. Some of the curves а ae d 


arla- 


veloped by Purdue workers, but 
believes such emphasis justified Б 
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familiarity with such tests and access to ргеуі- 
ously unpublished data on those tests. These 
faults detract from the book, but should not 
deter anyone from obtaining and reading it. The 
faults are minor as compared with its advan- 
tages. In the opinion of the reviewer, this book 
is a “must” for any executive who has super- 
vision over a test technician, who is considering 
using tests, or who wishes to discuss testing in 
an intelligent way. The book should also be 
required reading for all personnel department 
employees whose company uses any tests. Psy- 
chologists who have an adequate theoretical 
background in testing but lack the ability to dis- 
cuss the subject adequately with laymen can 
also profit considerably from reading the book. 

Pub Personnel R 10:116-8 Ap ’49. Harold 
Fields. * a most readable and comprehensive 
book on the principles of personnel testing— 
covering a large number of fields in relatively 
short space. The purpose of the volume is to 
acquaint the average employer with test con- 
struction, test devices, and test values. * His ap- 
proach is couched in simple language, his tempo 
isa patient one, and the development of his topic 
is enriched by a generous number of descrip- 
tive graphs and scattergrams. * He succeeds 
definitely in achieving this objective. Dr. 
Lawshe is the true teacher * Every chapter has 
its full quota of varied references and graphs, 
with a clear explanation for each. The volume is 
excellent for the tyro * Technical terms are sim- 
ply and clearly defined * Lawshe devotes ade- 
quate space to such tests as interest and pref- 
erence tests, personality tests, mental ability and 
mechanical aptitude tests, tests for clerical and 
Secretarial positions, tests for salesmen and re- 
tail store employees, and tests for supervisory 
Positions. In his treatment of many of these— 
but not all—he indicates the dangers and short- 
comings that must be considered ; unfortunately, 
however, this approach is not always consistent. 

More suggested combinations of tests consti- 
tuting a more valid measuring instrument would 
ave increased the value of an already valuable 
book. Thus, mentioning the work that has been 
done in linking the Humm-Wadsworth Test, 
the Rorschach Test (which, surprisingly, the au- 
thor does not refer to), and Bernreuter's Per- 
sonality Inventory would have been helpful. * Tt 
15 refreshing to find adequate treatment given to 
differences of raters in this volume for аре 
Sonnel staff.’ Of equal value is the space given to 

fundamental problem of job analysis as à 


prerequisite to the worth of any testing instru- 
ment. * Too little attention is paid to personal- 


„ity tests which seek to measure those behavior 


patterns that can insure or defeat success in 
leadership. * it would have increased the value 
of the book had less spatial equality been given 
to different types of tests and more proportion- 
ate space given to tests in terms of their rele- 
vancy to a job. As an example of this, eleven 
pages are given to personality testing as against 
twenty-nine pages to visual tests. * One weak- 
ness is found in the chapter on test construction 
where the author emphasizes true-false tests but 
makes no reference to the guess factor. * For the 
person not trained in the field of tests and meas- 
urements, this book offers a rich source of in- 
formation that is clearly stated and comprehen- 
sive in treatment. For those who are trained, it 
makes a good “review” volume. 


[8240] 
*Lezeps, Carrot Н. A Scale for Measuring 
Teacher-Pupil Attitudes and Teacher-Pupil Rap- 
port. American Psychological Association, Psycho- 
logical Monographs; General and patie Vol. 64, No. 
6, Whole No. 312. Washington, D.C.: the Association, 
Inc., 1950. Pp. iv, 24. Paper. $1.00, * (PA 25: 3432) 

[ B241] 
XLruwaN, Ruru T. Appraising the College Pro- 
gram in Home Economics: A Study о Some 
Procedures. Washington, D.C.: American Home 
Economics Association, 1950. Pp. vii, 229. Paper, litho- 
typed. $2.00. * 

[ B242] 
*Lennon, Rocer T. A Comparison of Results of 
Three Intelligence Tests, Test Service Notebook, 
No. 11. Yonkers, N.Y.: World Book Co., [1951]. Pp. 
4. Paper. Gratis. * 
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ү, ҮлллАм M, EDITOR. Psycholo; ical Re- 
ors in the heaters of Wiss Amy ir d 
iati m Resea eports, Re- 
AP Perch on, D.C.: U.S. Government 


үн ine Office, 1947. Pp. vi, 202. Out of print. * (РА 
21: 3716) Ч 
J Am Med Assn 135:66 S 6 '47. Before ће 
United States entered World War II, psychol- 
ogists had begun to work out technics of selec- 
tion of combat crews. Their technics tested 
aptitude and intelligence and were considered 
highly successful, although validation was never 
scientifically achieved. These psychologists so 
completely sold their services to the air forces 
that the utilization of psychiatrists for the study 
and treatment of emotional disorders was greatly 
neglected. The cost in man power was probably 
greater than that saved through psychologic se- 
lection, The present volume represents the type 
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of psychologic research that completely excludes 
the dynamic emotional factors responsible for 
failure in combat. There are dozens of statistical 
tables concerned with scores, validated by the 
adage that this isa scientific or score keeping age, 
and if one doesn't keep score one never knows 
the score. The editor admits that the evidence 
presented is at once tenuous and massive and that 
few of the validity indexes considered by them- 
selves are impressive. This book represents the 
results of a worldwide investigation at tremen- 
dous cost in money and personnel. Its results 
seem hardly commensurate. 

Psychol B 46:525-6 N "49. N. Н. Pronko. * 
The primary missions were two-fold: to obtain 
first-hand information in the various theatres of 
war that would serve as combat criterion data 
against which to validate the original selection 
and classification tests; and to collect data on 
the requirements for the various tasks of air- 
men. Secondarily, the missions were concerned 
with the development of proficiency tests for 
specialized personnel; the evolution of aptitude 
tests not adequately covered by the original test 
battery, and such special problems as pilot fa- 
tigue in connection with long missions. * The 
investigators frankly state the various attenu- 
ating and biasing factors that affected the 
objectivity of their criteria. * Other inconven- 
iences and frustrations of a pioneer attempt 
to fit the best man into each particular job 
under combat conditions are objectively de- 
scribed. * 


For an additional review, see 3:984; for a re- 
view of the series, see Вә. 


[ B244] 
*Lewis, Don. Quantitative Methods in Psychol- 
ogy. Iowa City, Iowa: Gordon Bookshop, 1948. Pp. 
vii, 286. Paper, lithotyped. $3.25. * (РА 23:462) For 
reviews, see 204 in Statistical Methodology Reviews, 
1941-1950. 

[ B245] 
*Lizo, Lester M. Attitude Prediction in Labor 
Relations: A Test of "Understanding." Fore- 
word by Paul R. Farnsworth. Stanford University, 
Graduate School of Business, Division of Industrial 
Relations, Studies in Industrial Relations, No. то. Stan- 
ford, California: the Division, [1948]. Pp. ii, 17. Pa- 
per, lithotyped. $1.00. * 


Adult Ed B 13:221-2 О ’48. * a report of an 
interesting experiment in testing the accuracy 
with which industrial relations executives can 
predict the attitudes of labor union officials, and 
vice versa * 
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*Linvguist, E. F. A First Course in Statist 
Their Use and Interpretation in асанна, 


Psychology, Revised Edition. Bost З 
Houghton Mifin Со, 1042. Pp. xii, 242 $4 i 


(Study Manual for a First Course in Statist, 
vised Edition. 1941. Pp. v, 117. Papan, Reia 
structor's Manual and Key for Study Manual, Pp. v, 
49. Paper. $1.50. *) (London: George С. Harrap & 
Co., Ltd. 155.; 7s. 6d.) (PA 17:8-9, title ont ). For 
reviews, see 206 in Statistical Methodology Reviews 
1941-1950; for reviews of the first edition see 38: Вб) 
and 2:166. 
[ B247] 

*LixpQuisr, E. F., Eprron. Educational М E 
ment. With chapters by Gordon V. ‘Anderton’ eia 
Chauncey, Herbert S. Conrad, Walter W. Cook, Ed- 
ward E. Cureton, John G. Darley, Frederick B. Davis, 
Robert L. Ebel, John C. Flanagan, Norman Frederik- 
sen, E. Е. Lindquist, Irving Lorge, Charles I. Mosier, 
David С. Ryans, Geraldine Spaulding, John M. Stal- 
naker, Robert L. Thorndike, Arthur E. Traxler, Ralph 
W. Tyler, and K. W. Vaughn. Washington, D.C.: 
American Council on Education, 1951. Pp. xix, 819. 
$6.00. * (PA 25: 5836, 5851, 5856, 6199, 6458, 6460-2, 
6464, 6466, 6471-3, 6475-6, 6478, 6481, 6483, 6484 

Cath Ed R 49:490-1 S ?5т. F. J. Houlahan, 
* This reviewer has used the book as a basic 
reference in a graduate course in educational 
measurement and is of the opinion that it is а 
definitely teachable and comprehensive work, 
For fullest utilization of its wealth of materials, 
students should have considerable competence 
in statistics. However, even those who are less 
familiar with the statistics are able to follow the 
greater part of it with ease and profit. Consider- 
able adaptation of the materials provided is 
necessary to care for the needs of those whose 
primary interests are in the home-made дне 
room tests. The authors had іп mind ће i 
more formal standardized test, and the whole 
book is written from that viewpoint. Part 1, i 
bracing the first four chapters and one hundre t 
sixteen pages, can easily be omitted from | 
course. It deals with functions of megsureme 
in various aspects of education and displays, te 
enthusiasm for educational measurement du. 
out of proportion to our ability to construc a 
to implement the functions. Even were 69 
sible to have satisfactory tests, not all pon ai 
will agree to all of the uses indicated in x 
in this section. For instance, on pages at El 
we have chronological age and pe y 
opment listed as if they were the only ba we art 
which a child selects his "peers а ri 
advised that “Therefore, throughout t "d 
of maturation, which corresponds zopa ош 
the compulsory school age, these traits 
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constitute the fundamental basis for educational 
grouping, that is, when a child is five he enters 
kindergarten, when six he enters elementary 
school, when twelve he enters the junior high, 
and when fifteen the senior high school." While 
further discussion modifies this position some- 
what, the point itself is left intact. The diploma 
becomes a certificate of attendance and should 
be accompanied by some statement of level of 
achievement determined through measurement 
procedures. Some may be puzzled also by the 
fact that in presenting data to support this posi- 
tion it is the 2-98 range which is employed 
rather than the more common statistics for the 
description of groups. If it is so important to 
include all of the students, it would seem justifi- 
able to use the whole range. If one is to leave 
out of consideration four percent, then why not 
describe the distribution in the usual way ? How- 
ever, whether or not one agrees with all of this 
section, it is stimulating and worthy of con- 
sideration. The book is a masterpiece and de- 
serves a place in the library of any student in- 
terested in the field of educational measurement. 
Ed & Psychol Meas 11:524-9 au’ 51. William 
B. Michael. * admirably fulfills a longfelt need 
for an advanced and comprehensive reference 
volume in the techniques and general theory of 
educational measurement. It will render an in- 
valuable service not only to teachers who have 
Sought for years a suitable text for graduate 
Courses in educational measurement, but also 
to field workers of diversified interests and of 
varying degrees of experience in test evaluation. 
embodies a high degree of developmental unity 
despite the presence of numerous points of view 
among the contributors * Other ways in which 
Measurement may be useful include the follow- 
Ing points developed at length by Tyler: the 
realization of further education of in-service 
teachers within their own fields, the coordination 
of instructional efforts throughout the school, 
and the evaluation of the proficiency of instruc- 
tors for purposes of selection and promotion. 
Although this last point is suggested in con- 
Junction with the function that measurement 
um Play in school administration and perhaps 
‘ould not be considered out of context, there 
a clear implication without any stated reser- 
м that teaching performance can be evalu- 
Pa terms of differences in test scores among 
ud es. Such a policy is decidedly dangerous 
certainly open to serious challenge. If teach- 
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ers have reason to suspect that their success is 
being largely determined from test results of 
their classes (when actually differences between 
classes may be attributed to numerous other fac- 
tors besides relative quality of teaching) the 
morale of a.staff is likely to deteriorate to a 
marked degree. In such a situation teaching is 
likely to become test centered in a highly com- 
petitive atmosphere. Tyler might well have 
devoted considerable space to mentioning and de- 
scribing the steps that should be taken in over- 
coming difficulties that are likely to arise in the 
valuation of teaching by administrators, * Of 
considerable interest to college admissions of- 
ficers and administrators, as well as to college 
teachers, is the fourth....chapter, “The Functions 
of Measurement in Educational Placement," 
ably written by Henry Chauncey and Norman 
Frederiksen, * two excellent chapters concern- 
ing the use and evaluation of performance tests 
and of essay tests are included * Exceedingly 
well planned and written is Chapter 7, in which 
Robert Ebel systematically describes various 
types of objective test items and offers numerous 
helpful suggestions in the actual preparation of 
items. Although Ebel's illustrative materials em- 
phatically show the importance of high stand- 
ards of workmanship and precision in item writ- 
ing, the reader may feel quite discouraged after 
studying the chapter. He may seriously doubt 
whether he can begin to achieve the level of 
perfection set for him. Despite the possible criti- 
cism that Ebel's suggestions may seem somewhat 
dogmatic, inflexible, and at times almost selí- 
contradictory, the importance of his contribu- 
tion cannot be minimized. In Chapter 8, Conrad 
succinctly, but adequately, describes the pur- 
poses that experimental tryouts serve and the 
various steps that need to be taken, as well. as 
cautions that should be exercised, in the selection 
of samples in the administration of the tests, in 
the arrangement of materials within the pre- 
liminary forms, in the determination of time 
allowances, and in the use of data furnished by 
the preliminary forms. One of the most outstand- 
ing chapters of the volume from the standpoint 
of scholarship, thoroughness, and technical ex- 
cellence is Chapter 9. The presentation of the 
pro and con arguments regarding the correction 
for chance success in the computation of item- 
analysis data is the best discussion of a difficult 
problem that the reviewer has been able to find. 
Two other excellent sections include one re- 
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garding the relationship between item difficulty 
and item discriminating power and another con- 
cerning the selection of items for specific types 
of tests. At long last much of the wealth of ma- 
terial upon the techniques and theory of item 
analysis has been summarized and collated 
within one comprehensive chapter. * Another 
comprehensive and well-written chapter is the 
tenth one. * A welcome feature of the volume 
is an entire chapter limited to the very practical 
problem of how to produce multiple copies of 
a test. In Chapter 11, Geraldine Spaulding dis- 
cusses at length problems of format and prepara- 
tion of copy. Her points should prove helpful 
and at the same time interesting to workers in 
test instruction who may or may not have had 
to be concerned with the production side of tests. 
* A thoughtful discussion of the advantages and 
disadvantages of the essay type of examination 
constitutes most of Chapter 13. * After consider- 
ing reliability in terms of analysis of variance 
and suggesting sources of variance in test scores, 
Thorndike proceeds to discuss and to evaluate 
at length in Chapter 15 the principal sets of ex- 
perimental and statistical operations in estimat- 
ing test reliability. * Although numerous tech- 
nical details are covered, the reader may be 
expected to grasp without difficulty the major 
points discussed in view of the exceptional clar- 
ity of Thorndike's presentation. In treating the 
problem of validity in Chapter 16, Cureton first 
defines several terms, the meanings of which 
may seem to be somewhat unconventional to 
the reader. It is important that these terms be 
understood and remembered if the import of 
the chapter is to be grasped. Considerable atten- 
tion is focused upon the properties of criteria 
and upon the logical problems of validity. The 
statistical formulations in estimating various 
types of validity may cause some difficulties to 
the reader unless he has carefully considered the 
contents of the previous sections within the 
chapter. The important problems of interpreta- 
tion in educational measurement are excellently 
treated in Chapter 17. In the use of test meas- 
ures for evaluation and guidance Flanagan 
emphasizes the need for five main types of de- 
scriptive information : content, rank, level of de- 
velopment, growth, and profiles. One of the most 
interesting and comprehensive discussions in 
the entire literature of educational measurement 
regarding the uses, advantages, and disadvan- 
tages of various types of norms is to be found 
in this chapter. * Constituting a fitting climax 


to the book and representing one of the last 
projects completed by Charles Mosier before his 
untimely death, the closing chapter is an out- 
standing example of Dr. Mosier's many contri. 
butions to the field of educational and psycho. 
logical measurement. The tenor of the chapter 
is theoretical and somewhat abstract, but the 
principles discussed are fundamental to under- 
standing the problems involved in the combina- 
tion of multiple measurements. Concisely pre- 
sented and illustrated, the topic of profiles 
concludes the chapter. The importance of this 
comprehensive reference volume cannot be un- 
derestimated. It will doubtless remain as one 
of the classic works in the history of educational 
measurement. It is to be hoped that the fund 
which will be created from sales of this book 
will be great enough to warrant several future 
revisions. To Dr. Lindquist and to all his asso- 
ciates whose cooperative efforts have made pos- 
sible the production of this volume, professional 
educators owe a lasting debt of gratitude. The 
reviewer must be speaking for many others when 
he says, "Congratulations on a splendid achieve- 
ment, and thank you." 

J Consult Psychol 15:263-4 Je ' 51. Laurance 
F. Shaffer. A comprehensive reference work 
and graduate textbook * Clinical psychologists 
have much to gain from the study of sucha book, 
for no gulf exists between educational measure 
ment and psychological measurement, and the 
same techniques apply to both. Technical handi- 
caps from which many clinical tests suffer coul 
be lessened by improved methodological sophis- 
tication. There is no reason to believe that an 
instrument would lose any of its clinical power 
by also being a good test. 

dh Col 7 5629 D ’5r. Roy M. e^ 
In the last few years, much has been Mio 
about the purpose, theory, and principles d 
measurement, but in no other references | 
the reviewer found a more detailed and ird 
illustrated description of the step-by-step E i 
ess involved in sound test construction t ү 
found in Chapters 8 through 12. The nies 
is presented in such a forthright and illus dir 
manner that the most elementary tet ү it 
can use it as a guide, and, at the same pum 
shows in such an exact and thoroug? "imple 


how the theory of test construction 18 " that 
mented in practical and concrete situatio! je to 
advanced students of measurement ү A 
use these eight chapters as a frame of revere’ 
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compendium * Much of the discussion will ap- 
pear verbose and unconvincing to British edu- 
cationists, and their doubts will be reinforced 
if they persevere as far as Dr. J. C. Flanagan’s 
chapter, which makes clear the difficulties of 
expressing attainments in really meaningful and 
satisfactory units. But several sections are of 
great interest, and the book will be of value, 
not only to students for reference purposes, but 
also to teachers and examiners who try to set 
"new-type" or objective examinations. Every 
variety of "new-type" question is described and 
illustrated, and detailed hints are given on plan- 
ning a test, laying out the material conveniently, 
reproducing it, and on administration, prelimi- 
nary trials and revision. The most noteworthy 
chapter is that by Dr. J. M. Stalnaker. No one 
concerned with setting and marking ordinary 
"essay-type" examinations, or with their results 
(parents, for instance), could read it with com- 
plete equanimity. * 


[ B248] 
XLoowrs, CHARLES P., AND PEPINSKY, Harop B. 
Sociometry, 1937-1947: Theory and Methods. 
Sociometry Monographs, No. 20. New York: Beacon 
House, Inc., 1949. Pp. 27. Paper. $2.00. * (PA 24: 570) 
[Reprinted from Sociometry 11: 262-86 Ag '48. *] 


[ B249] 
*Lorp, Ертн. Experimentally Induced Varia- 
tions in Rorschach Performance. American Psy- 
chological Association, Psychological Monographs: 
General and Applied, Vol. 64, No. то, Whole No. 316. 
Washington, D.C.: the Association, Inc., 1950. Pp. v, 
34. Paper. $1.00. * (PA 25: 6217) 


[ B250 ] 
žLucas, DeWrrr B. Handwriting and Character. 
rexel Hill, Pa.: Bell Publishing Co., 1950. Pp. 368. 
3.50. [Reprint of edition originally published by David 
McKay Company in 1923.] 


à [ B251] 

ALUNDIN, Roserr W. The Development and Vali- 
ation of a Set of Musical Ability Tests. Ameri- 

can Psychological Association, Psychological Mono- 

graphs: General and Applied, Vol. 63, No. 10, Whole 

ne 305. Washington, D.C.: the Association, Inc., 1949. 
P. iii, 20. Paper. $1.00. * (PA 24: 5737) 
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PLUNDIN, Wikram Н. Projective Techniques and 
cay chotherapy : Examination of a Therapy Proc- 
me Through the Use of the Projective Move- 
je t Sequence and Rorschach Techniques. Pro- 
06 Technique Monographs, No. т. New York: 
ОУ for Projective Techniques and Rorschach In- 
tute, Inc., September 1951. Pp. i, 39. Paper. $1.50. * 


XM. [B253] Gla 
ACHover, Karen, Personality Projection in 
pe Drawing of the Human Figure: A Method of 
qronality Investigation. Springfield, Ill. : Charles 
homas, Publisher, 1949. Pp. ix, 183. $3.50. * (Ox- 


ford, England: Black ienti ications 
175. 6d.) (PA 23 ED KC SH i rq MM 
[ B254] 
ЖМсІхтоѕн, Douctas M. Promotion From Pri- 
mary to Secondary Education. Foreword by Wil- 
liam McClelland. Publications of the Scottish Council 
for Research in Education XXIX. London: University 
of London Press, Ltd., 1948. Pp. xiv, 152. Paper. 1s. * 
Eug R 40:220 Ja '49. К. Н. This is a short- 
ened version, in non-technical language, of the 
report of Dr. W. W. McClelland's investigation 
published in 1942 under the title Selection for 
Secondary Education, which has been described 
as "one of the most important of all recent con- 
tributions to educational psychology." Dr. D. M. 
McIntosh, the author of the abridged version, 
was one of Dr. McClelland's chief assistants. He 
is to be congratulated on this summary of the 
original report. 


[ B255] 
жМсІхтоѕн, Dovctas M.; WALKER Davin А.; AND 
Mackay, DowaLp The Scaling of Teachers’ 
Marks and Estimates. Edinburgh, Scotland: Oliver 
& Boyd Ltd., 1949. Pp. xiv, 114. 75. 6d. * (Toronto, 
Canada: Clarke, Irwin & Co. Ltd. $1.90) 

A.M.A. 44:216 Je-Jl "49. This volume is 
somewhat less complicated than others we have 
seen on this subject, and teachers who believe 
that the scaling of marks will have an established 
and permanent place not merely in educational 
research but in many departments of everyday 
practice would do well to consult it. The point is 
made that the serious decisions involved in 
selection for secondary education or the award 
of Leaving Certificates require marks to be 
scaled in such a way that the estimates of dif- 
ferent teachers are comparable. The explana- 
tions are simple, the instructions clear and defi- 
nite, and it is claimed with some justification 
that the statistics are “without tears,” 

J Ed (London) 82: 460-2 Ag’50. C. M. Flem- 
ing. * [The] authors are to be complimented on 
the clarity of their statement and the candour 
with which they remind their readers that 
neither tests nor scaled estimates have complete 
predictive power. This book is likely to become 
of increasing importance with reduction in em- 
phasis on external examinations at the stage of 


leaving school. 


[8256] ТАТЕ Я 

. H. Researches on the Measure- 
жс S da Performance. Medical Research 
Council, Special Report Series, No. rig еч v. 
H. M. Stationery Office, 1950. ER 156. pey s 
(New York: British Information егуїсез. $1.00. 


(PA 25:3484) 
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Am J Psychol 64:318 Ар?зт. Madison Bent- 
ley. * The results are noteworthy. They should 
be carefully studied also by American psycholo- 
gists, who have been slow to resolve the modes 
and means of anticipative performances of many 
kinds. 

Eug R 42:227-8 Ја ’5r. H. J. Eysenck. * di- 
vided into two parts, the first dealing with what 
are called “vigilance” tests, i.e. attempts to meas- 
ure the decrease in the performance of reactions 
to an intermittent stimulus over somewhat 
lengthy periods ; the second part deals with “en- 
vironmental stress" tests, such as the effects of 
non-lethal war gases and of high atmospheric 
temperatures. The experimental design in each 
case appears to be adequate, the statistics sound, 
and the apparatus ingenious. The conclusions 
are not over-stated and appear to follow nat- 
urally from the experiments. Within its compass 
the work is, therefore, sound, if of little funda- 
mental interest. The attempts of the author to 
integrate the findings of his vigilance tests with 
conditioning theories do not seem very convinc- 
ing to the reviewer and appear to be largely on 
a semantic level. It is a pity that the writer has 
failed to add to his investigations by devising 
experiments to account for the individual dif- 
ferences so obvious among the members of his 
group. His only effort in that direction, com- 
parison between fresh subjects and tired sub- 
jects, shows what might have been done in that 
direction. 

Occupational Psychol 24:264-5 О ’50, Ed- 
ward Elliott. * Although the conclusions reached 
have an obvious value in the specific fields of 
performance in which the investigations were 
set, it seems doubtful whether they will have any 
wide application outside the Services. This is 
obviously true of the work on exposure to war 
gases. The studies of high atmospheric tempera- 
tures establish an upper permissible limit of en- 
vironmental worth which exceeds the tempera- 
ture to be found in all but a few industries. But, 
as Mackworth suggests, there is some resem- 
blance between the inspection processes found 
in many industries and the tasks with which he 
has experimented. In the reviewer's opinion this 
resemblance is superficial * Turning to con- 
sider the value of Mackworth's investigations 
for the Service consumer, the reviewer does not 
quarrel with what has been done or with the 
conclusions drawn, but he would argue that 
some important points have been left unsettled, 
particularly in the watchkeeping studies. * Sev- 


eral devices for preventing deterioration are 
shown to be effective. The more important are: 
half-hour watches alternated with other duties; 
administration of amphetamine (“benzedrine”) 
giving the operator information about the ac. 
curacy of his work; interrupting the watch with 
telephone messages. Of these devices, only the 
first is both possible and anything like practi- 
cable outside the laboratory ; even this might be 
a great nuisance or impossible in emergencies, 
And while much attention has been given to 
these unsatisfactory devices for preventing de- 
terioration, only passing comments are made 
about a more effective one. Mackworth states 
that there are individual differences in deteriora- 
tion of performance, and he gives data (in Fig. 
13, page 39) from which we may make the 
following rough and ready deductions. If we 
were to select the best 15 per cent. of subjects, 
there would be no deterioration; if we were to 
select the best 55 per cent., efficiency would 
not fall below the 75 per cent. level, and these 
selected men could achieve a better average per- 
formance after working for thirty minutes than 
the fresh unselected group. The paucity of the 
information on this subject must discourage our 
taking these crude deductions too seriously, but 
they strongly suggest that a really promising 
way of preventing deterioration has been neg- 
lected. We are told that the observed individual 
differences and intelligence test results are ш- 
correlated ; no other attempt to account for in- 
dividual differences has been made; we do not 
know whether a special cognitive ability, mot 
vational factors, or simply differences in tram- 
ing and experience account for them, though it 
should not be difficult to make some progress 
towards finding out. The reviewer is tempted A 
quote the final sentence of the preface to the 
report : “However specific the practical prolem 
it is most likely to be solved by those who pi 
also concerned to discover the principles pw 
lying the causes of the observed effects. s 
title of this report and the anonymous ie 
suggest that it is concerned with investigat! : 
of very wide importance. For example, My 
in the preface the claim that “The investiga ids 
reported in these pages all consist of attemp 5% 
develop controlled methods for studyine 
measuring behaviour in sequence and e 4 
termining the relations of items of behave е 
they occur within the sequence. deri 
which really achieve this must surely sip ps 
any experimental psychologist. Is such а 
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justified ? What new "controlled methods for 
studying and measuring behaviour in sequence" 
has Mackworth developed? The reviewer can 
find none. Indeed, given the same problem to 
investigate and the facilities that have been en- 
joyed, it is difficult to envisage simpler or more 
obvious procedures than those described in this 
report. * Nothing here is new or surprising. 
From the point of view of the Service consumer 
this is good enough; the very simplicity of the 
experiments is a virtue, since the consumer can 
readily understand and have confidence in the 
technician's procedures; up to a point practical 
issues are settled; and the detailed and careful 
work, and especially the lucid exposition, are to 
be commended. On the other hand, to go as 
far as the writer of the preface and regard 
very good ad hoc investigations as worthy 
of the status of outstanding fundamental re- 
search must appear extravagant to any critical 
reader. 


[ B257 ] 
*McNemar, Quinn. Psychological Statistics. 
New York: John Wiley & Sons, Inc., 1949. Pp. vii, 
364. $4.50. (London: Chapman & Hall, Ltd. 36s.) * 
(PA 23:3035) For reviews, see 213 in Statistical 
Methodology Reviews, 1941-1950. 
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*Manvet, Herscuet T., Eprror. Testing and Test 
Results. Texas Commission on Coordination in Edu- 
cation, Research Bulletin No. 29. Austin, Texas: Ad- 
ministrative Board, the Commission, March 1949. Pp. 
27. Paper, lithotyped. * 
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MARSHALL, M. V. An Introduction to Tests and 
esting, Revised Edition. Toronto, Canada: Ryer- 

son Press, 1951. Pp. x, 134. $2.50. * 
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TMARTIN, А. Н. Psychometrics. Sydney, Australia : 
"ege s Arcade Ltd., 1947. Pp. vii, 165. 15s. * 
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Ex ARTIN, JEREMY, AND A STAFF OF APTITUDE TEST 
уон to Prepare for Your Draft Test. 
Paper. Crown Publishers, Inc., 1951. Pp. v, 93. 
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and Rew’ W. Encar. Selected Tests in Biology 

ton Related Areas. Circular No. 308-III. Washing- 
a Р.С.: 0.5. Office of Education, November 1949. 
"9. Paper, lithotyped. Gratis. * 
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man EY Paur, The Analysis and Control of Hu- 

Ren Experiences: The Individual Seen Through 

the Au ach, Volumes 1 and 2. Brooklyn 2, NY: 

x uthor, 16 Court St., 1947. Pp. iii, 233; iv, 19 . Pa- 

revi mimeographed, Out of print. * (PA 21: 2326) For 
Tews, see 125 and 3: 86. 
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ASLOW, у 

Rorschach: Жш P in pu m 
Brooklyn, N.Y.: the Author, 16 Court St., 1081. Pp. ‘i, 
Mos er, spiral binding, mimeographed. $3.50. * (РА 
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AURER, KATHARINE M. I 
turity as a Criterion for electing и Pie 
school Tests. University of Minnesota, Institute of 
Child Welfare Monograph Series, No. 21. Мїппеа 3 
lis, Minn. : University of Minnesota Press 1946. Р n 
166. deni $2.50. * (London: Oxford University 
Een 20s.) (PA 21:286) For reviews, see 352 and 

[ B266 ] 
*Mayne, F. Bram, Eprror. Evaluating Competence 
for Business Occupations. The American Business 
Education Yearbook, Vol. 7, 1950. Published jointly 
by The National Business Teachers Association and 
The Eastern Business Teachers Association. New 
York: New York University Bookstore, 1950. Р 
xiii, 302. $3.75. * A Tha 
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*Metton, Автнов W. Ертов. Apparatus Tests. 
Army Air Forces Aviation Psychology Program Re- 
search Reports, Report No. 4. Washington, D.C. : U.S. 
Government Printing Office, 1947. Pp. xxiv, 1056. Out 
of print. * (РА 23: 3942) 

J Consult Psychol 13:308-9 Ag '49. Lau- 
rance F. Shaffer. A distinctive feature of the Air 
Force’s classification testing program is the use 
of apparatus tests to measure such characteris- 
tics as coordination, dexterity, and speed of 
response. The report gives extensive data on 
the development, construction, and validation of 
the eight major tests used for standard classi- 
fication purposes, and of some fifty experimental 
tests. The mass testing of thousands of men by 
complex apparatus tests brought problems never 
before encountered by psychologists. The cali- 
bration procedures used to reduce variance due 
to apparatus and the test control mechanisms 
developed to minimize examiners’ errors, are 
therefore of special interest. The report 1s il- 
lustrated by many photographs, working draw- 
ings and wiring diagrams. This volume is the 
last of the nineteen Aviation Psychology Re- 
search Reports to appear, completing a series 
that is probably the most comprehensive ac- 
counting of a coordinated research project in 
applied psychology. 

Psychol B 46:504-6 N *49. Robert Н. Sea- 
shore. * describes the largest program of psy- 
chomotor selection research yet undertaken * 
The tests developed emphasized measures of 

recision at high speed rather than steadiness, 
equilibrium, or strength. * Major advances in 
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procedure included standardized instructions, 
automatic cycling of trials, and multiple unit 
controls for testing four persons under each ex- 
perimenter. Further improvements in the design 
of tests and the correlation of findings on specific 
tests with specific reasons for success or failure 
in flying skills would seem to call for more in- 
tensive empirical and theoretical job analyses. * 
Unfortunately, the most representative work 
sample tests were not employed for classifica- 
tion purposes until relatively late when there was 
no longer an opportunity for large scale valida- 
tion, but results were promising. * To the re- 
viewer this program has important implications 
as to needs in future research and training in 
psychology, e.g.: (1) The training of additional 
specialized personnel for instrument design and 
construction; (2) likewise, the relative neglect 
of intensive job analysis and motion study in 
psychomotor performances on military equip- 
ment calls for increased attention to this phase 
of training for future psychologists; (3) the 
further development of more complete factorial 
batteries of psychomotor tests would enable ap- 
plied psychologists to select tests having the 
greatest probable validities for various skills; 
(4) the development of procedures for setting 
up specific criteria for success in acquiring each 
major type of skill; (5) the rapid mechaniza- 
tion of military aircraft and equipment, e.g., 
remote control aircraft and missiles, may out- 
mode a great deal of present testing and require 
different aptitudes; (6) a standby research and 
selection program, revised periodically, is nec- 
essary to provide an adequate plan for emergen- 
cies. This in turn is likely to require an adminis- 
trative plan which will encourage the widespread 
participation in research of civilian as well as 
military psychologists. 


For a review of the series, see B2. 
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MENDEL, Atrrep O. Personality in Handwriting: 
A Handbook of American Graphology. Foreword 
by Rudolf Arnheim. New York: Stephen Daye Press, 
1947. Pp. 375. $3.50. * (PA 22:2156) 

Am J Psychiatry 105:238 S ’48. George E. 
Reed. This book has....three main divisions en- 
titled “At First Sight,” “At Second Sight,” and 
“At Third Sight" * In the section “At First 
Sight" there are chapters devoted to style evalu- 
ation, the margins, spaces, directions of lines, 
the slant, pressure, and writing zones. These 
are fairly orthodox. On page 50 the author de- 
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scribes his method of scoring the style Value, 
which is a long step in the right direction, “At 
Second Sight” includes chapters on symmetry, 
legibility, size, connections and spacing of let. 
ters, pace of writing, and concealing and coun- 
terstrokes. Without subscribing to many of the 
author’s conclusions this section may also be 
described as generally accepted material, “At 
Third Sight” the author emphasizes the im- 
portance of the down stroke which he describes 
as the “stable axis” and the significance of 
lateral strokes (mobile axis) as well as other 
special features such as circular, initial, and end 
strokes and simplification. In this section the 
writer frankly attempts to adapt Freudian psy- 
chology to handwriting interpretation. Although 
this should not be difficult, with so embryonic 
a study as graphology and so all-embracing a 
system as dynamic psychology, the examples 
given are not convincing. The references and 
handwriting examples that occur on almost 
every page are frequently somewhat vague. The 
informed will find many points of interest in 
the author’s views and emphasis but the novice 
will be misled by passages that are pure con- 
jecture, controversial, oversights or frank er- 
rors. * Unfortunately the book....does not always 
divide what we know from what we may believe 
or suspect at this stage of graphological develop- 
ment. Most workers would approve wholeheart- 
edly of Chapter 19 dealing with simplification, 
and much of Chapter 18 on circular strokes 1s 
accepted and all of it stimulating. However the 
chapter on psychopathology and references t0 
paranoid or epileptic characteristics of Es 
would better be omitted altogether. This | 
well be said of the chapter on physiology 25 Муч 
if only for the reason that the author attempts a 
divide handwriting into a dualistic act, 
physical and part psychological. If he eee 
separate behaviour there is no hope of a Hak i 
of graphology. Despite these criticisms t a 4 
much of value in the book which deals V А 
difficult and developing subject. In md 
graphology has suffered from too many " 
measuring techniques just as it may ™ fora 
treated in too impressionistic a manner 
time. LH 
B Menninger Clinic 14:148 Jl ^50. W iia . 
Brown. The author makes a serious ME (6 
raise the status of handwriting analy ys : yd 
depths of charlatanism to the level oi |, how- 
logical science. As with most grapholog? y and 
ever, his conclusions are based upon ana E 
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rather flimsy empiricism. A well-known per- 
sonality type, Hitler for example, shows certain 
peculiar characteristics in his handwriting. 
Therefore if your writing reveals the same signs, 
you are assigned character traits similar to Hit- 
ler's. Mendel has organized his book in a man- 
ner which permits investigation of graphologi- 
cal hypotheses without burdensome study of 
intricate techniques. He has adapted the methods 
of European graphologists to American hand- 
writing, particularly as it is learned from the 
Palmer method. 

J Am Med Assn 137:1096 Jl 17748. * a lucid, 
well written, convincing volume which takes 
graphology out of the realm of mysticism and 
presents it as a scientific approach to the under- 
standing of human personality. The whole book 
is so well written and the author’s style is so 
simple and convincing that the reader finds that 
this is not just a book to be read but one that 
may be studied with pleasure and profit. Noth- 
ing is left to the reader’s imagination and the 
deductions and conclusions reached appear log- 
ical and are reinforced with much concrete evi- 
dence for their validity. This book is highly 
recommended reading for any one seriously in- 
terested in personality study. 

J Nerv & Mental Dis 108:169-70 Ag '48. 
(7. A.) Piotrowski. * concise and lucid manual 
* It gives the critical reader ample opportunity 
to study the methodology of graphologists, and 
thus teaches a great deal in an entertaining man- 
ner, The details are many, but they do not blur 
the main outlines of this handbook. The author’s 
own contributions are found chiefly in the chap- 
ter on psychopathology in handwriting. Alfred 
Kanfer wrote the last, 20 page chapter on the 
relationship between handwriting and physio- 
logical disturbances. No claim is made that the 
conclusions relating to pathology are conclusive, 


but the two chapters illustrate how graphology - 


can be applied in the field of pathology. It would 
appear that graphology can be used as ап aid 
in detecting many significant personality traits. 
Its value as a diagnostic aid in neuropsychiatry 
or other fields of medicine is still very uncertain. 

Nerv Child 7:346 J1748. The author has some 
original ideas about graphology which may in- 
terest psychologists and those who are already 
Convinced that the study of handwriting can 
"hish clues to the discovery of specific traits 
m individuals, The sceptic who does not accept 
the value of personality evaluation from hand- 
Writing will hardly be converted by this attempt. 


Psychiatric Q 22:164-5 Ja '48. * The book is 
well written and it can be considered a text for 
those studying or wishing to study graphology. 

Psychoanalytic R 36:107-8 Ja "49. Kelley. * 
Apparently the author is widely read and this 
book undoubtedly is the first dealing primarily 
with American graphology. The discussions, for 
example, on the influence of various types of 
copy-book style are extremely interesting. The 
meat of the volume is, of course, the author's 
basic system of graphology which, while flavored 
with concepts of other experts, essentially is. 
original, particularly in its organization. The 
author has presented an excellent beginner's 
handbook but one wonders if even a comprehen- 
sive study of this volume alone would ever prove 
the short cut to total understanding of the per- 
sonality that the publisher promises. One also 
wonders at the scientific accuracy of the author, 
particularly his emphasis on certain illnesses 
with their actual organ location manifested by 
specific errors in the writing. As usual, with so 
many books of this type, there are too many un- 
supported statements and too few facts. There 
is also too much of the dream book pattern pres- 
ent and tables are appended which translate 
specific findings into equally specific personality 
manifestations. There is no doubt but that this 
"science" of graphology is progressing and in 
time may achieve adequate recognition in this 
country. It is doubtful if the present volume will 
add much in this achievement and it is equally 
doubtful if further advances will be made until 
they can be backed by adequate data founded in 
sound methodology. The appendix of the book 
contains a contribution by Alfred Kanfer dealing 
with physiology and pathology in handwriting 
and psychopathology in handwriting. Kanfer 
presents some interesting material and is cer- 
tainly more cautious in his claims. He falls into 
the pitfall of using psychiatric nosology without 
adequate definition but certainly does present 
some interesting ideas. The book is much too 
complicated for the average reader unless he is 
trained in psychology and has pertinent interest 
in graphology. For the scientifically trained, it is 
too superficial. For en superficially trained, it 
is too pseudo-scientific. 
is к В 46:324-5 П 40. Werner chen 
Rudolf Arnheim, introducing Mendel ^ ook, 
presents graphology а5 а projective tec ue 
while the author himself calls it the psychology . 
of man's finest expressive gestures. In the opin- 
ion of the present reviewer projective and ex- 
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pressive methods should be differentiated from 
each other. Projective methods (Rorschach, 
ТАТ, doll-play etc.) diagnose a subject's pro- 
jections of ideas upon a given, more or less 
structured material, while the analysis of ges- 
tures or expressive movements deals with a di- 
rect motor expression which is neither projected 
upon structured material nor transformed into 
any content. While many non-graphologists ex- 
plain graphic gestures by learning and by chance, 
Mendel starts with the assumption that graphic 
movements are patterned by personality. "Every 
gesture may fuse three currents in man: the 
emotional, the intellectual, and the physical. 
Likewise every stroke of his pen." However, 
with the admission, “I have speculated freely, 
drawing tones and shades from the slightest 
clues,” Mendel plays into the hands of those who 
condemn graphology and does not do enough 
justice to himself. He relies in many ways on 
previously tested relationships between certain 
graphic movements and certain personality 
trends and between certain movement configura- 
tions and personality patterns. The lack of exper- 
imental data, of methodology, and of a bibliog- 
raphy is partly offset by detailed tables of 
interpretation and by the use of graphology in 
diagnosing normal and abnormal aspects of per- 
sonality, supported by empirical observation. 
Especially interesting is the chapter done on 
“Physiology and Pathology in Handwriting” by 
Alfred Kanfer, who validated his graphological 
observations of patients by hospital diagnoses. 
Mendel follows the statement of Gardner Mur- 
phy, which he quotes in the beginning of his 
book, namely, “not to maintain that the interpre- 
tation of character by handwriting is a science; 
its most devoted adherents would make no such 
claims.” As far as graphological diagnoses can 
be validated by empirical experience, as far as 
successful predictions can be made far above 
chance expectancy, as far as graphic movements 
can be measured in their pattern and evaluated 
in their expressiveness, as far as the method- 
ology is consistent, graphology has to de- 
mand a scientific recognition. In the opinion 
of the present reviewer, the direct analy- 
sis of expressive movement without inter- 
mediary material for projection potentially al- 
lows a more precise and more defined applica- 
tion of scientific criteria than projective methods. 
Mendel’s illustrations and interpretations en- 
rich our dictionary about the language of per- 


sonality but they do not reveal the laws of its 
grammatical structure, those laws which govern 
graphic movements as one of the many expres- 
sions of personality. When Mendel states, “Į 
hope the experimental psychologist will take my 
hunches, examine them, then reject or verify 
and refine them,” the present reviewer is one 
who supports this appeal. 
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in the Human Relations Aspect of Work-Group 
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Psychological Monographs: General and Applied, 
Vol. 65, No. 3, Whole No. 320. Washington, D.C.: the 
Association, Inc., 1951. Pp. v, 29. Paper. $1.00. * (РА 
25:7132) 
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*MicHAEL, WiLLIAM Burton. Factor Analyses of 
Tests and Criteria: A Comparative Study of 
Two AAF Pilot Populations. American Psycho- 
logical Association, Psychological Monographs: Gen- 
eral and Applied, Vol. 63, No. 3, Whole No. 2%, 
Washington, D.C., the Association, Inc., 1949. Pp. v, 
55. Paper. $1.00. * (РА 24: 6080) 
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*MicHeets, WiLLIAM J., AND Karnes, M. Ray, 
Measuring Educational Achievement. New York: 
McGraw-Hill Book Co, Inc., 1950. Pp. vii, 496. $5.50. 
ж (London: McGraw-Hill Publishing Co., Ltd. 47s.) 
(PA 25: 5686) 

Sch R 59:244-5 Ap "51. J. R. Gerberich. * 
Micheels and....Karnes, faculty members in the 
industrial education departments of two large 
state universities, revised and expanded ma- 
terials which they developed during World War 
II in their work at the Armored School, Fort 
Knox, Kentucky, for publication as а general 
book on Measuring Educational Achievement. 
* Tests of achievement, general intelligence, an 
aptitude; interest inventories; and ple 
techniques—all receive attention. Several me 
esting, unusual, and probably not widely eni 
materials are illustrated. Of doubtful oni : 
cause of length and because of the level ап 


1 ; e 
stricted. usage of the test illustrated, are t 
mple directions 


Educational Aptitude Battery. 
hand, the Purdue Vocational Tests, a 
applicable in industrial education, are acc | 
only brief descriptions in this chapter, ze y 
primarily to acquaint students with the b * 
and general uses of evaluation instrume e 
Despite their awareness of the need for QE 
bility of scores, and the adequate attention £ | 
elsewhere in the book to derived measu" 


rg a0 
comparability of course marks, the autho 
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not include this concept, no less important for 
informal objective testing than for standardized 
tests implemented by adequate norms. * The 
four chapters on ob ject tests, manipulative-per- 
formance tests, observation and evaluation, and 
evaluating major projects make up an outstand- 
ing feature of the book. Principles of good test 
construction are embodied fully in the recom- 
mendations, and excellent illustrations are given 
for evaluating procedures and products in indus- 
trial education. For example, the analysis of 
operations for a performance test in woodwork- 
ing, the analysis and classification of measurable 
qualities, the directions to the pupils, the di- 
rections to the examiner, and the check lists of 
procedure and quality, all reproduced in detail 
and adequately implemented by discussion, set 
before the student an excellent specimen of 
workmanship as a guide in the evaluation of out- 
comes in his own classes. This section of the 
book also gives significant treatment to the evalu- 
ation of total performance in complex series of 
varied learning situations through the use of 
progress charts and other observational aids. 
Procedures for tabulating and analyzing test 
scores and for assigning course marks are well 
treated. Item-analysis procedures for use in the 
improvement of measuring instruments also re- 
ceive adequate treatment. However, the Hoyt 
analysis-of-variance technique may well look too 
involved to students, even though the procedures 
are presented by rule-of-thumb rather than at 
the higher level of real understanding. The use 
of analysis-of-variance symbolism and the termi- 
nology of this advanced statistical technique 
may also reduce the effectiveness of this section, 
although the able student can use the procedure 
in obtaining item-analysis data that he can un- 
derstand and use. * The authors present many 
helpful and, in some instances, original sugges- 
tions to the teacher who constructs tests for a 
Continuous program of measurement and evalu- 
ation, These have to do with some of the me- 
chanical aspects of recording and filing infor- 
mation as well as with such time-saving devices 
às а nomograph for determining item-discrimi- 
qon indices and with objective procedures 
yee the quality of work produced in bui: 
tions ae X кые AR » 
bod S book....are found in the chap 

uct and procedures measurement—an often 


neglected or casually conducted aspect of pupil 
‘valuation, 
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XMicHicAN SrATE CoLLEGE, TRE Boarn or EXAMIN- 
ERS (PAUL L. Dresser, CHAIRMAN, LAWRENCE BABB, 
Ernest M. Banzer, Jonn Н. DAUGHERTY, WALKER 
Н. Нш, Ricuarp H. Jorman, Marria E. LAYMAN, 
V. E. Leicury, Lewrs B. MavuEw, CLAnENCE Н. 
Netson, Osmonn E. PALMER, AND FREDERICK Н. 
Srutz). Comprehensive Examinations in a Pro- 
gram of General Education. An introductory chap- 
ter by Howard C. Rather. East Lansing, Mich. : Michi- 
gan State College Press, 1949. Pp. ix, 165. $4.00. * 

Ed Res В 30:167-8 S 12°51. John V. Mc- 
Quitty. * The authors have told their stories 
with frankness; they have stated their basic 
philosophy, their problems, their solutions, their | 
strong and weak points, their successes and 
shortcomings. If the reader makes due allow- 
ances for an understandable and pardonable en- 
thusiasm of the authors for their work, he is 
impressed by their desire to give both sides of 
the controversial issues. * I am glad that the 
folks at Michigan State have put their story in 
writing. We shall all profit by having ready 
access to their experience. 

J Appl Psychol 35:85 Е '51. Harold D. Car- 
ter. * Educational objectives as stated in any 
educational program need to be clarified in the 
light of evaluation, Comprehensive testing, 
rather than testing upon small units of assigned 
work, is desirable. The truth of both these prin- 
ciples has long been recognized, but in common 
practice both principles are ignored. The test- 
ing program at Michigan State College furnishes 
an exception, and this book which reports upon 
it can have great value. The attempt to indicate 
in some detail the form taken by new-type tests 
in one general education program provides a 
basis for critical reaction by scholars. One may 
hope that it will lead to further constructive 
activity. The book should be read by all college 
professors. 

J Higher Ed 22:396-7 О ’5r. Henry Borow. 
* the volume is almost as much а description of 
the curriculum of general education as an ana- 
lytical account of a distinctive testing program. 
If, as happens to be true of this reviewer, one 1s 
sympathetic to the spirit and aims of general 
education as here described, he will likely con- 
clude that the system of examinations falls neatly 
into place and provides the critical instrumenta- 
tion by which student attainment may be me- 
thodically appraised. * The introductory chapter, 
written by the late Dean Howard С. Rather, 

resents a succinct and well-formulated account 
of the development and objectives of the Basic 
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College. * The preparation of examinations 
for the seven departments of the Basic College 
is painstakingly described in a series of chapters 
" written by the examiners holding chief responsi- 
bility for these departmental tests. * A. marked 
degree of parallelism in the manner of presenta- 
tion, perhaps by editorial design, binds this series 
of chapters together in a logical unit. * Perhaps 
the chief contribution of the volume is that it 
is almost the only full-scale publication which 
concedes comprehensive achievement testing its 
deservedly crucial place in the structure of col- 
legiate education. It accomplishes this not by 
prescribing what should be done but rather by 
the all too rare pedagogical device of reporting 
what is being done at one forward-looking insti- 
tution. The book has its flaws. Not the least of 
these is the strong implication (page 9), log- 
ically indefensible, that to mark students on the 
basis of abilities and skills is consistent with 
democratic principles but to mark them on atti- 
tudes, appreciations, and the like, is not. In the 
large, however, educators who admit to the 
view that competent achievement testing is an 
indispensable cog in the academic machinery 
cannot fail to find much in this work to kindle 
their enthusiasm. 

Jun Col J 21:115-8 О '50. Herschel T. Man- 
uel. * With the opening of the Basic College, a 
Board of Examiners was established to provide 
a system of comprehensive examinations for 
measuring attainments in the seven basic areas. 
The Board is administratively independent of 
the instructional department but has the co- 
operation of the departments in preparing the 
examinations. Grades and credits in the basic 
courses are given upon the basis of the compre- 
hensive examinations alone. * [This book is] 
an account of the experiences of the examining 
staff during the first three years of the program. 
Following the introductory discussions, seven 
chapters are devoted to the seven basic areas, 
each written by an examiner in the area under 
consideration. These chapters give an interest- 
ing insight into the techniques of test construc- 
tion, with generous and helpful illustrations of 
items constructed. * will be of interest to junior- 
college teachers and administrators who wish 
to learn more about basic courses, comprehensive 
examinations, or techniques of measurement. 
Although the separate board of examiners is 
distinctly a large-college organization, the dis- 
cussion has value for the small college as well. 
The lone teacher of a course will find suggestions 
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for the improvement of his own examinat 
and the student of education will find (riii. 
topics for discussion. The book is written at à 
level to be understood by the non-technical by 
thoughtful reader. On the technical side the 
method of building an examination and the es. 
phasis upon understandings and critical thinks 
in the examination merit special study, The 
techniques of test construction are illustrated, 
and many examples of items are given, The dis. 
cussion will be helpful to teachers and examiners 
who are working with the test-preparation prob- 
lems that go beyond information and simpl 
skills. The idea of giving credit on the basis of 
comprehensive examinations, "in some case 
with no enrollment in the course at all" (an idea 
not new or peculiar to this program), intrigues 
the present reviewer. Apparently, it is assumed 
that this is good. Perhaps it is, certainly it cas 
be if it opens a better educational opportunity to 
the student than would be had by taking the 
course. The plan seems to imply (1) that a 
course has a certain “ceiling” which sets a limit 
to growth; or (2) that the instructor is unpre 
pared to stimulate and guide his more capable 
students; or (3) that the time saved will be 
better spent in another course, a course better 
adapted to the level of scholarship already st- 
tained. It would be interesting to discover Ju 
how in fact the time saved is spent. There # 
indeed the possibility that giving credit by ex 
amination will cut short the total time to be 
devoted to formal education. If the goal, f 
example, is a bachelor’s degree and the time my 
be cut from four years to three by examinati 
why use the fourth year? If the fourth year ® 
to be spent in a professional school, thea ч 
for economy of time might be compelling. 
means, on the other hand, that the student » 
be exposed to formal education three a^ A 
stead of four, the answer is not so € ear. 
supposed that college has just so much to 
and that the time for taking on this fixed " 
of culture is the important variable? І 
would seem а great waste of time : 
permit the gifted to spend four years 00 p 
that is for them only a three-year or eve? 
year job. Is it not possible, however, A 
third and fourth years can be made [Pu 
even for gifted students? Is it not a litt cot W 
to encourage a talented youngster e mad i 
education short simply because he car ble? 
a shorter time the goal set for the vere w 
reader may wish to know more 
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courses, "Effective Living" and "Social Sei- 
ence." In Chapter Ш we are told that "Effective 
Living may be called a course in the psychology 
and philosophy of human behavior," that its 

“is to help each person recognize, ana- 
lyze, and solve problems which he meets as an 
individual, as a member of a family, and as а 
member of other groups," and that the fields 
which have seemed most useful are philosophy, 
psychology, sociology, home economies, and 
education. The problems are "the core around 
which content is developed." The course in So- 
cial Science, according to Chapter VI, "tries to 
aid students in understanding fundamental and 
persistent problems of modern society," It is 
said that, "the particular function of Social Sci- 
ence in the program is training the understand- 
ing of the social, economic, and political prob- 
lems of group living.” The title, “Effective 
Living,” could well designate a course which cuts 
across other fields—such as biology, hygiene, 
and even the fine arts—as well as the social sci- 
ences, It seems to be an attempt to organize ma- 
terials on a new basis, which pays no attention 
to traditional lines of cleavage. Yet its material 
is restricted; it has to be. Not everything can 
be presented in a single course. The materials of 
education. must be presented serially for the 
simple reason that they cannot be presented or 
apprehended in a single exposure. Perhaps the 
two courses, Effective. Living and Social Sci- 
ence, will provide the basis for an evaluation of 
two contrasting methods of organizing materials, 
Both seem pointed toward the solution of prob- 
lems, and some of the problems at least seem to 
be in the same general field. * it should be re- 
called that the report deals with the period, 
1944-47, and like other printed accounts runs 
behind the current situation. Doubtless а sup- 
plement as of today would add important in- 
formation and conclusions. 
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Р m Reports, 
W ^. Government Printi 
e. Рез DAS Оа М print * (РА 22: 4153) 
J Consult Psychol 12:125-6 Mr-Ap '48. Lan- 
rance F, Shaffer. * The report includes ех- 
cellent descriptions of the job of the military 
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on the basis of further experimental evidence, 
it was possible to conclude that, under the con- 
ditions of the investigation, “the type of plane 
in which students receive their basic training 
does affect their performance in advanced train- 
ing." Beyond the unique value that such studies 
have in providing information critical to effi- 
cient training, they also demonstrate the feasi- 
bility of employing objective measures of flying 
skill for the quick evaluation of a large scale 
pilot-training program. To the reviewer the 
most important single lesson in the report is the 
repeated demonstration of the limited value of 
many “testing” instruments—even when fully 
exploited, for doing a job that requires basic ex- 
perimental research in order to obtain conclusive 
answers. As the authors point out, "While the 
results of laboratory experiments cannot pre- 
dict exactly what will happen in a complex situ- 
ation like that involved in flying training, they 
can tell one what to look for and how to go about 
setting up a series of training experiments to 
decide what kinds and amounts of training... 
will be most efficient in the long run." 


For a review of the series, see B2. 
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Am Sociol R 11:248 Ap 46. * Research work- 
ers will no doubt continue to disagree about the 


significance, validity, or usefulness of this meth. 
odology, but all will agree that it represents care- 
ful, thoughtful work. 

B Menninger Clinic 13:136 J1*49. Louisa p, 
Holt. The authors of this “monograph” attempt 
to borrow some of the prestige won by the studies 
of autocratic and democratic group atmospheres 
undertaken by Kurt Lewin, Ronald Lippitt, and 
R. K. White. They succeed in indicating the 
value of using sociometric choices to determine 
seating arrangements at meal time in an institu- 
tional setting. The assignment of the girls who 
were the subjects of study to live in cottages of 
their own choice also distinctly lessened the 
extent and severity of adjustment problems, it 
was found. A brief and somewhat unrelated ac- 
count is given of the procedures for training "re- 
jects" of the group to assume positions of leader- 
ship. 

J Nerv & Mental Dis 109:280 Mr '49. This 
monograph is a republication of a paper appear- 
ing in 1938 in Sociometry, by the authors, un- 
der the title “Sociometric statistics of Social 
Configurations.” It offers in detail a technique 
for the evaluation of social configurations and 
presents the various aspects of sociometric pro- 
cedure. The mathematical analysis of a popu- 
lation unit in action brings a number of patterns 
and relationships into the foreground for the- 
oretical consideration. It constitutes à useful 
study for those working in this highly specialized 
field. 
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tests. It combines a treatment of the pr? 0 
or theory of testing with some d 
manual. * The treatment is well-balant® ‘with 
exposition clear—though at times burden 


details—and the literature is well cover” 
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physical make-up the book is a refreshing re- 
turn to pre-war standards. 

Cath Ed R 48:140-1 F '50. Walter L. Wil- 
bins, * well recognized ability to write a readable 
college text is adequately exemplified in this 
book * А good deal of space is devoted to de- 
scriptions of tests, although the book does not 
attempt to duplicate Buros’s Third Mental 
Measurements Yearbook. Although Professor 
Mursell stresses the practicality of tests, he dis- 
misses some of the most widely used testing 
tools of the clinicians pretty cavalierly : the The- 
matic Apperception Test with a paragraph, and 
the Rorschach with not much more—represent- 


ing the author's judgment that “projective test- _ 


ing in the immediate future must steer a course 
between charlatanry and alien pseudoscientific 
rigidity.” * 

Ed Outlook 22:166-7 Mr’ 48. Miles Murphy. 
* The predominant position given to the meas- 
urement of intelligence is perhaps justifiable on 
the ground of its importance in the history of 
psychological testing, but the treatment of other 
types of testing seems as a result somewhat 
limited, * Mursell....provides a particularly valu- 
able chapter on wider issues in psychological 
testing. Throughout there is adequate emphasis 
on the theory of measurement and the relation 
of testing to psychology. This should prevent the 
student who uses this book as a text from re- 
garding tests as mere technical tools, as so often 
happens, 

J Abn & Social Psychol 43:406-7 Jl 46. 
Clare Wright Thompson. Apparently this book 
isdesigned as a text for the upper division course 
in mental testing, presumably for students plan- 
ning to continue with a practicum in this field. 
The choice of subject matter and references has 
about it the flavor of an education, rather than 
а psychology, course in mental tests. * The first 
two chapters are perhaps the most acceptable 
in the book, Even here, the instructor may well 
Wish to orient discussions of validity, reliability, 
standardization, etc., around specific tests rather 
than to present them in the abstract at the be- 
ginning of the course. The rest of the book, how- 
‘ver, suffers from several important weaknesses 
Which seem serious enough to make it a difficult 
text to use, First, the material included and the 
distribution of emphasis seem quite individual 
үш this author, The discussion of general in- 
à ligence, for example, both in the chapter 50 
abeled and throughout the rest of the book, 
Taws heavily on two sources not given SO much 


в 


importance elsewhere in the psychological liter- 
ature: Stoddard's Meaning of Intelligence and 
Boynton's Intelligence: Its Manifestations and 
Measurement, The omissions are as remarkable 
as the inclusions, with the references consisting 
largely of secondary sources. E. В, Greene's 
Measurement of Human Behavior is most fa- 
vored here. The discussion in the text of the 
Vocational Interest Blank gives only secondary 
references, without so much as mentioning 
Strong’s name, and his Vocational Interests of 
Men and Women is not found even in the bibli- 
ography ; the theory behind the Allport-Vernon 
Study of Values is explained with no mention of 
Springer; Pintner's discussion of the Terman 
Gifted Group is given and Terman's is not; Ter- 
man’s classification of intelligence quotients for 
the 1916, but not Merrill's for the 1937, Stan» 
ford-Binet is quoted, A similar criticism can 
be made of the material included in the chapters 
on testing. The presentation of individual intelli- 
gence testing is historical and consists of what 
is usually covered in such discussions. Other- 
wise, however, the instruments selected do not 
parallel those assuming importance either in cur- 
rent psychological literature or in clinical prac- 
tice. Moreover, the organization is not chrono- 
logical and the student is left with no idea of 
resent usage. Thus the Terman Group Test of 
Mental Ability is listed rather than the Terman- 
McNemar ; the Cattell Infant Tests are omitted ; 
the Zyve Stanford Scientific Aptitude Test is 
included; the Minnesota Multiphasic Personal- 
ity Inventory is not included. with personality 
tests, although the Logical Decision Test is; the 
Kuder Preference Record is omitted but not 
Miner's Analysis of Work Interests; and so on 
in every field. The second weakness centers 
about the role of statistics in such a course, Sta- 
tistical principles are omitted because of the au- 
thor's belief that "the extremely superficial 
treatments of these subjects not seldom found 
are likely to do more harm than good, because 
they produce the illusion of understanding with- 
out the reality.” For this reason, he recommends 
instead a collateral or prerequisite course in 
statistics, and “any serious student of mental 
testing should be told frankly that he ought to 
be willing to spend the time and effort to under- 
stand the statistical concepts and techniques in- 
volved." The asters case de pes rigor 
i lined by his disregard tor ! 
m jeu The yea will indeed have to е 
pend time and effort if he is to evaluate a table 
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average IQ's when no measures of dispersion 
are given (Table 43) ; to realize that a test said 
to correlate .98 with psychiatric diagnosis (р. 
251) is actually using the standardization group 
as the validating group when the author does not 
uncover this; to understand the effect of limited 
range on correlation coefficients (Table 23) 
where this is not mentioned in the text ; to weigh 
the results of controversial studies when these 
are presented as fact (cf. p. 81 discussion of 
Rapaport, Gill, and Schaefer) ; or to understand 
that although “reliability” has meant test relia- 
bility for 394 pages, on page 395 it suddenly 
means inter-scorer reliability. There is another 
gross discrepancy between the author’s state- 
ment of his position and what he does about it. 
The orientation of the book is said to be around 
the view that “psychological tests must frankly 
be regarded as practical instruments, and used, 
evaluated, and interpreted as such.” From this, 
one might expect to find, somewhere in this 
book, material dealing with the selection and in- 
terpretation of tests for psychiatric diagnosis, 
for vocational guidance, or for evaluating and 
planning an effective curriculum, with possibly 
an actual case or two to illustrate these. Any such 
emphasis is sought in vain. The “practical” is- 
sues referred to are the broad theoretical ques- 
tions of socio-economic factors, of heredity and 
environment, of the amount of education, and 
of race as related to the intelligence of groups of 
people. Altogether, one emerges with the feeling 
that this is a book on testing written by someone 
whose experience with psychological tests is 
neither extensive nor current. This impression 
is intensified when, occasionally, the author 
speaks with great authority and makes a real 
contribution. Thus the section on talent tests is 
the best in the book, with a particularly sophisti- 
cated examination of tests of musical talent. The 
author has published in the field of the psychol- 
ogy of music, and his treatment of this topic 
might well be required reading in every course 
on mental testing. However, the instructor who 
uses the entire book as a text will find that, un- 
less he himself does an heroic amount of work, 
his students will finish the course with a set of 
misconceptions about the testing field or will be 
unable to reconcile their lecture notes with the 
textbook. 

J Appl Psychol 32:679-80 D '48. W. Grant 
Dahlstrom. * Apparently organized as a text for 
education students with previous statistical 
training who will have little additional instruc- 


tion on testing but who will be called Upon to 
utilize test data frequently * Evaluation should 
be largely in terms of the author’s own aims, but 
there are some general criticisms which any book 
should meet. In a recent publication, recent ma- 
terial is expected. In terms of the tests selected, 
the forms described, the validation studies cited, 
the content is often wanting. The tests for dis. 
cussion must serve many purposes. As examples 
of the various types they must be either popu- 
lar, useful, or grossly bad. They must satisfy 
the student requirements of being conversant 
with tests he is likely to encounter, of being 
aware of serious drawbacks or unusual poten- 
tialities in some of them, of knowing how to 
interpret or when to ask for the tests available, 
Here again, the choice however large is poor. 
There are glaring omissions, like the Kuder 
Preference Record in the area of interests, the 
MMPI and the Guilford batteries in personality. 
The test criticisms are often perfunctory, occa- 
sionally erroneous. Although it is delightful to 
find how effectively he dispenses with Links 
Personality Quotient Test, it is strange to see 
the Humm-Wadsworth actually given consider- 
able acceptance. There is no one place where the 
beginner is systematically instructed in the pro- 
cedures and sources for evaluating tests not m- 
cluded in the text. In meeting his own aims, 
Mursell's performance is checkered. It seems he 
treats rather well methodology in theory and 
practice—the usual problems of reliability, vali- 
dation, standardization, etc. but often contra- 
dicts himself later in discussion of specific tests 
Definitions are important for beginners, but he 
seems to be troubled by them; often mere refer- 
ence to Warren’s Dictionary would settle kar 
Scant attention is given to the history of the я 
ing movement, although this would y ie 
excellent vehicle for the large emphasis he p he 
on controversies and problems. Gratifyine!y, i 
takes up many other issues—e.g., factor p 
sis, scoring technics, projective ancient 
tations and potentialities of psychometr! 
pressing opinions which can of course d А 
appraised clearly after more data are ava g 
Students should be aware of these i а 
be informed sufficiently to follow the d e 
ments as the movement grows. The Us 
of the ID's material is of questionable t ie 
Naturally anyone dealing with tests 5 timabil 
familiar with these findings, but of ines bo 
greater worth would be careful gulda testing 
to utilize specifically the data in actua 
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and interpreting. Mere quotation of researches 
does not seem to achieve this, and would per- 
haps be more meaningfully presented in a course 
devoted to differential psychology. Nonetheless, 
the author has recognized a real need to be 
served by а book of this kind ; the testing move- 
ment will gain real strength in proportion to 
the sophistication instilled in those actually 
called upon to utilize its results. 

J Consult Psychol 12:60-1 Ja-F *48. Lau- 
rance F. Shaffer. * an informative, well-organ- 
ized textbook for the introductory course in 
psychological measurement * almost wholly de- 
scriptive. Theory is given scant attention ; sta- 
tistical methods are left to other treatises. On 
the whole the choice of illustrative tests is sound, 
and the information about them is up to date 
and well documented. There seem to be a few 
slips—a highly commercialized questionnaire's 
restricted distribution "adds to one’s confidence 
in it —the Kuder is unaccountably missing 
among several less important interest measures. 
Projective techniques, perhaps for the best at 
this level, are given only brief mention in a final 
chapter on “evolution and improvement,” along 
with factor analysis and profile scoring. * 

1 Ed Psychol 39:253-4 АР ^48. Раш E. 
Eiserer. * One "practical" testing problem Mur- 
sell overlooked is faced whenever an attempt is 
made to interpret test results to the persons 
who take the tests, This is a very practical issue 
both for clinicians and for educational and vo- 
cational advisors, and there is some literature. * 
There is a special value for beginners in this 
book because Mursell continuously points up the 
limitations of the knowledge with which he is 
dealing. This practice may help create the kind 
of critical thinking about psychological testing 
issues which is more important than mere fa- 
miliarity with what has been done. Any reader 
who is familiar with Mursell’s works has an 
expectancy for lucid writing and level-headed 
thinking, He will be rewarded by an uncom- 
monly sane approach to the field of mental test- 
ing. The book should prove a useful text for in- 
troductory courses, for it is balanced, critical, 
and well-written. 

J Ed Res 42:313-6 D ^48. СН. Wedell, * 
In terms of coverage, the author has done about 
as well as could be expected. * In general the 
treatment is such as might be expected in a 
work designed for teachers colleges and courses 
in testing in schools of education. Industrial 
psychologists, personnel workers, clinicians, and 
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vocational guidance workers as well as those 
interested in theory and technique will find little 
here to please them. The author's contention 
that his approach is primarily practical is some- 
what misleading. The treatment is no more 
practical than that of many others, And if the 
reader is looking for a searching account of how 
tests have worked, he will be disappointed. The 
statement that psychological tests must be re- 
garded as practical instruments should be taken 
to mean simply that no systematic viewpoint is 
adopted concerning the nature of intelligence 
or the structure of personality. In organization, 
the book is fairly sound. * As a whole the treat- 
ment is fairly adequate, though by no means 
scholarly or novel. A chapter on the applications 
of mental tests and their implications for testing 
is long on "applications" and short on "impli- 
cations." Moreover, the applications which have 
been selected for evaluation, while unquestion- 
ably important, again serve to emphasize the 
interests of the educator at the expense of a large 
number of other applications which might have 
been included, The topics reviewed in this chap- 
ter are: mentality and socioeconomic status ; 
family relationships and mentality; schooling 
and mentality; and mentality and race, The 
treatment is good, but confused at times by à 
tendency to forget that the issue is psychological 
testing, rather than whether race differences in 
intelligence exist. The same comments apply to 
the other three topics. The problems of соп» 
stancy of mental traits, mental growth, the dis- 
tribution of mental traits (normal or not?), 
heredity and environment, and the psychologi- 
cal significance of tests scores are touched upon 
in a penultimate chapter entitled “Wider Psy- 
chological Issues in Mental Testing.” They are 
not wider issues. They are only different issues. 
Moreover, by "psychological significance of 
test scores" is really meant non-psychol 
significance of test scores since the im- 
rt of test scores is the main question which is 
attacked. * Many students will find the 
difficult and at times very dull. This is due not 


ewhat turgid and verbose style in which 
appositional ia are used too liberally, Ос- 
casionally the language js inept, and many con- 
cluding paragraphs will strike the student as 
vacillating when the author's intent was to be 


i i hor is 
fair and impartial. In one respect, the aut 
to be congratulated. He has assumed а knowl- 


of elementary statistics, and has made no 

pooper scene tm ar epa 
statistical data is often incautious, to say the 
least, and - is throughout a 
tendency to cite secondary sources with the con- 
sequent implication of lack of scholarship. On 
the whole, there is very little in this volume 
that is scholarly, brilliant or stimulating. There 
is some that is superficial and inaccurate. There 
is much that is sound and workmanlike. 

Psychol B 45:181-3 Mr '48. Bert R. Sappen- 
field. Had the author written this book in 1940, 
it might have been considered a reasonably 
sound, up-to-date treatment of theoretical and 
practical aspects of psychological testing. How- 
ever, the reviewer was disturbed to find no men- 
tion of many of the relatively new tests which are 
of widespread current usage. Notable omissions, 
for example, are the Kuder Preference Record, 
the Minnesota Multiphasic Personality Inven- 
tory, and the Bennett Mechanical Comprehen- 
sion Test. A similar impression is gained from 
an examination of the bibliography, which in- 
dicates negligible reference to the literature ap- 
pearing after 1939. The author presents with 
unusual adequacy certain introductory concepts, 
such as the nature of psychological tests and 
some of the requirements of instruments (relia- 
bility, validity, objectivity, and standardization) 
in the first two chapters. Other fundamental 
concepts having to do with test-score interpre- 
tation are, for reasons not well understood by 
the reviewer, placed near the end of the book. 
That test scores have no known zero origin and 
that the units are unlikely to be equal are dis- 
cussed in the final chapter, though many teach- 
ers would prefer to deal with this material rela- 
tively early, as a basis for introducing the point 
that test scores become meaningful only in terms 
of their ordinal position in a distribution of 
scores for a group of known characteristics. 
Mursell devotes four chapters (144 pages) to 
his discussion of the concept of in and 
to methods for its measurement, His treatment 
is sound and rather thorough, departing insuf- 
ficiently from tradition to arouse i 
controversy or adverse criticism, His coverage 
of the New Stanford-Binet is much greater than 
that of the Wechsler-Bellevue, an emphasis 
which is usual but which the reviewer would 
prefer to see discontinued in books of this nature. 
In his treatment of tests in other areas than in- 
telligence the author seems to betray a less than 
desirable intimacy with his materials. This point 
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becomes э orea apparent in the 
voted to aj testing, wherea Cms. 
treatment of the difficult ін. м 
might have been expected. The рон 
more evident in the author's discussion od par 
sonality testing, where there 
tempt to cope with the complex conceptual 
lems involved in thís pe аю rev 
somewhat unusual criteria for 

ticular tests. For example, the author миш 
pressed with the Humm-Wadsworth Ti 
ment Scale partially because of ie таа 
while he neglects to mention the MMPI, wise 
was constructed on a similar basis (meting 
with Mursell's approval) but standardized ams 
carefully and less obscurely, Moreover, theme 
viewer cannot imagine that students ane Mily 
to develop interest in the fundamental 

of personality testing when a test is b 
as an “undefined hodge-podge" or as ^a plat 
pseudoscience.” The final chapter "à 
what Mursell regards as evolving improwm 
in testing practice and theory which are "ride 


in psychological significance," Here ant enm 
such topics as the concept of ране Г 
various suggested techniques for test 


terpretation, some standardization 

factor analysis, and projective technique 
discussion amounts to a fairly adequate 

tive introduction to these problems, 

the reviewer would disagree that whens ^ 
psychometric test is committed to mem 
ment... a projective test is committed M 
nosis.” It would seem more " 


mit that "measurement" is not 
either, and to regard both merely as 
айкала aimed at olicking responsie M r4 
tion by the psychologist. In general. 
Said thes the Look wil be wet pr 
courses in which the traditional 
intelligence is maintained, that it apposti 
as nearly adequate as any book of 2s 
toa , but awaiting 
ppear, | that, - 
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Brožek. * manifestly designed to sere 
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Pub Opinion Q 12:143-4 +P LÀ 
Schrader. * Discussions of many oí 0 
tests are apparently based on the 
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advantage, for basically it is sound and scholarly. 
It can be read with profit even by one who 
doesn’t yet know what this “IQ and mental test 
stuff” is all about. As to topics, Mursell devotes 
far and away the greatest share of space to the 
main stream of intelligence testing. The treat- 
ment of personality and motor-functions is cur- 
sory, as is the treatment of special analytic pro- 
cedures such as factor analysis. Nor are the 
test developments of the recent war adequately 
covered. Nevertheless, it must be admitted that 
adequate coverage of all topics would require 
a much larger book than would be practicable. 
Until the definitive treatment appears, this book 
will serve as a useful substitute. 


For an additional review, see 3:1043. 


[ B283 ] 

*MunsrLL, James L. Psychological Testing, Sec- 
ond Edition. New York: Longmans, Green & Co., 
Inc., 1949. Pp. xvi, 488. $4.00. * (PA 24: 190) 

Austral J Psychol 2:128 D'50. W. M. O’ Neil. 
* a valuable addition to the several available stu- 
dent texts on mental testing. * The material is 
well arranged and in general clearly presented. 
* The selection of tests for discussion is good 
and the aspects discussed are really important. 
This may also be said of the next two chapters 
on tests of aptitudes, interests, attitudes and 
personality, although the treatment of the var- 
ious aptitude tests tends to be brief. * The more 
technical and controversial issues are brought 
out clearly and are discussed in a balanced, un- 
dogmatic way. However, the discussion at times 
is not pushed far enough (presumably both to 
conserve space and to temper the wind for the 
lambs). Consequently the more able student is 
likely to be left with the impression that psycho- 
metric theory is vaguer and more insecure than 
it is. Further, most students will be left with a 
few wrong impressions. Some of these may arise 
from brevity. of discussion or from a failure on 
the part of the author to think certain matters 
through. For instance, the impression is posi- 
tively given that transforming raw scores into 
standard scores ensures equal units. Again, some 
students will be left believing that I.Q. and Co- 
efficient of Intelligence are very much alike 
apart from a different manner of computation. 
On the whole, the student is encouraged to take 
a critical attitude toward tests and data derived 
from them and is given a fair impression of how 
far the mental testing movement has got and how 
far it has yet to go. 
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Ed Forum 14:247 Ja’50. Harold H. Abelson, 
* essentially similar to the 1947 edition, It offers 
a solid diet of information about tests and dis. 
cussion of testing concepts and issues, with but 
a dash of the spicier ingredients associated with 
the newer dynamic and clinically oriented ap- 
proach to the study of mental life and personality; 
In this scholarly and careful work intelligence 
tests receive major consideration * Aptitude 
tests are succinctly reviewed in one chapter * 
The book is invaluable in conveying in a well 
ordered and mature way the major theoretical 
and practical contributions of the testing move- 
ment іп its more literal psychometric sense, Al- 
though conservative in his attitude toward the 
new approaches to personality study, Mursell 
makes sufficient reference to these trends to 
permit the interested reader to pursue the sub- 
ject further. * Particularly commendable is the 
fine balance between concise description of well 
selected tests and incisive critical discussion of 
tests and testing problems. This is no “cook 
book” manual of testing techniques, but a serious 
analysis of tests and testing. There is ample 
evidence that new tests and pertinent research 
have been examined conscientiously. The tone 
of the presentation is one that will command 
respect for the subject. Teachers should read 
this book and then go on to further study of the 
basic problems it raises. Р 

J Consult Psychol 13:383 О '49. eei in 
Shaffer. The revision....corrects a number of the 
shortcomings of a generally well-organized an 
useful book. * "T 

Psychol B 47:365-6 Jl ’50. Frederic i 
Davis. * The author writes well and has a 
good deal of the literature in the field pe 
urement ; the text is liberally sprinkled wit ҮҮ, 
note references to such sources. It 1s S 
that students will find the book readable nk i 
formative. * however, the book is not АШ 
їп any sense of the term. It is easy di 
point out specific errors of fact and in л 
tion in it, but it is time consuming to pir 
strate how fundamental misconceptions jm. ; 
understandings of psychological testing ™ . 
to the subtle distortions and misrepresen ntent. 
that are woven inextricably into the gi А 
It reads as though an intelligent Rn din p9* 
flair for writing but with no baa ie еа" 
chology ог statistics had surveye red of the 
ture on testing, taken a look at a aa udi 
most commonly used tests, and then Ps dil 
a book. To illustrate the sorts oi 6 
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characterize the text, let us consider the section 
on reliability in chapter 2. This section starts 
of “Е unchanging subjects are measured 
twice with a perfectly reliable instrument by a 
perfectly reliable agent, the correlation between 
the two sets of scores is 1.00” (Walker, p. 365). 
This statement briefly summarizes what is meant 
by reliability of measurement." The quotation 
from Walker is true, but it does not at all sum- 
marize what is meant by reliability of measure- 
ment. The classic definition of reliability by 
Spearman refers to the correlation of two sepa- 
rate but hypothetically equivalent measures. 
Later in the chapter (at the bottom of page 47), 
the writer contradicts an essential element in 
the formulation of reliability in terms of analy- 
sis-of-variance techniques when he writes, “In- 
terdependent items, i.e., those which present the 
same problem in different forms tend to lower 
reliability.” On the contrary, as item intercorre- 
lation is increased by testing the same point in 
several ways, test reliability tends to increase. 
On page 54, the author computes a standard 
error of measurement, finds it to be 9.36, and 
then interprets it as follows: “This tells us that 
there is a 68% chance that a person making any 
given score on the first testing will score within 
a range of +9.36 to —9.36 of that score on the 
second testing. That is, if a person makes a 
score of 170 on one testing, there is a 68% 
chance that his score on another testing (which 
is often called his ‘true score,’ i.e., his score on 
any other testing) will fall between 161 and 
179.” These statements will not meet with the 
approval of educational statisticians. If anyone 
supposes that the interpretation of test scores 
for practical purposes can safely be made by the 
author in spite of his apparent misunderstanding 
of the concepts of reliability, he should examine 
the statements at the bottom of p. 110 and the 
top of p. ттт concerning the interpretations of 
IQ's derived from the revised Stanford-Binet 
Scales, The author states, “Thus general relia- 
bility coefficents cannot be worked out for this 
scale, or for any such scales.” The truth is that 
a general reliability coefficient for Stanford- 
Binet IQ's could easily be computed, but stand- 
ard errors of measurement at several IQ levels 
are much more valuable for the informed test 
user than the general reliability coefficient. In 
the reviewer’s opinion, the faults of this book 
greatly outweigh any merits that it may possess. 
Tt does not seem to come up to the standard of 
the author’s previous writings or to accomplish 
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M Purpose; in the reviewer's judgment, 
PLU. never have been published. 
Walijo Been M МЕРЫ JI-Ag '50. Samuel 
A Hh it prias | ed for the beginning student. 
dcorum rui E : sound and comprehensive 
* Mursell h eld of psychological testing. 
quii as sought to include the more re- 
Eid im EH dos d in the fields enumerated 
e porate all that is pertinent in 
the literature. In this he has not been entirely 
successful. Many will doubtless feel that his 
treatment of projective testing is far too sketchy 
in light of the tremendous activity that has oc- 
curred, of late, in this quarter. He is further 
open to criticism on the basis of his very limited 
discussion of the complex interrelations between 
intellectual function and personality, particu- 
larly as these have been recently investigated 
by "scatter analysis," Even in an introductory 
text these topics would seem to warrant more 
extended presentation than they have received. 
On the other hand, the impact of factorial analy- 
sis seems to have been more nearly accorded its 
due. The reviewer concurs with the author in 
his decision to omit a statistical section from the 
book. All too frequently the fragmentary presen- 
tations that accompany handbooks of mental 
testing are either irrelevant or misleading or 
both. Particularly commendable are the critical 
discussions of such basic concepts in testing as 
validity, reliability, stability, etc. The circum- 
spection and lucidity with which these basic 
issues are covered should be of much help in 
orienting the beginner. In addition Mursell has 
thought intelligently and constructively about 
many of the assumptions underlying mental 
testing, and the sobriety of his views regarding 
both the possibilities and limitations of testing 
should exercise a salutary effect on the neo- 
phyte. 
For reviews of the first edition, see B282 and 
3:1043. 
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yer, Oscar N. The Langua 
and How to Read It. New 
Press, 1951. Pp. 207. $2.50. * 
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XNarINAL Counci ох MEASUREMENTS USED IN 
Epucation. The Fifth Yearbook of the National 
Council on Measurements Used in Education, 
1948-1949. Fairmont, W.Va.: the Council, Fairmont 
State College, 1949. Pp. v, 56, viii. Paper, mimeo- 
graphed. $1.00, * 
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Council on Measurements Used in Education, 

1948-1949. Fairmont, W.Va.: the Council, Fairmont 

sate College, 1949. Pp. v, 135. Paper, mimeographed. 
.00. 
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*NationaL CoUNCIL ON MEASUREMENTS USED IN 
Epucation. The Seventh Yearbook of the National 
Council on: Measurements Used in Education, 
1949-1950. Fairmont, W.Va.: the Council, Fairmont 
State College, 1950. Pp. v, 56, ix. Paper, mimeographed. 
$1.50. * 
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*NatIoNAL COUNCIL. ON MEASUREMENTS USED IN 
Epucation. The Eighth Yearbook of the National 
Council on Measurements Used in Education, 
1950-51. Fairmont, W.Va.: the Council, Fairmont 
State College, 1951. Pp. vii, 58, xviii. Paper, mimeo- 
graphed. $1.50. * 

[ B289] 
*Nationat EDUCATION ASSOCIATION, American As- 
SOCIATION FoR HEALTH, PHYsICAL EDUCATION, AND 
RECREATION, RESEARCH SECTION, NATIONAL RESEARCH 
Counc. Measurement and Evaluation Materials 
in Health, Physical Education, and Recreation. 
Washington, D.C.: American Association for Health, 
Physical Education, and Recreation, 1950. Pp. xi, 138. 
$2.50. * 

[ B290] 
*NarIoNAL EDUCATION ASSOCIATION, AMERICAN EDU- 
CATIONAL RESEARCH ASSOCIATION, COMMITTEE ON EDU- 
CATIONAL AND PsvcHoLociCcAL Tests, J. RAYMOND 
GERBERICH, CHAIRMAN. Educational and Psycho- 
logical Testing. Review of Educational Research, 
Vol. 20, No. 1. Washington, D.C.: American Educa- 
tional Research Association, February 1950. Pp. 99. 
Paper. $1.50. * 
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KNATIONAL EDUCATION ASSOCIATION, AMERICAN EDU- 
CATIONAL RESEARCH ASSOCIATION, COMMITTEE ON 
METHODS ОЕ RESEARCH AND APPRAISAL IN EDUCATION, 
J. Wayne WRIGHTSTONE, CHAIRMAN. Methods of 
Research and Appraisal in Education. Review of 
Educational Research, Vol. 18, No. 5. Washington, 
D.C.: American Educational Research Association, 
December 1948. Pp. 369-523. Paper. $1.00. * 
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*NATIONAL EDUCATION ASSOCIATION, AMERICAN EDU- 
CATIONAL RESEARCH ASSOCIATION, COMMITTEE ON 
METHODS or RESEARCH AND APPRAISAL IN EDUCATION, 
PauL Вгоммевѕ, CHAIRMAN. Methods of Research 
and Appraisal in Education. Review of Educational 
Research, Vol. 21, No. 5. Washington, D.C.: Ameri- 
can Educational Research Association, December 1951. 
Pp. 323-501. Paper. $1.50. * 
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*NarroNAL EDUCATION ASSOCIATION, ASSOCIATION 
FOR SUPERVISION AND (CURRICULUM DEVELOPMENT, 
COMMISSION ON TEACHER EVALUATION, GILBERT S. 
WiLLEv, Carman. Better Than Rating: New 
Approaches to Appraisal of Teaching Services. 
Foreword by Gladys L. Potter. Washington, D.C.: 
Association for Supervision and Curriculum Develop- 
ment, 1950. Pp. 83. Paper. $1.25. * 


Childh Ed 27:440-1 My ’5r. Increase in the 
use of teacher-rating schemes these last few years 
has given rise to serious consideration on the 
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part of those interested in good education a; tp 
desirable and efficient means of improving edu- 
cational programs. We know we want better 
schools, we know good teachers are the key to 
better schools, but whether we can have better 
teachers as a result of rating has been questioned 
by many. This thought-provoking pamphlet will 
help many teachers and administrators in an- 
lyzing some of the issues involved. * Better 
Than Rating meets a real need in American edu- 
cation today. It is so logically and clearly written 
that it makes a very convincing document. 

J Teacher Ed 2:148 Je ' 51. * The leadership 
of ASCD is to be congratulated for facing di- 
rectly a controversial problem which occurs 
periodically in every state and local community. 
There are unquestionably differences between 
effective teaching and ineffective teaching, and 
both laymen and professional educators occa- 
sionally assume that it ought to be a right and 
relatively simple matter to determine which is 
which, reward the “good” and penalize or elimi- 
nate the “bad.” Unfortunately, the systems de- 
signed to accomplish the above purpose have 
succeeded mainly in creating an atmosphere of 
mutual mistrust, suspicion, fear, and conformity. 
Better Than Rating lists the various systems of 
rating and the claims made for each. On the 
assumption that progress in democratic educa- 
tion depends on behavioral changes in the @ 
rection of creativeness and invention, the pam- 
phlet then describes the type of environment 1 
which such changes are likely to take place. The 
rating scales do not stand up against such crt 
teria, The process which is "better than rating 
thus emerges as one in which the school com 
munity organizes to foster professional gro i 
through cooperative evaluation, wide eset 
professional experiences, classroom aa 
mentations, help during probationary ien 
adequate physical and mental help, and s а) 
In short, there appears to Бе по quick an v, 
road to teacher evaluation ; only the long, ae 
and continuous process of total school Sad | 
ment appears likely to bring the results 0 
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XNaTIONAL EpUCATION ASSOCIATION, Division 
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Teacher Rating, Revised . Edition. үе, 
Pamphlet, No. то. Washington, D.C.: the Depa 
April 1950. Pp. 24. Paper. * 


[ B295 ] p. Exp 
X*NarIoNAL INDUSTRIAL CONFERENCE port Board 
ence With Psychological Tests. Соп 02. New 


Reports, Studies in Personnel Policy, ^9 
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York: the Board, 1948. Pp. 32. Paper. * [Distribution 
is limited to associated companies and to educational 
institutions, and governmental agencies.] 
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NATIONAL SOCIETY FOR THE Srupy or EDUCATION, 
CoMMITTEE ON THE MEASUREMENT OF UNDERSTAND- 
1NG, WILLIAM A. BROWNELL, CHAIRMAN. The Meas- 
wement of Understanding: The Forty-Fifth 
Yearbook, Part I. Edited by Nelson B. Henry. Chi- 
cago, Ш.: University of Chicago Press, 1946. Pp. xi 
338. Paper, $2.25; cloth, $3.00. * (PA 21:1272) ў 

Chicago Sch J 29:93 Ја-Е ?48. Denton L. 
Geyer. Addressed to the classroom teacher and 
written in non-technical style, this Yearbook 
explains how to appraise outcomes of educa- 
tion which go beyond acquisition of factual in- 
formation and involve understanding of prin- 
ciples and relationships. Ably prepared and 
simply phrased, it meets a real need in bringing 
the theoreticians and the practitioners together. 

Childh Ed 23:295 F "47. Edgar M. Draper 
and Alice Н. Hayden. * The measurement pro- 
gram has direct effect upon instructional prac- 
tices, and if these effects are to be beneficial, 
teachers and administrators will do well to study 
the suggestions and implications presented in 
this book. 

High Points 28:76-7 O '46. J. Wayne 
Wrightstone. * Because. it pioneers into the 
realms of evaluating such objectives as the inter- 
pretation of data and the application of princi- 
ples, as well as stressing tests of growth in 
thinking, this volume contributes something 
noy to the abundant literature on measurement. 

The committee preparing this yearbook is to 
be commended for the comprehensive, syste- 
matic, and yet simple manner in which they have 
brought together and presented the techniques 
for the measurement of objectives related to 
understanding and thinking in the various sub- 
ject areas. The volume points the way to à 
responsibility that all schools share to empha- 
Size and to implement in instruction and in 
evaluation real: understanding and thinking 
rather than the mere acquisition of information 
and skills, Teachers and supervisors are still 
prone to pay lip-service to the objectives of 
understanding and thinking. Admittedly, in- 
Struction and evaluation of these objectives are 
difficult. Here is an educational document that 
Will be of practical help to the teacher and super- 
Visor in planning more intelligently how to 
assess and to guide pupil growth in the under- 
standings of subject areas. This book is the 
first, most recent, and most usable but not the 
final statement on this important topic. The chal- 
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lenge is to move forward in emphasizing under- 
standing and thinking as objectives of the cur- 
riculum. Every teacher and supervisor can read 
this book with profit—read it critically and 
thoughtfully—then select and apply suggestions 
that are appropriate to his teaching situation. 

Nerv Child 7:123 Ja’ 48. This important hand- 
book for teachers provides material to help in 
solving special problems and stresses the need 
for understanding educational tasks. We should 
have been glad to find a long chapter on child 
health problems, especially the neglected mental 
hygiene aspects. 

For an additional review, see 3:1063. 

[8297 ] 

Union or Teachers, Transfer From 
Primary to Secondary Schools: Report of a Con- 
sultative Committee (G. B. Jeffery, Chairman) 
Appointed by the Executive of The National | 
Union of Teachers. Published for The National 
Union of Teachers. London: Evans Brothers Ltd. 
1949. Pp. 189. Paper. 75. d. * 

A.M.A. 44:214 Je-J1 49. This work attempts 
to consider with impartiality the whole problem 
of the 11+ Examination. A few paragraphs 
(104-107) make gratuitous charges of snob- 
bery against Grammar Schools and their Heads. 


In all the rest, we can i 
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gladly recognise the sin- 
cerity and industry of the compilers. We can 
whole-heartedly accept many of the arguments 
and some of the Recommendations : ‘the rejec- 
tion of “aptitude tests,” for example, and the 
wish to secure the co-operation of parents. Un- 
fortunately, we cannot believe that the N.U.T. 
Committee have surmounted their main prob- 
lem. Two schemes are propounded. The first 
assumes the usual practice of applying one 
s of selection throughout the area of a 
Local Authority. In this case, the Committee 
wisely suggest that some weight should be given 
to the opinions of the primary school teachers, 
but otherwise they rely wholly on Intelligence 
Tests and Objective Tests of Attainment. We 
do not share their faith in such tests, which 
favour the slick and superficial, offer little op- 
portunity for connected thought, and none for 
its coherent expression. It is admitted that spe- 
cial coaching does much to vitiate the results, 
and gravely distorts the work of the primary 
school. On the question of cramming, the Com- 
mittee show some inconsistency : they condemn 
with disgust all special training for Intelligence 
Tests, but leave to the conscience of the teacher 
the question of how much to train for Attain- 
ment Tests. We concede to Intelligence Tests 


proces 
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no sanctity that other tests do not share; and, 
whatever the moral truth of the matter, the 
practice of cramming is widespread and will 
continue so long as examinations encourage it. 
A much better plan, we think, would be to make 
every test a satisfactory educational experience, 
so that, at worst, pupils will get too much of 
good things, not of bad. The Committee's second 
scheme, called the “Quota Scheme," tries to 
minimise the evils of cramming. Briefly, primary 
schools will compete by an Intelligence Test for 
quotas of places, and individuals to fill a school's 
quota will be selected by adding standardised 
scores representing intelligence, primary school 
attainment (internally assessed), and teachers' 
assessment. Each educationist should read for 
himself the arguments of the Report. In our 
view, it is repetitive to give separate marks for 
School attainment and teacher's estimate; the 
Scheme relies too much both on Intelligence 
Tests and on the judgment of teachers ; and it is 
further invalidated by misuse of statistical 
method. Mathematical standardisation of marks 
is justifiable only if large numbers—say of 2,000 
or more—are considered: the absurdity of ap- 
plying it within a school is illustrated by the fact 
that a school with less than 20 pupils in the age- 
group each year would never be able to claim 
one of high ability. Our own opinion has been 
little changed by careful study of the Report. We 
consider the practices of the modern “examin- 
ing caste" to be in many ways unsound, and be- 
lieve that education would be best served by a 
substantial return to traditional methods. It is 
to be hoped that the Education Sub-Committee of 
our own Association will soon make public the 
fruits of its long study of the question. In the 
meantime, the N.U.T. Report should be care- 
fully and critically studied. 

Brit J Ed Psychol 19:144-5 Je '49. G. K. * 
a careful study both of the practical procedures 
actually in vogue and of the theoretical evidence 
submitted by educational and psychological ex- 
perts * How can the standards of the different 
teachers, each of whom knows only his own 
pupils, be fairly compared? To solve this prob- 
lem the Committee has put forward a "quota 
scheme." As a footnote indicates, a scheme on 
these lines was proposed by Professor Valentine 
many years ago in his pamphlet on Examina- 
tions and the Examinee (1938, see esp. p. 11). 
It was put into action, with Professor Valentine 
himself as examiner, by Mr. V. J. Moore, Di- 
rector of Education at Walsall (see his article 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK [ 


1006 
in the present Journal). The procedure at Wa. 
sall is actually described in a note to Table ҮШ 
in an appendix to the Committee’s report, It is 
however, curious that no explicit reference i; 
made to this experiment to show that the scheme 
is really workable, and that Professor Valen. 
tine’s initiative in putting forward so ingenious 
a principle is not more warmly acknowledged, 
The essential feature of the scheme is that the 
primary school teacher would be required: to 
draw up “priority lists” of pupils who, in his 
opinion, are most suited to grammar, or techni- 
cal, education ; the actual number of places avail- 
able for pupils from this or that primary school 
would be determined by quotas based on intelli- 
gence scores, and the final selection of the pupils 
within a given primary school would be deter- 
mined by a blending of the order in intelligence 
and the order of merit drawn up by the head 
teacher. Not the least valuable part of the Report 
consists in the detailed appendices drawn up 
by the Secretary (Mr. W. Emrys Davies) and 
the Chairman (Dr. С. В. Jeffery) on the pro- 
cedures in use under various education author 
ties, on the development of policy as indicated in 
official publications, and on the standardization 
of marks, and on the statistical theory of the 
quota scheme. The appendix by Dr. C. M. Flem- 
ing on aptitude testing does not seem altogether 
in keeping with the recommendation of the re- 
port. Since the statements she has made here 
and elsewhere have already been widely quoted, 
especially by those who decry the value of tests 
and statistical methods, it is desirable to € 
phasize that its general trend does not eun 
with the opinions of most educational pem 
gists in this country at the present day. А. k 
opening years of the twentieth century, a ka 
us, psychologists were dominated by an 
tion of innate or hereditary abilities, pis j 
cepted with excessive enthusiasm RET 
"separate education for children of «gat 
levels of ability,” as determined by tests 0 p 
is commonly called intelligence"; later, n " 
sult of “the procedure known as factor "m Tit 
psychologists “talked much of types Quse Put, 
verbal, numerical, mechanical, or puru 
we gather, "the evidence of the ein older 
largely reversed the conclusions © t T: told 
investigators. “The human mind," we ere fo 
is “relatively structureless”; what je it 
merly supposed to be inborn and perma nmental 
ferences in mind are due chiefly to ел0? 
influences, and can, therefore, rea Пу 
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come by teaching and improved social environ- 
ment, “In the light of recent evidence,” she con- 
cludes, "there seems very little justification for 
an acceptance of earlier conceptions as to fixity 
of relative status and consequent assumptions 
as to prediction of later performance.” Who the 
"clinicians" may be and where they published 
the researches which led to such “drastic 
changes” in psychology is not quite clear, No 
doubt it is important that teachers should be 
warned not to regard test-results as infallible 
or test-methods as the sole method of investigat- 
ing personality. But this warning is not some- 
thing new, to which older educational psycholo- 
gists were blind. Nor have there been any such 
“drastic changes” in educational theory as the 
tone of Dr. Fleming’s memorandum might im- 
ply. Indeed, the psychologists who have most 
strongly criticized the allocation of eleven-year- 
olds on the basis of aptitude have been the very 
psychologists whom Dr. Fleming names as re- 
sponsible for the introduction of mental testing 
and factorial analysis. The report itself concludes 
with a strong plea for further research, which 
all educationists and psychologists will readily 
endorse. “Local education authorities,” it is 
urged, “should encourage their officers to un- 
dertake research into problems of allocation as 
they arise in their areas and to experiment with 
new methods under scientifically designed con- 
trols.” Considering the exceptional difficulty of 
the problems raised, the Committee are to be 
heartily congratulated on the attitude they have 
taken and the recommendations they have 
worked out. Even those who do not altogether 
agree with their conclusions, will acknowledge 
its unusual combination of practical common 
sense with an appreciation both of the humani- 
tarian and of the scientific issues involved. 
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goon, С. W. An Introduction to Educational 
tatistics, New York: Prentice-Hall, Inc., 1946. Pp. 
xiii, 269. "Text edition, $3.75; trade edition, $5.00. * 
(Answers to Exercises and Problems in An Introduc- 
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3112077) For reviews, see 241 in Statistical Method- 
ology Reviews, 1941-1950. 
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Am J Psychiatry 105:799-800 Ap ’49. Har- 
old M. Hildreth, * a valuable final AME out- 
lines coe for the guidance of other 
individuals or organizations faced with the prob- 
lem of selecting highly qualified personnel for 
specific tasks. * A basic weakness of the work 
reported, and so stated by the authors, arises 
from the fact that the assessment unit was set 
up to perform a service rather than primarily as 
a research organization. Adequate validation 
data are, therefore, lacking in many instances,* 
and the reader is asked to accept the qualitative 
judgment of the assessment staff and OSS ad- 
ministrative personnel that the utility of the 
program was worth the time and cost. It appears 
evident from the data presented, however, that 
assessment contributed to the efficiency of OSS 
operations by reducing the number of personnel 
failures, and that the methods developed have 
great promise for future use in situations where 
more of the factors of selection and validation 
can be controlled. 

Am J Psychol 62:140-3 Ja '49. T. A. Ryan. 
This detailed report....is cautious, frank, and re- 
markably free from attempts to persuade the 
reader upon the importance of its results, * The 
assumptions underlying both the specific pro- 
cedures and the general program of assessment 
were extremely general, * Nevertheless, certain 
assumptions are clearly implied by the methods 
described in this report; (1) that, in the ab- 
sence of opportunity to make preliminary valida- 
tion studies, a method which has “face validity 
for the assessor has some probability of success 
even without a formulated theoretical basis ; 
and (2) that the observations of psychologists 
and psychiatrists aré of more value than those 
of laymen, even when those psychologists and 
psychiatrists are unable to agree upon a theoreti- 
cal formulation and are using essentially com- 
mon-sense methods. The first assumption 18 
clearly that which underlies all common-sense 
and traditional procedures in the assessment of 
men. The Staff can scarcely be blamed for adopt- 
ing it in view of the practical situation in mo 
they found themselves, and the present state " 
our knowledge in this field. Perhaps we shou 


B301 


not blame them, either, for their second assump- 
tion. We wish, however, that more light could 
have been thrown upon the manner in which 
the psychological training of the assessors con- 
tributed to their skill in making these ratings. 
Аз stated above, the authors have been ad- 
mirably frank in their statements of the short- 
comings of the validation procedures, and in 
listing the errors which were made in assess- 
ment. Even the most perfect possible appraisals 
(as they term their criteria of validation) would 
not, however, bear upon the point now under 
consideration. A group of completely untrained 
laymen might produce just as satisfactory pre- 
dictions once the testing situations had been 
set up. Even the testing procedures themselves 
or others just as adequate might have been in- 
“vented by a group of laymen who set out to 
assess a group of men by putting them into as 
many different situations as possible. * we may 
ask whether there is anything here which would 
not be found if a group of intelligent engineers, 
business men, or physicists were to set up a simi- 
lar program and work at it with the same dili- 
gence and interest which characterized the OSS 
staff. Perhaps a similar interest could not be 
readily generated in а group of non-psycholo- 
gists, and this may be the main contribution of 
the psychologists to the program at the present 
time. The repórt closes with a recommendation 
that assessment programs similar to that used 
by OSS be set up for peace-time research and 
application. * The first research problem to be 
solved in such a project might well be the ques- 
tion whether the psychologists are contributing 
anything to the prediction which is based upon 
their professional background rather than upon 
common sense and interest alone. If they are, 
then what have they added over and above the 
common-sense approach? In what ways do their 
judgments differ from those of the layman who 
has the same information to work with? Do 
these facts suggest other revisions of common- 
sense approaches? The above questions refer to 
possible modes of judgment which the psycholo- 
gists as a groitp may have in common. It is also 
evident from the OSS report that there were 
many variations from psychologist to psychol- 
ogist. The proposed research program would 
also afford an opportunity for testing these 
more controversial theories. If the first result 
showed that what the psychologists have in 
common is only common sense and is not unique 
for psychologists, there still remains the vast 
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program of testing the various approaches to 
judgment, and also of testing the predictive 
value of the common-sense judgment itself, The 
writers of the report are to be commended for 
the excellent description of the operation of their 
assessment program. It is certainly to be hoped 
that their recommendations and suggestions for 
further applications of the method will be fol- 
lowed out that many of the gaps still inherent 
in the war-time program may be filled. 

B Ind Psychol & Personnel Prac 7:46-8 Mr 
°5т. D. Macleod. * The book has been written 
in the form of a detailed research report and is 
packed with examples of the principles of assess- 
ment used. For the technical reader the length 
seems far greater than is necessary, but the au- 
thors were no doubt anxious to record as much 
as possible of their techniques and experiences. 
For those interested in selection methods and 
techniques the chapters on Principles of Assess- 
ment and Conclusions and Recommendations 
are well worth reading. * 

Can J Psychol 4:92-3 Je ’50. William Line. 
* important volume * It is impossible to assess 
either the work as described, or the description 
of it. There could be no known outcome, from 
the very nature of the circumstances under 
which the project was conducted. About all that 
can be said is that a number of intelligent st 
entists came together and thought intelligent! 
about people who were potential candidates for 
responsible and difficult assignments. This n 
itself is significant. One can only hope that id 
larly careful consideration will spread to mi 
ters of peace-time leadership. Since the descrip 
tion is on the whole, non-technical, it cannot 2 
expected to reflect much іп the мау of e 
scientific insights. The recommendations d 
tion is somewhat disappointing, however, MC 
have a right to anticipate that so rich an d " 
gation-experience would lead to ш " 
judgment about selection generally. d ml 
dices add an air of statistical respectabin ү is 
little purpose appears to be served card 
Factor-analysis of gross data meets ү i 
reward. Psychology is still seeking 4 m ‘i 
way of handling the concept “persona” j^ p" 
reliance on the layman's opinion 


be the productive starting point mi 
cal analysis. Yet this is what the aut A | 
to cling to, in their final confession 07, p. Hor 

J Abn & Social Psychol 44:1302 a A иш 
old E. Burtt. This book should be exce 
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interesting to psychologists working in the field 
of personnel or in clinical psychology. * The 
description of the program is intriguing. * The 
general idea is not new, of course. Thorndike 
long ago suggested something of the sort in 
connection with measuring social intelligence. 
And there are sales conventions where they de- 
liberately rough up a cub salesman with a view 
to noting whether he can "take it." But the situ- 
ations used here seem very good for the purpose 
and of course had a military flavor. * The de- 
compression proposal caught the reviewer's at- 
tention and suggests an effort to control the 
amount of frustration involved or to standardize 
itin a way, so that it could be turned off when 
enough had been applied. T his is a rather tricky 
feature but one which is unavoidable. It is ob- 
vious that one difficulty in situational testing is 
keeping the situation as constant as possible from 
subject to subject. As a result of all this observa- 
tion the staff analyzed such variables as motiva- 
tion, energy, initiative, effective intelligence, 
emotional stability, social relations, leadership, 
and ability to observe and report. Many of these 
obviously are variables which cannot be evalu- 
ated by any objective test at present in existence. 
This does not mean that we can never devise 
such an objective test but at present the situa- 
tion test seems one of the best leads. The story 
regarding the validity of the procedures is not 
too cheerful. * The appendix gives a lot of inter- 
correlations between the variables. * It is 
scarcely in order to attempt to pick the work to 
pieces in a critical fashion, for the writers make 
no undue claims. They merely present the re- 
sults for what they are worth, and are well aware 
of shortcomings and urge more investigation. 
It would seem well to emphasize one of their 
points, namely, the importance henceforth of at- 
taching more significance to the criterion aspect. 
In the pressure of the war this was not handled 
very well. In any subsequent study it should be 
an initial consideration. The reviewer is also a 
bit worried about the problem of standardizing 
the situations when different staff members par- 
ticipate. Some of the dialogue quoted in the 
book is of a character which would not be apt 
to crop out every time a given situation was 
presented. He wonders, also, whether there 
might be some possibilities of doing more than 
using a rating scale after each situation, per- 
haps quantifying some behaviors—for example, 
counting the number of times a futile request 18 
repeated, The book is quite exhaustive and de- 
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tailed. The reader may not want to cover every 
word but he will find much of it, especially the 
situation tests, very interesting. It represents 
the first extensive systematic attempt to explore 
the possibilities of situation tests in assessing 
personality. The reviewer hopes that some of 
the staff get a chance to try out their proposed 
institute. 

J Appl Psychol 33:511-5 O '40. Donald E. 
Super. * This volume is as distinctive in literary 
style and as easy to read as any treatise by spe- 
cialists in fantasy should be; it is as full of in- 
sights into personality structure and measure- 
ment as one would hope from personologists and 
clinicians ; and it reveals the quick grasp of prac- 
tical matters which was demonstrated time and 
again during World War II by academic and 
laboratory psychologists who turned to the prac- 
tice of psychology. We might ask, What was the 
unique contribution of this group of World War 
II psychologists to personnel psychology? The 
orientation of these psychologists was unique 
for, unlike the other groups working on per- 
sonnel selection, known to this reviewer, this 
one included practically no personnel psycholo- 
gists: they were laboratory men or clinicians, 
their interests were in personality theory and in 
psychotherapy. They therefore approached a 
personnel assignment with a freshness and orig- 
inality, with a freedom from attachment to ex- 
isting methods, which resulted in unusual cre- 
ativity. Among the innovations are a willingness 
to use subjective methods as a supplement to 
objective tests, the development of projective 
and situation tests, and the routine use of the 
intensive case conference. At the same time, lack 
of familiarity with the principles and practice 
of personnel psychology resulted in some un- 
necessary mistakes and in some “findings” which 
have long been familiar to workers in the field. To 
cite just one of several possible examples, despite 
an excellent analysis of the complexity and un- 
reliability of their own criteria, they state (on 
page 397) that “If the job is running à lathe in 
a large factory the rate of piece-work production 
is a satisfactory criterion...” whereas various 
studies have shown that even as “objective” a 
criterion as output may be an unreliable and 
invalid index of a worker's success. The organts- 
mic as opposed to the elementalistic approach to 
measurement is described and valent ad 
tail, for perhaps the first time in a practical situa- 
tion. n of the techniques used by the OSS 
Assessment Staff were not new, and were not 
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proclaimed as new, but they were thoroughly 
tried, revised in the light of experience, written 
up in detail, and statistically analyzed. The psy- 
chologist or personnel worker interested in the 
testing possibilities of social and practical situa- 
tions, or in the use of the interview and of the 
case conference in evaluating leadership ability 
or ability to adjust to trying situations, will find 
many worthwhile suggestions in this account 
of the trial of these methods. This source is 
unusual among clinical studies, because the As- 
sessment Staff was self-critical, checked for per- 
sonal and systematic bias, and made all possible 
statistical analyses of their data as their work 
progressed. The organismic method attempts to 
predict future behavior by inductive thinking 
from a set of observed facts to a conception (the 
hypothetical formulation of a personality), and 
then inductively to predict behavior in an antici- 
pated situation, whereas the elementalistic ap- 
proach attempts to predict future facts directly 
from observed facts. It is especially interesting 
to compare the conclusions concerning the rela- 
tive validity of the two methods reached by the 
authors with those of some workers in one of 
the other major personnel selection programs of 
the Armed Forces. The OSS psychologists state 
on page 227: “It is the contention of S (one 
OSS unit) that more can be learned of people 
by being with them as ‘good Joes’ than by testing 
them as professors.” (Perhaps the reviewer will 
be pardoned for stating that this seems to him 
to be the discovery of someone who has pre- 
viously done his testing as a professor, a labora- 
tory man or clinician detached from the situa- 
tion into which he injects his tests, rather than 
the wisdom of a person who has worked in a 
practical situation, who has used tests as only 
one method of gathering data, and who has had 
to be a “good Joe” all along in order to do any 
kind of testing!) In contrast with this state- 
ment concerning the superiority of observational 
and projective techniques, one should consider 
the conclusions drawn from military testing ex- 
perience by factor analysts such as Guilford in 
a recent article in the Psychological Review, pro- 
posing a selection procedure which is even more 
elementalistic and quantitative in its details than 
the traditional methods decried by the organi- 
cists of the OSS. * The OSS authors state that 
no adequate comparisons have been made be- 
tween organismic and elementalistic approaches, 
but a few inadequate comparisons could be 
made. The AAF Aviation Psychology Program 
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attempted to validate a number of clinical or 


- organismic tests against success їп flying train- 


ing. Although time did not permit assessment 
by case conference procedures using all avail. 
able information, single organismic techniques 
(e.g., interviews and Rorschach interpretations) 
were validated and proved to have no predic. 
tive value, while a number of single objective 
tests had validities in the .30s and .40s. In an- 
other study a psychiatrist and a psychologist 
interviewed cadets with borderline test scores 
and found that their clinical or organismic eval- 
uations were no better than chance in predict- 
ing flying success in this group. * the average 
validity for the two principal assessment sta- 
tions, for a sample of 171 men followed up and 
rated for job performance overseas by the Over- 
seas Assessment Staff, was .45. There were no 
control groups to permit a coinparison of the 
effectiveness of these with other methods, but 
taken by itself this is not a negligible validity 
coefficient. * the final chapter of the OSS book 
reads almost like a request for a research grant 
ж It is to be hoped that....organismic psycholo- 
gists, both theoretical and applied, will have a 
real opportunity to develop and exploit the pos- 
sibilities of their approach. * This is a valuable 
book, worthy of careful study by students of 
personality and by students of man at work, 

J General Psychol 41:133-40 Jl "4p. H. Ј 
Eysenck. * [the] lengthy and detailed discus 
sion of the actual tests, ratings, interview pro 
cedures, and so forth,....is so vivid and well done 
that to one reader, at least, it has given the al- 
most hallucinatory experience of having m 
through it all—an experience probably айе 
his having watched the rather similar Britis 
procedures at close range. These 300 pages d 
descriptive writing are of a very high order p 
the pictures included invariably help the rea е 
to gain an accurate picture of precisely v 
went on. It is, perhaps, a pity that no deta { 
account of their experiences is given by gm 
the assessees. It is just barely possible rence 
description might have been less ent n te 
than that of the assessors. * By and wo a 
average forecasting accuracy of the О i 
validated against the four methods of ай 
described, appears to have been in the pes d 
hood of .25. It is odd that on the W seit 
one-day rating procedure at Station pe three 
to have given higher predictions than t апай! 
day rating methods at Station S. No exP 
is forthcoming. * In addition to the 
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praisal, certain more detailed figures are gi 
for more “atomistic” qualities. Taking” 
median of correlations for the three meth 


appraisals for which these figures were calc к 


lated we find the following correlations betwee! 

assessment and appraisal ratings: motivation 
= 12; effective intelligence = .32; emotional 
stability = .08; social relations = :06; leader- 
ір = .16. These values are disappointingly 
low. In -addition to these validity results 
there is an appendix giving reliabilities and 
intercorrelations for the various factors rated, 
as well as a factorial analysis. These fig- 
ures are difficult to evaluate because of the “ог- 
ganismic" attitude of the authors, which makes 
it impossible for any one rating not to be in- 
fluenced by ratings on all the others. Any evalu- 
ation of this book is extremely difficult. It 
reports a study and a method of approach of ob- 
vious importance. The results are intriguing but 
impossible to validate on апу reasonable basis. 
The reasons given by the authors for the lack of 
high positive correlations between assessment 
and final criteria are convincing to the reviewer ; 
he does not believe that because these correla- 
tions are low the procedures are necessarily in- 
valid, but neither should it be assumed, of course, 
that because the reasons for a low correlation are 
acceptable on a common sense sort of basis, 
therefore the procedures have proved their value. 
I think the verdict must be rather like the Scot- 
tish one of "not proven." The reviewer feels that 
more skilful statistical treatment might have 
salvaged something. He also feels it is possible 
that slight changes in the original set-up might 
have provided research data of much greater 
interest, but it is easy to be wise after the event 
and conditions may genuinely not have been 
Such as to make this possible. What makes the 
reviewer cautious in accepting the procedures 
With as much enthusiasm as the writers of this 
book seem to do is his familiarity with follow-up 
studies conducted in Britain at War Office Selec- 
tion Boards which are very similar in many 
Ways to the OSS Assessment Stations. These 
showed that in spite of the high hopes and great 
claims which accompanied the setting up of these 
Stations, the reliability of their estimates was 
disappointing and their validity very much lower 
than one would have hoped. * We must now 
discuss, at least briefly, a section of the book 
Which is clearly of great importance to the men 
Who wrote it, namely, the theoretical, "organis- 
mic" framework within which all this work has 


. been set. Considering the fame of the writers 


(mainly, D. W. Fiske, Eugenia Hanfmann, D. 


№. MacKinnon, J. б. Miller, and Н. A, Mur- 


ray), it is astonishing to find that this part of 
the book is probably the weakest and the one 
most open to serious criticism. To the present 
reviewer, this part of the book seems to be 
almost entirely given over to a Tweedledum- 
Tweedledee sort of battle between what the au- 
MM pe ааз and the “elementalis- 

c” points of view. It is full of cliches which have 
characterized the writings of Gestaltists for the 
last 40 years (“the whole is greater than the 
sum of its parts”; or “the whole and its parts 
are mutually dependent”) and which to some 
people at least must still be almost completely 
meaningless. The reviewer has often been 
tempted to write a book on the proposition that 
the whole is less than the sum of its parts on 
the basis of the well-known experiment that 
oysters are good and jam is good, but oysters 
with jam are pretty awful. The writers of this 
book follow the fashion set by the early Ge- 
staltist in another way also. They show a com- 
plete incapacity of understanding and dealing 
with the actual views expressed by their oppo- 
nents. In every way these are twisted and stated 
in such a form as to be representative of no 
living “elementalistic” or “atomistic” writer. To 
the present reviewer, this seems a serious 
charge; psychology. is probably not the only 
science in which Aunt Sallys are set up for the 
special purpose of being knocked down again 
with a great show of rejoicing, but there can 
be few other disciplines in which this practice 
is so widespread. It is very confusing to the 
student and probably, in the long run, also to 
those who are responsible for this practice. Let 
us take up these points in some detail. What is 
one to make of the following statement? 

The elementalistic method is abstract and unrealistic, 


i attempt is made to reproduce the conditions un- 
oe which елап will eventually perform * In adopt- 


inj this method the sychologis akes à adi sub- 
g — p t mak 1 r ie 

he very start by electing to a stract 
judgment at tl sí y 


arately, an 


is even though he kn 
бое y ren add sequences of elementary processes to 
produce results, but organtzes them into effective forms. 


ў ЖАШ А 

11 the questionable points 1n this statement, 
s p just rm a few. Firstly, there is the ques- 
tion of adding. Now surely the elementalist does 


nothing of the kind. If we take, as an example 


of the elementalistic approach, the fifth report 
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of the Army Air Forces Aviation Psychology 
Programme, dealing with the prediction of fly- 
ing efficiency, we find a great deal of effort in- 
volved in finding out something about the or- 
ganization of abilities which are thought to 
correlate with the criteria. There are innumer- 
able correlations and factorial analyses reported 
in those pages, and what is the coefficient of 
correlation or the scatter diagram on which it 
is based but a measure of the relation obtain- 
ing between two mental traits—in other words, 
an attempt to investigate their organization? 
Again, scores are hardly ever simply added. 
They undergo a suitable process of weighting 
according to the usual multiple regression for- 
mulae which indicates the respective importance 
of each trait to the final criterion. And lastly, if 
the elementalist does choose to add, he does so 
merely as a first approximation. Other func- 
tions, such as multiplication, etc., are possible 
and are often used. There is no mathematical 
difficulty in finding a function which will repre- 
sent reality, however complex reality may be, 
and the principle of least squares ensures that 
this function shall be the best possible combina- 
tion of elementary traits. How extremely ef- 
fective this "elementalistic" method can be is 
clearly shown in the above mentioned report, 
which shows predictive accuracies much higher 
than one would have thought possible at the 
beginning of the war. It seems to the reviewer 
that "organismicists" would do well to hold their 
fire until they have produced predictive accura- 
cies of a similar order. The criticism that the 
*elementalistic" method is abstract and unrealis- 
tic seems to the reviewer simply the statement 
that the “elementalistic” method is scientific, be- 
cause, surely, the most obvious features of sci- 
entific method are its abstract nature (an elec- 
tron is an extremely abstract kind of concept) 
and that it is unrealistic (an element is prac- 
tically never found in natural conditions and 
again, metals, chemicals, and so forth, are nearly 
always tested under conditions other than those 
to which they will be exposed in due course. 
Why psychology should not follow the usual 
scientific method then is by no means clear 
from this.) All through the argument presented 
by the authors runs rather an odd view of “re- 
ality.” Thus, they maintain that a scorea person 
obtains on a test is not a representation of re- 
ality, or again, "Surely the essential criterion of 
a good test is its congruence with reality" and 
they go on to say that it seems a little peculiar 
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that psychologists should have devoted so much 
time to improving the reliability of their test 
and so little time to improving their validity. | 
is difficult for the reviewer to understand t 
point of this argument, seeing that it is лу 
agreement among all psychologists that relia 
bility is only a means to an end and that validity 
is the ultimate aim. A criticism of this kind seems 
to come particularly badly from a group of peo- 
ple who give little evidence of the validity of 
their procedures, while the despised “elemental- 
ist” (as, for instance, J. P. Guilford in the Army 
Air Forces Report already mentioned) gives 
indices of validity on almost every page. In fact, 
surely this argument moves in “cloud cuckoo- 
land” and bears no relation to reality. Quite 
often the authors’ data contradict their conclu- 
sions. On page 40 they say that “it must be clear 
that an elementalistic intelligence test could not 
possibly be an adequate measure of effective in- 
tellection." However, it is clear from data given 
on page 432 that a combination of two of these 
despised “elementalistic” tests (matrix and vo 
cabulary) are no whit inferior when compared 
with the validation criterion than are any of the 
“organismic” tests used by the authors. (It is 
assumed that the correlation between the matrix 
test and the vocabulary test for such samples 
as were tested at OSS Camps would be similar 
to correlations between these tests found in 
British War Office Selection Boards; the writ 
ers themselves do not give the multiple R from 
these two tests.) It seems to the reviewer, there 
fore, that the “organismic” theories and pri- 
ciples discussed at such length are really quite 
immaterial to the main part of the book a М 
nothing but confuse the issue. This нк. 
strengthened considerably by noting what tht 
writers seem to consider their main theoreti 1 
finding, namely, that the "conception of the m 
sonality as a whole' points to a gon a 
force or conation, as the chief unifying ipe 
tegrating factor in personality." This pel М 
identical with that expressed by the reve 


с : ; А esi! 
his book, Dimensions of Personality as ® n 4s 
alistic” methot 

there 


these procedures are suppose 
mentally. The authors of the boo 
as well as other Gestaltist writer 
before them, have failed to bring 


s who 
forward 9! 
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objective evidence whatsoever, but there are ef- 
fective differences in validity between “organis- 
mic” procedures, so called, and “elementalistic” 
procedures, 50 called; this being so, then surely 
it is premature to argue so vehemently and at 
such length in favour of these allegedly “or- 
ganismic" principles and their superiority, Per- 
haps the authors will, in time, give us scien- 
tifically acceptable proof of this superiority; 
until then, we cannot regard their argument as 
anything but an expression of faith. 

J General Psychol 41: 309-14 O "49. Cecil A. 
Gibb. * One of the great achievements of the 
O.S.S. Staff reported in this volume is the suc- 
cess (however limited) they achieved in as- 
sessing personalities which had been moulded in 
such widely different foreign cultures as the 
Burmese, Thai, Javanese, Chinese, and Korean. 
Such facts as this suggest that the group method 
of assessment represented by the O.S.S. pro- 
cedures has characteristics which are well 
worthy of the attention of those psychologists 
who recognize, as part of their professional task, 
the assessment of personality in real life situa- 
tions. * very readable * The procedures, which 
are here described in greater detail than in any 
previous publication, derive largely ігот the 
work of the German psychologist, Simoneit. It 
is almost certain that they owe rather more to 
the British adaptations of that work than this 
report admits. But there are, no doubt, other 
researches too upon which these procedures de- 
pend. In many respects it is a pity that the au- 
thors judged it unprofitable to review the litera- 
ture and the research which contributed to the 
construction of the program. Had rather less 
space been devoted to simple accounts of simple 
situations as they varied slightly from one sta- 
tion to another and had that space been given 
to even a brief review of the work which con- 
stitutes the scientific basis of this technique a 
greater service would have been rendered. More 
work and more time would, of course, have 
been involved. The detailed descriptions of the 
test-situations employed—the construction test, 
the murder mystery and the like—are very m- 
teresting and they do provide a prototype for 
future situation-tests. However, relatively little 
information is available to enable one to appraise 
each situation in its relation to others and most 
of the situations being pseudo-real in the 95:5: 
setting are quite inappropriate for other pro- 
grams. It would seem that a conscientious at- 
tempt to get at the psychological elements of 
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those situations would have been of greater in- 
terest to psychologists and probably also to 
those intelligent laymen who are expected to 
read the report, and certainly to all those thou- 
sands of candidates who will constitute the most 
certain element of the reading population. One 
of the major criticisms to be made of this report 
is that it holds too rigidly to a policy of factual 
reporting. With this criticism many will not 
agree, especially since purely factual reporting 
has been a relatively rare treat in psychological 
literature. The reviewer, however, believes that, 
considering the circumstances, there has been 
too little theorizing and too little generalization 
here. It may be a long time before a similarly 
strong group of psychologists, psychiatrists, and 
psychoanalysts shares similarly significant ex- 
periences and works so closely together in the 
detailed assessment of men. These experiences 
must have provided a common basis for a good 
deal of high-powered theorizing. Many hy- 
potheses well worthy of communication and 
further research must have been formulated at 
the several assessment stations. Psychologists 
would like to know of these hypotheses, of the 
inspired guesses of the men who worked in this 
very significant program. The facts reported, the 
correlations given in the appendices are, in the 
circumstances, not enough. * in keeping with 
the organismic approach a personality sketch is 
to be prepared and presented rather than a mere 
set of scores or personal profile. The difficulty 
one sees here is that such sketches quickly be- 
come stereotyped and as easily apply to a dozen 
other candidates as to the one the writer had in 
mind. Such was certainly true of similar sketches 
used by the Australian Officer Selection Board 
and the likeness of those to the illustrative 
sketches of pages 363-4 suggests that it could 
not have been very different with O.S.S. Few, 
if any, psychologists will quarrel with these basic 
principles of assessment but many will be less 
satisfied with the qualitative, and often sub- 
jective, personality sketches than the writers of 
this report appear to have been. Personality is a 
dynamic whole. It is true that it cannot be ade- 
quately described in terms of a profile of quanti- 
tative trait scores, however carefully these traits 
may have been isolated and defined. But it is 
also true that a subjective compounding of traits 
may be quite misleading. What is needed | a 
great deal of research into the established rela- 
tions of personality traits with one another and 
with characteristics of the situations in which 
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they are displayed. The group methods, the situ- 
ational tests, and the many other techniques 
which this volume describes are necessary but 
advance is more likely to come through a quanti- 
tative approach to these procedures than through 
a subjective organicism. * The problem of evalu- 
ation is of signal importance. The difficulties 
faced, in this regard, by O.S.S. are by no means 
peculiar to it. They are difficulties frequently 
met in officer selection, in criminal psychology, 
in educational psychology, in personnel selec- 
tion, in short wherever personality assessments 
are made and used as a basis for predicting fu- 
ture behavior. The most outstanding of such 
difficulties are—(a) The frequency of unpre- 
dictable changes in the jobs and in the environ- 
ments to which men are assigned. As is here 
pointed out this is frequently enough to invali- 
date the whole evaluation procedure. (b) The 
considerable differences between the frames of 
reference of the original assessors and of those 
whose assessments become criteria of success. 
(c) Mechanical difficulties, largely but not en- 
tirely, peculiar to the war situation make it im- 
possible to have appraisals of all men. As a re- 
sult of these influences the evaluation statistics 
are relatively unsatisfactory. The intercorrela- 
tions among the several appraisal techniques are 
low. The correlations between assessments and 
appraisal are likewise low, but there is no evi- 
dence to suggest the extent to which this is a 
function of makeshift appraisal or of inadequate 
assessment. The simple fact is that, like officer 
selection in the British forces, the O.S.S. assess- 
ment program was affected by the urgency of 
war. Time did not permit adequate planning for 
follow-up and when this could be done, make- 
shifts alone were available. Thus it is plain that 
the research psychologist who judges this emer- 
gency experiment worthwhile must, at his lei- 
sure, set it up again, in part at least so that ques- 
tions now unanswered can be tackled with the 
advantage of proper planning and of adequate 
controls. The one insoluble problem seems to 
be— How shall we ever again secure comparable 
motivation of our subjects—and of our staffs? 

Mgmt R 38:59-61 Ja’49. Walter R. Mahler. 
* The authors wisely devote Chapter II to the 
Principles of Assessment as they applied them. 
The busy executive who has but little time should 
concentrate his attention on this chapter. * The 
reader is taken through the entire process in 


processed, except that the former has the ad- 
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vantage of an explanation for each of the some. 
times strange and stressful situations, Pictures 
reproductions of some of the tests and forms 
used, and numerous anecdotal accounts provide 
an interesting explanation of the complete 
process. * Chapter IX provides the authors’ 
evaluation of the assessment program, They 
readily admit that the final verdict is a question 
mark because of the numerous complexities in- 
volved. The performance rating process was es- 
tablished belatedly and with much less regard 
to scientific principles, which accounted for a 
large though undetermined proportion of the 
discrepancies. This chapter is the second most 
important one to be read by busy executives. * 
A personnel director recently asked this reviewer 
whether or not this book would be of value in 
practical operating situations. The answer went 
somewhat as follows: “Yes, the book does have 
something. How much it has to offer, no one 
knows, not even the authors. This book's im- 
portance does not lie in what it has proved but 
in the promising approach it suggests to a ma- 
jor problem of business and industry. This ma- 
jor problem is the adequate selection, develop- 
ment, and evaluation of key personnel, especially 
executive personnel. Few companies can point 
with pride to anything like an adequate approach 
to this major problem. Many are making valiant 
efforts. Those companies that are serious about 
their executive problem can profit greatly from 
the discussion of this unique method of assessing: 
While the authors do not point it out, it will be 
obvious to many personnel directors that a mod- 
ification of the processes described can be uti- 
lized for selecting executives for promotion as 
well as for initial selection.” * an approach utiliz- 
ing the services of highly trained clinicians, à 
variety of tests including situational tests, inter- 
views, and observations to formulate à pe 
of the complex personality of an executive Ш 
action and providing а word-picture of it, "I 
tainly deserves investigation by organization 
that earnestly desire to improve the caliber j 
their executives. As in all applications of y^ 
chological principles to a dynamic i assit 
most critical point is the selection of oy 1 
personnel to guide the process. Already, Е 
apparent that certain inadequately ша m 
sons are jumping on the bandwagon: Such one 
dividuals usually restrict themselves ї0 
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complex that a variety of methods is essential 
to secure adequate information upon which to 
base sound predictions. Oversimplification of 
the assessment process and overselling of its 
accuracy of prediction are two major dangers; 
disregard of these dangers will bring the entire 
process into disrepute before it has been given 
a chance to profit from adequate development 
and evaluation under sound professional per- 
sonnel. * The organismic approach to the ap- 
praisal of the whole personality in action is 
admittedly an infant, but future experimentation 
will eventually assure a mature technique, capa- 
ble of fulfilling (in part, at least) the expanding 
demands of business and industry for a more 
adequate means of understanding and predicting 
behavior on the executive level. 

Occupations 27:282-4 Ja*49. Wilse B. Webb. 
* should evoke wide-spread interest in the field 
of psychology if only because of the eminence of 
the authors. The chief contributors to this book 
are listed as H. A. Murray, James G. Miller, 
Donald MacKinnon, Eugenia Hanfmann, and 
Donald Fiske. Co-workers who took part in the 
assessment program and contributed to the re- 
search results of the volume include such names 


as D. K. Adams, R. Chin, D. Kresh, O. Н. 


Mower, P. Symonds, E. C. Tolman, C. C. 
Tryon, and many others. It is for a number of 
additional reasons, however, that the book will 
generate considerable interest and controversy. 
The task—the selection of the personnel for the 
OSS—is exciting in itself. The comparability of 
this problem and the personnel selection prob- 
lems of the Air Force and Navy programs is 
immediately apparent. The sharp and rather 
pointed critiques of the classical approaches to 
selection at an objective level, and the develop- 
ment and use of more “holistic” and subjective 
techniques in selection, causes this book to stand 
out among the more recent reports of selection. 
Finally, the results and recommendations evolv- 
ing from this program, and the ominous possi- 
bility of the necessity of reinstituting such pro- 
grams, should command the attention of persons 
alert to such problems. * The chapter giving an 
“analysis of variables” provides some of the 
more provocative data of the book. * Of par- 
ticular interest to the general psychologist is 
the stimulating re-evaluation of the concept of 
“intelligence” to form a definition of “effective 
intelligence.” The concepts of “emotional sta- 


bility” and “social relations” seem particularly уа d улен 
Well analyzed in the light of the approach em- - in group situations, and derives directly from 
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ployed and the task of evaluation required. The 
operation of other evaluation stations are de- 
scribed in some detail. Included are a one-day 
testing program in Washington and an overseas 
evaluation program used in China, India, and 
Burma. The latter report seems somewhat 
marred by rather dogmatic references to racial 
differences (pp. 351-354, 368, etc.) which “hin- 
dered” the efficiency of assessment. * Four dif- 
ferent ratings of success were found available on 
a satisfyingly large number of the assessees. 
When these criteria of success or failure were 
correlated with the over-all predictive scores of 
success or failure, the assessment program 
yielded validation coefficients ranging from 0.08 
to 0.53 (the coefficient being maximized to ac- 
count for restricted samples). Although the lat- 
ter of these scores seems respectably high in 
view of the obviously unstable criterion, assign- 
ment discrepancies, and the usual difficulties 
inherent in personality evaluation, the authors 
devote a large portion of the validation chapter 
to what seems a somewhat subjective condemna- 
tion of the criterion scores and equivocation of 
the obtained scores. Here, as elsewhere, one be- 
comes somewhat disconcerted by the consistent 
tendency to "damn the figures, let's stick to 
theory." Too frequently it seems possible to join 
the authors and to be "left speculating in the 
valley of uncertainty." The concluding chapter 
lists the reasons for failure in the program in the 
categories of “unavoidable,” “unavoidable in the 
specific OSS program,” and “avoidable.” Such 
a listing could well serve as a constant reminder 
to the director of any program in assessment. * 
The present book is interesting and frequently 
provocative reading. In regard to its contribu- 
tion to the problem of assessment of men, how- 
ever, it is felt that one must agree with the au- 
thors, who say: "Thus there is no tangible proof 
that the OSS assessment staffs produced effects 
which more than balanced the expenditure of 
time and money.” WT 

Psychol B 45:460-1 S *48. William A. Hunt. 
* This book sets forth clearly and in great detail 
the selection procedures which.... [the OSS 
assessment] staff utilized. It is well. written, 
effectively illustrated, and presented in pleas- 
ing format. The program set forth is “organis- 
mic” rather than “elementalistic,” designed to 
reveal the dynamic organization of the total 
personality. As such it relies strongly on obser- 
‘vation of the individual performing under stress 
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the methods used by Simoneit in Germany and 
the War Office Selection Board program for 
officer candidates in the British Army. The book 
not only describes these procedures but attempts 
a justification of their use. Its description is 
eminently successful, its justification is less so. 
In reading the book one must carefully set aside 
the question of whether these assessment meas- 
ures sound interesting, stimulating, and revela- 
tory of personality, and ask instead whether 
their successful application to a special selection 
task has been objectively demonstrated by the 
data set forth. There is no final answer in this 
book. The assessment program was instituted 
without provision for any adequate criteria 
against which its operating efficiency could be 
evaluated. Such criterion data as were gathered 
later were fragmentary, impure, and of little 
scientific value. The force of this criticism is 
somewhat disarmed, however, by the complete 
honesty and candor with which the authors rec- 
ognize and stress this weakness. For this reason, 
the critical reader will find so many points that 
he might wish to attack that a careful discussion 
of them all becomes impossible within the con- 
fines of a book review. One problem may serve 
as an illustration. In justifying their assessment 
techniques the authors compare the rate of neu- 
ropsychiatric breakdowns among unassessed as 
against assessed personnel. They state that the 
precise total of OSS personnel was never de- 
termined, and use as a base figure a number 
“which is probably correct within то per cent.” 
No mention-is made of the facilities for recog- 
nizing neuropsychiatric conditions or of the 
availability of channels for their disposal, al- 
though both these factors influence the validity 
of any recorded figures for incidence. More- 
over, one assumes that the assessed group is 
temporally a later group, and we know that 
length of service is a factor in producing mal- 
adjustment. Finally, we must assume that since 
the initiation of a screening program came after 
the recognition of a personnel problem, general 
recruiting procedures also were improved and 
the group available for assessment must have 
been more highly selected than the earlier re- 
cruits. Two general criticisms are valid and may 
be raised in the form of questions: (1) Was a 
determined effort made and genuinely fought 
through administrative channels to provide for 
evaluative criteria at the initiation of the pro- 
gram? (2) In view of the great reliance upon 
the personal evaluative judgments of the indi- 
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vidual staff members, what criteria were set u 
for participation in the program, and what E 
ning checks, if any, were made upon the efr. 
ciency of staff personnel? The total impression 
of the OSS assessment program that one gets 
from this book is that it was instituted under 
an arbitrarily selected philosophy of assessment, 
conducted by individuals enthusiastically ies 
mitted to this point of view, and sustained by 
faith rather than concrete evaluative data. This 
is not to say that it was not a good program, nor 
that a better one could have been developed. It 
is merely to wish that belief were reinforced by 
scientific data. The reader probably will agree 
when the authors say “....something definitely 
useful has come out of these expert labors, But 
it is not possible to say that they have added 
anything to our knowledge of the components 
and determinants of personality." He would be 
happier, however, had he been presented with 
more adequate data upon which to base his 
judgment. 

Psychosom Med 11:63-4 Ja-F ’49. F. L 
Wells. * An interesting comparison is afforded 
between this volume, and the similarly based Ez- 
ploration in Personality, preceding it by а decade. 


_It represents a distinctly fuller maturation of the 


organismic viewpoint. Little use is overtly made 
of the trait-schemata which were so prominent 
a feature of the earlier volume; a valid reason 
therefor being that these are in essence ele- 
mentalistic,” and lose much of their meanmgs 
when they enter into configuration with eac 

other. Just as the interview (upon үр; 
anamnestic foundations) proved the most Ko 
fective single instrument of assessment, 50, á 
most effective account of that assessment 15 in 
terms of an organic whole, not in terms of s 
series of discreet scores or ratings. The infil 
on American psychology of the work neh j 
ume represents should be great and enduri i 
the direct influence of the volume itself may i 
be commensurate. Aiming to give а eona 
of the entire project, as carried out 1n pase. | 

fering environments, its interest wie 
according to the interests of the reader. 

the most culturally valuable are cer 


tain passages 
in clarification of the clinical aspect ‹ 


of psycho” 
nismic’ 50 largely 


hardly © 


> E im 
clei as follows: Page 40: stress on ШО 


i d j 
“the psychologist as observer am ical in- 
rather than the improvement of mechan! 
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struments and tests materials" ; p. 41: “the sub- 
jectivity of the elementalist comes in at the be- 
ginning when he plans his procedure, and at 
the very end when, despite protestations to the 
contrary, he is inclined to rely on his test as a 
valid measure of over-all ability”; pp. 48—53: 
the position of psychology as in intellectual dis- 
cipline (no better pages in the volume) ; pp. 70, 
114: place of the interview, as above; pp. 264, 
273: the concept of “effective intelligence”; p. 
447: the differentiation between “mastery of 
social skills and the possession of genuinely posi- 
tive attitudes towards others” (cf. the pertinent 
case material); p. 468: “the frequency of such 
traumata and such complexes in the past his- 
tories of very effective personalities”; p. 475: 
“perhaps the most disqualifying attributes for 
a member of an assessment staff are rigidity of 
mind and rigidity of feeling.” Among matters of 
more specifically psychometric interest may be 
mentioned: p. 40: the comparatively low ceil- 
ing of the "classic" intelligence test; P. 276: the 
correlations cited for “Energy and Initiative”; 
p. 279: the breakdown of intelligence categories; 
b. 306: the correlations of “Leadership”; $. 
432: the singularly low rating among the rele- 
vant variables of “Mechanical Comprehension” 
(by test). There is other technical material of 
this sort, especially in the appendix; although 
the book stands as an effective apologia for clini- 
cal principles, quantitative refinements are not 
unrecognized. The volume is self-critical to an 
unusual degree, and recognizes as an important 
(and inherent) limitation of the work, a lack of 
validating data; which may arise from a) ab- 
sence of follow-up information, b) assignment 
different from that contemplated in assessment, 
ог c) inadequate controls. Yet one cannot es- 
cape the feeling that those who passed through 
the experience of working on these assessment 
teams must have acquired (William James) “the 
power of judging in all that class of matter” to 
a degree that few might hope to reach without 
like experience, One is led also to speculate on 
the values of some of the assessment technics as 
educative procedures. To some extent at least, 
the qualities one is attempting to assess are such 
as one would like to develop ; and the organis- 
mic outlook in education is not less relevant 
(needed?) than in assessment. While this could 
be of little immediate concern to the project, 


it is nevertheless broadly true that assessment 


Should concern itself with what one шау be- 


come, as well as what one is; and a volume with 
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the perspective of this one, might well have given 
in Chapter X somewhat more attention to these 
implications of psychogenesis. An Osler, a 
Welch, or a Cushing, any Olympian with a med- 
ical perspective on the history of knowledge, 
could well look down on today's yeasty workings 
of American psychology with mild amusement, 
Medical science and clinical medicine have long 
been at virtual peace with each other; it is not 
even clear that there was a time when, as such, 
they were in such overt opposition as may be 
seen in contemporary psychologic criticism. For 
reasons psychodynamically not far beneath the 
surface, the "scientific" wing has been the more 
productive in this respect. The present volume 
answers them as well as they need answering, if 
less violently than Havelock Ellis did Karl Pear- 
son. It is reasonable to expect that a maturing 
discipline will achieve an insight into these dif- 
ferential values comparable to that which today 
characterizes the older field of medicine. 
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*Perers, Атлзох S. How to Pass College Entrance 
Tests. New York: Arco Publishing Co., Inc., 1950. 
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Chicago Sch J 32:172 Mr-Ap ' 51. M (ax) D. 
E(ngelhart). * provides the prospective college 
student with information pertaining to just what 
tests are administered for admission or place- 
ment purposes by all the colleges and universi- 
ties in the United States. Numerous sample 
exercises are given from the American Council 
on Education Psychological Examination, the 
College Entrance Examination Board Aptitude 
and Achievement tests, and from other widely 
used achievement tests. Sample exercises are 
also given from spatial relations and pre-engi- 
neering tests and information is included with 
respect to personality and interest inventories. 
The book concludes with suggestions on how to 
prepare for a psychological examination and 
with suggestions and practice exercises in read- 
ing comprehension, vocabulary, verbal analo- 
gies, mathematics, number series, and figure 
analogies. The best preparation for the College 
Board and other examinations given by col- 
leges and universities for admission or for place- 
ment is preparation inherent in effective ele- 
mentary and high school instruction rather than 
in intensive study of sample exercises. Some ex- 
perience with the latter, however, may be worth- 
while in giving the student confidence. In the 
opinion of this reviewer the book can best be 
used by students under the supervision of a 
high school guidance counselor. Furthermore, 
counselors will find it a valuable though not 
always accurate source of information in advis- 
ing prospective college entrants. 

Clearing House 25:379 Е ’51. Clifford W. 
Hall. * a useful book for any guidance officer in 
high schools, as it brings together much ma- 
terial on college-entrance examinations—mate- 
rial available in nearly every school but scattered 
in different files. The book is so arranged that 
it should save hours of time for any counselor. 
The title, however, is misleading. While the 
book presents samples of college-entrance exami- 
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nations, it does not give any valuable Suggestions 
for passing them. It is, however, an excellent 
compendium bearing on the subject of the па. 
ture of the examinations. Probably the most 
helpful section of the book is that which sets 
forth the tests which each college gives, This is 
material which has hitherto not been available 
to most guidance directors. 

J Bus Ed 26:182 D '50. * useful not only to 
high school seniors but also to anyone who must 
pass a rather comprehensive type of examina- 
tion such as the preliminary examination for 
the doctorate. 
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perimental psychology for neglecting SHE i 
as a technique of personality study. md 
hemence of the attack tended to put the "s jn 
on the defensive before the first сае " 
book had been read. The author, an sie B 
cusses graphology both as science |n osis de 
Graphological interpretation and diagno sci 
pend upon signs and their теапшёб үү ath 
ence; judging trait interrelations pS ny i 
His frankness in postulating this dic i lent 
commendable, but until the art" pes a sc 
clearly understood and systematize a ue ca 
entific discipline, the diagnostic ba the oe 
easily become charlatanism, frequently 
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jective hypothesis is involved in handwriting ; 
as one aspect of behavioral response, it has sig- 
nificance in the interpretation of personality. 
However, the author's own statement that “in- 
terpretations vary in accuracy because so much 
depends upon experience, study, and individual 
ability" points up the need for greater refine- 
ment in understanding this technique. One must 
concur with the author in the need for more re- 
search in this field before it can be accepted into 
the armamentarium of psychological techniques. 
Insufficient stress is placed on training in human 
personality for researchers and practitioners in 
this field, although lip service is given to this 
concept in the last sentence of the book. As in 
the study and application of all projective per- 
sonality techniques, knowledge of psychology as 
it applies to normal and abnormal personality 
development is essential. 

Brit J Psychol, Gen Sect 38:105 D '47. * 
Some of Mr. Rand's interpretations seem in- 
genious and convincing ; others, such as those of 
the moral qualities, singularly far-fetched. Since 
this is intended as a handbook for students, he 
himself gives no validatory data. But an intro- 
duction by Prof, A. A. Roback implies that the 
expert graphologist may produce character as- 
sessments which would be at least as accurate as 
those obtained from other projection tests. It 
would, however, be interesting to know if the 
various graphological experts agree in their in- 
terpretations of the various characteristics of 
hand-writing. 

J Abn & Social Psychol 43:123 Ja 248. Jo- 
seph Zubin. 'This is a very clear, concise manual 
of directions, giving specific detailed instructions 
in graphological analysis of handwriting. Any- 
one capable of following simple directions can 
perform the analysis indicated and anyone who 
wishes to see what graphologists are up to and 
how they work.can get a pretty clear picture of 
the procedure from this book. It belongs to the 
cook-book variety of publications, adds nothing 
to previous knowledge, and throws no further 
light on the validity and reliability of graphology. 
The various assumptions and working hypoth- 
eses that are now in vogue are accepted as facts 
and the reader is not delayed by any scientific 
scruples or empirical evidence. Despite these 
facts, the book does offer to an inquiring mind a 
good opportunity for becoming acquainted with 
a neglected area in the psychological field. If 
one were to set up experiments to verify or dis- 


prove the series of assumptions which are . 


B315 


naively propounded as facts by the book, a good 
start might be made in the experimental evalua- 
tion of graphology. 

J Clin Psychopathol 8:753-4 Ар '47. V. C. 
B (ranham), * The author in a rather unscien- 
tific manner has touched upon psychiatric con- 
cepts of introversion, extroversion, ascendance 
and submissiveness. It is perhaps in this rather 
unscientific use of psychological concepts that 
the author is at his worst. His exposition of the 
subject of graphology would have been bene- 
fited by clinical study of the individual produc- 
ing the writing and then the applying of this 
knowledge against the revelation of the hand- 
writing itself through its Cluster Traits. In 
other words a controlled group of studies made 
by psychiatrists should have been checked 
against the data obtained through а similar 
group, whose handwriting formed the chief 
source of data. To offset this defect, the author 
has resorted to the grouping of the dominant 
graphological signs representing various psy- 
chological qualities such as social, volitional, 
temperament, emotional, ego, cultural and moral 
traits. Some attempt has been made to apply 
these groups against certain types of known in- 
dividuals, such as the thief, the forger, the black- 
mailer and the murderer. The validity of such 
procedure is highly questionable. * The format 
„is not particularly appealing. The quality of 
the paper, the binding and the print itself are 
not up to required standards for scientific pub- 
lications. The appeal of the book probably will 
be confined to a large group of lay individuals 
interested in near and pseudo-scientific litera- 
ture. This is regrettable because the subject de- 
serves to be lifted toa high plane, and may within 
the near future become an integral part of that 
body of scientific knowledge which is being so 
effectively applied to the field of Forensic Medi- 
cine. 

J Nerv & Mental Dis 106:612 N47. Douglas 
M. Kelley. This little book begins with an inter- 
esting introduction by A. A. Roback, which 
mentions, in support of graphology, some of the 
Grant studies which are being undertaken at 
Harvard University. The book itself, however, 
seems much too superficial for use as a text in 
this field. It unfortunately has been prepared in 
the “dream book” style, wherein a list of traits 
is given, each trait being followed by the specific 
signs which indicate the trait. For example, 
“anger = T bar high, heavy, pointed at termi- 
nation.” "Calmness — 1 dot low, long, T bars 
\ 
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low, forms round, speed slow, pressure even." 
The body of the book describes these various 
graphological signs and then gives a summary 
of interpretive and diagnostic procedures. Again 
these interpretive groupings are extremely su- 
perficial and the entire impression is that this 
book was turned out for the lay reader who cer- 
tainly, if he follows it closely, is bound to get 
in trouble diagnostically. Books of this type are 
rarely informative because graphology is a young 
and comparatively new technique in this country 
and as yet has not been widely accepted as a 
standard psychological procedure. Superficial 
evaluations serve only to give its opponents 
material to refute any values which it may really 
have. There seems little doubt but that grapho- 
logical analyses properly carried out are of con- 
siderable usefulness in personality description. 
The usefulness of graphology, however, can 
never be expected until oversimplification is dis- 
carded and the technique can be reduced to 
scientific systematology which can be acceptable 
to any sound clinician. 
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Rapaport, DAVID; WITH THE COLLABORATION OF MER- 
ton Сил. AND Roy Scuarer. Diagnostic Psycho- 
logical Testing: The Theory, Statistical Evalua- 
tion, and Diagnostic Application of a Battery of 
Tests, Volumes I and РР Chicago, Ill.: Year Book 
Publishers, Inc., 1945, 1946. Pp. xi, 573; xi, 516. $8.00 
per volume. * (РА 20: 929, 1712) 

Am J Psychiatry 104:587-8 F '48. W. Line. 
These volumes are by this time well known to 
the public for which they are intended. They 
have been extensively reviewed ; and several re- 
search publications have appeared in current 
journals, bearing upon a few of the issues raised. 
Without attempting a "review of reviews," it 
may be said that the reception by clinical psy- 
chiatry and psychology has not been without 
serious reservation; and research observations 
to date have on the whole added to the evaluative 
uncertainties, rather than to the clarification of 
fundamentals. * Rapaport....has examined pa- 
tients of known psychiatric colour by means of 
a battery of tests ( Wechsler-Bellevue, Babcock, 
Hanfmann-Kasanin, Goldstein and Scheerer, 
Rorschach, T.A.T., Szondi, etc.) ; has attempted 
to furnish his own conceptual rationale, and to 
crystallize certain profile or scatter patterns 
characteristic of various diagnosed psychiatric 
conditions. Some of the appended bibliogra- 
phies on particular tests are very useful and 
selectively complete as of the date of publication. 


The over-all impression would be good if these. 
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volumes represented a teacher’s attempt to get 
started with his students on a serious study of 
the field in question; or a research leader’s ini- 
tial compilation of material, with a view to sys. 
tematic planning of experimental investigation 
and conceptual refinement. Presented, as the 
are, as the outcome of long search and rigidly 
disciplined enquiry, they are disappointing, to 
say the least. There is hardly a point in the 
present-day confusion which pervades clinical 
psychology that is honestly tackled; certainly 
none that is clarified either by research evidence 
or systematic thought. Psychology and psychi- 
atry are not brought into closer partnership in 
any scientific sense; nor has the methodological 
contribution of psychology been made to serve 
any notable purpose in the direction of satisfying 
psychiatry’s yearnings for scientific purifica- 
tion. To the reviewer, the outcome suggests mud- 
dled teamwork. The setting is psychiatric. Test- 
ing is fitted to the patient population, with a view 
to showing some possible interpretation of the 
test data that conforms to psychodynamic theory 
and diagnostic classification. The rôle of the 
psychologist becomes that of a pattern maker, 
or a special type of symptomatologist. His cate- 
gories of classification are set for him by psy- 
chiatry. His professional sense is challenged 
only to the degree that he must rationalize his 
findings in psychodynamic terms. His field, thus 
defined, is misleadingly intriguing to the inex- 
perienced student ; for does it not provide a short 
cut to the mysteries of psychiatry? Ore aa 
ally, a given type of psychiatric liability 1s : 
test performance, shorn of its artistry an di 
duced to its stereotyped form and content. 
Szondi test is perhaps the nearest example 0 
the reductio ad absurdum in this com 
Even at the level of this definition of the x | 
psychologist's róle, the work under pii: 
not very impressive. The statistical veneer g А 
on the whole, a misleading suggestion arid 
jectivity ; and the “rationale” is ern a 
inatively discursive, largely without En di 
relationship to factual material. beber ii 
doubt about the fertility of the mind em i 
the presentation; but much doubt E. vii 
the discipline behind it. Psychiatry 08 jin. 
vast strides since the epochal work of S ang 
Isn't psychology, in Rapaport 5 tao 7 

of superimposing а neo-Kraepelinia not tt 
on the modern dynamic scene? Mig 
techniques of measurement be better is 
the clarification of fundamental дула 
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cepts in psychology, rather than concerning 
themselves with the matching of outcomes, and 
then justifying the always-very-approximate 
matching by referring (ultimately) to Freud? 
And in what direction should these fundamental 
concepts be sought? The answer to this ques- 
tion must surely be that which pertains to every 
scientific discipline—namely, within the frame- 
work of its own postulates. True, these postu- 
lates reflect a chosen purpose, preconceived 
clearly or vaguely, but none-the-less precon- 
ceived. To argue the case for psychology in 
terms of the “enhancement of living,” the de- 
lineation of personal assets, as against the 
medical-psychiatric interest in liabilities, would 
take us too far afield at the moment. However, 
it would appear that Rapaport's frame of ref- 
erence is confused by his tacit acceptance of 
psychiatry as the ground of his conceptual struc- 
ture. He has been busy and stimulating ; but not 
very constructive so far, in the important field 
of psychology, judging by the extensive report- 
ing these volumes represent. 

Erasmus 1:135 F 1 "47. M. J. Langeveld. * 
Those, who have to face actual persons and not 
laboratory preparations of human origin will 
be most grateful for the tremendous amount of 
work, done by Rapaport and his collaborators, 
in such an admirably competent way. After an 
Introduction, the Bellevue Scale and the Bab- 
cock Test are exposed with a wealth of experi- 
mental material and intelligent discussion. * The 
appendix giving a tabulation on the Rorschach 
Scores of each subject of Rapaport’s research 
population is in itself a valuable contribution to 
Rorschach research work. As is clear from this 
summary such an undertaking can easily be 
called a venture, And a venture it was, open to 
much criticism. The authors are well aware of 
this fact. But we may be grateful for their com- 
prehensive book, intelligently prepared and well- 
balanced in the interpretation. One feature de- 
serves to be specially mentioned. The authors 
are holding statistical methods in due respect 
without being in such a way under their charm 
that they are covering the lack of insight in the 
teal problematic implication of a topic in ques- 
tion with a state of calculations, starting from 
nowhere and consequently ending on a not very 
convincing conclusion. On the contrary: the 
book is as a whole of a gratifying understanding, 
that qualitative facts can make use of statistics 
only as an incidental illustration of peripheric 
aspects or in a limited category as à heterono- 
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mous form of proof which doubtless is very use- 
ful in many cases, but an enormous waste of 
time in other cases as well. 

J Nerv & Mental Dis 106:206-7 Ag '47. D. 
Kelley. In these two large volumes, Rapaport 
has presented his techniques of clinical testing 
covering a battery composed of the major clini- 
cal tests in current use. After reading the books, 
one must agree with his foreword wherein he 
states, “On any single test or single nosological 
category much more could have been done.” 
This statement is true both quantitatively and 
qualitatively. * If we realize, then, that Rapa- 
port feels his results herein presented are not 
necessarily end-points and his concepts and find- 
ings may well be modified by his own further 
research or research by other workers, we can 
utilize his findings and accept his theories with 
considerable appreciation. * we....indicate ap- 
proval of Rapaport's decision not to “be lim- 
ited by the initial diagnosis of the patient at ad- 
mission but rather re-evaluate the case in view 
of the course of the patient in treatment and 
post-treatment follow-up.” This concept indi- 
cates the careful consideration made of taxo- 
nomic method—a consideration which is fre- 
quently lacking in similar studies. * Volume IT 
„begins with an excellent introduction * The 
Word-Association Test is next introduced, and 
Rapaport has made excellent use of this tech- 
nique. The major section of this volume, com- 
prising 210 pages out of 507, deals with the 
Rorschach Test. Rapaport has his own ideas 
about the Rorschach method and has definite 
differences of opinion from Beck, Klopfer and 
other major Rorschach workers. He differs even 
in the administration of the test, preferring “о 
do the bulk of inquiry after each card is fin- 
ished.” His scoring symbols are also different 
from those of Rorschach or Klopfer or Beck 


or any one else. His deviations, however, are 


not profound and simply represent that dif- 
ference in opinion so commonly found in experts 
in any field. His scoring differences are essen- 
tially indications of finer meaning which do not 
affect the functioning of the test in any way. It 
is impossible, of course, to indicate all of the 
other points which Rapaport has enumerated. 
One can only recommend that any serious Ror- 


schach worker should study this entire section. * 
bitious and com- 


Rapaport has presented an amb › 
prehensive system of diagnostic psychological 
testing. His findings and conclusions represent 


his point of view and are presented in a rather 
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dogmatic fashion. These books are designed 
ideally for the expert who is cognizant of the 
other methods and theories dealing with the 
same tests. Such an expert can gain tremen- 
dously from Rapaport's observations without 
being mislead by his assumptions of finality in 
many of [his] conclusions. Beginners in this 
field can also read these books with considerable 
profit, provided they constantly bear in mind 
the fact—frequently lost sight of in the books— 
that there are other psychological methods and 
other clinical techniques besides those empha- 
sized. 

Nerv Child 5:276 Jl '46. This is by far the 
most important contribution to the problem of 
test psychology published since the beginning of 
this decade. * The result of the research has been 
to show that the weaknesses and impossibilities 
of one test can be overcome by the use of com- 
plementary ones. 


For additional reviews, see 3: 1102-4. 
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*Rause, S. Avery. Experience With Employee 
Attitude Surveys. Conference Board Reports, Stud- 
ies in Personnel Policy, No. 115. New York: National 
Industrial Conference Board, Inc., 1951. Pp. 120. Pa- 
per. * [Distribution is limited to associated companies, 
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*Rayner, S. A. The Special Vocabulary of Civics. 
Foreword by Alice Hoy. A.C.E.R. Research Series, 
No. 65. Published for the Australian Council for Edu- 
cational Research. Melbourne, Australia: Melbourne 
University Press, 1951. Pp. x, 105. Paper. 10s. * 
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*REMMEns, H. H., Eprrog. Studies in College and 
University Staff Evaluation, Purdue University, 
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Education [No.] 70; Further Studies in Attitudes, Se- 
ries 15. Lafayette, Ind.: the Division, March 1950. Pp. 
96. Paper. $1.25. * 
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kRemmeRs, H. H., AND Gace, N. L. The Abilities 
and Interests of Pharmacy Freshmen. The Phar- 
maceutical Survey, Monograph No. 1. Reprinted from 
The American Journal of Pharmaceutical Education, 
Vol. 12, No. 1, January 1948. Washington, D.C.: Amer- 
ican Council on Education, [1948]. Pp. 65 plus 13 in- 
serts. Paper. $0.50. * 


[ B321] 

*Rıccıuti, Henry N. AND Frenc, Joun W. De- 
velopment of УК БҮ Tests for Naval Officer 
Selection: I, Analysis of U.S. Naval Academy 
Criterion of Aptitude for Service. Office of Naval 
Research Contract N9 onr 98200, Task Order N9 onr 
98201, Project NR 151-030. Technical Report No. т. 
Princeton, N B LE Educational Testing Service, March 
iod Pp. iv, 34. Paper, lithotyped. Gratis. * (PA 25: 
7714 & 
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*RosENZWEIG, SAUL; WITH THE COLLABO 

КАТЕ Levine Kocan. Psychodiagnosis: Prid 
duction to Tests in the Clinical Practice of Psy- 
chodynamics. New York: Grune & Stratton, Inc, 
1949. Pp. xii, 380. $5.00. * (РА 23:3761) 1 


Am J Orthopsychiatry 20:656—7 Jl’ 50. Adolf 
С. Woltmann. This excellent volume is further 
proof of the fact that clinical psychology is grow- 
ing up and establishing itself as an independent 
as well as an indispensable discipline among 
those professions which are concerned with the 
preventive and curative aspects of mental health, 
* abstract formulations, background hypotheses, 
and general discussions about validity and relia- 
bility have been omitted * Included under pro- 
jective methods are the Rosenzweig Picture 
Frustration Test and the World Test. These 
two tests, to the best of this reviewer's knowl- 
edge, have not been included in such descriptive 
form in previous books on clinical psychology. 
* It is hoped that a revision of Rosenzweig's 
book will contain more about the diagnostic use 
of the *human figure drawing." The Bender 
Visual Motor Gestalt Test also is established 
as a valid and diagnostically important tool. 
This test likewise should be included in a future 
revision. Part III is of special interest to the 
clinical psychologist because it deals with the 
knotty problem of diagnostic integration. A 
brief summary of psychodiagnosis as 4 science 
concludes this well-written, informative book. 
Dr. Rosenzweig's approach is “a constant " 
plication of dynamic postulates intended to m 
together the older psychometric and the n 
projective techniques." He has succeeded t 
mirably. It should be noted that part 0 ion 
credit for the smooth and successful duce. 
of the subject matter goes to Dr. Kogan, ji ps 
rich clinical experience and thorough fam! t id 
with the diagnostic functions of the рї m. 
gist are reflected in the chapters which she EA 
pared. The graduate student and the xd 
logical intern cannot afford to 10185 шү it 
To the psychiatrist and the social Td 
will bring enlightenment and understan m 
will tell them what clinical psychology тейле 
and how it may help toward diagnosis. will 
ment, The practicing clinical psycholog 
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enjoy this volume because it contains not only a. 
wealth of information but also very valuable 
hints in the difficult field of diagnostic integra- 
tion and interpretation. 

Am J Psychiatry 107:397-8 N "50. Alexander 
Simon. * The author, of necessity, was limited 
by space from including many tests, such as the 
Bender Gestalt or the Szondi. He may be con- 
gratulated, however, on providing an authorita- 
tive reference for students, written in compara- 
tively untechnical language, in an understanding 
fashion and easily understood. 

Am J Psychol 63:477-8 Jl’ 50. Frank 5. Free- 
man. * It is not clear....what professional groups 
the volume addresses. * the author states that 
the book should meet the needs of beginning 
students of professional psychology as well as 
the needs of those in allied fields of medicine, 
nursing, occupational therapy, and social work 
* The professional groups mentioned are so 
highly heterogeneous in regard to preparation 
and competence in psychology that it seems ex- 
tremely doubtful, indeed, whether any single 
volume can or should be addressed to all of them. 
The psychologically non-professional reader will 
not have had the necessary preparation in the- 
ory and practice of psychological testing, nor in 
psychological theory of development and be- 
havior—both normal and abnormal—to under- 
stand, value, or benefit from, the excellent tests 
and case-materials that Rosenzweig presents. 
Apparently recognizing the limitations and 
handicaps of the naive reader, Rosenzweig has 
tried to overcome these in some degree by pre- 
senting the sketchiest kind of descriptions of 
еасһ of twenty-two tests * For the reader who 
is unfamiliar with the characteristics, the theory, 
and the construction of these....tests, such very 
brief descriptions are inadequate to an under- 
Standing of the case-materials that follow; not 
to speak of a valuation of them. For the reader 
who is equipped with a knowledge of the tests, 
such descriptions are unnecessary. Fortunately, 
the description of each test is followed by one ог 
more very good illustrative protocols, so that 
even the inexpert reader may see how each in- 
Strument is used and how the results are re- 
Ported. But, for an adequate appreciation and 
evaluation of the results and reports derived 
Tom tests, an understanding of their psychologi- 
cal and statistical rationales is essential. At an- 
other point in the preface, Rosenzweig тз 
that “the advanced student and the professional 
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worker will find in these pages a consistent ap- 
plication of dynamic postulates intended to knit 
together the older psychometric and newer pro- 
jective technics.” Herein lies the value of this 
book; for the illustrative protocols are well 
chosen and well presented. The cases cover a 
wide range of types; the results and interpreta- 
tions of the several tests in each case are clearly 
presented and interrelated. On the whole, the 
case materials are, in their content and organiza- 
tion, of the best that have appeared among the 
numerous publications in clinical procedures 
and diagnosis within recent years. An experi- 
enced clinical psychologist has here written out 
his views of clinical procedures and illustrated 
them through case presentation and interpre- 
tation. Other clinicians and advanced graduate 
students should find the volume of professional 
value. 

B Menninger Clinic 14:188 5 '50. Richard S. 
Siegal. This is a valuable introduction to and 
survey of the field of diagnostic testing. The 
reader is given some of this eminent clinician's 
views on the function of diagnostic tests, the 
scope of their contribution to the full under- 
standing of the patient and some inkling of the 
process by which the psychologist arrives at 
his diagnostic conclusions. * concludes with a 
number of interesting case studies. 

Digest Neurol & Psychiatry 17:714 D 4p. 
R. S. McK. * This volume is carefully organized 
both in scope and presentation, and progresses 
with increasing complexity. Although not all- 
inclusive, it offers in Part 2 a wide diversifica- 
tion of tests in functional grouping. * A valuable 
section....is Part 3 in which the process of psy- 
chodiagnostic integration is demonstrated. * 
Three widely diversified cases are cited, con- 
crete responses from each test being given to- 
gether with the conclusion drawn from. each. 
Particularly informative is the explanation of 
the rationale behind the selection of the tests 
for the particular cua * The book serves its 

ion as outlined. 
crt & Social Psychol 45:779-80 О "о. 
Sibylle Escalona. * written in agreeably non- 
technical language. Fortunately the style is fac- 
tual rather than pseudoliterary throughout. The 
author describes the chief function of the е 
as follows: “to explain the purpose, pn 
instructions, obtained data, scoring Soter an 
interpretation of a representative group Ms 
chological tests as these are currently applied. , 


i 
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The book lives up to its purposes as stated. It 
is to be noted that the author does not promise 
to provide a rationale for psychodiagnostic test- 
ing, nor does he attempt to do so. This reviewer 
doubts the wisdom of this omission since the 
selection of tests for discussion, the recom- 
mended devices for administration, scoring and 
interpretation of tests, and the diagnostic sum- 
maries inevitably reflect a theoretical bias. As it 
is, the author's theoretical preferences are dis- 
cernible only to the sophisticated reader. * It 
would have been clearly impossible for the au- 
thor to discuss all available tests in each of 
the general areas receiving consideration. In 
view of this fact one wonders why the detailed 
discussion of selected instruments was not sup- 
plemented by at least an enumeration of other 
tests widely used in the same area. Such tests as 
the Cornell-Coxe Performance Ability Scale, the 
Szondi test, the Merrill-Palmer Scale of Mental 
Tests, or the Sentence Completion test do not 
appear in either the text or the index. One also 
regrets that the bibliography is extremely lim- 
ited. The book makes its greatest contribution 
in the liberal use of case material and test proto- 
cols. While rarely referring to general principles 
of psychodiagnostic testing, it succeeds in giv- 
ing a coherent and orderly picture of the inter- 
relationship between clinical data, test findings 
and the characteristics of personality function- 
ing which are reflected by both. It succeeds in 
removing the element of mystery from the 
process by which the clinical psychologist arrives 
at his conclusions and frankly acknowledges the 
decisive importance of the clinical judgment 
and theoretical viewpoint of the psychodiagnos- 
tician. It is this reviewer's impression that the 
author's analysis of test data emphasizes con- 
tent analysis and surface manifestations in the 
form of conscious attitudes rather more than 
formal modes of analysis and the search for 
deeper-lying aspects of personality functioning. 
This applies especially to the discussion of the 
Rorschach and the Thematic Apperception Test. 
“Psychodiagnosis” is recommended as a text- 
book for clinical psychologists in training. It 
should also prove useful to psychiatrists and 
those allied disciplines who wish to acquaint 
themselves with the work of the clinical psychol- 
ogist. mS 

J Consult Psychol 13:309-10 Ag ’49. Lau- 
rance F. Shaffer. Not since Well's Mental tests 
in clinical practice (1927) has there been quite 
. -$0 satisfactory an introductory book on psycho- 
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logical diagnosis. Its greatest value is in the 
thoroughly clinical flavor, with many case stud. 
ies that illustrate first the use of each test sepa- 
rately, and later, in integration with others, * 
With all its merits, the book is not without faults, 
Many test descriptions—e.g. 35 lines to describe 
the Rorschach—are so scanty as to be meaning. 
less without other references. There is question, 
too, of the value of including very brief and 
superficial accounts of mechanical and musical 
aptitude tests. Clinical psychologists will like 
this book. Their students will use it widely, both 
as a framework for beginning study in psycho- 
diagnosis, and as an integrative summary for 
more advanced instruction. 

J Crim Law & Criminology 41:209-10 JI-Ag 
'so. Nathaniel Thornton. As a primer the 
purpose of which is to acquaint the reader or stu- 
dent with the essential fundamentals of psycho- 
logic testing, Rosenzweig’s book merits com- 
mendation no less than attention. * a volume... 
marked not only by fine technical accuracy but 
also by a most ingratiating clarity of exposition. 
It is the reviewer’s opinion that the present vol- 
ume definitely fills a gap in the existing litera 
ture on methods in modern clinical psychology, 
because he recalls no book giving the uninitiated 
reader a more adequate understanding of such 
methods. The frontispiece very appropriately in- 
cludes portraits of Francis Galton, Alfred Binet, 
Sigmund Freud, and Hermann Rorschach, for 
it is largely, if by no means exclusively, from the 
contributions of these four scientists that the 
basic idea of all psychologic testing has been de- 
rived. It would be a mistake, however, to 1916 
out of consideration that debt which we ow? t 
C. G. Jung, in this connection, for his кү 
nently valuable studies in word-associatom 
say nothing of the Jung Word-Association 2 
that is currently employed. Within the d 
of approximately four hundred pages the 4d 
thors set forth the inherent features of abi 
used intelligence and personality tests. we 
time to time, the authors introduce Шш А 
protocols which not only interest the rea ЕП 
aid him greatly in his efforts to pap. 
more fully the results obtained by suc viene! 
gence and personality tests. What ет bool 
ought to add here, perhaps, 15 that t i fro 
despite its somewhat technical character, | 
beginning to end a readable book. On 4 this 
nated even while one is instructed. ai 
as it ought to be; for it would be di ion 


otio! 
to deny that the study of mental and em 
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processes is one of the most engrossing things on 
earth. The final chapter, entitled Uxoricide on 
an Unconscious Basis, will be quick to engage 
the attention of those interested in the crimino- 
logic aspects of psychologic testing. Here the 
authors do much towards pointing up the tre- 
mendously important part that unconscious fac- 
tors so often play in the perpetration of crimes. 
Certainly this proves to be one of the most re- 
warding chapters in the entire volume. When 
the psychologic testing had been completed in 
this case of uxoricide, it transpired that the 
crime had been committed essentially as a de- 
fense against possible “castration” by the wife. 
One cannot help speculating upon the good 
which might result if in other cases involving 
criminal actions some extensive psychodynamic 
investigation could be made of the subjects’ 
mental and emotional processes. It is easy 
enough to see how much valuable or even Vital 
material might come to light in this way. Con- 
sidering how admirably Rosenzweig's book 
serves the purpose for which it obviously has 
been designed, the reviewer can recommend this 
volume without qualification. 

J Proj Tech 15:111-4 Mr ’51. Robert R. 
Holt. * clearly and simply written, well organ- 
ized and reasonably comprehensive in its cov- 
erage of methods in current usage. The point of 
view is generally progressive and sound. I liked 
particularly the attention that was given to the 
problem of integrating the results from several 
methods, and the stress on the need for a thor- 
ough synthesis of inferences from tests with data 
of other kinds, historical and observational. The 
introductory chapter orients the reader to the 
psychodynamic point of view in diagnostic work, 
bringing out a strength of the book which is also 
a weakness. “Diagnostic labels are clearly 
secondary to psychodynamic considerations. 
Amen, most of us will chorus. But in the name 
of abandoning mere pigeon-holing, clinicians 
these days too often underemphasize structural 
in favor of dynamic considerations. Throughout 


the book, the psychodynamic inferences drawn. 


and integrations of cases are admirable; but by 
contrast the discussions of the structural aspects 
of personality are disappointing. The lack of 
any systematic concepts of ego-structure makes 
the discussion of results in this area seem un- 
duly vague and inconclusive especially when 
non-projective tests are treated. * Particularly 
commendable in the brief and excellently or- 


dered exposition of each test is the inclusion of 
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one or more actual cases. Some facts about the 
patient, reasons for testing, the test results and 
their interpretation are given, making the na- 
ture and uses of each instrument vivid and con- 
crete. It is disappointing to find no section de- 
voted to interviewing or history-taking, and no 
explicit recognition that diagnosis rests basically 
ona consideration of the patient’s difficulties in 
living, as reported by himself and those who 
have been in contact with him in everyday life. 
It is there implicitly, but for the benefit of en- 
thusiastic neophytes or persons who are dazzled 
by the mystique of projective testing, it would 
have been better to have said it plainly, The 
book’s principal weakness as an introduction to 
psychodiagnosis, particularly for students who 
will do no further reading, lies in its avoidance of 
the crucial issue of validity. It is too easy to get 
the impression that the authors accept com- 
pletely the claims that are made for the tests, and 
that they use certain interpretative principles 
uncritically, though the validity of these princi- 
ples has never been demonstrated. For example, 
in presenting the Babcock test they state : “Com- 
parison of the vocabulary score with the other 
scores reveals the degree to which the indi- 
vidual's present functioning level coincides with 
his original level,” ignoring Feifel's work in 
demonstrating that the apparent relative stability 
of vocabulary is an artifact of scoring method. It 
is not to be expected that an introductory book 
would spell out all of the conditions under which 
Babcock’s assumptions may be (cautiously) ac- 
cepted. It would have been easy to include a 
little critical evaluation throughout, which would 
have helped prevent that overselling of tests to 
which clinical psychologists are unfortunately 
prone. For example, the exposition of the Bern- 
reuter inventory could easily have included 
Flanagan’s factor-analytic finding that three and 
not five separate traits are measured by this 
test. The discussion of the Rorschach test has 
a similar defect, besides being so brief as to be 
almost misleadingly general. Despite the lack 
of positive evidence and in the face of Roe's and 
Zubin’s findings, human movement responses 
are blandly presented as an indication of the 
subject's “capacity to be creative,” and in gen- 
eral the various scoring categories are treated 
as if their significance had been thoroughly es- 
tablished. An almost opposite criticism may also 
be made: that in their discussions of the sepa- 
rate instruments, too often the authors seem to 
get 50 little out of a test interpretatively that one 
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might well wonder what purpose had been 
served by using it. The brilliance of some of the 
case demonstrations in the latter part of the book 
—notably the chapter on “Охогісійе оп an un- 
conscious basis"—is proof enough that Rosen- 
zweig and his co-workers really are able to learn 
a great deal about a person from diagnostic 
tests, particularly when taken in conjunction 
with life-history materials. It is of course very 
difficult to steer a true course between the en- 
couragement of wild speculation and the over- 
conservative error of being unwilling to con- 
clude more from a test than a possibility that the 
patient may be sick. In such a situation, it is 
probably easier to err on the side of caution. 
Chapter 7 on Projective Methods (Word As- 
sociation, Rorschach, TAT, World Test, Pic- 
ture-Frustration) contains two solid contribu- 
tions of interest to readers of this Journal. In 
fourteen pages there is an excellent and com- 
prehensive treatment of the senior author's own 
test, the Picture-Frustration Study, with ex- 
amples showing clearly how it is scored and in- 
terpreted, and how it may be useful in clinical 
work. And then, in 20 pages there is the first 
published exposition of Rosenzweig's technique 
for analyzing the TAT, which he calls the Com- 
posite Portrait method. Promising a detailed 
presentation in a future publication, he sketches 
out the six stages that are involved in his analy- 
sis of a set of stories. * The examples given show 
this to be an eminently practicable method of 
getting at the meat of the TAT, one which is 
directly applicable in everyday clinical practice 
(unlike so many other, more ambitious interpre- 
tative schemes). Here, as elsewhere, Rosen- 
zweig demonstrates his keenness in following the 
essential psychodynamic threads of the case 
and tying them together meaningfully, and with 
it a characteristic weakness: he fails to utilize 
fully the formal characteristics of the story pro- 
ductions. It is true that formal aspects are con- 
sidered in the method, and that inferences about 
the cases are drawn from them throughout the 
book. They tend to be used in two ways: as 


“complex indicators" (e.g., note is taken of ~ 


blocking as pointing to an area of conflict), and 
as a basis for inductions about the cultural and 
intellectual level of the subject. What is missing 
is their use to furnish hypotheses about ego- 
structure, the patterns of coping with the world 
that are characteristic of the person. This is a 
minor criticism, however. Rosenzweig's method 
promises to have the great virtue of being more 
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easily teachable than most others, which recom 
mends it strongly to the many clinicians ho 
have won for themselves an understanding of 
the TAT but have found it very difficult to 
develop similar proficiency in students, There 
area number of minor errors, such as occasional 
incorrect computation of weighted color sums 
and inconsistent scoring of the Object Sorting 
Test, but they do not detract greatly from the 
usefulness of the book as an introductory text, 
The advanced worker in the field of projective 
techniques will not discover a great deal that 
is new or instructive for himself in this book, 
but he may find it quite useful as a way of pre- 
senting a quick and intelligible overview of psy- 
chological testing to students or colleagues. 
Psychoanalytic О 19:114-5 Ja '50. William 
F. Murphy. * an up-to-date introduction to psy- 
chodiagnosis which can be easily read and un- 
derstood. The language is nontechnical and 
discussion of test standardizations has been 
avoided. The purpose, material, instructions, 
data, methods of scoring and interpretations of 
the more important psychological tests...are 
adequately and interestingly described and ex 
plained. The author has a style which is refresh- 
ingly clear and direct, and a good understanding 
of the capabilities and limitations of psycholog- 
cal testing. There are many well-detailed clinical 
examples showing how the results of the various 
tests are collated, and a picture of the living ir 
dividual reconstructed from the data. Chapter 
VIII, which discusses how the psychologist 
should present the picture of the individual, de- 
serves special mention and praise. There a 
few psychologists or psychiatrists who pen 
cases before clinical conferences who could nol 
read this chapter with profit. * Students, i 
fessional psychologists, and psychiatrists pe 
wish to become acquainted with or ret 6 
aims, scope, advantages, and limitations 0 А 
chological testing will find this book of 8 
value. Hun. 


.с25_7 N ^50. Howard F. PV 
Psychol B 47:525-7 N '50 ty of concrete il- 


ceptua an 


clinical operations actually performec 
modern, or dynamically orien 
chologist. The authors de-emphasize 
formulations of theory and method P 
cause it has seemed premature to m A 
principles that are as yet largely e osti 
tentative in the work of the psych | formi 
cian" (p. viii). Abstract and theoretica 
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lations have by no means been disregarded, how- 
ever, in the attempt to avoid jargon. * includes 
anice discussion of the art and practice of writ- 
ing psychodiagnostic reports * In the closing 
chapter on “Psychodiagnosis as a Science" the 
authors view with favor the beginning rap- 
prochement between experimental and clinical 
psychology. Here they make a case for the in- 
dividual, in all his complexity, rather than the 
"generalized average man" (sic) as the more 
profitable unit for purposes of scientific scrutiny. 
The discussion in this chapter seems almost to 
identify scientific enterprises devoted to the dis- 
covery of general laws of behavior with the pro- 
duction of propositions about "the generalized 
average man" or about isolated segments of 
behavior which will be irrelevant to the task 
of understanding the total human individual. 
Though such an outcome has been frequent, 
possibly even modal, this reviewer is inclined 
to question any implication that it is either nec- 
essary or unavoidable. After all, psychoanalytic 
theory, which is widely conceded to be relevant 
to the clinical enterprise of understanding the 
individual, is a general theory of human behavior 
and development. And recent advances in learn- 
ing theory, based largely upon isolated segments 
of the behavior of the rat, appear not only to 
confirm some aspects of psychoanalytic theory, 
but even promise to furnish clues as to how it 
may be revised and extended. In general, the 
discussions as well as the test and case materials 
are well formulated and lucidly expounded. 
They should prove effective as an introduction 
to the sophisticated clinical approach to psy- 
chological diagnosis. The authors deserve com- 
mendation for their attack on this difficult edu- 
cational task. Though a text of this length 
cannot cover, or even mention, all of the popular 
‘testing techniques, the selection in this book 
Seems sufficiently broad and representative for 
introductory purposes. For all its virtues, the 
book has several serious limitations in its role 
as an introductory text. For one thing, the in- 
terpretations and inferences included in the dis- 
cussions range from the probably valid or widely 
accepted and canonical, through the hypotheti- 
cal, to those which the authors would, no doubt, 
be willing to class as frankly speculative. Even 
зо, alternative interpretations and qualifying 
Statements appear only infrequently. The begin- 
her, without extensive background in clinical 
psychology, will find it difficult to evaluate the 
validity of these interpretations and inferences. 
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Little in the book would even suggest that such 
a problem exists. Furthermore, the bibliogra- 
phy includes only very scanty reference to the 
critical research and clinical literature. Though 
space limitations imposed on such a book pre- 
clude detailed examination of the validity of the 
interpretations presented, the general problem 
deserves more emphasis than it received. For 
pedagogical reasons some attempt should have 
been made to differentiate between standard, 
widely accepted interpretations and those which 
are hypothetical. This could have been achieved, 
without swamping or discouraging the begin- 
ner, by means of brief, organized discussions 
illustrated by some research data and argument, 
all presented with the same lucidity the authors 
achieved in delineating their approach to psy- 
chodiagnosis. 

Q R Biol 24:267-8 S '49: Wendell Muncie, * 


“an excellent textbook * can be highly recom- 


mended. 

Scientific Mo 69:202 S '49. Stanley B. Zuck- 
erman. * among the projective techniques, the 
Szondi and the Bender Gestalt tests, on which 
considerable research material became available 
sufficiently prior to publication, would seem to 
warrant attention, along with more ample treat- 
ment of the personality implications of human 
figure drawings. Similarly, graphology, the “ac- 
tion techniques," and the revealing play in 
«control-situations," "frustration techniques," 
expressive finger painting, plastic апі balloon 
games, might well have been included within the 
scope of the book as implements both of diag- 
nosis and of the larger therapeutic process. Nev- 
ertheless, although the really advanced clinical 
psychologist will gain relatively little in new or 
refined skills by study of the work, even for him 
it has value as a convenient compilation, which 
offers as well good illustrative case material. 
Most significantly, though, in providing stu- 
dents, younger people in the field, and profes- 
sionals with a scientific interest in psychody- 
namics with an up-to-date, readable, compact 
yet authoritative review of techniques employe 
by clinical psychologists today to gain à better 
understanding of the functioning, endowment, 
interests, drives, conflicts, and problems of their 
patients, Rosenzweig has made an important 

ntribution. 

Т acil Forces 28:219-20 D ^49. James W. 
Layman, * written for use as an introductory 
text. As such, the reviewer believes it to ‘fall 
short of meeting the needs of students ordinarily 


j 
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enrolled in introductory courses in clinical psy- 
chology. Although the writer clearly indicates 
his intent to omit “the conventional discussion 
of test standardization," it is questionable that 
beginning students are ready for exposure to 
test interpretations without discussion of their 
limitations. Also, the text suffers from such 
sweeping generalizations as the clinical psychol- 
ogist “leans heavily upon the formulations of 
psychoanalysis” without further qualification as 
to “whose” psychoanalytic theory. Although 
cogently critical of the “academic psychologist” 
and the early “psychometricians,” no criticism 
is made of the more speculative, or theoretical, 
assumptions utilized as a basis for psychoanaly- 
tic theory, or of the unsubstantiated aspects of 
the “older psychiatry.” Personality descriptions 
such ds “this girl is easily stimulated by the 
events and relationships of her environment but 
she cannot accept or integrate these experiences ; 
they thus become tension-provoking” are made 
without reference to which of the data are used 
as a basis for the interpretation or citation of 
demonstrable evidence (clinical, experimental, 
or otherwise) showing any relationship between 
whatever data are used and the behavior at- 
tributed to the subject. On the other hand, by 
the time the student of clinical psychology is 
ready to use the techniques discussed he should 
have passed the point where he would accept 
generalizations without wanting to know their 
bases. It is to be questioned also that students 
or persons already trained in the “allied fields” 
should be exposed to an uncritical presentation 
of these same techniques. 


[ B324] 

*RosxiLL, О. W., Ерток. Methods of Recruitment 
and Selection of Personnel in Industry and Com- 
merce. London: O. W. Roskill & Co. (Reports) Ltd., 
1951. Pp. v, 67. Paper, mimeographed. 21s. * 

Occupational Psychol 25:150-1 Ap ’51. B. 
Ungerson. This report attempts in 48 mimeo- 
graphed quarto pages, plus 20 pages of appen- 
dices, to deal with methods of recruitment, job 
study, methods of selection, and the co-ordina- 
tion of recruitment and selection with other as- 
pects of personnel work. It states its own objec- 
tive in the following words: “The aim of this 
report is to assist business men to devise systems 
of personnel selection which, having regard to 
the expenditure of time and money involved, are 
really effective in finding men and women who 
will perform particular jobs with maximum ef- 
ficiency.” Such a task is clearly impossible in 


such a short report, even if it were attempted by 
someone expert and experienced in the subject 
Unfortunately, there is ample evidence that the 
report is the work of a person or persons with 
very superficial knowledge of the subject, and 
with a non-scientific approach to personnel 
selection problems. The report is full of “psycho- 
logical” generalizations, and it over-simplifies 
every problem which it discusses. For example, 
in defending “the good old-fashioned way of 
taking on new work people and staff which was 
to rely on your own judgment and choose the 
man you liked the look of best,” the following 
statement is made: “The judgment of one man 
making an appointment might go wrong, but 
given, as boss, a good judge of men, the chances 
are that he will be right more often than not.” 
To any vocational psychologist accustomed to 
expressing test validities in terms of statistical 
correlations, such a statement must seem la- 
mentably crude. * The problem of selection is 
„discussed on the assumption, now long dis- 
proved, that good selection can be achieved by 
deciding the "qualities" needed for апу job 
and then using tests which measure those quali- 
ties. Psychologists will recognize here “the nam- 
ing fallacy,” which should be well known to any- 
one claiming to write with authority on this 
subject. A psychologist reading the report will 
find unsound statements or excessive generali- 
zations on nearly every page. * The need for 
validation of selection methods is occasionally 
mentioned, but then only in such general ап 
over-simplified terms as to show that the m" 
has no experience of the methodological difi- 
culties involved. * The author seems to have n 
appreciation of the technical difficulty of ds 
ing selection procedures, and suggests at 0 
: their ow? 
point that managements should devise tX m 
tests. Test construction and the establis ur ; 
of norms are both despatched in nine i 
glorious generalization. To sum up, the И 
abounds in psychological fallacies, and cer ay 
cannot be recommended in preference 0 5 
recognized text book on personnel S ^ 
fortunately, there are deficiencies Ms ret 
psychological in the report. Many 0 ^, sy. 
ences to sections or pages are nactus ар 
tably page 11, where relevant page? hn foris 
pendices are misquoted. The spe an 
given in the appendices are badly der in it 
certainly inferior to a great many 1 ae mos 
dustry and commerce to-day. Perhaps nd in the 
useful section of the report is to be fou 
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$a e 
appendices, where lists are given of University 


Appointments Boards and other possible sources 
of recruitment. This information, however, is 
quite easily available elsewhere to those who 
need it. 


[ B325] 
Ross, C. C. Measurement in Today’s Schools, Sec- 
ond Edition, New ork: кеи a a 1947. 
Poi Е , $4.75; trade edition, $6.35. 

Ed Adm & Sup 34:250-2 Ap ' 48. Carroll D. 
Champlin. * It remains for the classroom in- 
structor of educational measurement to put to 
the pragmatic test of teachability this sincere 
and psychologically sound recommendation for 
a more thorough, functional and profound study 
of measurement. 

J Appl Psychol 33:99—100 F ’49. Walter W. 
Cook. * The organization, chapter and section 
headings, and approximately ninety-five per 
cent of the content remain unchanged. In gen- 
eral the references have been brought up to date 
and the content modified sufficiently to incorpo- 
rate them. * The text....is well organized to give 
the beginning teacher and partially trained 
school administrator an integrated insight into 
the theory of measurement, descriptive statis- 
tics and individual differences as they relate to 
school organization and instruction. * Appropri- 
ate emphasis is placed on the construction of 
informal teacher-made tests. While the ap- 
proach to the use of measurements is functional 
and integrative it is also traditional and uncriti- 
cal, In the cataloguing of research the point of 
view is that of the professor of measurement 
rather than that of the director of school or- 
ganization and learning. The extent to which 
our present knowledge of individual and trait 
differences in the schools points to new uses of 
measurement and needed reforms in school or- 
ganization and practice is not sensed. Neverthe- 
less....[this book rates] high among the teach- 
able books available in the field. 

J Ed Res 41:391-2 Ja’48. Richard M. Drake. 
* does not represent as complete a revision as 
the publicity would lead one to believe. It is 
Possible, though not probable, that the first book 
was so good that little needed to be done. One 
cannot overlook the fact that rapid strides have 
been made in this field since 1941. * It is a ques- 
tionable procedure to incorporate in one chap- 
ter, the entire contents of a course in elementary 
Statistics, This is more than is necessary for 
those who have had statistics and insufficient 


explanation for those who have not. In an ele- 
mentary course in testing designed for class- 
room teachers and administrators, classification 
of data, computation of measures of central tend- 
ency, and centiles is all that is necessary. Meas- 
ures of variability, correlation techniques, etc., 
could then be made functional in a more ad- 
vanced course in measurement. The author 
devotes one chapter to the discussion of evalua- 
tion, Since measurement is a part of evaluation, 
it would seem undesirable to segregate the latter 
and thus leave the impression with the reader 
that they are separate and distinct concepts. The 
chapter on general principles of test construc- 
tion is exceptionally well done and should prove 
helpful to classroom teachers. Administrators 
will be able to obtain much valuable assistance 
in establishing a testing program from Chapter 
VII, “Steps in the Testing Program.” The book 
as a whole is well written, The explanations 
should be understandable to most students and 
the analyses of the topics are well thought out. 
One unusual characteristic of this book which 
sets it apart from other similar treatises is that 
it is interesting to read and is interspersed with 
bits of appropriate humor. In the opinion of the 
reviewer, the late Dr. Ross's book is among the 
best of the available texts in this field. 
Occupations 29:479-80 Mr 'sr. Florence S. 
Dunlop. Those who are familiar with the first 
edition....will welcome the second edition, in 
which the author has revised almost all of the 
material from the first edition and includes 
much altogether new material. AIL bibliogra- 
phies and citations have been revised, * The 
book is scholarly yet practical. It reviews and 
gives exact references of the contribution made 
by the leaders in the field and shows how these 
have been and can be applied in today's schools. 
The book is replete with illustrative material, 
tables, graphs, charts, statistical computations, 
case studies, cumulative records, etc. At the end 
of each chapter is a list of carefully selected read- 
ings for further study. This is one of the most 
inclusive, practical, and up-to-date texts in the 
field of measurement. It is highly recommended 
as a college text as well as a text for in-service 
training of teachers and school administrators. 
Psychometrika 13:12374 Je 48. Frederick B. 
Davis. * designed to provide a practical intro- 
duction to the problems of educational measure- 
ment for students in teachers’ colleges and 
schools of education. This it probably does rea- 
sonably well, though the reviewer cannot help 
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feeling that the obvious is often belabored. То 
cite just one example: On page 195, where the 
author is discussing the administration of 
standardized tests, he writes, “Who should ad- 
minister the tests? It goes without saying that 
only competent persons should administer 
standardized tests." The reviewer couldn't help 
thinking, "Then why does he say it." And this 
same reaction cropped up again and again 
throughout the book. * it is regrettable that in 
many instances the book presents outmoded and 
sometimes erroneous concepts. For example, in 
discussing the use of the Spearman-Brown for- 
mula, the author says (p. 86), "It must be 
emphasized that the formula requires the use of 
chance halves of the test, not just any halves." 
Now the fact is that it is highly desirable to use 
halves matched for content and variability (as 
the author has actually suggested, in part, on the 
preceding page). No warning of the most severe 
limitation of the split-half, Spearman-Brown 
technique—namely, its inappropriateness for 
speeded tests—is given. Statisticians will object 
to some of Ross's definitions and descriptions of 
statistical terms and procedures. The range is 
defined several times as “the distance between 
the lowest and the highest scores." (Instead of 
the difference between the highest and lowest 
scores plus one.) On page 248, we read, “Ап 
adequate sampling must be chosen in a random 
manner ;...." This is not always true. In general, 
it must be said that this book, though originally 
published in 1941 and revised for republication 
in 1947, is of the vintage of the late thirties. No 
suggestion of the impact of war-time research 
on psychological measurement appears in the 
book. Consequently, the author's discussions of 
measurement in guidance and of other topics, 
such as the level of reliability required of tests 
used for selection purposes, seem dated. 


For reviews of the first edition, see 3:1131. 
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*Roruney, Joun W. M., AND Roens, Bert A. Coun- 
seling the Individual Student, New York: William 
loane irinte Inc., 1949. Pp. vii, 364. $3.00. * (PA 
24: 1187 


Calif J Sec Ed 25:62-3 Ja ’50. H(enry) 
C(lay) L(indgren). * Although the title is re- 
dundant, it is nonetheless indicative of a recent 
trend in guidance thinking—a trend which dif- 
ferentiates sharply between the individual and 
the group approach to personnel services. Roth- 
ney and Roens are interested primarily in the 
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former. They are keenly aware that Very often 
generalizations based on experiments with 
groups of people are being applied directly t; 
the individual without being qualified, Without 
further modification, and particularly without 
consideration of whether the generalizations in 
question are appropriate for use with the par- 
ticular individual involved. The authors are con- 
cerned with the superficial nature of much of the 
activity which passes for guidance in the schools 
* written primarily for those who have com- 
pleted an elementary course in guidance * will 
probably be more useful in the hands of the 
person who has had some teaching or, prefer- 
ably, some counseling experience. It makes no 
attempt to cover “counseling” completely, di- 
dactically and systematically ; rather it attempts 
to present the authors’ points of view docu 
mented partly with quotations from “guidance 
experts,” but mostly with excerpts from case 
histories. The amount of space devoted to the 
authors’ viewpoints is kept to a minimum; in- 
deed, most of the book is devoted to case his- 
tory and biographical material. All of this makes 
for an interesting and highly readable book. The | 
only criticism which comes to mind on this point ` - 
is that the authors might have derived more 
instructional clues, techniques, or recommenda- 
tions from the case material —much of it seems 
somewhat unrelated to the authors' main theses 
Another aspect of the book which is attuned to 
recent trends in guidance literature 15 the a 
phasis on educational and emotional dp 
students as contrasted to a corresponding laci 
of emphasis on problems involving Ager 
choice. Books of this sort should do much to к 
the guidance profession in breaking ps 
traditional concept of the secondary-sc* 
counselor: one who is concerned with Wa 
square pegs into square holes and round p 
into round holes. It is difficult to see how, " 
educator could read Counseling the Indiv F 
Student and come away with the Шү, Wi 
fortunately too prevalent, that counse p 
relatively mechanical process which any po 
with a free hour can do, and that КҮДҮК 
take place аз a sort of mass-production р tive a 
non without benefit of those admini han ait 
rangements which result in providing О? 
ficient counseling time for all those $ 
who need individual assistance. 
Rothney and Roens have written 
text—one which grows out of many ad writ 
counseling, experimenting, teaching, 
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ing, and one which can be recommended as in- 
teresting and rewarding to persons who are be- 
yond the first step in guidance. 

J Consult Psychol 13:451 D ?49. F(red) 
McK (inney). * the authors....recognize that pro- 
fessional work in the field requires extensive 
training, understanding of behavior dynamics, 
and psychological techniques. They further re- 
alize the inadequacies of present day tools to 
produce this broader understanding. They do 
not ignore the fact that there are teachers who 
have the responsibility of counseling and there 
are students who look to them for assistance with 
their problems. The book attempts to meet this 
dilemma by a critical yet practical use of acces- 
sible information about counselees. * For many 
untrained persons the text will raise, as it should, 
more problems than it solves, and possibly show 
their need for further knowledge. For other bet- 
ter qualified persons it will furnish attitudes and 
the simpler elastic procedures for the collection, 
collation, and evaluation of facts and tentative 
ideas about their students. It is not a volume on 
psychological first aid or a collection of simple 
do's and don'ts. Some will feel that the authors 
could have gone even farther in emphasizing the 
scope of the task of understanding human indi- 
viduals particularly the deeper psychodynamic 
trends. 

Sch R 58:57-8 Ja ' 50. Thomas E. Christen- 
sen. * divided into six parts. The first five are 
concerned with describing various techniques 
for the study of individuals. It is only in the last 
chapter of the book that any discussion of coun- 
seling per se is found. While it is the authors' 
contention that studying the individual and 
counseling are inseparable, it is somewhat dif- 
ficult to reconcile this contention with their illus- 
tration of the “fact-finding” technique. In this 
illustration the counselor asks questions which 
include the following: “What courses are you 
taking and why?” “Did you know that when 
you were in the ninth grade you had as good a 
vocabulary record as the average Freshman in 
this university? Did anyone ever tell you that p 
“What type of problems are you coming up 
against these days?” To this last question the 
client replies, “Hmmmm—mm.” and the coun- 
selor then supplies the answer : “One is your 
difficulty of concentration” (рр. 140, 143, 147). 
Such questions do not meet the commonly ac- 
cepted standards for questions to be used in pro- 
fessional counseling. Rothney and Roens decry 
what they describe as *overemphasis on science 
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and statistics" (p. 14) in the study of the indi- 
vidual. On the other hand, counselors are urged 
to manufacture “locally constructed inventories" 
of interests. True, test results have their limita- 
tions in counseling, but their effective use is not 
to be gained by substituting the homemade va- 
riety for carefully standardized instruments, The 
reader of this book may find that the many foot- 
notes—over two hundred—impede his speed 
of reading. Many of these appear to have been 
inserted as afterthoughts, and the order in which 
they are presented is often confusing. * Numer- 
ous references to another book by Rothney and 
Roens which was unpublished at the time Coun- 
seling the Individual Student was written do 
not add to the book's usefulness. Careless editing 
further detracts from its effectiveness, * Publi- 
cation of the book may serve a useful purpose if it 
stimulates ultra-statistically-minded counselors 
to include in their kit of counseling tools some 
of the informal techniques described by Rothney 
and Roens. However, if Counseling the Indi- 
vidual Student strengthens the opinion of some 
untrained counselors that they do not need to 
acquire a sound statistical background in order 
to become professional counselors (and this 
seems a more likely consequence), it will con- 
tribute little to the development of counseling 
as a profession. 
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B Menninger Clinic 13:106 My '49. Walter 
Kass. With this Menninger Foundation mono- 
graph, Schafer rounds out a monumental trilogy 
begun with the publication in 1945 of the two- 
tomed treatise, Diagnostic Psychological Test- 
ing, by Rapaport, Gill, and Schafer. The earlier 
volumes present a diagnostic rationale, valida- 
tion and normative data for a battery of tests in 
a variety of psychiatric syndromes. Schafer now 
filters the distillate of this clinical research into 
a crystallized compendium of diagnostic sum- 
maries and exemplary case studies. This book, 
like its predecessors, is a pioneer text in the 
still nascent field of clinical psychology. It ad- 
vances diagnostic test practice from the realm 
of the intuitive and artistic by making explicit 
a methodology and body of demonstrated in- 
sights which can be identified as a common core 
of clinical experience. * This book demonstrates 
in concrete step-by-step fashion the process of 
diagnostic inference from test finding to clinical 
correlates. It is the most lucidly, compactly, suc- 
cinctly written and illustrated discussion on di- 
agnosis with a battery of tests available today. 
It will fill the hitherto acutely felt need in clini- 
cal psychology teaching and training programs 
for a text and manual of prototypical case ma- 
terial. 

Brit J Psychol, Gen Sect 40:232-3 Je '50. 
G. A. F(oulds). The present trend in clinical 
psychology in this country appears to be to- 
wards using larger and larger batteries of tests 
on more and more people. This trend may have 
at least two unfortunate consequences : first, the 
introduction of tests into the battery, which, 
though of proved reliability, are of unproven 
validity and of doubtful significance; secondly, 
the delegation of tests to untrained personnel. 
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The psychologists at The Menninger Founda. 
tion prefer to rely on a relatively small ba 
of tests which they consider to be of real signif. 
icance and to extract from them the maximum 
information. The second great merit of the book 
is that complete protocols are presented for typi- 
cal examples of most of the chief nosological 
groups and these are followed by the psycholo- 
gist’s full interpretation. Despite the fact that 
the tests in the Menninger battery (Wechsler. 
Bellevue, Learning Efficiency, Sorting Test, 
Word Association, Rorschach, T.A.T.) have 
achieved a certain venerability (not all, in the 
present reviewer’s opinion, justifiably), the in- 
terpretations occasionally suggest that there has 
been too much raking among the ashes. Can the 
fact that the Digit Span score does not drop 
really reflect the blandness of the psychopathic 
character disorder (p. 54) ? Can the fact that In- 
formation is two points below Vocabulary really 
suggest some weakness of application in this re- 
spect—perhaps superficiality of interest and 
poor integration of achievements (p. 110)? Or 
could it suggest that it is dangerous to attempt 
to draw very subtle conclusions from the a» 
swers to twenty-six questions designed to sam- 
ple the whole range of human knowledge? One 
suspects that the author often projects into his 
interpretations clinical impressions formed in- 
dependently of the tests. One also suspects that 
these clinical impressions are probably right. The 
cases presented are intended to be unequivocal 
examples of particular illnesses. To many din 
cal psychologists the Schizophrenics may appear 
relatively unflorid. The description of the i 
cissistic character disorders (p. 46) ns 
seems to provide an adequate differentiat? 
from hysteria (p. 32). Finally, this Se ia 
supplement to Diagnostic Psychological ү d. 
should serve to remind us that Clinical ae i 
ogy in the adult field is still grossly neglect? 
Britain. (29 

Int J Psycho-Analysis 30:209-10 P Fan 
Joseph Sandler. * Very fortunately Scha ofthe 
not restricted himself to the disgusto р 
well-worn diagnostic categories of the "ne 
tive psychiatrist. Rorschach workers T. "m 
come the inclusion of typical results wey p 
ber of character disorders—the ner "noil 
chopathic, schizophrenic, schizoid, zn dad in 
characters. * constitutes a definite hajer jg 41 
diagnostic testing technique. Mr. ж pis wor 
experienced clinical psychologist, = st 16909 
with its accent on interpretation of te 
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should prove invaluable to the practising psy- 
chologist. Psycho-analysts should find this book 
easy to understand if they are interested in the 
translation of test results into a final clinical 
picture ; the use of concepts which require a 
specialized knowledge of the theory underlying 
each of the tests is cut to a minimum, * 

J Abn & Social Psychol 44:294-6 Ap ’49. 
Robert R. Holt. To anyone engaged in teaching 
the diagnostic use of psychological tests, it has 
been obvious for some time that there was an 
urgent need for a book which would spell out in 
concrete detail and with actual examples just 
how one goes about this difficult art that is try- 
ing to become a science. There are books which 
tella good deal about the variety of tests that are 
available : the statistical details of how they were 
created and validated, how they are to be ad- 
ministered, and scored, and even (in the case 
of a few) how one test may be interpreted to 
yield understanding of personality and its dis- 
orders. There is even no dearth of published case 
materials in the field of abnormal psychology, 
due in no small part to the yeoman efforts of the 
editor of this Journal. Yet it seems surprising 
that the lack of a book to integrate such«ma- 
terials into a single usable presentation did not 
sooner call forth something with which Schafer's 
book might be compared. At any rate, he has 
met this need, and met it handsomely. In a char- 
acteristically direct and simple first chapter, the 
author presents seven propositions on which 
clinical psychological testing is based. The brev- 
ity and unpretentiousness of these formulations 
belie their very great importance. They are cer- 
tainly not all new; Mr. Schafer would probably 
be the first to deny them originality, but he has 
done a job of bringing together and cogently 
Presenting basic principles that excites admira- 
tion. Surely every diagnostic tester needs to 
familiarize himself with these ideas and make 
them a part of his working tools, if they are not 
already. The seventh proposition is worthy of 
being singled out for special comment, because 
of its great methodological importance, and be- 
Cause it is all too seldom made explicit. What 
the interpretation of tests yields is a series of 
Predictive statements about behavior or thought. 
АЕ diagnostic conclusions, though en- 
itely dependent on this interpretative step, 15 
not ‘reasoning from the raw test material....it is 
Sheer clinical reasoning and must stand or fall 
With the criteria it accepts and uses.” Interpreta- 
tions are not pinned to any particular nosology ; 
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diagnostic conclusions necessarily are. Tests can 
(and some would argue, should) be reported en- 
tirely in terms of statements about the person, 
without mention of a diagnosis though some- 
times they are used as if they give direct indica- 
tions of specific “diseases.” One implication of 
this distinction is so important for clinical re- 
search that it is a pity that it was relegated to a 
footnote : 

Unfortunately most research into the clinical useful- 
ness of tests has attempted to correlate test “signs” with 
diagnoses, and not with characteristics of thinking or 
behavior. These studies, when they have obtained posi- 
tive results, have then tried, by reasoning rather than 
by experiment, to establish which personality character- 
istics assumed to be widespread among the members of 
any diagnostic group were responsible for the signifi- 
cantly frequent occurrence of the established "sign" or 
“signs.” This is a fault of the statistical investigations 
in Diagnostic Psychological Testing [by Rapaport, 
Schafer, and Gill—a work to which the present book 
is a sequel]. It is a roundabout method and can never 
yield conclusive results. 

To which the reviewer can only add a hearty 
Amen. A further reason that this point has been 
presented in such detail is that some sophisti- 
cated clinicians may, after glancing at the book 
long enough only to see that it is organized in 
terms of diagnostic categories, reject it on just 
such a basis, Actually, Schafer makes a real 
effort to discuss the test indications for any 
nosological entity in terms of the characteristics 
of ego-organization about which the tests do 
give information and which are the clinical basis 
for diagnostic reasoning, Nevertheless, the book 
is made up almost entirely of a long chapter of 
diagnostic summaries and two sets of case his- 
tories, all classified in terms of such clinical 
familiars as obsessive-compulsive neurosis, acute 
paranoid schizophrenia, etc. If one takes seri- 
ously the implications of the first chapter, that 
our thinking about tests (and our teaching) 
should be oriented toward such matters as emo- 
tional lability, methods of dealing with passivity, 
mechanisms of defense, and the like, the book is 
not very well adapted to this kind of use. No one 
is more acutely aware of this shortcoming than 
the author; in his own words, “the reader will 
have to hop around among the discussions of 


the various groups in order to gather together 


all the references to any ma jor personality char- 
acteristic.” The principal reason he offers for 
following a traditional organization is the in- 
fluence of routine practice at the Menninger 
Clinic. As he points out, however, the index has 
been designed with an eye to helping out this 
“hopping around.” The influence of a particular 
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clinical setting is felt throughout the book. The 
tests used (Wechsler-Bellevue, Rorschach, 
TAT, Sorting, Learning Efficiency, Word As- 
sociation), the range of clinical material covered, 
the nosology, the independent rather than inte- 
grated presentation of findings from the tests 
and from clinical examination, as well as the 
emphasis on diagnosis as such, all are deter- 
mined by the fact that the author’s experience 
has been built up primarily in a particular clinic, 
with the inevitable limitations that are thereby 
implied. Probably the book will therefore be 
most useful to psychologists who are working 
under fairly similar circumstances. But when 
this much has been said, it has to be admitted 
that it is a richly variegated clinical background 
to which the author has been exposed, and that 
he has made the most of it. In the chapter on 
diagnostic summaries, there are nineteen sub- 
divisions covering a wide variety of nosological 
groups (seven of these subdivisions deal with 
schizophrenia and its form-variants). Typically, 
such a section begins with a concise statement of 
the essential features of each disorder; there is 
then a presentation, test by test, of the diagnostic 
features to be found in each instrument of the 
battery. The Wechsler-Bellevue and Rorschach 
tests get the most emphasis and space. These 
summaries alone are worth the price of the book 
to the working diagnostic tester; they are lucid, 
well organized, comprehensive, and eminently 
helpful. Without concrete illustrations of just 
how the principles laid down are applied to ac- 
tual cases, however, the summaries would lose 
much of their value. The ten long and the ten 
short case studies which make up the next two 
chapters are in many ways the book's heart and 
its greatest merit. Each presents complete scored 
protocols of the tests given (from all six down 
to only one), detailed discussions of each test 


and the conclusions to be drawn from it, an in- ' 


tegrated test report, and a summary of the 
independently obtained clinical findings and di- 
agnosis. Beck has given us materials of this kind 
for the Rorschach test; Stein presents one TAT 
analyzed in a similar way ; but, aside from scat- 
tered case reports in the periodical literature, 
these cases are otherwise almost unique. The 
discussions illuminate problems of administra- 
tion as well as of interpretation and differential 
diagnosis; they remove any aura of mystery or 
magic that may surround psychological testing 
for some persons ; they show an easy mastery of 
the many dimensions available for interpretation 
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in most of the tests. Only one omission is seri- 
ously felt. The final integrative Step of reasoni 
back and forth from one test to another, recon, 
ciling and synthesizing indications and sugges. 
tions gleaned from the different parts of the bat- 
tery—of this most subtle and important Step we 
learn relatively little. After the consideration of 
each test separately (and sometimes somewhat 
atomistically, as with the story-by-story analysis 
of the TATs), there is a jump to the finished 
test report, which presents us with an admirably 
integrated product that hardly can be said to 
emerge automatically from what precedes it, But 
this is straining a point somewhat; there is still 
more here about how to take advantage of the 
super-summative interactions of a battery of 
tests than the reviewer has been able to find any- 
where else. In a final chapter of “Concluding 
Remarks," the author himself makes the most 
serious criticisms that should be leveled against 
his book, and counterbalances them with an ex- 
cellent statement of its compensatory merits. He 
recognizes that in many ways it would have been 
better if the book could have been organized “in 
terms of the indications of various personality 
characteristics, and only secondarily in terms of 
diagnostic implications,” and that it would have 
been good to have presented an integration af 
data from the tests and from clinical examine 
tion for each case, but cites the constraining in 
fluence of the tradition in which he worked 454 
principal reason for the conservative type of or- 
ganization used. To mitigate these MOD 
he points to these features of his book: rx 
has attempted throughout to emphasize that " 
pect of tests which is least specific to any Neh 
strument: the patient's style of verbaliza " 
and its many significances. Secondly, in k i 
variety of cases presented with the rest КП 
battery of tests affords an unusual bro " 
to appreciate the great potentialities oft t v 
chological tests used simultaneously. M 
he has tried “to verbalize the step-by-step Tio 
ess of interpretation of tests results and t a » 
ration of differential diagnostic argumen m 
well has he succeeded in these three Ru 

in the reviewer's judgment, the metn 
book far outweigh its faults. A convo 
the present book with the preceding tw ү contri- 
work to which Schafer made importa! yea 
butions is inevitable. Those who Were, ri ; 
with the latter on the grounds of Ais will tà 
rigor and unconvincing statistical d 7? 

be much better impressed with Scha 
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book. It is quite innocent of any attempt to back 
dp the statements about the significance of test 
patterns with proof. The reader is simply given 
an opportunity to sharea rich and well-organized 
store of clinical experience presented frankly as 
such. The student, teacher, or practitioner of 
diagnostic testing will find it for this reason and 
because of its simple, easily comprehensible style, 
an invaluable handbook for almost daily refer- 
ence. The psychiatrist who has some curiosity 
about the way clinical psychologists use their 
tests can hardly find a better book for his pur- 
pose. The scientifically oriented psychologist will 
be disappointed if he looks for research results, 
though he will be rewarded if he cares to look 
for some sound ideas about clinical research. 
Finally, the reader who has any appreciation for 
fine book-making will be disappointed as soon 
as he penetrates below the binding. With this 
book the International Universities Press has 
continued its curious policy of printing excellent 
matter on indifferent paper, with poor typogra- 
phy and generally inferior format, at a rather 
high price. When Knopf can put out a handsome 
book like Kluckhohn and Murray's recent Per- 
sonality in Nature, Society and Culture, which 
isa delight to the eye, is nearly twice as long, and 
costs only two-thirds as much as this one, one 
wonders why his publisher could not have done 
better by Mr. Schafer. 

J Consult Psychol 13:147 Ар '49. Laurance F. 
Shaffer. * There can be no doubt of the consider- 
able value of the volume for advanced instruc- 
tion. It also serves as an abundant source of hy- 
potheses for further research studies. Schafer's 
viewpoint shows decided growth from that of 
the two volumes of 1945, and may mark a transi- 
tion from the often fruitless search for relation- 
ships between "scores" and "diagnoses" to the 
more promising realm of the correlations be- 
tween test responses and real-life trends of char- 
acter, 

J Mental Sci 96:543-4 Ар '50. M. B. Brody. 
* This book....begins promisingly enough by in- 
dicating firmly though discreetly where experi- 
ence has shown its progenitor to be at fault both 
in practical details and in method of approach. 
The main sections of the book, however, disap- 
pointed me. Hitherto, there has been more con- 
flict of opinion than agreement about the consti- 
tution—even the existence—of different mental 
test characteristics in different psychiatric con- 
ditions, and Schafer advances no evidence that 
his formulations are any more reliable than those 
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vip have previously appeared and have been 
puted, He relates “the various (psycho- 
metric) indications to specific identifying char- 
acteristics of each syndrome and not to the 
name of the syndrome,” but fails thereby to 
sharpen differential diagnosis, which, he admits, 
pi often difficult. Qualifying terms, such as 
sometimes,” “generally,” “usually,” “a fair 
number of cases,” etc., recur disconcertingly of- 
ten. The absence from the patient population of 
manics, catatonic and hebephrenic schizophren- 
ics, mental defectives, and patients with organic 
brain disease surely casts on the arguments more 
than the shadow of doubt which is reluctantly 
conceded in a footnote. The chief interest in the 
case studies lies in the elaborate and meticulous 
exposition of each step in the process of inter- 
preting the test responses. Schafer makes it all 
seem deceptively simple, like an illusionist ex- 
plaining his tricks, He hopes that such exposi- 
tions will help to reclaim mental test interpreta- 
tion from the “domain of private insights and 
art.” It is a pity that the Rorschach and The- 
matic Apperception Tests appear so prominently 
in his work, for many psychologists believe that 
it is precisely that type of test which dooms this 
hope to inevitable failure. 

J Proj Tech 14:75-6 Mr ’50. Theodore R. 
Sarbin. Because the case studies are terminated 
at the point of diagnosis the reviewer is left with 
a disappointed feeling—there is a lack of closure 
—he wants to know something of the relation- 
ship of the psychological test findings to thera- 
peutic recovery, or to management of the patient 
within a hospital or clinic, or what happens if 
the clinic, therapist, or hospital does not inter- 
fere in his adjustment-efforts. Because the au- 
thor does a good job of demonstrating how psy- 
chological tests may contribute to classification 
and descriptive diagnosis, one must still question 
the assumptions underlying the validity of the 
diagnostic process per se. To engage in such 
high-powered inductive procedures merely to 
classify a patient or to provide a literary descrip- 
tion (even in dynamic terms) seems exceed- 
ingly wasteful of human resources. Elsewhere 
the reviewer has argued that a diagnostic state- 
ment without a future referent is scientifically 
meaningless—it is a literary description. When а 
diagnostic statement or interpretation refers 
(expressly or by implication) to a prediction 
of behavior under alternative circumstances, then 
we have a meaningful statement with some prac- 
tical utility. No one will deny that certain diag- 
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nostic or descriptive statements imply a state- 
ment of prophesy; for example, deteriorated 
schizophrenia. Here we have had enough ex- 
perience with outcomes to be able to predict that 
the patient will require hospitalization and that 
psychotherapy of the usual sort will be fruitless. 
But what of descriptions of patients who are 
diagnosed with psychoneurotic or with charac- 
ter-malformation labels—can the psychological 
testing lead the clinician from a statement of 
status to a predictive statement? Can the de- 
scriptions help the therapist select from among 
the various therapeutic weapons now in his 
armamentarium such as group therapy, sup- 
portive therapy, superficial or deep insight 
therapy, use of authority, etc.? Or, can these 
descriptions help even the clinic director to as- 
sign cases to therapists who have demonstrated 
success with certain kinds of patients? If the 
answer is in the affirmative, Schafer does not ad- 
duce any data to support such a conclusion. In 
the introductory pages the promise is held out 
that clinical data can be specified with at least 
some crude approximative designations, such as 
“ ‘more than’ or ‘less than’ other people in gen- 
eral." The case studies are disappointing in this 
respect. Conclusions that have almost universal 
validity or are undifferentiated from application 
to non-clinical populations, are not hard to find. 
One might also mention as a deficiency of this 
book the complete absence of statistical pro- 
cedures. In summary, the reviewer would repeat 
that this book will be welcomed by diagnosti- 
cally-oriented clinical psychologists. The criti- 
cisms mentioned above are not so much directed 
to the author as to the outworn tradition which 
places such great weight on descriptive diag- 
nosis, both of the nosological and interpretative 
variety. This is an incubus that is carried over 
from earlier attempts to regard psychological 
maladjustments as disease entities. Schafer does 
take some steps away from this tradition—but 
not enough. While literary descriptions couched 
in so-called dynamic language may lull the un- 
critical therapist into a belief that he “under- 
stands" the case better, they too often íail to 
provide a background against which to make 
predictions of behavior. The book abounds in 
hypotheses which can give the eager graduate 
student suggestions for research. It will be read 
by diagnostically-oriented psychologists. How- 
ever, it will not find an important place on the 
shelves of mature psychologists who eschew lit- 
erary description in favor of correlations be- 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK 


[ 10% 
cr e inam and predictions of future be- | 
E. J Proj Tech 14: 76-8 Mr ’50. Roy Schafer, 

Rejoinder to Dr. Sarbin.” Dr. Sarbin’s critical _ 
discussion centers around the point that test — 
reports of the type I have presented make little 
or no contribution to therapy; the reports are 
not oriented toward selecting the therapy of 
choice and indicating probable areas of thera 
peutic difficulty and progress. It is tempting in | 
reply to say no more than that research in this 
area has in general been woefully neglected, that 
we lack well-tested evidence on which to base — 
such predictions and recommendations, that it 
would therefore be premature to present as 
knowledge what is still essentially in the realm — 
of vague, uncertain conjecture, and that it would 
be a sad state of affairs in our science and pro- ~ 
fession when we would have to blush to say, 
“We don't know." * This is a point worth mak- 
ing, but to let it go at that would be to neglect 
an important problem implicit in Dr. Sarbins 
review. That is the problem of the psychological — 
basis on which therapeutic and prognostic evalu- _ 
ations are made. There seems to be no other | 
way open to us in making these evaluations than 
that of surveying what the patient is like, how — 
he tries to adjust, his dominant needs and 
defenses, his expectations, attitudes, values, 
achievements, and images of himself and others 
To describe a person with emphasis on the inttr 
play of forces within him and on his major 0 — 
characteristic adjustive trends is intrinsically 
predictive, for once our characters are formed _ 
we are not so malleable by situations that р. | 
sonal identity and continuity аге easily p | 
a very great extent we are all able, as the the 


J. consu i e 
here is that of correlating test results with the оне a fef 
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pist repeatedly and to his dismay discovers, to 

rpetuate our historically-rooted expectations 
about the world by what we ourselves do in 
action and fantasy. To say of a patient that he 
is characteristically an intellectualizer means 
among other things that the patient will try to 
define and limit the therapeutic relationship and 
to defend himself against the very feelings ther- 
apy is after by the use of the mechanism of de- 
fense, intellectualization. To say that he is mark- 
edly schizoid will imply among other things that 
a basic mistrust and fear of relationships will 
limit and define his behavior in the therapeutic 
situation. To say that one patient is tremendously 
labile emotionally and that the next is inhibited 
and cut off from his emotional experience sets 
two very different baselines for a therapist when 
he evaluates emotional display in the course of 
therapy. These examples could be extended at 
some length, touching on needs, fears, values, 
identifications, etc. Dynamic description thus has 
its future referent, though the specificity of this 
referent may be limited in many respects. In 
writing a test report we may if we please verbal- 
ize the implications for therapy of inferred per- 
sonality characteristics but these implications 
have not been read out of the test results; they 
are inferences we can make on the basis of the 
description of the patient we have mined out of 
his test responses and of what we know or think 
we know of the relationship between personality 
and its expression in psychotherapy. The pro- 
spective therapist might do this part of the job 
or the tester might; this is immaterial. If the 
prospective therapist needs these implications to 
be verbalized by the tester before he himself can 
make therapeutic sense of the personality de- 
scription, if he cannot be sensitized by the test 
report to the major patterns of response of his 
patient, if he needs thus to be spoonfed, one 
would wonder about his understanding of what 
he as a therapist does. Limits exist, no doubt, 
but this does not detract from the significance of 
the contribution a descriptive test report can 
make to a preliminary therapeutic orientation 
and sensitization to the patient. These considera- 
tions also help absorb the criticism that testing 
à patient before therapy is under way, or testing 
him at all, is préjudicial to therapy, inasmuch 
as it implies a lack of respect for the patient's 
desire to determine how much he wants to tell 
of himself and at what rate: whatever loss of 
therapeutic rapport is likely to result from pre- 
liminary testing—and this loss seems to vary 
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considerably from one patient to the next—is 
offset by the therapist’s gains of initial orienta- 
tion and sensitization to the patient. In these 
days of pressure on the clinical psychologist to 
produce more than he is ready to, it is difficult, 
except under the most favorable circumstances, 
to take a stand and insist that we are, like our 
psychiatric colleagues, still learning, The temp- 
tation is to begin to weigh ideas in terms of how 
well they help meet impossible or difficult de- 
mands. When “future referent” becomes equated 
with “What on earth shall I do with this pa- 
tient?” we see the pressures insidiously cloud- 
ing our understanding of where we are and what 
we are doing. We see this again when it is 
claimed that it is scientifically meaningless, a 
waste of human resources, and practicing out- 
moded descriptive diagnosis to explore the ways 
in which personality make-up is expressed in 
patterns of thinking and the ways in which we 
can make psychological inferences backwards 
from thinking to the internal forces condition- 
ing it. It is by now outmoded to think in terms 
of gross correlations between “signs” and diag- 
nostic labels, but, though it may be spoiled by 
the use of outmoded concepts, the search for a 
rationale of correlations is in itself never out- 
moded. 

Lancet 257:564 S 24 49. * Regrettably, the 
author has not been able to provide data on the 
abnormalities found in organic brain disorders 
or mental defect; and, among the insanities, 
mania and both catatonic and hebephrenic types 
of schizophrenia are omitted, which reduces con- 
siderably the value of the summaries in differ- 
ential diagnosis. The book is mainly notable for 
the full records of what Mr. Schafer's patients _ 
said (“their verbalised end-products,” as he puts 
it) in the various test situations, These will un- 
doubtedly be useful to the clinical psychologist 
for reference, and they are detailed enough for 
the reader to assess for himself the interpreta- 
tions which are founded on them. 

Psychoanalytic О 18:396-7 Л 749. Adolf G. 
Wolimann. * Schafer takes each clinical group 
and describes how each reacts to the Bellevue 
Scale, Babcock Test, Sorting Test, "Rorschach, 
Word Association and "Thematic Apperception 
Tests. * Schafer has prepared brief diagnostic 
summaries for each clinical entity. Added to this 
are case studies which contain full descriptions 
of scores obtained on each of the tests, an analy- 
sis of the test results, clearly stated conclusions 
and clinical summaries. This novel approach is 
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of great help to tlie clinical psychologist, but it 
is by no means a substitute for thorough train- 
ing, clinical knowledge and perhaps a necessary 


dash of intuition. Alcoholic addiction and nar- 
cissistic c r disorders have been added as 
new clini ps. The terms "coarctated pre- 
schizoph: and "overideational preschizo- 


phrenia” have been replaced by new designations 
such as schizoid character, incipient schizo- 
phrenia, and schizophrenic character, It is re- 
grettable that the whole field of organic involve- 
ments and perhaps some borderline cases in 
psychosomatic medicine have not been included, 
A minor criticism might be leveled against the or- 
ganization of the book, Typographical contrasts 
between diagnoses and the names of the tests 
used is insufficient and there are no topical head- 
lines on each page. This makes the use of the 
book a bit cumbersome because one must always 
consult the table of contents to find a diagnostic 
summary or the respective case study, This book 
is written primarily for the clinical psycholo- 
gist ; however, it will acquaint psychiatrists and 
psychoanalysts with the methodology and termi- 
nology of clinical psychology and should help to 
make psychological reports and summaries more 
meaningful to them. 

Psychosom Med 11:248 Jl-Ag '49. William 
Guy. The present volume is a sequel to Diag- 
nostic Psychological Testing. It endeavors to 
overcome many of the debatable aspects of the 
preceding work; and it succeeds admirably in 
doing во. * Typical records that one is likely to 
encounter in the diagnosis of psychological dis- 
orders or of personality at large are presented 
in all necessary detail, yet in a concise and clear- 
cut fashion. One of the many good qualities of 
the book is the brevity and the precision with 
which subtleties of differential diagnosis are 
demonstrated. The compactness of the diagnos- 
tic summaries may give to the novice the im- 
Pression that this is a “cook-book.” * It is most 
certainly not the intent of Mr, Schafer to pre- 
sent a series of “recipes” to the reader. * The 
most instructive and, in a way, the novel contri- 
bution of this book consists in the manner in 
which careful, detailed reasoning from the prem- 
ise of test results to the final appraisal of per- 
sonality is illustrated. А psychiatric study at 
the end of each case corroborates the psycho- 
logical findings. I have but one complaint to 
register—one which in no way reflects upon the 
quality of Mr. Schafer's work. The book is a 
terrible example of the book-maker’s art. All in 
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all, the text is admirable but the paper is of pone 
quality and the typography and format are ож. 
pletely lacking in taste. The book makes a оњ 
siderable advance toward the refinement м 
systematization of test techniques and deme. 
strates the best achievement of clinical 
ogy to date, It brings together, in one volum. 
a helpful discussion of diagnostic r anl 
its illustration through a wealth of са = 
mulated clinical material. Schafer's book will be 
come a handbook for the clinician, a guide be 
the student and an introduction to modern (бө 
cal psychology. 

Q R Biol 24:173 Je’ 49. Richard S. Lasers. 
* should be a welcome beginning for anyone is 
terested in determining how the experienced 
clinician goes about his diagnostic job. The Ф 
aim of the book is to describe accurately the op 
erations by which the interpretative conclusions 
made by the clinician from diagnostic tests are 
evolved, One is certain that the complex job o 
evaluating and validating what has been com 
ceded to be essentially an art cannot progres 
without such a clear statement of these ted» 
niques and principles. The difficulties underly: 
ing such a task as Schafer has undertaken are 
extreme, While the book does not represent à 
final achievement in this direction, it does ию 
ceed in being a first and most essential step # 
the right direction. * The theoretical orientats 
is psychoanalytic, The ticklish question of vil 
dation is not dealt with. The number of сазе ® 
the sample on which the interpretative 99 
clusions are based is severely limited, and there 
are no really original or overwhelming ideas pre 
sented. These are not serious criticisms, 
within the scope and frame of reference dr 
book, Schafer has written an exceedingly =ч 
and instructive clinical manual. * the "gen 4 
portant section of the volume... [deals] oe 
series of diagnostic summaries of most 
major clinical syndromes. * [The third) сне 
provides excellent and complete case st ading бе 
emplifying each of the syndromes, incl p^ 5 
complete test protocols, their analyses, e 
short clinical history. * However one might | 
agree with the principles and 
made by a practising clinical psycholog* 
their limitations, it is hardly possi ^4 
Schafer's presentation that what he does pn 
evaluation is not clear, This is his ив 
tribution, that he has provided а clear $97... 
of his methods. What he does is two mt 
ject to repetition and verification—the 
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Rorsch Res Exch & J Proj Tech 13222204 
m2 49. Theodore R. Sarbin, * will prove useful 
t diagnostically-oriented clinical psychologists 
asd advanced graduate students, Since many 
dinics and hospitals precede their therapy and 
management programs with diagnostic proce- 


dures, those who participate in these 
will fnd many helpful hints in formu diag- 
nostic pictures, Schafer is to be for 


going beyond the boundaries of traditional psy» 
chiatric classification in stating the need for dif- 
ferentiating diagnostic conclusions from test in- 
terpretations, * The criterion of validity seems 
to be behavior which солаи in the 

hiatric interview. In this book, one 
often impressed with the correlation between 
the psychological interpretations and the case 
summary (written on the basis of the intake pro» 
cedure) as the criterion variable, * Schafer's em- 
phasis on contexts is noteworthy, He attempts to 


and TAT analyses are unusually clear and com- 
plete and. indicate the author's real interest in 
trying to discover the meanings behind test re- 
sponses. He goes beyond test 
diagnostic labels to reveal the “dynamics” of 


vior. 
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improvement of teaching, and filling the gaps in 
the information and attitude of those graduates 
who have been "raised" on an "integrated" pro- 
gram greatly to the detriment of their definite, 
professional training. 


[ B338] 
ЖЅсотт, ВовевтА. The Group Oral Test in Select- 
ing Public Employees. Foreword by Kenneth O. 
Warner. PAR No. 503. Chicago, Ill.: Civil Service 
Assembly, October 1950. Pp. iiij 25. Paper, mimeo- 
graphed. $1.50. * (PA 25:7715) 

[ B339] 
XSrcrL, Davin. Intellectual Abilities in the Ado- 
lescent Period: Their Growth and Development. 
Foreword by Galen Jones. Federal Security Agency, 
Office of Education, Bulletin 1948, No. 6. Washington, 
D.C.: U.S. Government Printing Office, 1948. Pp. v, 
41. Paper. $0.25. * (PA 24: 129) 

J Consult Psychol 13:223 Je’49. Laurance F. 
Shaffer. A compact summary of the distribu- 
tion, growth, differentiation and stability of in- 
tellectual traits during the adolescent years, 
presented mainly by graphs and tables. Implica- 
tions for the curriculum and for guidance are 
suggested, and there is a description of a differ- 
ential aptitude test for use in high schools. 

Occupations 27:504-5 Ap 49. Thomas E. 
Christensen. * Part II....will provide ammuni- 
tion for school administrators who are concerned 
with the initiation of personnel procedures ap- 
propriate for all American youth. * well-selected 
bibliography of 68 titles * this comprehensive 
outline of the "fundamental principles of mental 
ability and mental traits of adolescents" may 
be purchased for the modest price of fifteen 
cents. 


[ B340] 
*Srcexr, Davm. State Testing and Evaluation Pro- 
grams. Federal Security Agency, Office of Education, 
Circular No. 320. Washington, D.C.: U.S. Office o£ 
Education, 1951. Pp. ii, 38. Paper, lithotyped. Gratis, * 

[ B341] 
*SzczL, Davin. Measurement and Cumulative Rec- 
ord Index. Miscellaneous 3405. Washington, D.C.: 

-S. Office of Education, February 1952. Pp. i, 25. Pa- 

per, lithotyped. Gratis, * 

[ B342] 
XSELOVER, MARGARET; TOWNSEND, AGATHA; Jacoss, 
ROBERT; AND TRAXLER, ARTHUR Е. Introduction to 
Testing and the Use of Test Results. Educational 
Records Bulletin, No. 55. New York: Educational 
Records Bureau, August 1950. Pp. vii, 107. $2.00. * 
(PA 25:2675) 


Clearing House 25:312 Ja’51. Jules Kolodny. 
This brief bulletin....admirably accomplishes 
what its authors intended. * it is a "practical 
down-to-earth handbook." The bulletin, written 
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from the guidance point of view, Suggests types 
of tests, describes how they should be adminis. 
tered, indicates how they may be Scored апі 
analyzed, and concludes with a brief discussion 
of the uses to which test results may be put, One 
interesting feature is the inclusion of a case study 
showing how a testing battery may be applied in 
a specific situation. * The chapter on statistics 
is exceptionally well done. * The bulletin merit 
the attention of both old-timers whose training 
antedates recent testing trends, and of new. 
comers to teaching who have not yet become 
aware of the guidance possibilities inherent ina 
good testing program. 

Sch Sci & Math 51:412 My ' 5r. Kenneth E, 
Anderson, * an excellent nontechnical treatment 
of testing and the use of test results, for use 
primarily in independent schools ; however, pub- 
lic schools with large numbers of college-bound 
students will find it most useful. The guidance 
point of view including application to specific 
cases receives considerable emphasis. A number 
of forms and illustrations help to clarify the 
explanation. The bulletin provides a practical 
handbook for schools contemplating the use of 
objective tests. Teachers of introductory classes 
in tests and measurements should find it an 
invaluable supplementary reference for prospec 
tive teachers. In addition, teachers in service will 
find the bulletin an excellent refresher on the 
simpler fundamentals of testing. 


[ B343] 4 
*Suaw, L. Scholarship Intelligence Ted 
England: E. J. Arnold & Son Ltd., [1951]. Pp. e ‘a 
per. 1s. 6d. * Accompanying Teacher's Book. Pp. 
Paper. od. * 


[ B344] rin 
Ж5нїмвЕвс,‚ BENJAMIN. The Development. tool 
Needs and Problems Inventory for Hig! Pie Uni- 
Youth. Foreword by H. H. Remmers. риш Gies in 
versity, Division of Educational Reference in Atti 
Higher Education [No.] 72; Further Stu jea 1950 
tudes, Series 17. Lafayette, Ind.: the Division, 
Pp. 75. Paper. $1.25. * (PA 25: 4836) 


[ B345] LAU 
XSnNEIDMAN, Epwin S. “Schizo hrenia Paycto- 
MAPS Test: А Study of Certain Wen Schizo- 
Social Aspects of Fantasy Production i on th 
phrenia as Revealed by Performance » Genetit 
Make a Picture Story (MAPS), ТЕЗ 23:0 
Psychol Monogr 38: 145-223 ЇЧ '48. * ( 

For reviews, see 114. 


[ B346] TION 
XSHNEIDMAN, EDWIN S.; WITH THE corta no Туе, 
оғ WALTHER JoEL AND KENNETH В. 3A Murray: 
matic Test Analysis. Foreword by Henry Pp. xi, 320 
New York: Grune & Stratton, Inc., 1951. 
$8.75. * 


ww 
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J Clin Psychol 8:102 Ja’52. * presents the re- 
gults of an interesting study in which fifteen 
clinical psychologists contributed blind interpre- 
tation of test results of a single subject on the 
Thematic Apperception Test and the Make A 
Picture Story Test. The collected interpretations 
are compared with additional test and behavioral 
data, and also subjected to preliminary statistical 
work-up. The results are significant in that they 
make possible a comparative study of the work 
of various “experts.” 

Psychiatric О 25:709-10 О '5т1. * a book of 
considerable interest to the clinical psychologist 
* an interesting and needed contribution to the 
field of psychological testing. Perhaps outstand- 
ing, is the insight given the reader into various 
methods of approaching test analysis by those 
who are well known in the field. Also of note, are 
the excellent interpretations provided in the ad- 
ditional testing. The author is careful in draw- 
ing conclusions and recognizes well the limita- 
tions of the study. Although the question of 
whether the psychologist should give personality 
evaluations with a maximum or minimum of ma- 
terial beyond test results is not gone into, the 
book definitely removes the “blind” analysis 
from the “parlor game" setting. A possible criti- 
cism might be directed against the lack of con- 
trols in the present work. This reviewer cannot 
understand, with the richness of clinical material 
available, why a subject could not have been 
chosen and testing done by the same examiner 
on the same day (or why this was not done with 
the subject used). The two months’ delay and 
the possible influence of the six insulin comas 
has limited a work which might well have been 
far richer in potentialities. 


2 [8347] 

SEILVEY, Hersert M. Change in Status of Iowa 
tate Teachers College Students as Revealed by 
cop sating Placement Tests. Iowa State Teachers 

Buy lege, Research Report No. 58. Cedar Falls, Iowa: 

pureau of Research, the College, July 20, 1949. Pp. 27. 
aper, lithotyped. $0.35. * 


X [ B348] 

SINGER, Eric. The Graphologist’s Alphabet. 
wauwings by Gertrude Elias. London: Gerald Duck- 
Pritt & Co. Ltd., 1950. Pp. 118. 8s. 6d. (New York: 

ilosophical Library, Inc., 1951. $375. *) 
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*Sincer, Err 
NGER, Erc. Graphology for Everyman. Lon- 
gon: Gerald Duckworth & Co. Ltd., 1949. Pp. vii, 104. 


xs [ B350] 
eem Martın. The Validity of a Multiple- 
oice Projective Test in Peychopathologital 
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Screening. American Ps: i iati 

ning. ychological Asso 
Shological Monographs: General and Applied. Vol d 
pia , Whole No. 314. Washington, D.C.: the Asso- 
Pro Yn Inc., 1950. Pp. v, 40. Paper. $1.00. * (PA 25: 


XSLoAN, WiLLIAM. TH Ea Adaptation of th 
ену Tests: А Measure of Motor Profi. 
а CRT 
print. * (РД 23:1547, title only) о gei 
xS i [ B352] 

мт i 
dean Children in Gaelic and English. With “A Sta 
tistical Analysis” by D. N, Lawley. Foreword by God- 
frey Thomson. Publications of the Scottish Council for 
Research in Education, [No.] 27. London: University 
of London Press, Ltd., 1948. Pp. 42. Paper. 1s. * 

Brit J Ed Psychol 19:148 Je ^49. W. R. J. * 
an admirable account of two experiments on bi- 
lingualism undertaken in the island of Lewis in 
1943 and 1945. It appears from the results that 
bilingual children respond more adequately when 
tested in their native tongue—Gaelic. Even in 
these circumstances, their mean Intelligence 
Quotient was depressed by approximately 15 
points below 100. The discussion of the under- 
lying causes of this and other related facts is 
one of the many important contributions to the 
study of bilingualism contained in the booklet. 
The 1945 experiment is particularly significant 
because it indicates that the Analysis of Variance 
technique promises to be a fruitful line of ap- 
proach to the problems of bilingualism. The 
booklet will be especially valuable to those read- 
ers who are interested in the questions of ex- 
perimental design and statistical treatment in 
“bilingual” research. 

Brit J Psychol, Gen Sect 39:115 D 48. P. E. 
V (ernon). Miss Smith....has carried out a pains- 
taking, though somewhat inconclusive, investi- 
gation into the intelligence test performance of 
Hebridean children, who speak Gaelic in their 
homes but are taught in English at school. She 
used Cattell’s non-verbal Scale I, Forms A and 
B, with instructions either in English or in 
Gaelic. After a preliminary experiment in which 
Form A was given in English and Form B in 
Gaelic and a large increase was found at the 
second test, each of the four possible pairs of 
tests was applied to about 100 children. The re- 
sults showed а large practice effect of about 
eleven Т.О. points on taking any test second, but 
no significant difference between tests taken in 
English or in Gaelic. Possibly this is what one 
would expect with a non-verbal test. The Report 
contains a most interesting account of the dif- 
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ficulties of testing in the Hebrides, of the atti- 
tudes to tests among children in this remote cul- 
ture, and of the unsuitability of many of the 
pictorial items of the tests in such a culture. 


[ B353] 
ж5мітн, G. Mitton. A Simplified Guide to Statis- 
tics for Psychology and Education, Revised Edi- 
tion. New York: Rinehart & Co., Inc., 1946. Pp. xiv, 
109. Paper. $1.25. * (PA 21: 670) For reviews, see 275 
in Statistical Methodology Reviews, 1941-1950; for re- 
views of the first edition see 38: B697 and 2:271. 

[ B354] 
*SwirH, Water D. Manual of Sociometry for 
Teachers. Foreword by Willard C. Olson. Ann Ar- 
bor, Mich.: Child Development Laboratories, Univer- 
sity of Michigan, 1951. Pp. i, 11. Paper, lithotyped. 
$0.20. * 

[ B355] 
XSoNNEMANN, Шімісн. Handwriting Analysis as а 
Psychodiagnostic Tool: A Study in General and 
Clinical Graphology. Foreword by Bela Mittelmann. 
New York: Grune & Stratton, Inc., 1950. Pp. x, 276. 
$5.00. * (PA 24: 3741) 

Am J Orthopsychiatry 21:215 Ja "51. Samuel 
J. Beck. * The volume here under review is es- 
pecially timely now when the scientific study of 
personality is being so vigorously prosecuted. * 
Especially satisfying is the inclusion of many 
samples of handwriting, with interpretation. The 
clinical student will be especially intrigued by 
the broad nosological range which is repre- 
sented: the major severe mental diseases (af- 
fective disorders, the schizophrenias), the neu- 
roses including their subgroups, and some 
organic diseases. It thus makes a useful intro- 
duction to the science of graphology in the field 
of personality. It provides a hypothesis and a 
method, both of which deserve testing out. The 
book may well be a start toward the goal stated 
by Dr. Bela Mittelmann in the Foreword, “Op- 
erating as a whole, graphology can take its place 
as a full equal in personality diagnosis with other 
projective methods, e.g., the Rorschach test, 
that have stood the test of time.” The bibliogra- 
phy, while not extensive, includes the well- 
known names in the science of graphology and 
also those of students in related areas bearing 
on projective tests as applied in the field of per- 
sonality. 

B Menninger Clinic 15:112 My '51. Н. M. 
Graumann. This book is more than the first seri- 
ous introduction to modern graphology in this 
country. By freeing his principles of investiga- 
tion from their vitalistic philosophy and by giv- 
ing new psychological rationales for most of the 
basic graphological interpretations, the author 
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shows—theoretically and in many practical ex- 
amples—how to apply them to the existing 
standard groups of classification of the mentally 
ill. Not a reference book easily to be used as an 
encyclopedia of pathognomic signs in handwrit- 
ing, but certainly useful for graphological train- 
ing, not only on account of its richness of phe- 
nomenological descriptions and experience in 
abnormal psychology but also because of its the- 
oretical clearness and soberness. 

J Abn & Social Psychol 46:445-7 Jl’ 51. John 
E. Bell. * based upon a sound appreciation of 
the critical attitude towards graphology cur- 
rent in American psychology, and of the exceed- 
ing difficulty that the clinical psychologist who 
uses a configurational approach to the analysis 
of personality productions must face in convinc- 
ing a more quantitatively-minded psychologist of 
the validity of such a method. The author does 
not attempt to rationalize away these problems, 
but attempts to face courageously and straight- 
forwardly the limitations of his method, while in 
a logical and dispassionate way advancing the 
theory and principles upon which his systematic 
analyses are based. No one who reads this book 
will dismiss handwriting analysis as a super- 
ficial and simple procedure. That it demands art- 
istry and must be both complex and time-con- 
suming is readily evident, but that it is also a 
communicable method is clearly demonstrated by 
the author's thorough and detailed description of 
the various aspects of handwriting and of the 
personality factors operating in their production. 
The definition of the factors in handwriting 5 
so well supported by illustrations of actu 
scripts that application of the analysis to xi 
samples could be accomplished with considerab Ё 
confidence by the psychologist who 1s willing to 
master the text and to accept the underlying 
theories upon which it is based. In hi tei 
the text is outstanding as a communication ora 
clinical technique. The author's style, hovers 
is not a simple one; the sense of many [iri 
is only forthcoming after rereading. * Per e 
it is understandable that the theoretical gu 
view of a psychologist concerned with motor F 
pression should be dominated by concern WI, 
intra-organic phenomena. But one mi 
Sonnemann's approach a sensitivity to ^ro 
ational factors that may affect handwr! je 
While he does make a passing reference Ré T 
fact that samples produced spontaneously a 
greater value than those made by s sul 
the immediate purpose of graphologic4 
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sis, he does not show either in his theory or in 
his interpretations the importance of taking into 
account the sensitivity of movement to the cir- 
cumstances under which it is made. This is the 
more surprising since his analyses are consist- 
ently based on the recognition of the specimen 
of writing as a Gestalt where single movements 
gain meaning from their relationship with the 
unity of the writing as a whole. * While the case 
materials are well developed from the point of 
view of demonstrating the interpretative method 
used by Dr. Sonnemann, the lack of supporting 
independent clinical material leaves the reader 
with no other check on the validity of the inter- 
pretations than the authority of the writer. Those 
who are familiar with the European. sources on 
handwriting, especially the writing of Klages, 
will find much that is familiar in this treatment 
of graphology. This presentation is no mere 
translation, however, and it expands and ad- 
vances both the theoretical and practical aspects 
of graphology in such a way that we can easily 
overlook the few errors in names in the text and 
bibliography. In addition, it makes available for 
English readers a technique that, as Bela Mittel- 
mann has said in his Foreword “....can take its 
place as a full equal in personality diagnosis with 
other projective methods, eg. the Rorschach 
test...” 

| J Ат Med Assn 144:5050 7 '5o. * The book 
is interesting and well written. Some of the ma- 
terial and many of the interpretations are, how- 
ever, of undoubtedly controversial nature. In the 
several examples presented, wherein interpreta- 
tion is provided on the basis of a single sample of 
handwriting, interpretations appear far too de- 
tailed and all too inclusive. Much further re- 
search is needed to clarity theory and practice in 
this field. There seems little doubt, however, that 
handwriting analysis, properly conducted, can 
hold a place along with other more formalized 
projective technics as an adjunct in personality 
diagnosis. 

J Consult Psychol 14:236-7 Je ' 50. Laurance 
F. Shaffer. Why has graphology so long re 
mained a rejected stepchild of psychology in 
America, while receiving more recognition in 
Europe? A thoughtful study of Sonnemann's 
book perhaps gives an answer. Graphology, at 
least the “systematic” graphology of Klages that 
18 expounded here, is content to describe the 
character of a person in terms of the character 
of his handwriting. The test and the referent are 
9ne and the same, whereas the empirical culture 
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of American psycholo 

referent. DI s vd rane ы, iori 

а Chapter "Twelve Lines ot 
, Ing the application of his analy- 


sis, are many phrases such as these: “it seems to 


Riesa ue rather than neglect the letter 
movements is iT nha of the 
volved in ‘crippling’ j It never gets in- 
үй ng Over-scrupulous detail...” 
psychology wants to know somethi 
more. Does Joan “caress and cultivate” her chil 
dren and her friends? Does she “never get in- 
dap bs over-scrupulous detail" in her home 
pint t m Such questions remain unan- 
mistic letter- no solution to regress to ato- 
D totu M in order to be objective 
ems ve, for, as Sonnemann rightly says, 
approach has been unrewarding. However. 
TEN global approaches being developed for the 
validation of projective techniques may well 
prove applicable to graphology also. Meanwhile, 
kae Psychology may withhold judgment, 
Pe EME ja graphology as quackery, nor 
dence becomes eio until the further evi- 
, 7 Grim Law & Criminology 41:251-2 JI-A 
50. ТК Thornton. This пан бы 
Vue ik Г rezder's serious attention, because it 

tit us scientific a manner as possible, with a 
su sin itherto exploited largely by entertain- 
ов isin charlatans, * Much of the material pre- 
м Mieres rather technical character, with 
yes | at the book will perhaps have its 
i p among those already knowing some- 
t Lei h ut the more scientific principles of 
graphology. But with the help of Sonnemann's 
lucid exposition, as well as with that of the nu- 
merous illustrations given, nearly any intelligent 
person who reads the book in a sober and un- 
prejudiced frame of mind is certain to gain a 
clearer conception of the value which. handwrit- 
ing analysis may proye itself to have in the study 
and assessment of personality dynamics. More- 
over, the psychologist or psychopathologist 
ought to find especially interesting the two chap- 
ters entitled Graphic Expression of Psychopa- 
thology and Abnormal Samples. 

Psychiatrie Q 24:616 Jl °$0. This reviewer 
must agree with Dr, Sonnemann that graphol- 
ogy—the study of handwriting—is an important 
psychodiagnostic method that is insufficiently 
used in America at present. When the psychia- 
trist realizes that handwriting, according to the 
experts, is as characteristic of the individual as 


П 
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the nose on one’s face, and that after the original 
school training period is over, the handwriting 
becomes an expression of the individual, as char- 
acteristic as his speech and actions, he will give 
the analysis of handwriting its proper niche in 
projective methods. * 

Psychosom Med 13:65-6 Ja—F ’51. Elizabeth 
C. Anderson. This book is the first one in the 
English language to give a systematic descrip- 
tion of graphology as a tool in the diagnosis of 
mental diseases. * Sonnemann concludes his 
book with the blind analysis of ten abnormal 
handwritings to demonstrate the accuracy and 
the value of the graphological method as com- 
pared with the results of clinical case studies. 
We regret that the scope of his book does not 
permit the author to explain how he has arrived 
at his very elaborate graphological diagnoses. 
Sonnemann has based his book essentially on 
the concept of Klages but has modified and 
broadened those concepts to some degree. New 
observations and ideas are presented, some of 
which will have to be tested and proven by fur- 
ther studies before they can be accepted as facts. 
Whatever the final result may be there can be 
no doubt that this is an exceptionally stimulat- 
ing book that will promote further research in 
the graphological field. The complicated style of 
the author makes hard reading at times, but 
one’s efforts will be rewarded by the interesting 
material presented. Also his criticism on pres- 
ent-day methods of objectifying and validating 
graphology appears very appropriate. The book 
can be recommended to psychologists who have 
already some graphological experience. 
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*SPEARMAN, C., AND Jones, Lt. Wynn. Human Abil- 
ity: A Continuation of “The Abilities of Man.” 
London: Macmillan & Co. Ltd., 1950. Pp. vii, 198. 16s. 
* (New York: Macmillan Co. $2.50.) (PA 24:5721) 

Brit J Med Psychol 24:79 pt 1°51. B. Seme- 
onoff. * It is a difficult volume to assess without 
constant and detailed reference to the earlier 
work. In style it is extremely allusive and con- 
densed, almost as if the senior author were re- 
plying in person to questions and criticisms 
raised following an oral exposition of his theory. 
Consequently, it would appear that some other 
mode of presentation might have been prefera- 
ble—as an Appendix to The Abilities of Man, 
or incorporated in a revised edition of that work, 
or, if as a separate book, at any rate with a fuller 
and Ceaser restatement of its fundamental teach- 
ing. 
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Brit J Psychol, Gen Sect 41:87 S ^50. James 
Maxwell. * this latest volume....cannot be said 
to add much to the principles already set forth 
in the earlier volume * It is the central part of 
the book which is most interesting. Here is ex. 
amined the evidence for the "group" factors, 
verbal, mechanical, number and the like, The test 
applied is a strict logical one, the method of 
agreement and difference, and the authors find 
the evidence not wholly adequate to establish 
clearly such factors both logically and psycho- 
logically. Those who perform various arithmet- 
ical operations on a correlation matrix, and too 
readily label the resulting factors as space, rea- 
soning and such like, could read these chapters 
with profit. Prof. Spearman did not live to see 
the book completed; the final preparations for 
publication were made by his friend and collab- 
orator, Dr. Wynn Jones. A certain unevenness 
in presentation has resulted, the style being in 
places of an almost cryptic conciseness. But an 
assessment of later progress by two of the pio- 
neers in the field of factorial analysis canot fail 
to be of interest; and the application to these 
investigations into human ability of the basic 
principles of logic and psychology by a mind so 
clear and critical as Prof. Spearman's cannot 
fail to be of value. А 
Brit J Psychol, Stat Sect 3:190-1 N 50. 
E. К. C. B. This slender volume....has been long 
awaited * The chapters suffer a little from the 
attempt to compress so much material into. $0 
small a space ; and at tiinc:, in the effort to avoid 
excessive technicality, the writing drops into 
picturesque metaphor and loose colloquialisms, 
which often obscure the meaning instead of € 
fying it. The manuscript was unfortunately le 
in an unfinished state; and, partly no doubt a5 
a result, the text is somewhat inadequately an- 
notated, and the references are not always je 
plete or exact. * Those who are familiar ms 
recent literature will be somewhat disappom Ч 
with the outcome of the discussion, for the ҮШ, 
expressed are rather confused, and show 7h 
advance beyond the position adopted va 
Abilities of Man some twenty years ago. ii 
it will be remembered, Professor Spem" 
while still defending his celebrated theor b 
factors, at the same time tried to leave ph he 
hole for “exceptional cases,” should t Ant 
finally confirmed. In the present volume, rains 
assured, “the psychological picture still va ai 
essentially the same." Nevertheless, 1 the? 
mary to chapter II (headed “Theory 0 
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Factors") the writers tell us that "the statistical 
results of this method....consist of (1) a general 
factor; (2) an unlimited number of narrow spe- 
cific factors ; and (3) very few broad group fac- 
tors." This formulation is almost identical, word 
for word, with the statement of the "three-factor 
theory," put forward by Burt over thirty years 
ago, which Spearman at the time so strongly 
criticized. But how, the puzzled reader is 
tempted to enquire, can a two-factor method 
yield three kinds of factor? And, even if "very 
few" group factors are to be accepted, this never- 
theless means that the two-factor theory, which 
postulates general and specific factors only, is 
no longer adequate. Elsewhere, it is true, the 
authors try to set up a distinction between the 
kind of supplementary factors they are prepared 
to admit and the "group factors" or "special 
abilities" of earlier writers. Group factors, they 
say, must always depend on the tests put into 
the battery, and consequently cannot represent 
innate capacities. Their own factors they prefer 
to describe as “broad specific factors” which, we 
are told, “must be sharply distinguished from 
what have been called group factors,” although, 
a little inconsistently, they are termed “group 
factors" in chapter II. A new feature in their 
discussion is the development of a “Method of 
Excess Correlation,” which is put forward as 
an alternative to existing group-factor methods. 
To illustrate the procedure, the writers print an 
artificial table of correlations arranged in such 
a fashion that all the highest correlations appear 
in the diagonal submatrices. This, of course, is 
the way in which an empirical correlation table 
is usually arranged before Burt's group-factor 
method is applied ; and, as he has himself pointed 
out, the underlying principle goes back to Karl 
Pearson, who distinguished, in tables or corre- 
lations between physical measurements, two dif- 
ferent levels of correlation, namely, the high 
positive correlations between “like” traits and 
the comparatively low positive correlations be- 
tween “unlike” traits respectively. The general 
scheme had already been set out and criticized 
by Spearman and Hart in their well-known pa- 
per (Brit. J. Psych., У, 1913, p. 57), where they 
claimed to show that it is not encountered in 
correlations between mental measurements. On 
this point, therefore, Spearman seems plainly to 
have changed his view. As regards the new 
method here proposed, an obvious defect arises 
from the fact that the formula necessarily as- 
sumes that all the cross-correlations, in which 
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only the general factor is Operative, are equal in 
» 


numerical value. Such ORS 
fulfilled, even арна assumption is rarely 


à pr ely, by the fi һ 
are found in empirical tables ] : ФУ Ж 
eich tas bios bles, Here and elsewhere 


e played by Pearson in de- 
oe the product-moment formulae, the prin- 
ciples о n tiple correlation, and the method of 
mu axes, nor yet his many constructive 

КЫП S, аге ever explicitly mentioned. In an 
early chapter on “Some Criticisms,” and later in 


the text, objections raj ; 
followers are окуса RUE я боя 
Бана ера refutations are 
анн Thi imself is never referred 
о by name. This ban (as older psychologists in 
this country are well aware) dates froma heated 
controversy between the two which arose as far 
back as 1906. But by now it could surely b 
lifted. These anon ipm 
à ymous references, to say the 

least, are highly misleading to the younger stu- 
dent. To the statistical reader the most interest- 
ing section will doubtless be that which reviews 
and criticizes the methods of factor analysis 
developed by more recent workers. Spearman 
naturally takes first the “bifactor method" put 
forward by Holzinger, since that claims to be 
based on the principles underlying his own two- 
factor theory : the method he accepts as valid in 
certain conditions where the Two Factor analy- 
sis ceases to be applicable at all.” He turns next 
to “simple and weighted summation.” Here he 
makes quite clear his main objections to Burt’s 
procedures. Chronologically even earlier,” he 
writes, came another outstanding attempt at 
extending the scope of factorizing ; this was con- 
tributed by Cyril Burt: whereas the older 
method dealt with every tetrad separately, the 
newer way took the sum of each whole column of 
correlations. All such attempts, however, he 
considers to be unsuccessful. His chief objection 
is to "the step whereby the vacant diagonal cells 
are filled out (by stopgaps) so as to make the 
whole table into a solid rectangular block." 
Thirdly, he briefly examines Thurstone’s cen- 
troid | method, and quotes his "fundamental 
equation." "This," he says, “is Burt's formula" ; 
hence much the same criticisms are urged. And 
finally he glances at the method of “principal 
components,” in which “the axes of the (corre- 
lation) ellipsoids correspond to the factors.” 
This is attributed to Kelley and Hotelling, and 
no reference is made to the earlier outline of the 
procedure developed by Karl Pearson. It is sum- 
marily dismissed on the ground that, like the 
centroid method, it produces negative as well as 
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positive saturations in the supplementary fac- 
tors. Burt’s bipolar factors Spearman appears to 
accept, at any rate in the temperamental field 
(p. 60). Up to now, he says, unlike the group 
and general factors, “they have attracted com- 
paratively scant attention: but recently they 
have come more and more into their proper es- 
tate as research is turning to the other side of 
human nature—feeling and volition.” In the 
cognitive field he rejects them, arguing that in- 
tellectual abilities cannot have a negative influ- 
ence on actual performance. This criticism is 
plainly due to the fact that for Spearman a factor 
is always a causal factor, never a principle of 
classification. Consequently, he fails to under- 
stand that, in the view of the writers he is criti- 
cizing, when (say) a nonverbal test receives a 
negative saturation, what is negatived is the 
verbality, not its influence on actual perform- 
ance. With Holzinger, he tells us, “the concept 
of ‘bipolar factor’ has been greatly extended" ; 
and he thinks that “їп this sphere of factor we 
must look to him for the fullest development." 
What he calls “indirect methods," such as “ro- 
tation" and the use of "oblique axes," are re- 
jected because they too readily permit a subjec- 
tive choice of whatever factors are required by 
the investigator's personal theories. His own 
method, he says, has been throughout that of 
partial correlation, with an external criterion 
rather than an internal.’Taking external criteria 
(such as teachers' assessments for g or the Binet 
tests of intelligence) the general factor is par- 
tialled out by Yule's formula, and the residuum 
examined for supplementary factors. This, how- 
ever, is scarcely factor analysis as now under- 
stood, since that proceeds primarily by seeking 
internal criteria derived from the table of inter- 
correlations themselves. To those who knew 
Professor Spearman, it will be clear that he had 
lost none of his statistical ingenuity and little 
of his fighting spirit. To stress too strongly the 
shortcomings of the book would perhaps be un- 
fair. As a pupil and devoted follower of Profes- 
sor Spearman, Dr. Wynn Jones has evidently 
had a difficult task in preparing an incomplete 
manuscript for the press; and the labour which 
he has expended on the work shows how loyally 
he has carried it out. Everyone will feel deeply 
indebted to him for the way in which he has en- 
deavoured to make Professor Spearman's final 
views available to the psychological reader. 

J Consult Psychol 14:332 Ag "50. Laurance 
F. Shaffer. * Spearman's final formulation of 
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his theory of abilities....reveals a narrowing of 
the gap between his original two-factor theory 
and the multi-factor theories pópular in America, 
especially in the discussions of the "broad" spe- 
cific factors, including the verbal and the me- 
chanical. The nonintellective factors such as 
perseveration, fluency and oscillation receive 
brief mention, reminding us again of challenging 
areas of human behavior that have yet to be 
explored adequately. 

Nature 166:707 О 28°50. A. D. Harris. *Hu- 
man Ability" was designed by the late Prof. 
C. E. Spearman as a continuation of the “Abili- 
ties of Man," published in 1927. Of the earlier 
book, an anonymous reviewer (Nature, 120, 
181; 1929) regretted that “....it is distinctly 
written for the layman, and he is expected to 
take many things on faith. He has to trust Prof. 
Spearman's mathematics and still more Prof. 
Spearman's arithmetic." Prof. Karl Pearson— 
for anonymity was no disguise, and the review is 
included in Morant's bibliography of his writ- 
ings—was not prepared to trust either, and ex- 
pressed his lack of confidence at some length 
elsewhere. It is to these controversies of the 
twenties that readers of “Human Ability" will 
find their attention often returning. It is not 
that later work is unacknowledged—the litera- 
ture of the past twenty-five years has, in fact, 
been very fully ingested, if not always completely 
digested—but that this work is presented as a 
series of purely technical extensions of method 
and that important differences of opinion which 
have emerged about the status of mental factors 
are scarcely touched on. Briefly, Spearman s$ 
thesis was that the uniformly positive correla- 
tions observed between different mental tests 
could be—and, for economy of hypothesis, 
should be—explained by two kinds of factor, 4 
general factor G common to all the tests, anc 2 
series of specific factors S, each related to a sin- 
gle test. Later, other factors were admitted, 
which were intermediate in range, being repre 
sented in several but not all of the tests. “Human 
Ability” is partly a recapitulation and partly an 
attempt to bring the story up to date, describing 
the development of factor analysis at the чс 
of Spearman and others, and replying to n 
various criticisms which have been raised. In | 
latter part of its purpose, the book is not re 
successful. It is disappointing to find the in T 
pretation of factors considered only in а ur 
sense, and the biological reference of pee 7) 
tistical entities dismissed as “metaphysic 
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Similarly, the objections considered are largely 
those raised by Pearson and Thomson to the 
original two-factor theory—not the more radical 
criticism which in recent years has attacked the 
whole conception of factor analysis. On the tech- 
nical side, the book will scarcely stand compari- 
son with the accounts given by Thomson or by 
Holzinger and Harman, and it has less general 
interest than Burt’s “Factors of the Mind” or 
Spearman’s own earlier works. For the student 
some introduction to the subject from other 
sources is indispensable before reading “Human 
Ability.” The defects of the book are to some 
extent accounted for by the fact that it was un- 
finished at the time of Prof. Spearman’s death ; 
and Prof. Ll. Wynn Jones, who as part-author 
prepared the book for publication, has treated an 
incomplete text with almost excessive reverence. 
Many parts of the book seem to lack any consec- 
utive theme ; and it is difficult to believe that the 
brief paragraphs on the use of matrix algebra— 
to take one example—were meant to be pub- 
lished as they stand. If the book is in places little 
more than a skeleton, the bones are at any rate 
elaborately clothed. Prof. Spearman's crusading 
zeal is fully evident—in his comments, for ex- 
ample, on the extensive use now made of mental 
tests: Voices are becoming more and more in- 
sistent,” he writes, “....that the sway of the tests 
must be extended over the whole national man- 
power, woman-power, and child-power,” a sit- 
uation which “would appear to be promising, 
but not free from danger.” Many psychologists 
are more impressed by the danger than the prom- 
ise; but whether we agree with Prof. Spearman 
or not, his views have been extremely influential, 
and no one interested in human abilities can af- 
ford to neglect his writings. Pearson credited 
him with “the merit of directing attention to the 
subject,” and it is a subject in which interest 
has grown continuously. Pearson’s fear that 
Spearman’s advocacy would “do more harm 
than good if it leads the non-mathematical psy- 
chologist to believe that the author has proved 
his hypothesis” has fortunately not been con- 
firmed. Non-mathematical psychologists who 
have interested themselves in the theory have 
been as active in criticism as in acceptance ; and 
latterly, non-psychological mathematicians have 
made their contribution to the debate. Both have 
found in Spearman’s work the stimulus of a 
Coherent theory, but neither has been disposed to 
take much on trust. 

Occupational Psychol 24:187 Jl ’50. P. E. 
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Vernon. It is of some historical interest to see 
how far Spearman’s views had changed between 
1927, when The Abilities of Man appeared, and 
his death in 1945. This posthumous volume, in 
the writing of which he was assisted by his 
friend and disciple, Dr. Wynn Jones, is largely 
a summary of the earlier work. It does take some 
cognizance of centroid, principal axes and bi- 
factor techniques (though not of Burt’s group 
factor methods). And it includes the main re- 
sults of a number of investigations subsequent to 
1927, chiefly by his own followers. Thus scarcely 
any of the work published in Psychometrika is 
mentioned. Considerably greater emphasis than 
formerly is laid on broad group factors. But it 
seems a little ingenuous to state that from 1906 
onwards such factors “were placed in the fore- 
front of enquiry,” and to accord prime place to 
the verbal group factor, when in the 1927 volume 
Spearman clearly denied the existence of any 
such group factor. He also refuses to recognize 
that multiple factor analysis yields results en- 
tirely congruent with general + group factor 
analysis, when allowance is made for the greater 
homogeneity of most of the populations studied 
by Thurstone and his followers. There is no 
mention of second-order factors which provide 
an obvious rapprochement with g-theory, and 
Simple Structure is described as “incredibly 
complicated and arbitrary.” The book does con- 
tain some wise passages on the identification of 
the content of factors, but otherwise its interest 
lies almost solely in the light that it throws on 
Spearman’s rigidity. One could hardly recom- 
mend it conscientiously to students who wish to 
gain a general acquaintance with the techniques 
and applications of factor analysis. It is written 
in his characteristically ornate style and, being 
more compressed, it is much more difficult to 
follow than The Abilities of Man. 
Occupational Psychol 25:67 Ja’ 51. Ll. Wynn 
Jones. [А reply to the above review by P. Е: 
Уегпоп.] * аз Ѕреагтап 1з по longer with из, 
it is fair and indeed imperative that I should deal 
with P. E. Vernon's review. He remarks: “Соп- 
siderably greater emphasis than formerly is laid 
on broad group factors." Naturally, for the 
Spearman system was responsible for their dis- 
covery. Incidentally, we did not accord prime 
place to any group factor (cf. pp. 109, 191) and 
emphasized throughout the need for more exact 
interpretation. He says also: “Spearman clearly 
denied the existence of any such group factors. 
I sometimes despair of scotching this perversity 
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and can only ask the reviewer to read p. 342 of 
Thomson’s book on Factorial Analysis. He will 
there find that “The Spearman system has 
sought and found ‘verbal’ and ‘practical’ factors 
to add to g.” I have selected Thomson as he is 
one of the very few who entered the lists when 
Spearman’s discoveries were making history. I 
can assure the reviewer it was a feat demanding 
rare skill and courage. Complaint is also made 
that there is no mention of second-order factors. 
Why should there be? It is well if Thurstone’s 
brilliant researches provide a rapprochement 
with g theory, but the aim of Human Ability was 
not “to give students a general acquaintance 
with the techniques and applications of factor 
analysis.” That has been admirably done by 
Burt, Holzinger and Harman, Thomson, Thur- 
stone and others. As explicitly stated on the 
cover, and in the preface, Human Ability is a 
continuation of The Abilities of Man. It is just 
too bad if Vernon finds it difficult to follow. I 
prefer to accept the views of well-known psychol- 
ogists who have written to me. As I wrote sev- 
eral chapters myself I was both flattered and 
amused when I heard that the book was written 
in Spearman’s “characteristically ornate” style. 
There are many other faults, but I will conclude 
with an anecdote. Twenty-one years ago at Yale 
a young psychologist rashly criticized Seashore’s 
scheme of research in music. Seashore merely 
asked him: “If you had a free hand, how would 
you set about the task?" A grand opportunity. 
But there ensued only silence. Will it be silence 
again? For I am going to repeat the Seashore 
technique. Is it true that Spearman framed the 
first scientific definition of intelligence ever 
given, as I made bold to assert in a recent broad- 
cast? Would the reviewer be able to carry on his 
work without reference to Spearman or to his 
discoveries? 

Occupational Psychol 25:67-8 Ја '51. P. E. 
Vernon. [A rejoinder to the above reply by LI. 
Wynn Jones.] I regretted having to write an 
unfavourable review of Human Ability, both be- 
cause I have always had the greatest respect for 
Dr. Wynn Jones and his work, and because I 
entirely agree with him that Spearman's contri- 
bution to the theory and practice of mental test- 
ing was fundamental. * But I persist in my judg- 
ment that Human Ability is a condensation of 
The Abilities of Man much more than it is a 
continuation which would show how the discov- 
eries of 1904 to 1927 require modification in the 
light of later researches by Thurstone, Burt and 
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others. By “prime place,” I mean that v is the 
first group factor to be described by the authors, 
and the only one to receive a chapter to itself, 
The “perversity” which Dr. Wynn Jones finds 
difficulty in scotching arises because Spearman 
wrote in 1927: “The so widely assumed special 
ability for verbal-abstract operations has here at 
any rate no appreciable magnitude.” But by 1932 
he had altered his views and added: “Ordinary 
tests of the so-called ‘general intelligence’ do in- 
volve verbal factors in large degree.” May I be 
permitted to continue Dr. Wynn Jones’s anec- 
dote? The young psychologist was indeed some- 
what embarrassed at being asked a very large 
question for which he was unprepared. But he 
did give an answer, and then proceeded in the 
next few years to publish eight articles on the 
psychology of music. Finally I should like to 
congratulate Dr. Wynn Jones on his successful 
simulation of Spearman’s style of writing. 

О J Exp Psychol 2:198-9 N ’50. A. W. 
H(eim). This book is both interesting and dis- 
appointing. Its interest is largely historical since, 
as Wynn Jones tells us in his Preface, Spearman 
planned it to be a continuation of his “Abilities 
of Man”; moreover, the book refers extensively 
to writings whose appeal is to the historically- 
minded only. Its disappointing quality is most 
striking in the light of the authors’ Concluding 
Remarks. Here we read, with approval and hope, 
that “по one can say that a final interpretation of 
any broad factor has been obtained” and that 
“an ultimate explanation..is still anybody 5 
guess." But this air of caution and fluidity is not 
justified by the body of the book. “Human Abil- 
ity" consists largely of an inventory of the ied 
of many psychologists whose views were dit- 
ferent from Spearman's; but no recogniti 
accorded the underlying presuppositions (siis 
they share with Spearman), and no space 5 iu 
voted to those whose views were so fun е7 
mentally other that they share virtually en 
mon ground with the authors of "Human à ШШ 
ity." For instance, Binet (apart from be 
mentions in the first chapter, entitled pE 
Scientific Notions”) is dismissed in the bo ie 
the book with one comment: “It is well ae 
that the various speculations of Binet as el 
nature of intelligence were hardly co 
Few writers on psycho-pathology would be pt 
tent to limit their evaluation of Freud to el н 
reflection. The problems treated concern i i 
most part such questions as the ie pend- 
ceptable factors, the extent of their inter 4 à 
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ence, the eventual compatibility of the various 

statistical methods which have yielded incom- 

patible results ; and emphasis is persistently laid 
on the exactitude of the computations, the im- 
pressive numbers of subjects participating and 
the variety of tests which have been used. * The 
writers make convincing criticisms of Thurstone, 

notably (in Chapter X) on the question of test- 
ing "the power of abstraction," but they appear 
unaware that such criticisms could be applied 
with equal cogency to some of their own conclu- 
sions—in particular those which depend on tak- 

ing the name of a test at its face value. Perhaps 
the most startling jump from a name to a "fact" 

is to be found in the chapter on Other Broad 
Qualitative Factors: “....the noegenetic theory 
has taken a hand, by showing that the essential 
form of all mental creation or invention consists 
in educing correlates." The matter of naming 
tests brings us to one of the fundamental prob- 
lems of the factor analyst—or rather of his crit- 
ics, “What’s in the name?" they ask, “What is 
meant by ‘G’ and ‘p’ and ‘O’?” “Nothing,” would 
appear to be the answer if the critics look for 
instance at page 25: “Not for us....is the luxury 
of exploring the depths of ontology and episte- 
mology. We are under no obligation to answer 
such questions as: Whether ‘factors’ have any 
‘real existence’ ? do they admit of genuine ‘meas- 
urement’? does the notion of ‘ability’ involve at 
bottom any kind of cause, or power?” (It seems 
odd that the last question is not for the writers, 
in view of the title of the book.) This viewpoint, 
though sterile, would be unexceptionable if it 
were maintained consistently. Elsewhere, how- 
ever, very definite implications are made as to 
the existence, the measurability and the meaning 
of factors. On page 111, for example: “Failing 
such an assumption....they would present some- 
thing not susceptible of genuine measurement * 
In order to extricate the tests from all this em- 
barrassment....the sole effective resource is to 
pass from the ancient concept of faculties to the 
very modern one of ‘factors.’ * The first [step] 

is to determine a statistical coefficient of ‘load- 
Ing . The second step is to supply this determina- 
tion with an interpretation." In this excerpt may 
be found the source of many of the doubts and 
fears which give pause to the unconverted. These 
suspect that the few factors which are generally 

accepted by all mental testers are precisely those 
(under new names) postulated by the “ancient” 
faculty psychologists; that the “very modern” 

factor analysts have lost in psychological under- 
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standing what they think to have gained in math- 
ematical precision; and that the apparent 
modesty and objectivity of naming factors sym- 
bolically—relegating "interpretation" to a sepa- 
rate chapter—cloaks an inconsistency at least as 
heinous as that of Binet. 


[ B357] 
*SrEDMAN, Louise А. An Investigation of Knowl- 
edge of and Attitudes Toward Child Behavior. 
Purdue University, Division of Educational Reference, 
Studies in Higher Education [No.] 62. Lafayette, Ind. : 
the Division, March 1948. Pp. 66. Paper. $1.00. * 

[ B358] 
*Srern, Morris І. The Thematic Apperception 
Test: An Introductory Manual for Its Clinical 
Use With Adult Males. Foreword by James G. Mil- 
ler. Cambridge, Mass.: Addison-Wesley Press, Inc., 
1948. Pp. viii, 95. Lithotyped. $3.00. * (РА 22:4959) 
For reviews, see 140. 

[ B359] 
*Sremer, M. E. The Psychologist in Industry. 
Springfield, Ill.: Charles C Thomas, Publisher, 1949. 
Pp. vii, 107. $2.00. * (Oxford, England: Blackwell 
Scientific Publications, Ltd. оз. 6d.) (РА 24: 3909) 

Am J Orthopsychiatry 21:214—5 Ja’ 51. Fred- 

erick A. Zehrer. * The author attempts to cover 
too much in too sketchy a manner at too rapid a 
pace in too little space. The uninitiated (for 
whom it is intended primarily) may well formu- 
late a markedly confused concept of the indus- 
trial psychologist. Also the inclusion of such 
items as how to compute mean, median, mode, 
quartile, standard deviation, probable error, Chi- 
Square, and coefficient of correlation (рр. 
22-25) has as little place in this volume as have 
some of the quotations selected concerning Ror- 
schach (pp. 56-64) and "constitutional types" 
(pp. 45-46). The author's selection of a quoted 
definition of personality (p. 46) is indicative of 
the random use of quotations tending to con- 
found rather than clarify. She employs extensive 
quotations with little critical comment through- 
out the presentation. The ultimate of something 
or other is attained on page 95 where she suc- 
cinctly describes (in the lower half of the page) 
the psychoneuroses and then uses two pages 
(96-97) to quote Cameron on “tensional states.” " 
This is followed by Strecker's differentiation of 
psychoneuroses from psychoses and Lichten- 
stein and Small’s description of “psychopathic 
personality” (p. 98). Without preamble, the 
foregoing is followed by comments on “directive 
and nondirective counselling methods.” It is 
this reviewer’s opinion that the author was not 
discretely selective, did little to point up under 
what conditions various techniques described 
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might best be employed, and beclouded the issue 
to the degree that the stated goal is not reached. 
The author's last sentence in the Preface (p. v) 
is presented without comment: “Though most 
of the material may be familiar to psychologists 
and psychiatrists, the section on Testing Tech- 
niques in Part I may offer some new informa- 
tion." This book is recommended for reading if 
used as reference material in orientation sessions 
conducted by a qualified industrial psychologist. 

Am J Psychol 63:650 O ' 50. M. Bentley. * Al- 
though its materials are—as its editor, M. R. 
Harrower, suspects—familiar to psychologists, 
it should be useful in the company of its origin, 
and it may be to many other industrial corpora- 
tions as well. * 

B Menninger Clinic 15:32 Ja’51. B. Steinzor. 
* an itemized catalogue of the tools, concerns 
and trappings of some industrial psychologists. 
It is so exceedingly short on discussion and in- 
terpretation that it is difficult to see what useful 
purpose the book can serve. 

J Am Med Assn 144:280 S 16750. * a splendid 
presentation is made in the way of a thumbnail 
sketch of the interviewing technics and testing 
methods utilized in the employment procedures 
in American business and industry * The bro- 
chure is of value not only in that it summarizes 
in a brief manner the broad subject of testing 
which it covers, but, in addition, the author has 
included approximately 150 references repre- 
senting a good cross section of the literature deal- 
ing with the subject matter presented. * This 
brochure is recommended to any one interested 
in the psychologic aspects of employment pro- 
cedures, The reviewer would like to see more 
space allotted to the consideration of groups and 
the study of group morale and attitude in the 
management of employees. The implications of 
the broad field of human relations seem to have 
been given too little space. 

J Appl Psychol 35:371 О 'sr. C. E. Jurgen- 
sen. * Although parts of the book are excellent, 
the total effect is disappointing. It is doubtful 
that a brief book for employers should include 
definition and description of average deviation, 
standard deviation, standard error of the mean, 
probable error of the mean and chi-square ; and, 
if they should be included, certainly they should 
require more than the single page allotted. Also 
questionable, in a book of this type, is the listing 
of names and addresses of test publishers and 
listing of tests (including twenty “better known” 
personality questionnaires) with no information 


other than what they purport to measure and the 
publisher. Description of how to compute 10% 
seems out of place in a book for employers, Em- 
phasis on projective techniques, especially the 
Rorschach, does not describe the work of 

cal industrial psychologists. In general, other 
books will give employers a better description —— 
and understanding of the psychologist in in- 
dustry. | 

J Consult Psychol 14:237 Је '50. Laurance P, 
Shaffer. * For so small a volume, the summaries — 
of research are surprisingly full. The digest of 
the applications of the Rorschach to industrial 
situations is especially complete, although some- 
times uncritical, The newer responsibility of the 
psychologist as a personal counselor in industry 
is recognized adequately. 

Lancet 258:166 Ja 28 ’50, * well worth read- 
ing ; * contains useful accounts of the many psy- 
chometric procedures employed in estimating in- 
telligence and aptitude. He is best when deserib- 
ing the Rorschach and similar tests ; but whether 
the battery of tests to which an industrial worker 
may be subjected have real value in industr i 
not clearly stated. The importance of the 
metric approach to human relationships lies in 
the attempt to measure abilities, even if....this re 
sults in a purely technical approach. Parado: 
ically, the greatest danger to the future ind 
worker may be the numerous and complex t 
designed to ensure his correct placement: 
ciency becomes the measure of health. For the 
interested in psychometric procedures, the b 
will prove a useful summary; but Mr. Stein 
damages his own case by advocating counse 
(perilously close to therapy) by non- 
qualified psychologists. 
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*STEPHENSON, WILLIAM. Testing 
An Essay in Educational and 


London: mans, Green & Co. Ltd., 1 
14s. * yy York: Longmans, Green 
$275.) (PA 24:2095) 


B Int Bur Ed 23:128 q 3'49. А most cle 
invaluable account, addressed to teachers 
parents, of the main forms of psychological 18 
preceded by an acute discussion of the va? 
the examination system * In compelling W® 
he compares the new secondary st v 
England with the secondary system in ^^^ 
which is alike for all children and is free О 
taint of class distinction or inequality of op 
tunity. 2% 
Bri J Ed Psychol 20:68-9 F '50. 6 1 
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B(urt). * As a result of his experience both in 
the Army and in schools, Dr. Stephenson has 
been led very considerably to modify his earlier 
views. Formerly he appeared as the most en- 
thusiastic champion of Spearman's two-factor 
theory. But he now cites evidence, which, he 
says, seems to “lend support to Professor Thom- 
son's views rather than to Professor Spear- 
man's," He describes himself as “one of those 
who hold that, even though the child has been 
formed partly by heredity and partly by environ- 
ment, heredity is but a spark to the conflagration 
that constitutes environment * As for intelli- 
gence, this is an empty concept in itself, which 
we don't even measure in practice: it should be 
replaced by Galton's notion of natural ability, 
with zest and honest endeavour at its roots." 
Thus, when applying intelligence tests to chil- 
dren of elementary and preparatory schools, he 
himself has found that the former, at any rate 
during their earlier years, fall far below the 
latter. Nevertheless, though he does not cite his 
reasons, he holds that the elementary children 
аге really "no less innately intelligent," and that 
their poor performance is due chiefly to “in- 
. hibitions.” He has become a great admirer of the 
Rorschach test. It is, he declares, “а telling probe 
into the sufficiencies and insufficiencies of per- 
sonalities" ; it can disclose “the degree to which 
а child is well-adjusted for its age," and many 
other suggestive characteristics. But here again 
the evidence that he offers is far from convinc- 
ing. * The whole book is written in a lively and 
challenging style; and it would be unfair to criti- 
Size the sweeping or dogmatic character of some 
of its statements as though it was addressed to 
scientific specialists, One of its most interesting 
features consists in the plates and appendices of 
test-materials. These include several ingenious 
forms of the familiar matrix test, a coloured ink- 
blot fest, which certainly produces far more dis- 
tinctive and varied responses than a Rorschach 
blot, coloured “art-form” tests, and а good deal 
of other test-material, designed for children aged 
en or thereabouts, which should certainly 
Prove helpful to those experimenting in psycho- 
| methods of selection. 
‚ 4 Ed (London) 82: 460-2 Ag '5о. C. M. Flem- 
ing. * In refreshing fashion... [Stephenson] 
maintains that, although abilities and talents may 
rooted in innate capacities and aptitudes, their 
development depends upon environmental inter- 
Play and the satisfaction of psychological needs. 
ling matters; no native capacity runs un- 
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changed or merely matures thros 
and adolescence ; and educational 


K, in 
tests he includes as 
mens of períormance 
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which retains in its tripartite classification the 

taint of unnecessary class distinction, * 
Scottish Ed J 33:434 Je 23 '50. By setting 

forth in clear and simple language the 

tions of testing technique, the author makes a 


F 


worthwhile contribution to the subject of e: А 


testing. * A most interesting experiment in 
which the psychologist co-operates with teachers 
in determining personality traits is outlined. As 


? 


- 
- 


a review of current problems in psychol- 
ogy and a guide to future ex this book 
should be welcomed by teachers. ^ * 
&Srourre, ^ Tou : Louis; Suome 
i , Lovis; | 

MAN, Ze 5 Maram Pa Р, Stan, r^ 
LEY А.; AND сорин, ES \, Y, 
bi uber ЕЛЫ * (London: T. 
University Press, бу.) (PA 25:3037) 


Am Acad Pol & Social Sei 272:261-8 
Олы McClung Lee. In a brief review of 
a complex work, one must confine oneself to a 
few fundamental points briefly and almost dog- 
matically ‚ * The first eleven chapters 
therefore describe various aspects of scalogram 
and latent structure (not to be confused with F, 
Stuart Chapin's somewhat more specific and ten- 
able theory of "latent culture patterns; see 
American Journal of Sociology, 40| 194735]. 
61-68, or his Contemporary Social Institutions, 
New York: Harper & Bros, 1935) — 
the first eleven chapters with a constant sense 
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cial science should be so divorced (a) from so- 
cial data of observation, (b) from integration 
with substantial contributions of others to social 
science, and (c) even from internal integration 
within itself and within the four-volume series 
except on purely abstract statistical bases. There 
is а chasm between such abstract statistical vir- 
tuosity as one finds exhibited in this symposium 
—only casually and inadequately related to data 
of observation—and social statistics. Specula- 
tion is something other than science even when 
it takes the form of sophisticated hypothetical 
models. The basic job of science—whether phys- 
ical, biological, or social—is to describe in pre- 
cise, simple, and preferably mathematical terms 
sense data on reality. To do this, one most de- 
pendably proceeds from the specificities of sense 
data to its generalities, with constant verifica- 
tion and reverification of the latter in terms of 
the former. Stouffer and his associates appear 
to bein a stratosphere far above this fundamental 
characteristic of science. As near as the writers 
come to a recognition that men have relation- 
ships to social structure is the statement by La- 
zarsfeld, “Implied in some social theories is the 
idea that society is stratified into a number of 
groups with rather distinct cleavages between 
them." One need not dig for such as an implica- 
tion in the writings of C. H. Cooley, W. G. 
Sumner, or W. L. Warner. But the Research 
Branch's treatment of men as discrete psycho- 
logical units, without social matrices or life- 
histories, makes their ideas of stratification the 
products of attitude scales on relatively narrow 
areas rather than a matter of social structuring. 
Their treatments of item sampling, of “levels of 
generalized intensity,” and of test-retest relia- 
bility emphasize this unreal atomization of the 
populations studied. A careful student of society 
or of social psychology knows that there is no 
such thing as “а test for the single meaning of a 
series of questions.” 

Eug R 43:52-3 Ap’51. R. F. T. * In the first 
part of the book considerable thought is given 
to the role of questionnaire research in attitude 
and opinion measurement; and scale analysis, 
which provides a rank order for individuals be- 
ing questioned, is regarded as valuable. The con- 
ditions under which scales vary are fully dis- 
cussed, and obviously every attempt is made to 
subject these hypotheses and methods to the 
most rigorous tests. This section will be of ex- 
treme interest to all concerned in the production 
of questionnaires and will give them much food 
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for thought, though many—even those whose 
mathematics are adequate—may find that the 
arguments are set out without conciseness, and 
that consecutive reasoning is somewhat obscured 
by divergent ideas. The second part of the book, 
on prediction, is open to more criticism. Again 
there is—to English standards of paper economy 
—an almost incredible wealth of descriptive de- 
tail, hampered by cumbersome and ill-assorted 
terms. We are given the account of the tests used 
for screening psychoneurotics and for studying 
soldiers’ post-war plans. The value of both pro- 
cedures is now generally accepted; but the ac- 
count given here may paradoxically give some 
ammunition to old opponents; for the reader, 
a little bemused perhaps by the profusion, is re- 
warded just before he gets to page 700 by hear- 
ing that “considerable support is given to Dol- 
lard’s common-sense statement that a man will 
best predict what he will do in a future situation 
if he has been in about the same situation be- 
fore.” But the writers want to do more than sup- 
port common sense, and they point out that it 
isn’t enough to say that “people will do what 
they said if they don’t change their minds first 
—a phrase which is so refreshingly clear that one , 
starts with joy, only to be told that this "carries 
also the implication that once the individual has 
stated his intentions, he remains in this respect 
static unless some crucial experience leads him 
to change his intentions." The crux of the ques- 
tion, what constitutes cruciality, is in any case 
ignored. Finally, we do not know if any record 
was made of any validation of these predictions 
by observation of results ; what did the геш 
soldiers go to? It is now five years later—an 
some work may have taken place. я 

Int J Opin & Attitude Res 5:95-102 SP 57 
H. J. Eysenck. [See Eysenck's review for criti 
cal comments too lengthy to excerpt.] ЗЫ 

Int J Opin & Attitude Res 5:103-20 ido 
Louis Guttman. [See Guttman’s reply 0 2 
senck's review for critical comments too lengthy 
to excerpt. Я 

Int дет & Attitude Res 5:124-5 sh е 
Steuart Henderson Britt. * this volume ca" in 
considerable literacy and even sophistication Я 
mathematics and in experimental niin” 
highly technical nature would limit its rea ; ae 
a great deal. Yet it is perhaps the most A : 
cant of all four volumes. It gets down = 
fundamentals on which all the studies гер à 
in the other books in the series were base "arte 
the basic methodology is presented. * One 
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most significant aspects relates to the field of 
ychiatry. An amazing variation in psychiatric 


diagnoses is demonstrated throughout different 


rts of the United States. Unfortunately the 
findings may not be heeded, for this book is not 
easy reading. Most people, in fact, will know 
about it only secondhand. The one difficulty with 
the volume is the lack of summary statements or 
conclusions for each of the chapters. Such sum- 
maries would have enabled the reader to grasp 
the conceptual structure as a whole much more 
readily, And there is a real need for a chapter 
at the end, as an over-all evaluation of the en- 
tire project. This book, however, increases our 
knowledge of social science considerably because 
of its unique contributions to the field of meas- 
urement and prediction. 

J Consult Psychol 14:505 D '50. Laurance F. 
Shaffer. * To read these chapters requires math- 
ematical literacy, but specialists in measurement 
will find them important and nonexperts will be 
sufficiently enlightened by Stouffer's excellent 
introduction. The last five chapters record two 
experiments in prediction, both of great interest 
to clinical psychologists, relating to the screen- 
ing of psychoneurotics and the postwar plans of 
soldiers. Full data are given concerning the de- 
velopment of the NSA—the Neuropsychiatric 
Screening Adjunct—a short questionnaire in- 
troduced late in the war that gave promise of use- 
fulness at induction stations. The constancy of 
NSA scores from one station to another in the 
face of startlingly varying rates of psychiatric re- 
jection should be studied seriously by all who 
are concerned with subjective methods of clini- 
cal evaluation. 

J Social Psychol 33:297-312 My '51. He 
Eysenck, * contains the long awaited presenta- 
tion by Guttman of his method of scale analysis, 
as well as an account of the equally novel latent 
Structure analysis by Lazarsfeld. These writers 
fake issue with the common preoccupation 0 
psychologists with quantitative material and be- 
lieve that qualitative data, such as those usually 
obtained in questionnaire and attitude studies, 
should be analysed by techniques appropriate to 
these data, and not requiring transformation into 
tetrachoric correlations, or other statistics de- 
Pendent on unverifiable assumptions. Because of 
this rejection of correlational procedures, Gutt- 
man and Lazarsfeld also object to the use of fac- 
tor analysis in the field of qualitative data and 
Present their respective methods of analysis as 
“apable of doing in this field what factor analysis 


‚ but depends on the margina 


does in the quantitative field. Unfortunately, the 
writers neglect to define these concepts of quali- 
tative and quantitative data in a manner which 
would make it possible to construct a rigid math- 
ematical argument, but leave them vague and 
lacking in precision. In particular, they neglect 
to discuss the difference between data which 
are genuinely qualitative and data which are 
quantitative, but, for various external reasons 
connected with the measuring instrument, ap- 
pear to be qualitative—i.e., presumably, the 
greater part of the data with which social psy- 
chology deals. To deal first with Guttman's con- 
tribution, he puts psychologists in his debt by 
presenting an excellent description of his method 
of scale analysis, which will undoubtedly become 
the loc class for any discussion of his system. For 
those, however, who had hoped that he would 
answer the criticisms made of his technique, this 
contribution will come as a disappointment. We 
may perhaps briefly note what the main objec- 
tions to his scheme are. He attempts to dispense 
with the concept of a continuum and substitutes 
for it the concept of "scalability," i.e., the dis- 
covery of a certain patterning, or arrangement of 
responses, implying that from the respondent's 
total score it should be possible to deduce which 
items had been endorsed. As perfect reproduca- 
bility has never been achieved in psychological 
work, an arbitrary limit is set at the 9o per cent 
level, so that only questionnaires with reproduca- 
bility above this figure are considered as scales. 
Where reproducability (sic) is less, and where 
this lack of reproducability can be shown to be 
due to many small uncorrelated error variables, 
Guttman talks about “quasi scales,” which, like 
true scales, have the property that the multiple 
correlation of any outside variable on the area 
under discussion will be exactly equal to the 
simple correlation with a rank order on the area, 
The following criticisms are not anticipated or 
answered in Guttman's treatment. In the first 
place, the 90 per cent level is a completely arbi- 
trary one; there is no fundamental distinction 
between the 89 per cent and the 91 per cent of 
reproducability, say, such as would be implied 
in Guttman's use of the term scale, and quasi 
scale characterizing questionnaires having these 
two different reproducabilities. In the second 
cannot be argued that теч jo 

of this kind are often made in statistics 
ae this 5 per cent level of significance), be- 


rite sU : 
roducability is not an invariant concep’ 
pen " 1 totals for each item. 


place, it 
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This is partly recognized by Guttman because 
he demands that no item should have more error 
than non-error ; he does not recognize, however, 
that the meaning of a 90 per cent reproducability 
differs from a scale in which the marginal totals 
are high as compared with a scale in which they 
centre round the 50 per cent level. In other 
words, any comparison of scales having different 
marginal totals, in terms of the level of reproduc- 
ability, becomes meaningless. In the third place, 
Guttman's claim that if the item in a question- 
naire form a scale, then the whole universe from 
which these questions are drawn would form a 
scale, is inadmissible. We have no basis for as- 
suming that the particular set which is scalable 
is representative of the universe, unless the uni- 
verse is defined in a trivial fashion by specifying 
that each item in it must form part of the scale 
observed. Thus, in practice, no generalization 
about any universe becomes possible from the 
knowledge that a particular set of items is scal- 
able. Guttman's claim, therefore, is either tauto- 
logical or incorrect. In the fourth place, there is 
a scarcity of validation studies in this book. It 
is claimed, for instance, by Guttman that the 
second principal component generated by his 
model can be used as a measure of intensity, thus 
defining the true zero point for the scale. No in- 
dependent evidence is given for this claim, and 
as it can be shown that the zero point determined 
by this second principal component will not be 
invariant, it is difficult to accept this claim, even 
on the theoretical level. Another claim made by 
the authors of this volume, namely, that Gutt- 
man type scales have value in practice, cannot be 
verified from the data given as no comparison 
has ever been made, apparently, between validity 
achieved by Guttman type scales and validity 
achieved by other types of scales. The one pos- 
sible exception to this statement is a psycho- 
neurotic inventory derived by means of scaling 
techniques and shown to be valid in a later 
chapter of this volume; its discriminative ability 
appears to be neither better nor worse than that 
of the best instrument available, constructed 
without the use of scaling analysis. It is much 
more difficult to discuss Lazarsfeld’s contribu- 
tion than Guttman’s as it has hardly ever been 
used, as the mathematics are more complicated, 
and as practical methods for using it have not 
been worked out yet. In many ways, it avoids 
the criticisms made of Guttman’s technique; 
thus, for instance, Lazarsfeld shows that Gutt- 
man’s quasi scale can be derived analytically as 
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a special case of latent structure analysis, Gutt- 
man's perfect scale then being a limiting case of 
the quasi scale. In general, Lazarsfeld’s approach 
and that of the factor analytical school appear 
very similar, and if latent structure analysis can 
be worked out into a usable technique, it will be 
very interesting to compare its results with those 
achieved by the older methods. The last chapters 
of the book are taken up by two studies in predic- 
tion, one dealing with the screening of psycho- 
neurotics, the other being a study of post-war 
plans. The first of these two investigations will 
be of much interest to clinical psychologists as 
it contains information on the extreme incon- 
sistency of psychiatric diagnoses. In comparable 
selection centres, for instance, proportions of 
psychiatric rejects classified as neurotic varied 
from 3 to 9o per cent, while proportions classi- 
fied as psychopathic disorders varied from zero 
to 81 per cent. The total proportion of men re- 
jected on psychiatric grounds varied from one- 
half of one per cent to 51 per cent. The writers 
give evidence that these differences could hardly 
be due to sampling differences in the population 
studies and it seems that it can only be due to 
variations in psychiatric judgments from one 
centre to another. It may be added, parenthet- 
ically, that inconsistencies among individual psy- 
chiatrists would be expected to be much greater 
even than differences between classification cen- 
tres using several psychiatrists each, and thus 
averaging out at least some of the specific vari- 
ance. The last part of the book relates attempts 
to forecast what men would do after the war and 
is of much interest because of the practical im- 
portance of the conclusions and because of the 
obvious methodological difficulties. The task was 
tackled in a sensible fashion and prediction 
seems to have been more accurate than one pd 
have expected at first glance. It is not quite i i 
why these prediction studies were include d 
this book. They seem to go much more m sd 
with the prediction studies at the end of Vo "d 
2. For those of us who are interested in EN y 
odology and theoretical models with ШЕЕ 
integrate our experimental findings, this vo rie 
is probábly the most interesting of the i E 
and the criticisms made here, particular И : 
Guttman's contribution should not deter y 
reader from examining the questions rais ib. 
these authors for themselves, and to oe 
the field of practical application results т e 
by these methods with those obtained Be 
more usual techniques of item selection, 
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analysis, and so forth. On the whole, there can 
ре no doubt that the four volumes reviewed here 
mark a milestone in the development of social 
psychology, as much from the point of view of 
the importance of the practical contribution as 
from that of the technical skill and competence 
shown in the actual investigations. They take a 
worthy place beside the 19 volumes of the Army 
Air Forces Aviation Psychology Program Re- 
search Reports as an indication of the great 
contribution which psychology can make to our 
national affairs, and also of the way in which 
the practical application of scientific techniques 
can, themselves, lead to improvement in those 
techniques and to important theoretical ad- 
vances. 

Psychol B 48:512-9 N ’51. Daniel Katz. 
[Joint review of Volumes I-IV of Studies in So- 
cial Psychology in World War II; see original 
review for additional comments not excerpted. ] 
It is difficult for the most enthusiastic supporter 
of quantitative methods in the social studies to 
overvalue the contributions of these four vol- 
umes to social science in general and to social 
psychology in particular. The contributions are 
both profound and numerous. They are essen- 
tially of four kinds: (1) the raising of standards 
for basic and applied research both through the 
excellence of the research carried on and the full 
reporting of what was done; (2) substantive 
contributions to factual knowledge and theory ; 
(3) methodological and technical improvements 
including both the development of new tools 
and the ingenious application of old methods to 
new problems; and (4) advances in the knowl- 
edge of the strategy and tactics for conducting 
social research. * The effect of these volumes 
should be to strengthen the social sciences at 
one of their weakest points—their general lax- 
ness with respect to research standards. The 
work of the Research Branch forms a marked 
contrast to much of applied social research in its 
wholehearted interest in methodological im- 
provement and in the use of adequate scientific 
communication to achieve this purpose. * The 
quality of the scientific reporting is... high. * The 
reporting reflects the care and sophistication 
present in the conduct of the research. In spite 
of the many practical obstacles to good research 
in the military organization during the war, à 
Systematic attempt was made to build in checks 
and controls wherever possible. * The net effect 
of the interest in methodology and in adequate 
Scientific reporting should be to raise the level 
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of scientific standards and aspirations of social 
scientists both in applied and basic research. * 
The major methodological contributions are in 
the field of scaling. The pioneer work of L, L. 
Thurstone in developing the logic, the mathe- 
matics and the techniques for the measurement 
of qualitative data some twenty years ago was 
perhaps premature in relation to the develop- 
ment of the field. Since the work of Thurstone, 
social psychologists and social scientists have 
tended to deny, ignore or postpone a systematic 
consideration of these problems. The Research 
Branch, however, has boldly given this area of 
measurement the central attention it deserves. 
Two models are presented in some detail—the 
scalogram analysis of Guttman and the latent 
structure analysis of Lazarsfeld, In his introduc- 
tory chapter to the volume Stouffer takes the 
statesmanlike position that the intent is to ac- 
celerate the development of this methodological 
area and that revisions of the models presented 
—or, indeed, completely new models—are 
needed. * The scalogram analysis of Guttman 
has been criticized for its limited practical uses. 
Reproducibility of an order to give a scale or a 
quasi-scale is not easy to obtain save for fairly 
repetitious questions. A more fundamental criti- 
cism of the Guttman method is its empirical and 
mathematical character. Though useful as a 
technique one hesitates to endow it with as much 
theoretical importance as does its author. It does 
not represent a social-science or psychological 
model, in that there is no theoretical scheme for 
the interrelationship of variables. The dimen- 
sions of attitude are to be derived operationally 
without reference to psychological theory. Thus 
the attitudinal variable about which Guttman 
seeks unidimensionality is defined merely as a 
delimited area of behavior. What the “more or 
less" signifies on the Guttman scale in terms of 
psychological processes is conspicuously by- 
passed. Thus when the intensity function is sug- 
gested as a second dimension of attitudes we do 
not know to what extent this is a repetition of 


the first dimension. More empirical tests are 


needed to validate the independence of the con- 
ts, The summary 


tent and intensity components. © © °. 
dismissal of Thurstone’s contributions n Gutt- 
man’s presentation seems unwarranted. Thur- 
stone did more than develop scales on the basis 
of the evaluation of judges. He also addressed 
himself to the problem of the ordering of people 
ой the basis of their own reactions, as in his 
rank-order and paired-comparison procedures 
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and especially in his method of similar attri- 
butes. The Lazarsfeld latent structure analysis 
is another procedure for discovering the com- 
ponents of attitudes through mathematical 
means. Lazarsfeld does not dismiss the illogical 
opinions of people on a problem as error from 
the point of view of scaling but assumes a more 
complex pattern of underlying attributes than 
the unidimensional scale. To uncover these latent 
attributes he employs the logic of factor analysis 
but not its correlational techniques. In its pres- 
ent stage of development, however, latent struc- 
ture analysis is still a laborious method. The 
values of these scaling procedures is two-fold: 
(1) they contribute methods of determining uni- 

. dimensionality of the attribute being measured. 
"Thus they can be used in their simpler forms to 
improve measures where we seek interrelation- 
ships between variables defined by existing psy- 
chological knowledge and theory; (2) like fac- 
tor analysis they can suggest basic factors 
responsible for observed relationships and thus 
lead to hypotheses for experimentation. They are 
basically inductive, however, and seek to dis- 
cover variables or components through their 
differential mathematical effects. * 

Pub Opinion Q 14:710-28 w '5o-5r. M. A. 
Girshick and Daniel Lerner. “Model Construc- 
tion in the Social Sciences—An Expository Dis- 
cussion of Measurement and Prediction.” [See 
original review for critical comments too lengthy 
to excerpt. ] 

Social Forces 29:207-9 D ’50. Daniel О. 
Price. This long awaited fourth volume of Stud- 
ies in Social Psychology in World War II 
proves itself to have been well worth the waiting. 
The material presented here should be assimi- 
lated and utilized by research workers in many 
fields of social science besides attitude and opin- 
ion research. While some of the material has 
been published in other places, the sources are 
so widely scattered that this volume, which 
brings the material together in an organized 
form, would be a major contribution even if 
nothing new were presented, * Whether one 
agrees or disagrees with Guttman’s approach to 
the problem of defining attitude and opinion, his 
approach will be found stimulating. * The scale 
and the scalogram analysis technique, as devel- 
oped in this volume, should have wide applica- 
tion and utility in many branches of social re- 
search. * Lazarsfeld has developed the method of 
latent structure analysis (presented in Chapters 
IO and 11), which in its general form includes 


the Guttman scale structure as well as the Thur- 
stone multiple factor structure as special cases, 
In analogy the method does for qualitative data 
what multiple factor analysis does for quantita- 
tive data. * As the authors point out, the reading 
of these two chapters requires “mathematical lit- 
eracy,” and this reviewer feels that authors have 
the right to expect mathematical literacy in an 
increasing proportion of social scientists, An in- 
consistency in symbolism was slightly confus- 
ing—the symbol [ij] is used with two different 
meanings (see pages 372 and 419). Also, the 
subscripts of elements in a determinant are not 
arranged in the conventional row-column order 
but in numerical order (p. 389). Since the ele- 
ments are taken from a symmetrical matrix, 
however, no confusion results. It would also 
have been well to have some additional interpre- 
tation of these latent subclasses rather than just 
a demonstration that the system could be so 
structured. * the last four chapters....are im- 
portant because of their work on the problem of 
prediction and interesting because of their sub- 
stantive material. * The Appendix contains some 
material that might well have been expanded and 
made a part of the body of the book, particularly 
the section on Questionnaire Administration. P 
In summary, it is felt that this is one of the more 
important books in methodology, and this review 
is in no way intended to cover all of the major 
points. Many of the methods elaborated here 
should be integrated into the kit of tools of every 
worker in social research. 
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*SrRANG, RurH. Counseling Techni 
lege and Secondary School, Revise 
New York: Harper & Brothers, 1949. Pp. ХШ, 
$4.00. * (PA 23:6430) 

Col & Univ 25:146-7 О '49. John M. d 
naker. * can be used as a text-book in a а 
in counseling or it can serve as a manual m 
background use in actual counseling pur 
The independence of the chapters, each wit 
own bibliography, makes for easy as 
The chapter on projective technics (ш Nile 
references) will be of especial interest 


cs in Col- 
d Edition. 
i ont. 


; inical 

the treatment will seem too isa to ied 
ist, i in 

psychologist, it should prove helpfu E technics 


the counselor unfamiliar with thes 
Registrars as a group may have only hnics 
terest in or concern with projective poem 
therapeutic methods, case studies, and o sinit 
tion, but the chapters on personal IH of 
cumulative records, and the interview W! 


limited dn 


105 ] BOOKS & REVIEWS: STRANG 


B363 


direct concern to all who spend time in inter- 
viewing and in dealing with records and recom- 
mendations. The most obvious lack in the book 
is an adequate treatment of the use of tests in 
counseling and an emphasis on their importance. 
The author justifies this omission because the 
subject is treated adequately in other books. In- 
deed, Arthur E. Traxler, for example, in his 
book on Techniques of Guidance, devotes major 
space to appraisal of aptitudes, achievement, and 
personal qualities by tests and testing programs, 
and his book supplements the present volume in 
this regard. Miss Strang’s presentation is clear, 
although in the text-book style. The need for 
further research in almost every phase of the 


work of counseling is repeatedly pointed out and · 


will, it is to be hoped, both stimulate the coun- 
selor to examine critically what he is doing and 
create in him an attitude of wholesome doubt as 
to the value of much of what now passes as “ex- 
pert counseling.” 

Nerv Child 8:539 О "50. * the most important 
and sincere attempt yet presented to clear the 
basic concepts and techniques of counseling. The 
author is one of the few workers who has really 
brought thoughts into the wide regions of brain- 
less working spheres of social work writing and 
teaching. Nobody in the fields ranging from so- 
cial work to psychiatry can miss reading this 
book. 

Occupations 27:602-3 My ’49. Iona R. Logie. 
* offers a new wealth of professional enrich- 
ment. As a teacher of university courses in 
counseling techniques, the author has been aware 
of a need for more concrete illustrative material, 
and so has presented in almost every chapter of 
the new edition more comprehensive data for 
both student and experienced practitioner. At 
the same time, the author maintains an admirable 
professional balance in analyzing the counsel- 
ing art: that indispensable balance between the 
values of a counselor’s techniques and those in- 
herent in his or her personal qualifications. In 
a different order, as compared with the first 
edition, the present volume describes and evalu- 
ates with careful documentation the more fa- 
miliar counseling techniques: observation, the 
interview, rating scales and cumulative per- 
sonnel records, and the use of autobiography and 
case studies. Two extremely valuable new chap- 
ters present (1) the use of “Projective Tech- 
nics” (e.g, Rorschach, TAT, the incomplete 
Sentence technique and play techniques), and 
(2) the use of such more recent methods as 


environmental therapy, group therapy, and the 
little-known techniques of the socio-drama and 
sociometry. At various points in the book, the 
moot question of "non-directive" counseling is 
inevitably discussed, and with fairness to all— 
we can hardly say both—points of view. The 
burden of the evidence, however, points to a 
middle-ground position, in which the counselor 
must intelligently blend his methods of leading 
a client to decisions from within, This viewpoint 
is not a denial of the long-range trend the author 
discerns, namely, the shift from appraisal to 
understanding and from counselor to client. 


For reviews of the first edition, see 40: B1088 
and 38:B493. 
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xSrurr, Dewey B. Eprror. Personnel Research 
and Test Development in the Bureau of Naval 
Personnel, Prepared by the Staff, Test and Research 
Section, in cooperation with N.D.R.C. Project N-106 
and the College Entrance Examination Board. Prince- 
ton, N.J.: Princeton University Press, 1947. Pp. xxiv, 
513. $7.50. * (PA 22:877) 


Occupational Psychol 23:254 O '49. М. A. B. 
Wilson. Not the least merit of this book is its 
conscientious thoroughness and comprehensive- 
ness. A student of applied psychology who con- 
templated employment in a military service 
could learn much of what he needed to know to 
fit himself for his job by careful study of its 
thirty-two chapters. * Stuit’s team of writers 
is a large one, and, inevitably, the style is in 
places a little heavy. But there is no padding. 
‘And when recording is left for discussion, the 
tone becomes lively and frank as well as contin- 
uing modest and reasonable. * Part V is a mis- 
cellaneous but important section which takes up 
in turn four topics: criterion measures (a some- 
what abstract chapter) ; follow-up of some 2,000 
men aboard ship (of first-rate interest, dealing 
with the formidable problem of the criterion 
this time in a highly concrete manner) ; informa- 
tion surveys (with comments upon the circum- 
stances in which these are useful or not useful) ; 
and problems for further study (including some 
discussion of the highly pertinent questions, Are 
the findings of peacetime research relevant to 
a wartime situation, and vice versaf). Many of 
the findings reported by Stuit and his collabo- 
rators will be a familiar story to Service psychol- 
ogists in this country * Among the less familiar 
material, special mention must be made of the 
chapter by Milton Wexler, on measures of per- 
sonal adjustment. The reviewer found it most 


B363 


illuminating. Other features of special note are 
the chapter....on follow-up aboard ship; and the 
enviably long list of professional achievement 
tests and advancement examinations prepared. 
It is evident from the description given that the 
American naval psychologists were not only 
very energetic and competent in constructing 
standard achievement tests but also extremely 
successful in integrating them into the training 
process. In the result training came to be im- 
proved and accelerated, because trainers and 
trainees had, at each stage, a clear and—they felt 
— just estimate of progress made. 

Occupations 27:289-91 Ja '49. C. Gilbert 
Wrenn. * of major value for the research- 
minded in the fields of personnel selection, test 
construction, and test validation. It is a solid 
technical book and no sugar coating upon the 
part of the reviewer can make it popular read- 
ing for the average man. * The book as a whole 
is aimed at psychologists and research workers 
* Readers of Occupations other than those al- 
ready mentioned will, nevertheless, find partic- 
ular chapters of significant value. All who use 
personality measures of any form would profit 
from reading Chapter IX, written by Milton 
Wexler, on "Measures of Personal Adjust- 
ment." By so doing the administrator or coun- 
selor will learn much of the care used in the 
proper construction of so-called "personality 
tests," the care used in selecting validation cri- 
teria, and in separating validation groups of sub- 
jects from those used for other standardization 
purposes, as well as the limited but positive re- 
sults to be obtained when validation is carefully 
performed. The chapter reports a variety of 
types of measurement used but concludes that 
"the general impression is gained that most of 
these inventories, regardless of item format or 
even the specific symptomatology investigated, 
produce about the same sharpness of prediction. 
It is almost as if a kind of plateau in prediction 
had been reached through the symptom ques- 
tionnaire approach. Sharply different methods 
will probably have to be experimented with be- 
fore new significant advances are made beyond 
the present accomplishments." The careful, 
accurate statements in this objective analy- 
sis will contrast sharply with the florid and 
"promising" language all too often found in the 
manuals of published personality tests. The 
reader of this chapter will hereafter be a little 
more cautious in his expectations of paper-and- 
pencil personality measures. Of almost equal 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK [ 1098 


value to school and industrial users of tests, or 
to those who participate in any form of person- 
nel selection or prediction, are Chapters VI and 
VII on basic tests for enlisted and officer per- 
sonnel. These are written by Guy L. Bond, Jo- 
seph Miller, and William A. Owens. These sixty 
pages will introduce even the casual reader to 
a short course in careful item selection and test 
validation. Those who never engage in test re- 
search but who use tests constantly will greatly 
profit from these chapters. Two other chapters 
are commended to the general reader, Those 
who find it difficult to understand why so much 
attention is given to the selection of criteria for 
the validation of tests will do well to read Chap- 
ter XIX by Harold P. Bechtoldt on "Problems 
in Establishing Criterion Measures." This pro- 
vides a careful analysis of the difficulties faced 
and, even if the reader has no general concern 
with the validation of tests, he will be better 
equipped to investigate whether the tests he is 
using are appropriately validated. And fre- 
quently they are not, even though using an in- 
adequately validated test is similar to giving a 
dose of medicine from a bottle merely labeled 
“Take 2 teaspoonfuls every four hours!" Chap- 
ter XX....is good reading for any concerned with 
vocational guidance and vocational prediction. 
The chapter provides an illustration of "predic- 
tion on the job." The analysis of what is success 
on the job is applicable to other job situations. 
This volume will be used for reference purposes 
in many university graduate schools and 
many professionals in the field of measurement 
and personnel selection. It has marked values 
for other readers provided such reading 15 di- 
rected at certain chapters—and is done slowly. 
The chapters pointed up in this review have no 
adequate counterpart in other literature 
should be considered marked additions to the 
literature of counseling and of personnel 

in general. Incidentally, the reviewer was na 
second reserve officer to report for duty ( 
tober, 1942) to the Standards and Curriculum 
Section of the Bureau from which the Test sos 
Research Section was an outgrowth. He 
there working on Officer Procurement an s 
lection problems when Guy Bond de 
construction research. The pressure and кө 
sion of those early days was considerable 
is gratifying to see how much careful r (em 
was carried out in spite of wartime frustra zi 
Given reasonable support and freedom from ^ 
itary irritation, scientists in physics 07 ™ p? 
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n" y can produce research that will last | 

| E. military need for it has disap iara 
Personnel Psychol 1:117-9 sp’48. E. Donald 
- Sisson. * Part II contains four chapters, outlin- 
ing the construction, standardization, and use of 
the various selection and classification tests, Of 
special interest here is the account of the Basic 
Test Battery administered to enlisted personnel 
at “boot camp.” * Perhaps the most significant 
chapters in the volume are those contained in 
Part V, under the heading “Follow-Up Studies 
of Training and Classification Techniques,” The 
discussion of criterion measures correctly indi- 
cates the crucial importance of this factor in all 
prediction studies. * As indicated in the editor's 
preface, the book presents “few distinctly new 
techniques.” The conditions of wartime pressure 
жеге not exactly suited to such developments, 
The volume does, however, summarize a wealth 
of experience with the application of established 
methods and techniques to the typical problems 
of personnel management. As such the volume 
should be instructive and interesting to workers 
and students in this area. 

Psychol В 45:144—50 Mr '48. John C. Flana- 
gan. * the editor and contributors...are to be 
tommended for presenting this material * The 
book is divided into five parts, consisting of four 
or five chapters each. * The first section is histor- 
ical and descriptive. The second section is related 
to the selection and classification tests. The third 
Feports projects on the prediction of success in 
training. The fourth describes the achievement 
measures developed, and the final section dis- 
cusses follow-up studies and surveys. * [In Part 
2, the] reliabilities and intercorrelations of the 
Various sub-tests are reported, It is unfortunate 
that although speed is a substantial factor in 
many of these tests, many of the reliability co- 

‘ents reported are computed from odd-even 
‘ores or by the Kuder-Richardson formulas. 
1е only discussion of this fact which came to 
this reviewer's attention was the following state- 
ment on page ло: “The alternate-form reliabili- 
ties of all tests are lower than the odd-even esti- 
nates. This result is to be expected in view of the 
fact that a test typically correlates with itself 


higher than with a » Tt is doubtful 
nother test." It is dou 
Whether there is any value in computing Kuder- 


Tdson reliability coefficients on such а 


‘tas the clerical aptitude test. Certainly, where 


this ig done, the unsophisticated reader should be 
wared that the coefficient has very little prac- 
Value, * The chapter on “Measures of Per- 


sonal Adjustment" by Milton Wexler is of 
special interest. Although the problems of meas- 
uring personal adjustment were certainly not 
solved during the war, the extensive experience 
in this field laid in the groundwork for a very 
productive period of research in the next few 
years, On pp. 140 and 141, Mr. Wexler pre- 
Sents an excellent statement of some of the pit- 
falls inherent in research in personal adjust- 
ment. These include the fallacy of comparing the 
Scores of individuals tested in hospital wards 
with those of recruits, the contamination of the 
criterion by knowledge of the test results, and 
the application of a scoring key to the same pop- 
ulation on which it was developed, * The ac- 


entitled, Constructing and Using Achievement 
Tests. This bulletin was very competently done 
and appears to have been effective, * Because of 
the central i of criterion and follow-up 
studies in all personnel research..,.[Part s] will 


ishing Criterion Measures,” by Har- 
old Bechtoldt. It includes suggestions for devel- 
oping an acceptable criterion. These are dis- 
cussed under the headings of definition of the 


answer all the problems 
^ suitable criterion, this discussion is 
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Dewey Stuit, it discusses prediction of perform- 
ance of enlisted personnel aboard ship. It is 
almost exclusively concerned with a follow-up 
study of about 2,000 enlisted men in six dif- 
ferent types of duty assignments on 27 ships. 
After trying and rejecting ratings scales and an 
adaptation of the nominating technique, a method 
based on the order of merit rankings made by 
supervising petty officers was adopted. The ac- 
count of the many problems. involved in the 
analysis and use of these criterion data provides 
an excellent example of the many difficulties, 
complexities, and unsolved problems in this gen- 
eral area. * [A factor] which is not discussed in 
this chapter, is the extent to which the judgments 
of supervising petty officers were really inde- 
pendent judgments. In many situations such 
ratings are found to have been based on reputa- 
tion. The individual's reputation is frequently 
derived from a pooling in informal conversation 
of relevant and irrelevant bits of information 
concerning him. The authors are certainly to be 
commended for carrying out this project and 
reporting it in considerable detail. Certainly 
much more work of this type is essential if prog- 
ress is to be made in personnel research. The 
remaining chapters include one on "Information 
Surveys as Evaluative Devices" by Robert Pace, 
and a chapter on “Problems for Further Study" 
by Norman Frederiksen, Eugene Caistater, and 
Dewey Stuit. The chapter on surveys includes 
an excellent discussion of factors influencing the 
value and usefulness of opinion research. The 
last chapter discusses unsolved problems in con- 
nection with techniques of selection, classifica- 
tion, and training, and various methodological 
issues. The problems defined and the considered 
opinions of these individuals with respect to 
promising research are well worth reading. In 
conclusion, it can be stated that much evidence 
is provided by this book of the great value of 
such a personnel research program. Further 
tangible evidence of the effectiveness of the 
work of this group is provided in the decision of 
the Navy to continue a program of personnel re- 
search and test development during peacetime. 
This book constitutes a valuable reference for 
all those interested in personnel research, 
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*Srurr, Dewey В. ( CHAIRMAN).; Dickson, GWENDO- 
LEN S; Jorpan, Tuomas F.; AND SCHLOERB, LESTER. 
Predicting Success in Professional Schools. Fore- 
word by E. G. Williamson. A report of a subcommittee 
of the Committee on Student Personnel Work of the 
American Council on Education. Washington, D.C.: 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK 


[ 1060 


American Council on Education, 1949. Рр, xii, 18; 


$3.00. * (PA 24: 3410) 
Col & Univ 26:128-9 О ’50. Ordway Tead, 
* serves a useful, even if limited, purpose. The 
self-imposed limitation of the study is that it 
is “limited to a treatment of prediction of success 
in training”—and does not extend to prediction 
regarding “success on the job itself.” It is inter- 
esting to observe the unevenness of existing 
predictive techniques when the problems of en- 
gineering, law, medicine, dentistry, music, agri- 
cultural work, teaching, and nursing are all 
reviewed. The advances in these techniques in 
a generation are certainly imposing and promis- 
ing. But the most cursory examination of the 
findings indicates that scientifically more re- 
mains to be done than has already been achieved, 
However, all concerned with the selection of 
students for the professional schools here stud- 
ied will find valuable assistance in this report. 
J Appl Psychol 34:445-6 D ҳо. Barbara A. 
Kirk. * based on a series of technical bulletins 
prepared by the Veterans Administration for use 
by vocational advisers * The already compre- 
hensive compilation of information in the Tech- 
nical Bulletins has been extended in its useful- 
ness by a very good editing job. A valuable 
introductory chapter has been added on problems 
and techniques of prediction. This contains such 
a good discussion of the problems of prediction, 
and limitations of research, that it would merit 
reading and re-reading by all practitioners. Men- 
tion is made of the absence of predictive knowl- 
edge relative to such factors as interest, OE 
tion, and personality, but this is not, m ү 
writer's opinion, strongly enough берила 
since the burden of effort is first to establis 
the importance of what quantitative йоду 
exist. There is, however, a good discussion 0 
the dearth of investigation relative to the m 
fluence of environmental factors in the psec 
of professional success. The authors wisely ut 
tion continuously, not only in this chapter, ү 
also in the concluding statement for each pro i 
sion, against applying generalizations ee E 
diction to individual cases. The general ий 
sion of tests as predictive indices рана e 
the question of whether tests measure T yo 
purport to measure. It is also somewha «a tie 
pointing that although there is фора (іе, 
restriction of the use of clinical orae ie i 
personality tests) to professional wor cholog' 
adequate background in Clinical Psy rding 
there is not equal caution expressed reg? 
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the interpretation and clinical use of any testing 
in the counseling process. For each profession 
helpful general information concerning national 
requirements precedes the presentation of re- 
search findings. The way in which this material 
isorganized for presentation is particularly good. 
- For example, the effect of length of pre-training, 
native vs. transfer training, etc., are considered 
- separately, followed by an evaluation of each 
— kind of test and combination of indices. А sum- 
mary and comment on implications for counsel- 
- ing completes each section. The book is in highly 
readable form, an excellent job having been done 
in sifting out and simplifying the important find- 
ings. There is a useful table of essential qualifi- 
cations for each field of professional training, 
with a suggested battery of tests. These test 
recommendations are open to question, espe- 
cially recommendations given in the personality 
area, In addition, no mention is made of the 
fact that, in generalizing regarding the lack of 
predictive value of interest inventories, such as 
the Strong, only a single key has been used, when 
itis generally considered that patterning or con- 
stellation is more diagnostically significant. De- 
spite these few omissions, this book is an invalu- 
able compilation for any high school or college 
counselor. It would seem also that in carefully 
selected cases it might profitably be made avail- 
able to counselees who are considering one or 
another of the professional fields covered. Inany 
case, filling a major gap, it is an imperative addi- 
tion to the working materials of the professional 
counselor. 
J Consult Psychol 14:160 Ap ’50. Laurance 
F. Shaffer. * The monograph will be of value in 
Vocational counseling for the professions, and 
also gives a concise background for thinking 
about the development of selective procedures 
lor students of psychology. 
Occupations 28:416-7 Mr '50. Walter Wat- 
son. * it would be less misleading if authors 
would avoid the word prediction in book titles. 
The authors of this volume have tried to soften 
the error by using the opening paragraphs to 
te-define the word to mean “estimating the 
chances of success.” A question might also be 
raised on the use of Professional Schools when 
agriculture, music, and nursing constitute three 
of the eight fields listed. * written as à reference 
handbook for counselors and educational ad- 
Ministrators, With its 65 tables and 17 pages of 
fine print bibliography it will not have much 
appeal for those outside the fields of counseling 


B364 


and selection of students. The editors have done 
a fine task of arrangement and notation во that 
the reader may readily locate both the original 
and the related research on any of the eight 
topics listed. A similar arrangement of materials 
is used to report each field of training : First, а 
very brief description of typical job categories 
and training; next, a brief listing of usual ad- 
mission requirements. Both of these summaries 
are so brief and so general that they are of little 
value to the counselor who must help individual 
students prepare for specific training schools, 
The third and major section of each chapter is 
devoted to tables of correlations showing the 
relationships between possible predictors and 
grades while in training. In the fourth and final 
section of each chapter the authors attempt to 
analyze the tabular material and to sort out a 
list of essential qualifications for each type of 
professional training matched with appropriate 
tests or other measures of the essential qualities. 
Preferred tests are marked. Comparison of the 
preferred measures and the research findings 
highlights the major weakness of the counseling 
task today as well as the weakness of the book. 
For example, in the tabular material on engineer- ` 
ing we find only one interest inventory reported 
and for that we see low correlations of 0,13 to 
0.32. In the text the authors say on page 37, “Al- 
though these findings demonstrate that interest 
tests alone are of relatively little value for pre- 
dicting success, the other benefits which may ac- 
crue from their use in the counselling situation 
should not be overlooked.” Yet in the table of 
essential qualifications we find both this test and 
another which was not even mentioned in the re- 
search data, listed as “recommended predictive 
measures.” Granted a need to consider interests 
when estimating the probabilities of success, 
aren’t we doing counseling a real disservice when 
r or unknown instruments are recommended 
for use rather than making a frank acknowledg- 
ment of our inability to make reliable estimates 
of interest? The authors have hewed so closely 
to the line of frequently published research that 
they have missed the inspiration to others work- 
ing in the field which they might have given by 
extending their coverage to such studies as the 
one by Ruth Monroe on the use of the Rorschach 
Test in improving the prediction of academic 
success; or studies of The College Board, now 
the Educational Testing Service, —: — 
larger numbers over à longer period o ime 
om the studies reported in their book. Pub- 
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lished reports of the armed services on selection 
of Army and Navy trainees also contain many 
clues for the improved use of current tests and 
development of new ones while the OSS re- 
search covered in their volume Assessment of 
Men is bursting with suggestions for counselors 
and admissions workers. As a summary of the 
unreliability and doubtful validity of current 
counseling, selection and advisement procedures 
the work represents an adequate and rather com- 
plete tombstone of undoubted historic value. As 
a signpost to the improvement which must come 
in the future if counselors are to live with their 
consciences, the book fails to note even the faint 
traces of hope which do exist in the not-too- 
hidden corners of current research. 

Personnel Psychol 3:229—30 su ' 50. Marjorie 
McGillicuddy. * 'The Committee has chosen to 
confine its study to summarizing the verified 
facts on the significance of tests for diagnosing 
aptitude for professional training. It begs the 
question of hew to predict success in the practice 
of the professions. * the editors appear to ad- 
dress counselors and their clients * Because of 
its orderly arrangement, the book should be very 
convenient to use. The recognized need for a 
myriad of additional validation studies calls at- 
tention to the fact that the potential practitioner 
of a profession must still be guided by a multi- 
plicity of highly specific standards linked to the 
year he plans to begin his training, the localities 
in which he may possibly practice, the schools 
he may wish to apply to, and the laws of the 
states in which he may hope to obtain a license. 
Progress in the art of measurement has empha- 
sized the confusion as to what things are worth 
measuring and to what extent each of them is 
important. The condition which this book so 
candidly reveals requires the joint attention of 
the professional schools, professional societies, 
licensing agencies, and employers in each of the 
professions. Neither the individual vocational 
guidance client nor the community can afford 
the risks of his entering professional training 
which may end in elimination before training is 
completed, or worse, when an attempt is made 
to obtain a license to practice, or still worse, 
when several years of practice have led to final 
discouragement. 
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*StutsMAN, Racuet. Guide for Administering the 
Merrill-Palmer Scale of Mental Tests. A reprint 
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urement of Preschool Children published in 1931. Yon- 
kers, N.Y.: World Book Co., 1948. Pp. 139-262. Paper. 
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$1.40. * For reviews of the Merrill-Palm 
Mental Tests, see 40: 1406. "ш of 
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*Sumner, W. L. Statistics in School. Oxford, Eng. 
land: Basil Blackwell & Mott Ltd., 1948. Pp. vii, 183, * 
For latest edition, see B367. For reviews, see 284 in 
Statistical Methodology Reviews, 1941-1950. 
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*Suprer, Donatp E. Appraising Vocational Fit- 
ness: By Means of Psychological Tests. New 
York: Harper & Brothers, 1949. Pp. xxiii, 727. $6.50. 
* (PA 24: 2130) 

Am J Psychol 63:122-4 Ja * 50. Josef Brožek. 
* The sections on general aspects of testing are 
thinner than the body of the text and cannot 
compete with such specialized treatises as R. L. 
Thorndike’s Personnel Selection, 1949. Informa- 
tion on the selected vocational tests is thorough 
and up-to-date. The data on validity are particu- 
larly extensive. They constitute, in the review- 
er’s opinion, the most valuable feature of the 
text. The book will occupy, together with Buros 
Mental Measurement Yearbooks, a place of 
prominence in the library of those concerned 
with vocational guidance, especially as other 
available surveys of tests and measurements are 
devoted to narrower segments of the guidance- 
field, are less directly concerned with guidance, 
or are somewhat out of date. The student should 
derive from the book both information and stim- 
ulation for thought. Throughout the volume 
needed research is frequently pointed out. The 
book is definitely oriented toward North Amert- 
can readers and the literature cited is eia 
completely limited to the United States. | T 
is a rather widespread but nevertheless irrital 
ing, “isolationist,” and shortsighted oP 
One more critical comment, on a mannerism ! 
the citation of references, unfortunately, г? 
not wholly unique in Super. Where a pu pn 
tion has more than three authors, only the "^ 
author's name is given. It is thoroughly nae à 
able to cite a publication in that way 1 knee 
for purposes of economy, even though nu inh 
cal references are preferable (and jani: » 
used by Super). It is preposterous, d eid 
do so in the bibliography appended to t P cit 
With increasing frequency the truly M id 
research projects demand cooperation 0 pi à 
authors, having an essentially equal share 
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хх. Singling out one of them is an unaccept- 
able and unethical procedure. 

Col & Univ 26:133-4 О '50. S. A. Nock. * 
although Mr. Super's book deals primarily with 
vocational aptitudes, there is much in it that col- 
lege officers can use. There is, in the first place, 
a thorough consideration of test construction, 
standardization, and validation, along with 
chapters on aptitude and aptitude tests, adminis- 
tration and scoring of tests, and intelligence. 
There is an excellent account of the nature and 
validity of each of numerous specific tests, of all 
kinds. There is much on interest, attitude, and 
personal counseling. And there is an appendix 
on statistical concepts and prediction and proba- 
bility. What college officers need to know about 
testing and tests is here, and with it a good deal 
of illustrative material taken from college test- 
ing and performance. The book is one that will 
answer many questions, and guide college people 
in what is too often a bewildering mass of facts 
and figures. If the admissions office can’t afford 
it, the library should ! 

Ed 70:275-6 D '49. William P. Sears. * a 
volume...that is truly "global" in its approach. 
The book is much more than a manual of cur- 
rently usable tests and their applications and 
limitations. Throughout an attempt is made to 
familiarize the reader with the bibliographical 
sources and to take him through the process of 
collection of data and synthesis of findings, so 
that he may develop the work habits and thought 
processes necessary for one to have in order to 
evaluate these instruments of measurement and 
to make new applications with the results ob- 
tained from these devices. * an excellent book for 
reference and for use * the vocational counselor 
will find it a practical aid in his work. 

J Appl Psychol 34:136 Ap ' 5o. J. R. Witten- 
born, The primary purpose of the author is to 
Provide an objective and detailed evaluation of a 
number of the most important tests employed by 
vocational psychologists. This has been realized 
Ma series of chapters which are thorough, 
straightforward, and with few exceptions free 
Tom ambiguity, * The work is scholarly. A 
Second objective of the book is to provide the 
ы With work habits and thought processes 
Ms шау enable him to evaluate new testing 
jn "ments and new research. The realization 
i i ма objective does not appear s de 
jective, ie the realization of the et xil 
limited hi hough Dr. Super has ce y d 

imself to the cataloguing of tests an 
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their essential statistics in the manner which is 
so prevalent among texts in this field, he does 
not make explicit many of the important con- 
cepts and psychological assumptions which are 
implicit in the evaluation of human behavior, He 
tends to avoid the discussion of controversial 
topics and does not strive to illuminate for the 
student the broad psychological basis for and 
general relevance of the samples of behavior 
elicited by tests. His discussion of the basic psy- 
chological significance of some of the test classi- 
fications is weak and his appendix on the 
fundamental quantitative principles of mental 
measurement is not sufficient to its purpose. Dr. 
Super's work is critical and intelligent but his 
discussion is no more profound than his sources. 
The implications of a test score are usually dis- 
cussed in terms of an average relation with some 
isolated criterion and (from the standpoint of 
most clinicians) with insufficient regard con- 
sistently manifested for such variable factors as 
peculiarities of the individual's motivational pat- 
tern, his characteristic reaction to competition 
and the probable future enrichment or impover- 
ishment of his environment. Dr. Super's insight- 
ful and stimulating discussion of these considera- 
tions is familiar to those who have read his 
well-known Dynamics of Vocational Adjust- 
quent. In his new book, however, Dr. Super is 
primarily concerned with the portion of the 
total test variance which is related to some 
known criterion. The possible determiners of 
the residual variance (that which makes for er- 
rors in predictions) receive relatively little at- 
tention. The author's privilege to make this 
differential emphasis would require no justifi- 
cation and would draw no comment had he not 
in the closing chapters of his book discussed the 
techniques of vocational guidance and the con- 
versational devices whereby tests results are to 
be presented to the individual. The book’s 
greatest value lies in its careful delineation of the 
most important respects in which a test may be 
used. The primary emphasis is on the use of 
tests. Only the most obvious or experimentally 
illustrated misuses of tests are emphasized ; the 
subtle conditions under which tests may be mis- 
used are not strongly illuminated. In presenting 
his sources, the writer's usual critical discern- 
ment was on a very few occasions in abeyance; 
and although the work is not free from minor 
errors, these are few and do not weaken the book. 
In general, Dr. Super’s А book адо Кш 
thorough adherence to the literature and he has 
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lished reports of the armed services on selection 
of Army and Navy trainees also contain many 
clues for the improved use of current tests and 
development of new ones while the OSS re- 
search covered in their volume Assessment of 
Men is bursting with suggestions for counselors 
and admissions workers. As a summary of the 
unreliability and doubtful validity of current 
counseling, selection and advisement procedures 
the work represents an adequate and rather com- 
plete tombstone of undoubted historic value. As 
a signpost to the improvement which must come 
in the future if counselors are to live with their 
consciences, the book fails to note even the faint 
traces of hope which do exist in the not-too- 
hidden corners of current research. 

Personnel Psychol 3:229—30 su ' 50. Marjorie 
McGillicuddy. * The Committee has chosen to 
confine its study to summarizing the verified 
facts on the significance of tests for diagnosing 
aptitude for professional training. It begs the 
question of hew to predict success in the practice 
of the professions. * the editors appear to ad- 
dress counselors and their clients * Because of 
its orderly arrangement, the book should be very 
convenient to use. The recognized need for a 
myriad of additional validation studies calls at- 
tention to the fact that the potential practitioner 
of a profession must still be guided by a multi- 
plicity of highly specific standards linked to the 
year he plans to begin his training, the localities 
in which he may possibly practice, the schools 
he may wish to apply to, and the laws of the 
states in which he may hope to obtain a license. 
Progress in the art of measurement has empha- 
sized the confusion as to what things are worth 
measuring and to what extent each of them is 
important. The condition which this book so 
candidly reveals requires the joint attention of 
the professional schools, professional societies, 
licensing agencies, and employers in each of the 
professions. Neither the individual vocational 
guidance client nor the community can afford 
the risks of his entering professional training 
which may end in elimination before training is 
completed, or worse, when an attempt is made 
to obtain a license to practice, or still worse, 
when several years of practice have led to final 
discouragement. 
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For latest edition, see B367. For reviews, see 284 in 
Statistical Methodology Reviews, 1941-1950, 
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edition, see 284 in Statistical Methodology Reviews, 
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Am J Psychol 63:122-4 Ја ' 50. Josef Brožek. 

* 'The sections on general aspects of testing are 
thinner than the body of the text and cannot 
compete with such specialized treatises as R. L. 
Thorndike’s Personnel Selection, 1949. Informa- 
tion on the selected vocational tests is thorough 
and up-to-date. The data on validity are particu- 
larly extensive. They constitute, in the review- 
er's opinion, the most valuable feature of the 
text. The book will occupy, together with Buros 
Mental Measurement Yearbooks, a place of 
prominence in the library of those concerned 
with vocational guidance, especially as other 
available surveys of tests and measurements are 
devoted to narrower segments of the guidance- 
field, are less directly concerned with guidance, 
or are somewhat out of date. The student should 
derive from the book both information and stim- 
ulation for thought. Throughout the volume 
needed research is frequently pointed out. The 
book is definitely oriented toward North pen 
can readers and the literature cited is Mn 
completely limited to the United States." . п 
is a rather widespread but nevertheless s 
ing, "isolationist," and shortsighted dnt 
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work. Singling out one of them is an unaccept- 
able and unethical procedure. 

Col & Univ 26:133-4 О '50. S. A. Nock. * 
although Mr. Super’s book deals primarily with 
yocational aptitudes, there is much in it that col- 
lege officers can use. There is, in the first place, 
a thorough consideration of test construction, 
standardization, and validation, along with 
chapters on aptitude and aptitude tests, adminis- 
tration and scoring of tests, and intelligence. 
There is an excellent account of the nature and 
validity of each of numerous specific tests, of all 
kinds. There is much on interest, attitude, and 
personal counseling. And there is an appendix 
on statistical concepts and prediction and proba- 
bility. What college officers need to know about 
testing and tests is here, and with it a good deal 
of illustrative material taken from college test- 
ing and performance. The book is one that will 
answer many questions, and guide college people 
in what is too often a bewildering mass of facts 
and figures. If the admissions office can’t afford 
it, the library should ! 

Ed 70:275-6 D ’49. William P. Sears. * a 
volume...that is truly "global" in its approach. 
The book is much more than a manual of cur- 
rently usable tests and their applications and 
limitations. Throughout an attempt is made to 
familiarize the reader with the bibliographical 
sources and to take him through the process of 
collection of data and synthesis of findings, so 
that he may develop the work habits and thought 
Processes necessary for one to have in order to 
evaluate these instruments of measurement and 
{0 make new applications with the results ob- 
tained from these devices. * an excellent book for 
reference and for use * the vocational counselor 
vill find it a practical aid in his work. 

J Appl Psychol 34:136 Ap ' 5o. J. R. Witten- 
бот, The primary purpose of the author is to 
Provide an objective and detailed evaluation of a 
number of the most important tests employed by 
Yocationa] psychologists. This has been realized 
Na series of chapters which are thorough, 
straightforward, and with few exceptions free 
rom ambiguity. * The work is scholarly. A 
“cond objective of the book is to provide the 
ia With work habits and thought processes 
Шр шау enable him to evaluate new testing 
^ ae and new research. The realization 
i i д objective does not appear to be i 
т А as the realization of the primp o | 
ШДЕТ Mthough Dr. Super has certainly no 
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their essential statistics in the manner which is 
so prevalent among texts in this field, he does 
not make explicit many of the important con- 
cepts and psychological assumptions which are 
implicit in the evaluation of human behavior. He 
tends to avoid the discussion of controversial 
topics and does not strive to illuminate for the 
student the broad psychological basis for and 
general relevance of the samples of behavior 
elicited by tests. His discussion of the basic psy- 
chological significance of some of the test classi- 
fications is weak and his appendix on the 
fundamental quantitative principles of mental 
measurement is not sufficient to its purpose. Dr. 
Super's work is critical and intelligent but his 
discussion is no more profound than his sources. 
The implications of a test score are usually dis- 
cussed in terms of an average relation with some 
isolated criterion апа (from the standpoint of 
most clinicians) with insufficient regard con- 
sistently manifested for such variable factors as 
peculiarities of the individual's motivational pat- 
tern, his characteristic reaction to competition 
and the probable future enrichment or impover- 
ishment of his environment. Dr. Super's insight- 
ful and stimulating discussion of these considera- 
tions is familiar to those who have read his 
well-known Dynamics of Vocational Adjust- 
ment. In his new book, however, Dr. Super is 
primarily concerned with the portion of the 
total test variance which is related to some 
known criterion, The possible determiners of 
the residual variance (that which makes for er- 
rors in predictions) receive relatively little at- 
tention. The author's privilege to make this 
differential emphasis would require no justifi- 
cation and would draw no comment had he not 
in the closing chapters of his book discussed the 
techniques of vocational guidance and the con- 
versational devices whereby tests results are to 
be presented to the individual. The book's 
greatest value lies in its careful delineation of the 
most important respects in which a test may be 
used. The primary emphasis is on the use of 
tests. Only the most obvious or experimentally 
illustrated misuses of tests are emphasized ; the 
subtle conditions under which tests may be mis- 
used are not strongly illuminated. In presenting 
his sources, the writer's usual critical discern- 
ment was on a very few occasions 1n abeyance ; 
and although the work is not free from aye 
errors, these are few and do not weaken the book. 
In general, Dr. Super’s book comprises a 


thorough adherence to the literature and he has 
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presented its implications for many important 
uses of tests in an expert manner. Appraising 
Vocational Fitness will be a very desirable re- 
quired text for courses in measurements, indi- 
vidual differences, guidance, and vocational psy- 
chology. 

J Consult Psychol 13:384 О ' 49. Laurance F. 
Shaffer. No previous work has surveyed the 
application of psychological tests to vocational 
guidance and selection so comprehensively. Af- 
ter orienting chapters on the general problems 
of the field and on test construction and adminis- 
tration, Super gets to the core of his material 
in fourteen chapters that deal with the research 
evidence on particular tests. One handicap of 
his approach is that the older tests with a large 
research literature receive the most space, not 
always in proportion to their merit shown by 
the evidence. The emphasis on experimental 
findings excludes vague generalities, however, 
and often points to gaps in our knowledge that 
should stimulate research. The book concludes 
with chapters on the use of tests in counseling, 
with illustrative case studies. The bibliography 
has 955 entries. Psychologists who use tests in 
vocational counseling will find the book valuable 
for their own reference use as well as indis- 
pensable for their advanced students. 

Mgmt R 30:364 Je '50. Bernard J. Muller- 
Thym. Within a relatively short compass....Pro- 
fessor Super has summarized most of the useful 
existing knowledge on readily available psycho- 
logical tests and what (principally as individual 
tests) they have been proved to contribute to 
finding the job for which a man is fit. There is 
a subtle and deceptive craftsmanship about this 
book, which conducts the reader through signifi- 
cant research in testing so that even the non- 
professional gets not only facts, but approach and 
interpretation—learns how to think about test- 
ing. These two objectives—to bring together, in- 
terpret, etc., the findings of significant research 
in the field and to show the reader how to work 
with, think about or evaluate tests—are clearly 
stated in the author's preface. Both are as im- 
portant for the personnel executive as they are 
for Students who would use Appraising Voca- 
tional Fitness as a textbook in an introductory 
graduate course. There is no question that Pro- 
fessor Super has attained those objectives. He 
has turned out a thorough, professional, read- 
able book, and one which unquestionably belongs 
in a basic working personnel library. * while the 
author naturally points out the usefulness of 
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test batteries, his emphasis is heavily on what 
the individual test has been proved to contribute 
to appraising vocational fitness, This approach 
not only makes the book a more valuable work. 
ing source of information, but has the айды 
advantage for the average personnel executive 
of showing him how much or how little actually 
has been proved about a particular test he may 
be considering. * Obviously, this book is valuable 
to the typical personnel executive for the in. 
formation which it brings together and inter. 
prets. (There is little in the book which is not 
already known to a professional worker in the 
field, but not much of this information has hith- 
erto been readily available to the executive who 
needs it.) The greatest value of the work, how- 
ever, will be in the orientation it will provide 
the personnel executive. In simple, straightfor- 
ward language, it gives him a brief view of how 
tests are constructed, of the range of tests readily 
available, of the limits of what individual tests 
may be expected to do. Professor Super's ap- 
proach cuts a clean path between the naive en- 
thusiasms and the equally naive scepticisms 
which many business men have about testing. It 
should do much to encourage the sound use of 
tests, for vocational counseling and selection, not 
only in special guidance centers, but in busines 
itself. 

Occupations 28:135-7 N ’49. Mitchell Drees. 
* The title of the book, Appraising V. ocationd 
Fitness, is somewhat misleading. Fitness ordi 
narily connotes present status, such as 
fitness, while the emphasis throughout the E 
is on appraising vocational promise. In fact ! 
author does not use the concept of fitness except 
in the title. The sub-title “Ву Means of P 
logical Tests" is important in indicating 
scope of the contents, for except for four page 
in Chapter 1, there is no systematic treatment 
of non-testing means of estimating vocation 
potentialities, Presumably the treatment was 
stricted to the testing approach in order t 
the presentation within the scope of да ib 
Since many courses in the analysis of 
vidual for purposes of counseling cover 
ods of individual analysis, xe a d 
be supplemented by other references. 
vA ees and vocational counselors have long 
sired such a book which would have the eem 
to appraise critically the better known te. 
to point out their usefulness and Бана зи 
the guidance process. The present vo tal Mest 
designed as a substitute for the Men 
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urements Yearbooks of Oscar Buros, but makes 
a real contribution in surveying the available 
research. evidence concerning a representative 
number of widely known tests and characteriz- 
ing the value of such tests in Vocational Counsel- 
ing. The writer has been careful not to do this 
ina dogmatic way but rather has marshalled the 
evidence and interpreted it so that the reader 
may appreciate the basis of the judgment. * 
Throughout the book, the emphasis is upon the 
use of tests in vocational counseling, and inter- 
mingled with a description and evaluation of 
tests is a sound counseling philosophy which 
adds much to the treatment, * One brief chapter 
deals with the use of tests in vocational selection. 
The principal value of this chapter is to differ- 
entiate clearly between the use of tests in guid- 
ance and in employee selection. In the short 
space of 13 pages there is little opportunity for 
a careful consideration of the role of testing in 
selection, The chapter on methods of "Test Con- 
struction, Standardization, and Validation" is 
clear and concise and is designed to give the 
reader a critical awareness of test construction 
techniques as a basis for evaluating test instru- 
ments, The chapter on “Test Administration and 
Scoring” is a familiar review of basic principles 
and could have been dispensed with in a volume 
which assumes the fundamentals of measure- 
ment, * The tests selected for intensive study 
were selected on the basis of wide usage, and 
availability of research data pertaining to them. 
It is hoped that the readers will not restrict their 
choice of tests to the ones mentioned in this 
volume, but will apply the principles of test 
evaluation gleaned from a study of these tests 
to the wider range of tests on the market, The 
author, quite wisely, did not attempt to provide 
à résumé of all likely testing instruments, The 
counselor on the job using this book as a ref- 
erence, however, may be unduly influenced by 
the inclusion or exclusion of a particular test. * 
ты role of interest tests receives а dispropor- 
lonate amount of , тоо pages, as соп 

with 14 for Койду tests. Chapters XX 
to XXIV dealing with the use of tests results in 
counseling and the preparation of reports of test 
results contain invaluable information not read- 
ily available in other printed sources. The seven 
illustrative cases which contain case summaries 
and test profiles, counselor's interpretations and 
immediate outcomes of counseling, and follow-up 
reports make test data alive and functional in 
the counseling process. They will be most useful 
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teaching aids in a course in vocational testing. 
The Appendix dealing with Statistical Concepts 
and Prediction and Probability provides an un- 
usually clear non-technical presentation of sta- 
tistical concepts with which a vocational coun- 
selor must be familiar, at least from a consumer's 
point of view. The comprehensive bibliography 
of 955 references attests to the careful survey of 
available literature which the writer undertook 
as a basis for this text, The citations are intro- 
duced so opportunely that they make their con- 
tribution to the unfolding story of the book with- 
out resulting in a heavy academic treatment, 
The reviewer has already tested the interest 
value of the book with graduate students in coun- 
seling and predicts that it will enjoy a wide, 
well-deserved popularity, Fortunately the au- 
thor, in his Preface, has already made plans for 
continuous revision for within three years many 
of the sections dealing with specific tests will 
be in need of modification. With such periodic 
revisions, Appraising Vocational Fitness is 
likely to be the basic reference for vocational 
counselors in the area of vocational testing. 
Personnel J 28:434-5 Ap "50. К. С. Rogers. 
* one of the best single reference texts available 
in the field of vocational guidance * The tests 
selected for inclusion seem to be those on which 
the most research data are available or those 
which are currently enjoying greatest popularity 
with vocation counsellors, Obviously such а se- 
lection has led to the exclusion of some tests 
which might have more promise than many of 
those included, However, limitations of space 
would prohibit a much wider coverage. * an 
attempt has been made to point out the gaps in 
our knowledge of (1) what a particular tes 
measures, (2) its relation to other instruments, 
and (3) its correlation with various criteria of 
occupational success and job satisfaction, As 
such, this book is a gold mine of suggestions for 
further research, * assumes a knowledge of the 
fundamentals of test construction, statistics, and 


and job satisfaction from test 


during these early years. Few 
E follow-up studies have been conducted. 


There is also virtually no reference to the use of 
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tests in industrial counselling (as contrasted 
with selection) programs. Clearly more research 
needs to be done in these areas if vocational 
counselling is to perform its full role. In the 
meantime industrial personnel men must con- 
tinue to tailor-make their own test batteries or 
restandardize existing tests in terms of local 
criteria, 

Personnel Psychol 3:225-8 su ' 50. J. B. Boyd. 
* 'The author suggests that "interests are the 
product of interaction between inherited apti- 
tudes and endocrine factors on the one hand, and 
opportunity and social evaluation on the other." 
The rather inconclusive statements that interest 
"is not necessarily a predictor of success," that 
it "seems to be much more likely to be important 
to satisfaction and stability," lead one to ques- 
tion the amount of space devoted to measures 
of interests in this volume. But possibly it is 
because the author feels that their wide use re- 
quires that this inconclusiveness be carefully 
documented. * Appendix A, devoted to statis- 
tical concepts, begins with an explanation of 
basic statistical terms, which is probably too 
compressed for the uninitiated, unnecessary for 
others. The latter part of this appendix, how- 
ever, contains a timely discussion of prediction 
and probability so important that it might have 
been included in the main body of the book. * a 
very comprehensive bibliography of 955 items. 
Conservative though most of his conclusions are, 
the author occasionally suggests use of a test 
on the basis of what many would regard as rather 
slight evidence of predictive value. More open 
to question is his undervaluation of the inter- 
view for purposes of selection. In this he appears 
largely influenced by an AAF study in which 
the interview averaged eight minutes in length. 
As in so many studies relating to its validity, the 
interview was placed under such extreme limita- 
tions as scarcely to justify the name. However, 
his challenge to users of the interview to test its 
validity under varying conditions is thoroughly 
justified. Such adverse criticism, however, is far 
outweighed by the book's merits. Not a com- 
plete reference on available tests like the Mental 
Measurements Year Books, nor an index of tests 
applicable to various occupations, it is, as it pro- 
fesses to be, an advanced textbook on testing in 
its main practical applications. As such, it takes 
a broad critical sweep of the current status of 
testing in such a manner that the reader cannot 
emerge without a good general knowledge of 
the scope of available tests, For the student of 
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testing, it is an essential. For all but the wel 
informed, it provides an economical Way of 
getting abreast of the subject. 

Psychiatric Q 24:199 Ја ’50. * This book 
seems to be extremely well put together, It is 
clearly written, easily understood and an out- 
standing contribution in its field. 

Psychol B 47:178-80 Mr ' 50. Milton Hahn, * 
a book important to psychologists in general 
clinical practice, particularly those in the areas 
of personnel psychology. The author has at- 
tempted to answer the question, “what does this 
test, and the score made on it by this person, tell 
me about his vocational promise?" Actually the 
publication goes far beyond the answers to such 
questions. The reviewer is especially impressed 
with Dr. Super's willingness to venture clinical 
judgments based on his long clinical experience. 
The author has also understated his scope by 
using the term vocational in his title. The Colum- 
bia group has tended to use this term in a very 
broad sense, and it is thus that Super has used 
it. For those prospective readers who, like the 
reviewer, consider vocational an extremely lim- 
ited concept, Super might well have substituted 
psychological fitness. Appraising Vocational Fit- 
ness should be a widely used book. It is an essen- 
tial addition to the libraries of clinical psycholo- 
gists whether they be engaged in industrial or 
educational applications or in the practice of 
psychotherapy. The administrator who employs 
or works with the clinical psychologist will find 
pertiņent information. Because of the somewhat 
informal methods of presenting materials, upper 
division as well as graduate students should fin 
it readable and interesting as a text or reference. 
The reviewer does not believe that Super и 
attempted to replace such books as Bingham 
Aptitudes and Aptitude Testing or has s 
duced a volume in competition to such а publi " 
tion as Buros’ Mental Measurements Kaarin 
Rather, he has contributed interpretive id of 
which make other presentations 1n ш ү The 
psychological measurement more use чү Ай 
book moves at a fairly rapid pace; its eva ШП Rd 
of counseling theory and practice pi ar 
Super's comments on Rogerianism ande objec- 
proaches to counseling are sensible pre the 
tive. The number of years during W js evr 
manuscript must have been in prepare qp 
denced by the carefulness of the pn raphy: 
and the comprehensiveness of the КК E ecte 
Significant research has been carefully 5s calle 
and well presented. Especial attention 
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to chapters XVI, "The naturé of interests," 
XXI, "Using test results in counseling," and 
XXII, “Preparing reports of test results." These 
chapters contain pertinent materials which, in 
the opinion of the reviewer, are presented well 
above the general level of many such presenta- 
tions. Readers will find Appendix A, “Statistical 
concepts," a valuable review of oft forgotten 
basics. Some readers will hope with the reviewer 
that when Super comes to the inevitable re- 
vision period he will have at hand a solution to 
the problems of consistent and psychologically 
acceptable terminology. Some of the chapter 
heads follow the general trait names so much 
used by clinical counselors with their clients. 
Examples are "clerical aptitudes," “musical tal- 
ents,” and “mechanical aptitude.” However, a 
separate chapter is presented on “Manual Dex- 
terities" and another on “Spatial Visualization," 
concepts which are frequently presented in the 
literature as factors or traits which are aspects 
of mechanical aptitude and ability. This is not 
criticism but rather a hope that Appraising Vo- 
cational Fitness when revised may help us to 
escape from a semantic wilderness. In conclu- 
sion, this book is highly recommended as a pro- 
fessionally sound, well-written, authoritative and 
thoughtful presentation of the use of tests in 
clinical psychology. 

Social Ed 14:92-3 Е ’50. Marian Rayburn 
Brown. * Super is well equipped to present a 
careful appraisal of the testing field. * he brings 
a wealth of experience and a rich background to 
his present task * While this book does not 
pretend to supplant organized courses and super- 
vised experience in testing, it does orient the 
reader to the nature and purpose and to some of 
the research in this area. Teachers, counselors 
and administrators who are interested in, or 
deal with, testing cannot afford to overlook this 
volume. 

[8369 ] 
*Surcurre, A., AND CANHAM, J. W. The Heights 
and Weights of Boys and Girls. London: John 
Murray, 1050. Pp. ix, 84. тоз. 6d. * (PA 25: 3749) 

J Ed (London) 83:106 Е ' 51. C. M. Fleming. 
* offers conclusions based upon the careful and 
detailed recording of the heights and weights of 
17,000 boys and girls between the ages of 5 and 
17. The figures were obtained for the most part 
from single testings of separate groups of boys 
and girls, and suggestions are made by which 
their results can be expressed in indices similar 


in type to the standardized scores 1n which in- 
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telligence test findings are often reported. The 
information collected is both interesting and 
important; but the manner of its collecting has 
encouraged a somewhat undue emphasis on the 
probable constancy or equivalence of status in 
height, weight, school work, or intelligence. 
There is little record of repeated measurements 
of the same individuals, and in consequence little 
interest is shown in the variability and dishar- 
mony of individual growth to which recent long- 
term studies have drawn attention. It is to be 
hoped that the authors will continue their in- 
quiries and at a later date will present findings 
based on the annual or biennial measurement of 
individuals over a period of at least ten years. 


[ B370] 
*Swrnerorp, Frances. A Study in Factor Analy- 
sis: The Nature of the General, Verbal, and Spa- 
tial Bi-Factors. Supplementary Educational Mono- 
graphs, No. 67. Chicago, Iil.: University of Chicago 
Press, November 1948. Pp. xi, 71. Paper. $2.00. * (Lon- 
don: Cambridge University Press, 1949. 105. 6d.) (PA 
23: 4668) 

[ B371] 
*Symonps, PercivaL M, Adolescent Fantasy: An 
Investigation of the Picture-Story Method of 
Personality, Study. New York: Columbia University 
Press, 1949. Pp. xii, 397. $6.00. * (London: Oxford 
University Press. 325. 6d.) (PA 24: 650) For reviews, 
see 133. 


[ B372] 
ЖТАВА, HitpA; BRADY, ELIZABETH Harr; ROBINSON, 
Joun T.; AND VICKERY, WiLLIAM E. Diagnosing 
Human Relation Needs. Studies in шеси рр Re- 
lations. Washington, D.C. : American Council on Edu- 
cation, 1951. Pp. xi, 155. Paper. $1.75. * 


[ B373] 
*Tavior, Cour. These Music Exams! A General 
Survey for Teachers, Parents, Candidates, and 
Other Students. Foreword by G. Dyson, London : 
J. Curwen & Sons Ltd., [1950]; Pp. xii, 96. 75. 6d. * 
(New York: G. Schirmer, Inc. $2.25.) 


[ B374] ia 
* х, Сорғв Н. The Factorial Analysis о! 
КЫ APIE ‚ Second Edition. London: niver- 
sity of London Press Ltd., 1946. Pp. xvi, 386. * (Bos- 
ton, Mass, : Houghton Mifflin Co., 147) (P. 
2529). For latest edition, see B377. For reviews, see 
in Statistical Methodology Reviews, 1941-1950; 
for reviews of the first edition, see 2: 288. 


[2375] Ae 
son, Govrrey. The Factorial Analysis о 
eae Ability, Third Edition. London; niver- 
sity of London Press Ltd., 1948. Pp. xvi, 392. For lat- 
est edition, see B377. For reviews, see 291 in (€ 
cal Methodology Reviews, 1041-1050] also see 290 for 
reviews of the second edition; for reviews of the first 
edition, see 2: 288. 


ч, ial Analysis of 

G . The Factor s o 

AT HOMSOY tity, Fourth Edition. London; р roo. 
sity of London Press Ltd., 1950. Pp. xvi 394. * (Bos- 
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TnowsoN, Соррику, The Trend of National Intel- 
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7 һу exander rr- r$, 
Thomson 


A s , 
t Cassell & Co. Lid. о Бареа, 
chol 18:50 F '48. C. W. VV (alen- 
or Thomson's lecture....is marked 
and by a restraint which makes the 
conclusions all the more telling. Thomson's esti- 
mate is that the negative correlation between the 
intelligence of a child and the size of i 
is approximately —0.25, 

Burt's estimate. The decli 
telligence he reckons to 
two points on the T.Q. scale, 


tine). * Profess 


almost identical with 
ne in the average in- 
approximate to about 
which fits Burt's 
nearer 1.5 than 3.0.” Thomson... 
need for a check of these estimates 
by a direct measurement of succes 
tions, and he reports that there is al 
way a plan to test a complete a 
eleven years in Scotland, as was 
years ago. Clearly... 
serious consideration 
about our national wel 

Brit J Med Psychol 23:124 pls 1-2 '50, R, 

- P. * This is a model о 
and the authors and the Sc 
search in Education and 
are to be congratulated wa 
book which must have a 
well-deserved recogniti 
of collecting data, much im 


[this paper] deserves very 
у all who are concerned 


f organized research, 
ottish Council for Re- 
ponsors of the work 
rmly on producing a 
ide sale and bring 
оп. In the course 
portant information 
logical interest was also obtained 
tabulated. Further books will 
lysis, and there will be valuable 
‚ especially those revealing the 
and occupations of the chil- 
dually with the Terman-Mer- 


deal with its anal 
follow-up survey: 
interests, careers 
dren tested indivi 


[ 
rill Test. In the next half. h 
will have to give much serious 
study of the inheritance of 
uted differences, especially of intelligence 
the numerous interrelationships between | 
ferences of strictly genetic origin and thos 
to the many forms of envi nflue 
Repeated surveys of the kind r 
book, and much more elaborate investig 
along the lines of many which have been, carn 
out in biology, will have to be made, This fs 
important pioneer investigation, and the ро pu 
tion of a country the size of Scotland will 
excellent for further work. While the disco 
of an increase in average intelligence is 
ing, however, there might be several factor 
which the most important is possibly "in 
gence test sophistication" during the 14 у 
interval, to account for the improveme 
served. І 

J Ed (London) 81:232 Ар '49. C. M. ВИ 
ing. * clarity characterizes.... [this] i 
ture * summarizes all that can be said о 
topic before that retesting after the lay 
generation which has more recently been 
tempted in the Scottish Surveys of 19$ 
1947. | 

[ B379] j d 
T 
wi Meses Талаш e dut 

don: Chapman & Hall, Ltd. yar) CP 2: 9 

Austral J Psychol 2:62-3 Je '50. 

Haynes. The subject matter....is much mor 

cific than the title indicates. * deals with the! 

uation, construction and use of psycho 
tests with some reference....to the dminist 
problems associated with maintaining ай 

cient test programme. However, in this ii 

sive treatment lies the strength of this bog 

too long, teachers of industrial psychology 

practitioners in this field have had texts 9 

deal with the many aspects of personnel sele 

superficially and none thoroughly. Texts o 

construction and usage are rare ei ы 

good books, until the appearance of 

Thorndike's work, almost non-existent 

though much of the information put 

could be used for a course on the кє 
measurement, one of the most ри н rc 
of the book is its practical approach. It 

that the author Me el — m а 

niques which he is placing before — 

that by so doing, he has worked out à 
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ance between statistical theory and practice, The 
reader with technical interests notes with relief 
that he commences his book with a chapter de- 
voted to job analysis and although this chapter 
does, of necessity, lack detail, it serves the pur- 
pose of emphasizing the importance of job analy- 
sis in the development of industrial test pro- 
grammes. Of the remaining 10 chapters, the two 
on the estimation of test validity are A 
* The book presupposes at least one year's Uni- 
versity work in statistics, * Many of the chap- 
ters—for example, the first of the two on valid- 
йу, require no statistical Those 
chapters of the book devoted to the administra- 
tive problems of personnel selection, are not per- 
haps as valuable as those on test construction, It 
is reasonable for industrial psychologists to as- 
sume that, although there are principles asso- 
ciated with the administration of test pro- 
gramme, much has to be taken from the actual 
conditions operating within a plant or institu- 
tion, and what is done can be expected to vary 
from situation to situation. However, even if 
this view is supported it cannot in any way de- 
tract from the excellence of this book as a text 
on the evaluation, construction and use of psy- 
chological tests, 

Ed Res B 29:134 My 10 '$о, Frank M. 
Fletcher, Jr. * a major contribution to the field 
of personnel selection and should prove of value 
in both industrial and educational areas * The 
writing is at a level appropriate to the student 
and business executive who is serious in wish- 
ing to obtain a clear and complet ing 
of testing procedures, The book is well organized 
and follows a logical step-by-step outline of 
Presentation. * The chapters on reliability and 
validity are excellent and include a treatment of 


cussions of the concept of reliability, of types of 
criteria, corrections for homogeneity, multiple 


une 
realistic in its demands, and so fails to give suf» 
ficient guidance to investigators 
work on a much smaller scale than did psycholo- 
gists and statisticians in the United States Army 
Air Force. Tests presented by sound-film, ma- 
chine-scoring, try-out groups of at least 400 and 
follow-up groups thousands, 
make one's mouth water in peacetime, His ap- 
proach also leads him to treat the whole of selec» 
tion and allocation as A ipo in logistics 
rather than as a matter which concerns individ» 
ual human beings. He may, in the present writ» 
era opinion, be justified in that 
interview judgments and data derived (rom bio» 
graphical questionnaires should be validated and 
combined with test scores by objective statisti 
cal techniques rather than by intuition. But 
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step procedures required in the validation of an 
aptitude testing program. The sources of infor- 
mation concerning job requirements are pre- 
sented, methods for test development are de- 
scribed, types of criteria are compared, and 
statistical methods and computational routines 
are presented in some detail. As such, it will 
serve as a useful reference in personnel psychol- 
ogy. To consider this report as merely a manual 
of procedures, however, would be a grave in- 
justice. For it performs the much more valuable 
function of taking the reader “behind the scenes" 
in order to see the Why as well as the How. 
Procedures are evaluated as well as described, 
limitations as well as strengths are pointed out, 
and theoretical implications as well as practical 
considerations are presented. Some of the topics 
of particular interest to this reviewer were the 
following: (1) the value of job analysis versus 
trait analysis, (2) objective versus subjective 
criteria, (3) selection versus classification, (4) 
evaluation of various measures of reliability in 
terms of the sources of variance affecting each 
of the measures, (5) problems of restriction of 
range of ability due to prior selection. These and 
other basic problems are treated systematically 
and the rationale as well as the methods of attack 
are described. The editor recognizes (and clearly 
states in his Preface) one limitation of the re- 
port, i.e., its restriction in large part to the se- 
lection and classification program. Another limi- 
tation, in the reviewer's opinion, was the failure 
to discuss in detail the study of a non-preselected 
"experimental group" given aircrew training re- 
gardless of their test scores. Although this 
project is described in Report No. 2 of this 
series, it warranted more than the brief reference 
in this report in connection with the problem 
of restriction of range. From a methodological 
point of view, it represented a major step in 
providing crucial evidence on the validity of the 
classification program, and a discussion of its 
rationale, procedures, and general implications 
for experimental design would have been appro- 
priate. 

Pub Personnel R то:1$о-т Jl "49. Ruth 
Bishop. The book....is something no technician 
can afford to disregard. One thing that im- 
presses the reader is the systematic way in which 
the material is presented and the author's refusal 
to give pat answers to problems. Rather, he 
simply states the strengths and weaknesses of 
techniques developed to date for attacking vari- 
ous problems. Let us hope that his expressed aim 
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in presenting the material in this way will be 
realized, namely, the stimulation of interest in 
the solution of the many problems facing the 
personnel technician. * Various kinds of test 
forms and media are considered, which is re. 
freshing since technicians all too often use writ. 
ten objective tests for mass testing programs 
without due consideration to the purpose of the 
test and/or the characteristics of the subjects, * 
readers not too familiar with test resources may 
find the bibliographical references of assistance 
* His discussion of reliability should give per- 
sons who have a tendency to feel satisfied with 
a test, simply because it has face validity and a 
high estimate of reliability, just cause to re- 
consider and remember that validity is the ulti- 
mate test of the effectiveness of an instrument, * 
Perhaps his discussion of derived scores and the 
combination of measures will seem a bit “аса- 
demic and impractical" for many civil service 
jurisdictions, but it certainly does no harm to 
consider the problems and to recognize their 
value in a research program which may be 
undertaken simply as a means of setting up a 
more adequate “simple and acceptable" system 
of scores. This book should be valuable in train- 
ing technicians who have entered the field with 
little or no formal background in this area. Parts 
of it may be interesting to operating officials and 
civil service commissioners who want to learn 
more about the technical aspects of personnel 
research. It should remind all technicians that 
there is no easy way to effective personnel selec- 
tion and that a responsible technician constantly 
must concern himself with “testing, verifying, 
and improving the adequacy of his pee, 
О R Biol 26:112 Mr '51. Marion R. Bartlett. 
"The recent technical advances which promise to 
convert the field of personnel selection ш 
scientific practice rather than а subjective e 
are well brought together here. The expos! ie 
on the use of measurement techniques 1M i 
selection of personnel for specific types 9 d s 
merits the attention of all psychologists an р " 
sonnel workers interested in this field. AE 
the book deals with the highly scientific e 
of measuring individual aptitudes and pad "t 
ity attributes, the style in which the ү i 
presented is unusually clear and easy (0 logici 
Few books in the complex field of psycholog 
measurement approximate this с 
tation. * presents an interesting апо Со 
sive analysis of the types of error | 
fere with achieving reliability of а $° 
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«ores * one of the few excellent textbooks so 
far published in this feld. 


[ B380] 
жТновѕтоме, L. L. Multiple-Factor Analysis: A 
Development and Expansion of The Vectors of 
Mind. Chicago, Ill.: University of Chicago Press, 
1047. Pp. xix, 535. $7.50. (London: Cambridge Uni- 

f * Ч 

versity Press. 425. ) (PA 21: 2833). For reviews, see 
293 in Statistical Methodology Reviews, 1941-1950; 
for reviews of the first edition, see 36: B263, 38: Bsos, 
and 2: 293. 

[ B381] 
xTuuRSTONE, THELMA GWINN, AND Byrne, KATHA- 
rne MANN. Mental Abilities of Children. Chicago, 
Ш.: Science Research Associates, Inc., 1951. Pp. 48. 
Paper. $0.40. * (РА 25:7983) 

[ B382] 
XTigEMAN, Herman R. Elementary School Tests 
Classified. Bloomington, Ill.: Public School Publish- 
ing Co., 1951. Pp. iv, 17. Paper. $0.50. * 

[ B383] 
XTINKELMAN, SHERMAN. Difficulty Prediction of 
Test Items. Columbia University, Teachers College, 
Contributions to Education, No. 941. New York: Bu- 
reau of Publications, the College, 1947. Pp. vii, 55. Out 
of print. * (РА 22: 4723) 

J Appl Psychol 34:452-3 D ’50. Charles I. 
Mosier. * the author had thirty judges rate the 
difficulty of each item. This rated difficulty (per 
cent of patrolman candidates expected to answer 
correctly) was compared with the actual per cent 
of patrolman candidates passing. * There were 
no significant differences in predictability be- 
tween “judgment” and “information” questions 
or between items with high and those with low 
tem-test coefficients (erroneously termed va- 
lidity coefficients). The problem is introduced in 
а civil service setting—and a rather conserva- 
tively Conceived one, The necessity for basing the 
Selection of items on subjectively judged esti- 
mates of difficulty is mentioned on page 1, but 
nowhere is the need for estimates of validity— 
е or statistical—alluded to. The impos- 
sibility of experimental tryout of items is ruled 
out because of the need for secrecy although pro- 
eps merit systems have found ways of try- 
ae Without violating security. The tone of the 
i ce and implications seems unwarrant- 
б хе optimistic, “The results of this study indi- 
i that judges are probably able to estimate 
* ative item difficulty accurately." This con- 
ФИР does not seem warranted for a median 
5 бу correlation of .53. Moreover, although 
is best" three judges have a combined validity 
n 7 the author fails to note that this is a “back- 
и coefficient, comparable to that com- 

ed for a test on the item analysis sample after 
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the best items have been selected by item analy- 
sis. Although Tinkelman's book does shed light 
on the process of judging difficulty, it will not 
convince the careful examiner that estimated dif- 
pied 18 an adequate substitute for empirical 
i J Ed Psychol 39:448 № '48. Miles A. Tinker. 

This study was well designed and the data 
adequately analyzed. The results will be of value 
to those who need to know something about 
item difficulty prior to giving a test. 

J Ed Res 42:637 Ap '40. Carl Wedell. * The 
author is to be congratulated for a well-con- 
ducted piece of research and a clearly written 
report which is a "must" for all who are inter- 
ested in the problem. 

Pub Personnel R 9:153-4 Jl '48. Fred S. 
Beers. * the implications of Mr. Tinkelman's 
study are more searching than his methodology 
or his findings. They point out inescapably that ` 
the concept of difficulty as applied to the devel- 
opment of examinations is one of the most elusive 
and subtle in the entire field of psychometrics. 
Because the determination of the difficulty of 
either an item or an entire examination has to 
be based upon performance of individuals, any 
index of difficulty is relative to the population 
within which it was determined. Moreover, in- 
dices of difficulty that have been determined on 
one sample cannot be routinely used as bases for 
estimating indices of difficulty in a second sam- 
ple except as they tend to be random samples of 
the same general population. Another complica- 
tion lies in the fact that the significance of an 
index of difficulty is related to the index of valid- 
ity of the item; as items decrease in validity the 
interpretation of their indices of difficulty neces- 
sarily becomes increasingly vague. The evidence 
that is pertinent to this question and now avail- 
able indicates that a competitor is "measured" 
most adequately when the items by which he is 
“measured” are answered correctly by 50 per 
cent of competitors of his level of ability. To 
attempt, as Mr. Tinkelman has done, to attack 
this problem via the route of having judges esti- 
mate the ability of populations and their reaction 
to test items is to be regarded as in the nature of 
a noble experiment. Whether the attempt can 
be more than modestly productive is question- 


able. 
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Ae School Testing Program: A Guide to the 
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York State Education Department, Division of Pupil 
Personnel Services, Bureau of Examinations and Test- 
ing. University of the State of New York Bulletin, No. 
1307. Albany, N.Y.: University of the State of New 
York Press, August r, 1950. Pp. 24. Paper. Gratis. * 

[ B385] 
TOMKINS, SILVAN S.; WITH THE COLLABORATION OF 
ELIZABETH J. TomKins. The Thematic Appercep- 
tion Test: The Theory and Technique of Inter- 
pretation. New York: Grune & Stratton, Inc., 1947. 
Pp. хі, 297. $5.00. * (PA 22: 1658) For reviews, see 
141 and 3: 104a. 

[ B386 ] 
*Travers, Ковевт М. W. How to Make Achieve- 
ment Tests. New York: Odyssey Press, 1950. Pp. xi, 
180. $2.25. * (PA 25: 4857) 

Calif J Ed Res 1:188 S ’50. * simply and 
clearly written. It is well organized and should 
serve as a handy reference or textbook for 
teacher use or in teacher education classes. The 
author succeeds in bridging the gap between the 
theory of test-making and the application of 
known technics. 

J Ed Res 44:633-4 Ap ' 51. William B. Reiner. 
* concise and well written * Classroom teachers 
and others whose function it is to construct tests 
will find this book very useful. The book is prac- 
tical and functional in its organization. It has 
the directness and simplicity of a good laboratory 
manual in that the problems and steps necessary 
to solve them are presented in operational terms. 
It can be easily understood and followed by the 
average classroom teacher. * How to Make 
Achievement Tests is brief, to the point, and 
attains the aim the author set for himself. There 
are no bibliographies or questions at the end of 
each chapter. There are footnotes and explana- 
tory notes to amplify points needing fuller dis- 
cussion. Most of the examples given refer to the 
Cooperative Tests and to the Forty-fifth Year- 
book of the National Society for the Study of 
Education. However, in a concise book that so 
clearly hits its target, these points are purely 
academic. Professor Travers' book will be very 
useful to classroom teachers. As a supplementary 
text for classes in Tests and Measurements, it 
has much to offer to the student interested in 
learning how to construct achievement tests. 

J Teach Ed 1:320-1 D ҳо. A. M. Jordan. * 
While many will miss the rigorous thinking 
incident to the use of well-documented proof for 
each statement, others will welcome the clear 
and readable exposition present in How to Make 
Achievement Tests. We have, then, not only a 
clearly written book which sets forth in simple 
language the fundamental principles of test con- 
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struction applicable to the classroom situation 
but also considerable information on the i 
provement of educational objectives. 

Jun Col J 20:426-7 Mr '50. Herschel Т, 
Manuel. * an excellent guide for constructing ob- 
jective examinations. It is written in terms which 
teachers can understand, and the discussions are 
well illustrated. * The author's statement of the 
relation of evaluation and measurement should 
contribute to a more accurate use of these terms 
* a distinct contribution to the literature of edu- 
cational measurement. 


[ B387] 
*TnAxLER, ARTHUR E. Functions of the Educa- 
tional Records Bureau in Comparable Measure- 
ment, Revised Edition. Educational Records Sup- 
lementary Bulletin H. New York: Educational 
EO Bureau, October 1947. Pp. iii, 31. Paper, litho- 
typed. $0.35. * 
[ B388] 

TRAXLER, ARTHUR E. How to Use Cumulative Rec- 
ords: The Manual for the SRA Cumulative Rec- 
ord for Junior and Senior High Schools. Chicago, 
Ill.: Science Research Associates, Inc., 1947. Pp. 40. 
Paper. $0.75. * (PA 22: 852) 

J Ed Psychol 39:125-6 F '48. Miles A. 
Tinker. * simple enough to be used in the average 
school. The record form is designed primarily 
for junior and senior high schools but much of 
the material may be used at lower or higher 
levels. * Some schools may find the record form 
too extensive to handle. It is suggested that 
these schools use only the more important o 
of the form. It would have been helpful if the 
author had presented a sample abbreviated m 
for this use. The important thing is for the sc fi 
to get started with as full a record as can | 
kept up to date. This is a highly practical b 
very useful pamphlet for the teacher and sc А 
counselor. It should promote wider "s n 
more efficient use of cumulative records 


schools. 


[ B389] 


Its in 
KTRAXLER, ARTHUR E. The Use of Test Resul М, 
Diagnosis and Instruction in the "Tool Subjec 


? ten е Bulletin NO 
Third Edition]. Educational Records Bu т 
[a (Second Revision). New York: ee i 
ords Bureau, September 1949. Pp. ks editions, 5° 


typed. $1.50. * For reviews of eal 
3: 1201 and 36: B270. 


[ B390] Oe M 
*TnaAxLER, ARTHUR E., EDITOR. Meo ducati em 
Evaluation in the Improvement О Conferences 


Report of the Fifteenth Educational , Held 
New York City, October 26 and 27, cords 
Under the Auspices of the Education cation 


А $ E 
Bureau and the American Council on ncil on 
Foreword by Arthur S. Adams. American Cou 
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Education Studies, Vol. 15; Series 1, Reports of Com- 
mittees апа Conferences, No. 46. Washington, D.C.: 


the Council, April 1951. Pp. vi, 141. $1.50. * 


Ed & Psychol Meas 11:656-8 w 51. William 
B. Michael. * Although all nine talks were in- 
teresting and informative, those by Lindquist, 
Conrad, Diederich, and Nelson were particu- 
larly outstanding from the standpoints of pene- 
trating content, technical excellence, and clarity 
of presentation. * In a thoroughgoing discussion 
that serves to challenge existing practices of 
testing, Diederich has proposed a “more system- 
atic, more cooperative, more comprehensive, 
and more continuous” program of measurement 
and evaluation. * What seems to the reviewer to 
be'one of the most complete and satisfying pres- 
entations (yet one of the briefest) of the uses of 
tests in school administration is the discussion 
by Nelson. After stating quite clearly the limi- 
tations that exist and distortions that may occur 
in evaluating teacher performance through use 
of test results, he proceeded to enumerate seven 
uses of tests that appear to be important in school 
administration. * This short book represents a 
distinct contribution in that it reflects current 
thinking by a number of experts regarding the 
role of measurement and evaluation in the im- 
provement of instruction. Numerous problems 
are suggested and several courses of action are 
proposed. No outstanding disagreements in gen- 
eral philosophy of evaluation are in evidence 
among the authors of the reports presented. This 
book is well worth the evening or two that it 
will take one to read it. There is much material 
to which educators at the elementary, secondary, 
or collegiate level can give serious consideration. 


[ B391] j 

XTurzs, Ernest C. An Evaluation of Personality- 
Trait Ratings Obtained by Unstructured Assess- 
ment Interviews. American Psychological Associa- 
Non, Psychological Monographs: General and Ap- 
pia Vol. 64, No. 11, Whole No. 317. Washington, 

.C.: the Association, Inc. 1950. Pp. vi, 24. Paper. 
$1.00, * (PA 25: 6883) 


[ B392] 

X[Tonxzn, луш]. Practice for Army Tests, [1951 
evision]. New York: Arco Publishing Co., Inc., 
1951. Pp, viii, 151. Paper, mimeographed. $2.00. * 


[8393 ] 
XTurrLE, Frorence Рірек. This Here I.Q. Lynn, 


Mass.: Tuttle Teacher Service, 1949. Рр. 52. Paper. 
$1.00, * 


) [ B394] 

Жетт, Сковск. Rorschach Introductory Manual: 
Primer for the Clinical Psychiatric Worker: 
ith Interpretative Diagram to Permit Clinical 

Use While Learning the Ink-Blot Technique. St. 


Louis, Mo.: Educational Publishers, Inc., 1950. Pp. 
$3.00. * (РА 24: 5262) For reviews, see ү ps 


[ B395] 
*Unirep States Civ Service Commission. Pro- 
ceedings of the Second Annual Conference on 
Efficiency Ratings Administration, Nabe ce ia 
pea the Commission], 1945. Pp. i, 61. Paper, Out of 
rint. 


[8396 ] 
ЖОзптЕр States Слуп, Service | COMMISSION, Рго- 
ceedings of the Third Annual Conference on 
Efficiency Ratings Administration. TEN Pi 
D.C.: the Commission], 1946. Pp. i, 35. Paper. Out of 
print. * (PA 21: 1621) 


[B397] 
*xUniren States Сїүп, Service Commission, Pro- 
ceedings of the Fourth Annual Conference on 
Efficiency Ratings Administration. [Washington, 
D.C.: the Commission], July 1947. Pp. i, 66. Paper. 
Out of print. * 


[ B398] 
*Unirep States Слуп, Service Commission. Pro- 
ceedings of the Fifth Annual Conference on Effi- 
ciency Ratings Administration. [Washington, D.C. : 
the Commission], October 1948. Pp. i, 46. Paper. Out 
of print. * 


[ B399] 
*Unirep States Слуп, SERVICE. COMMISSION. Speci- 
men Questions From U.S. Civil Service Examin- 
ations. Pamphlet 11. Washington, D.C.: 0.5, Gov- 
ernment Printing Office, May 1948. Pp. ii, 29. Paper. 
$0.20. * 

[ B400] 
*Unirep STATES MILITARY ACADEMY, DEPARTMENT 
or Tactics. The Operation and Administration of 
the Aptitude for the Service System, U.S.M.A. 
Foreword by Paul D. Harkins. West. Point, N.Y.: 
United States Military Academy, 1950. Pp. iii, 35. Pa- 
per, loose-leaf. Gratis. * (PA 25: 4847) 


[ B401] 

*University OF MINNESOTA, STUDENT COUNSELING 
Burrau. The Counseling and Guidance Use of 
Test Scores. Prepared under the direction of Ralph 
F. Berdie for the Association of Minnesota, Colleges. 
Foreword by E. G. Williamson. Minneapolis, Minn. : 
University of Minnesota Press, 1949. Pp. 24 plus in- 
serts. Paper. $1.00. * (PA 24: 1879) 


[ B402] 


C. W. Intelligence Tests for Young 
Children, First and Second Editions. London: 


& Co. Ltd., 1945, 1946. Pp. xii, 68. * (PA 19: 
Mey (The second edition [^ a reprinting of the first 
а For latest edition, see B403. For reviews, see 


344 and 3:283. 


or 13-уеаг- 
olds."—Preface to the Third Edition. The fourth edi- 
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[ B404] 
*Vax Aren, Rita. You and Your Hand: The Sci- 
ence of Hand Reading Made Easy. New York: 
Greystone Press, 1948. Pp. x, 246. $2.50. * 

[ B405] 
AVAN Овмев, Epwarp B., AND WILLIAMS, CLARENCE 
O. Elementary Statistics for Students of Educa- 
tion and Paychology, Third Printing, New York: 
Longmans, Green & Co., Inc., 1945. Pp. viii, 111. Paper. 
$1.90. * [Except for the Preface and minor correc- 
tions, this edition is identical to the 1940 edition.] For 
reviews, see 301 in Statistical Methodology Reviews, 
1941-1950. 

[ B406 ] 
*Vernon, Puur Е. The Structure of Human 
Abilities. London: Methuen & Co. Ltd., 1950. Pp. xii, 
160. 125. 6d. (New York: John Wiley & Sons, Inc., 
1951. $275.) * (PA 25:6026) 

Brit J Ed Psychol 21:63-8 Е ' 51. Cyril Burt. 
* The book....is a model of condensation ; and to 
do justice to the numerous points of interest 
with which it is packed would be quite impossible 
in a brief review. It is valuable first and fore- 
most as a critical but comprehensive summary of 
facts. * definite conclusions are also reached on 
certain controversial issue$; and in each case 
the inferences are invariably based on a scrupu- 
lously impartial survey of the empirical evidence. 
Everyone engaged in research on educational, 
vocational, or other branches of individual psy- 
chology, should not merely read these chapters, 
but study them most carefully. [See Burt's re- 
view for additional critical comments too lengthy 
to excerpt.] 

Brit J Psychol, Gen Sect 42:204-5 Ag "5r. 
F. W. Warburton. * We have long needed a 
critical survey of the applications of factor analy- 
sis in the educational and vocational fields. Prof. 
Vernon carries out this task with extreme thor- 
oughness and competence. * Some of the con- 
clusions reached are contrary to earlier opinion. 
It is suggested, for example, that the tendency 
reported in previous investigations for intelli- 
gence to be differentiated into more specialized 
abilities during adolescence and early adulthood 
is illusory, the appearance being due mainly to 
greater selectivity in the older groups studied. 
It is also considered that the children most likely 
' to benefit from advanced Schooling are those 
with the best attainment, not those with the 
highest ability, as was formerly believed. 
"Throughout the book great stress is laid on the 
importance of keeping close to the experimental 
evidence, and the author strongly urges the sta- 
tistical approach to the educational and voca- 
tional areas of research. * The more theoretical 
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part of the book also gives the first popular ac- 
count of the hierarchical group-factor theory and 
contrasts it with Thurstone's work on the pri- 
mary abilities. There is an increasing tenden 
for British psychologists to regard abilities аза 
system of hierarchical factors of various degrees 
of generality, the more comprehensive including 
themore specialized, ascountries include counties, 
“Genealogical” tables, headed by “g,” can now be 
drawn, attributing different levels to the various 
factors and to their “descendants.” A structure 
of this kind has the merit of clarifying the rela- 
tion between the hundred or so factors already 
reported by research workers and, in this re- 
spect, is a great advance on earlier theories, 
Prof. Vernon suggests that it follows from the 
notion of hierarchical structure that a person's 
score in a battery of tests, of minor factors, such 
as "v:ed" and “k:m” may be summed to form 
an estimate of a major group factor (“g”) with- 
out actually giving a “g” test. It is difficult to 
see why he explicitly attributes this advantage to 
the group-factor theory, unless the hierarchical 
factors are correlated with one another. This is 
an important point. Nearly all British psycholo- 
gists prefer to keep their factors uncorrelated, 
which means that adding a person’s scores in 
certain factors will not indicate his score in oth- 
ers. The case is different for test scores (as dis- 
tinct from factor scores), since tests are corre 
lated and may be summed. The practice of 
summing test scores that is recommended in 
book may thus afford some reconciliation wit 
those American workers, who use correlate 
factors and, on occasion, sum them to Aun 
composite measure of general intelligence. The 
Thurstonian may protest that his ача О 
rather sketchily given. He will miss the case x 
oblique factors: for simple structure as the К 
ing principle to questions of the вао 
orthogonality of factors; and for the an 
“g” being an average, dependent on the en íi 
lar tests that happen to have been e gin 
battery. American writers are even sat pa i 
adopted a “neo-faculty” theory. This see p 
little hard, whether one contrasts prod 
faculties (Chapter I) or takes them as i iti 
(Chapter V). Yet the author's belief t P call 
both psychologically foolish and gute ofte 
difficult to belittle “g,” as the American p 
do, seems the best hypothesis for рса gains 
poses, and he is surely right in pr od 
“primary” factors of relatively low Ru. 
which .depend largely on difference 
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background of the populations studied. There 
are two footnotes of considerable interest to the 
statistician. One gives the opinion that the 
multiple-regression technique is too efficient, 
since it prohibits the inclusion of rather similar 
tests which might improve the reliability of pre- 
dictions; and the other recommends formulae 
{ог testing factor significance, which compro- 
mise between British caution and American ex- 
pansiveness. The structure of the abilities is il- 
lustrated by some excellent diagrams, although 
the prominence given in some of these (and in 
the body of the book) to Alexander’s “Х” may 
suggest that our knowledge of personality factors 
is sounder than it actually is, and that these 
factors can be compared on equal statistical 
terms with those established in the cognitive 
field. Criticism is apt to give a false impression 
of a book as comprehensive and lucid as this. It 
should appeal to everyone interested in psychol- 
ogy, and to the specialist it is indispensable. Prof. 
Vernon has made a major contribution to the 
literature of educational psychology. 

Brit J Psychol, Stat Sect 4:61-2 Mr ^51. 
E. К. C. B. * The first three chapters contain a 
useful, non-mathematical account of the princi- 
ples underlying different types of factor analysis. 
Professor Vernon devotes most attention to 
three main types, which he illustrates with a clear 
but simple diagram: (1) Spearman's "two fac- 
tor analysis," (2) Burt's "group factor analy- 
sis," and (3) Thurstone's “multiple factor analy- 
sis.” So far as cognitive abilities are concerned, 
he strongly favours the second, 1.е., the method 
which, directly or indirectly, includes both a 
general factor and a set of group factors. He re- 
fuses, however, to identify the general factor, or 
g as he calls it, with the “subjective and inde- 
finable term intelligence,” and prefers to define 
it simply as an objectively established factor 
running through all the tests in an appropriately 
selected battery. In this section there are perhaps 
а few minor points which some readers may find 
unconvincing or difficult to follow; but that is 
largely because the discussion is so condensed. 
The second and main portion of the work con- 
sists of eight chapters describing in some detail 
the various factors for which there is now fairly 
adequate evidence. He takes in turn the main 
fields of statistical research, beginning with early 
work on educational attainments and ending 
with recent work on occupational abilities. Thus 
his second task resembles that of Professor Burt 
in his early survey of The Measurement of 
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M ental Capacities and his more recent review of 
The Results of Factor-Analysis." Burt's ac- 
count was based chiefly on work with children ; 
Vernon is able to draw largely on work in the 
fighting services among adults. The picture that 
he outlines is very similar to the earlier. He fully 
agrees that factors are primarily "principles of 
classification," and that the structure so revealed 
comprises a multiplicity of factors, general and 
group, broad and narrow, arranged roughly in 
accordance with what the logicians call a “Мег- 
archical” scheme, where genera include species, 
species are divided into sub-species, and so on. * 
the list of factors which he enumerates agree in 
a remarkable fashion with those that originally 
emerged from factorial work with children. Pro- 
fessor Vernon does not himself define a “hier- 
archy” ; he leaves its nature to be gathered from 
his concrete instances. This perhaps is an omis- 
sion which the non-mathematical reader may be 
tempted to regret, particularly as the broader 
lines of classification implied by his chapter head- 
ings seem to relate to the main branches of re- 
search rather than to the main branches of ` 
mental ability. For fuller explanations he refers 
to Burt's recent article on 4 lternative Methods; 
but that is a paper which does not seem strictly 
relevant for the present purpose. Curiously 
enough, he omits to mention the earlier papers 
on "The Structure of the Mind" and on “Sub- 
divided Factors,” to which the enquiring reader 
should by preference be sent for a more explicit 
and detailed account of the underlying theory. 
Professor Vernon deals, in a particularly clear, 
concise, and convincing way, with current criti- 
cisms of the main theory and with alternative 
views. As he points out, “Thurstone and other 
factorists in America, from 1938 on, opposed 
this notion of a general factor hierarchy." There 
are, so he suggests, two main differences be- 
tween the approaches of British and American 
factorists: first, “British workers make g as 
large as possible, whereas American writers... 
tend to minimize it," and indeed usually omit 
it from the primary factors altogether ; secondly, 
“British factorists recognize larger and more 
comprehensive group factors, together with sub- 
factors "descended" from them, whereas Amer- 
ican primary factors usually possess much the 
same status and variance." In the Appendix he 
sets out seven main arguments for rejecting the 
conception’ which Thurstone and his followers 
have chiefly favoured. However, as he points 
out, during the last few years American writers 
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have been approaching more and more to the 
British point of view: they now appear, perhaps 
rather tentatively, to accept a general factor as 
a factor of second-order, and even at times sub- 
divide their broader group factors. On these and 
other controversial points Professor Vernon, as 
he says, “almost always agrees" with Burt and 
Thomson, and “usually disagrees” with Thur- 
stone and Spearman. A special merit of the book 
is the detailed account of numerous factorial re- 
sults obtained during the war from tests applied 
to samples of an exceptionally large size. Many 
of these war-time analyses, drawn from the 
Navy, Army, and Air Force, are now published 
for the first time. It is clear that the conclusions 
thus obtained have been of great value, both 
practical and theoretical. They were of definite 
assistance during the war to psychologists en- 
gaged in planning work on personnel selection, 
and have helped to elucidate the puzzling prob- 
lem of educational classification for children at 
eleven plus; at the same time the increasing 
measure of agreement which they show seems to 
‘be rapidly solving many of the long-standing 
issues about the theoretical structure of the mind. 
But, quite apart from its new and original factual 
contributions, these chapters will also be valued 
for their critical and comprehensive discussion 
of factorial researches, particularly from 1935 
onwards. In a book just published Mr. Zangwill 
has assured us that "factorial analysis, despite 
its impressive mathematical procedures, is a bril- 
liant but misguided departure." To this and sim- 
ilar criticisms Professor Vernon's book provides 
an eloquent and a concrete answer. It is the first 
in a new series of Manuals of Modern Psychol- 
ogy that are to be issued by its publishers; and, 
for its intrinsic interest and critical approach, 
. а better volume could scarely have been chosen. 
Eug R 43:53-4 Ap "51. Cyril Burt. * “The 
hierarchical theory," he explains, “was first put 
forward by Burt, under the influence of Mc- 
Dougall” ; and he refers his readers to a recent 
article of mine on "alternative methods of fac- 
torial analysis." I should prefer them to turn 
rather to my previous papers on “The Structure 
of the Mind" in the Brit. J. Educ. Psychol. 
(XIX, pp. 100 et seq.), where, as the subtitle 
indicates, an attempt was made to review the 
"Results of Factor Analysis" up to date. Al- 
though he regards a "'strict hierarchical picture" 
as an over-simplification (with which I should 
heartily agree), he fully accepts the main under- 
lying ideas and briefly outlines their nature. * 
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one of the great merits of his book lies in the 
systematic survey of the results obtained during 
the war, on quite an unprecedented scale, from 
psychological tests applied to men and women 
in the Services * the bulk of his volume is con. 
cerned with detailed summaries and descriptions 
of the factors thus revealed. Their nature and 
interrelations are admirably depicted with the 
aid of statistical tables and ingeniously con- 
structed diagrams. Technicalities for the most 
part are avoided or relegated to the appendix. 
Hence the book should appeal not only to the 
professional psychologist but also to the general 
reader who is interested in the “factors of the 
mind.” It is difficult to praise too highly the 
critical insight shown in marshalling so complex 
an array of data, and the scientific caution and 
impartiality with which the evidence is weighed 
and the more practical conclusions drawn. 

J Clin Psychol 7:387 O ’51. * Students with 
limited backgrounds in mathematics will find this 
to involve a very lucid presentation of concepts 
which other authors have made seem very dif- 
ficult. It is recommended reading for all research 
workers and clinicians in the psychometric field. 

Occupational Psychol 25:147-9 Ap '51. Pat- 
rick Slater. * psychologists have discovered 
great many factors. The number increased grad- 
ually during the twenties and more rapidly since. 
From being conveniently small it has grown 
cumbersomely large. It shows no signs of having 
reached its limit. Professor Vernon has now 
undertaken the gigantic task of reviewing the 
literature on this subject and reducing it to some 
sort of order. His interpretation is in line with 
the main trend of factorial doctrine in this сош- 
try. General intelligence, Spearman's g, 1$ £a 
only general factor he accepts. Directly subordi 
nate to this are two main group factors, а verba" 
educational one and a spatial-mechanical one. 
Others are lower in rank, the numerical e 
perhaps the next most important, at leas s 
the educational field. Finally come minor кот 
factors and specifics. Thus he represents п 3 
abilities in hierarchical structure. * He а) ps 
possible to reconcile an enormous bulk o No 
torial evidence with this general outline. 
one who reads his book will fail to be imprest. 
with the comprehensiveness of his n um 
discussion of alternative methods of peni 
which is relegated to an appendix, і de: ate 
pressive, and amounts, in fact, to Jte T. 
than a statement of personal рее 
also not always very clear what methóds 


1077 ] BOOKS & REVIEWS: VERNON 


used in reconciling other people's findings with 
his general theory. Although he expresses a pref- 
erence for group factor methods, his discussion 
of particular pieces of research shows that he 
also makes use of Thurstone's method, some- 
times with rotation and sometimes without. The 
pliability of the various methods, which might 
disturb some people's confidence in the relia- 
bility of factorial evidence in general, aids him 
in bringing data from different sources into line. 
There are, perhaps fortunately, people pursuing 
other methods of psychological enquiry who 
continue to define terms for classifying mental 
processes independently of the factorists. Stu- 
dents of academic psychology may still be taught 
that within the process of awareness, sensation, 
perception and apperception must precede cog- 
nition, and that remembering implies learning, 
retention and recall. Psychiatrists prefer their 
own system of nomenclature, in which, for in- 
stance, the extent of impairment of memory, as- 
sessed without testing during the clinical inter- 
view, is regarded as diagnostically important, 
although memory does not enter into the fac- 
torists’ system except as a large number of 
specifics. The devotees of the Rorschach, the 
psycho-analysts, the Gestalt psychologists, the 
field theorists, and so on, elaborate their own 
terminologies without troubling to discover 
whether the mental functions they describe can 
be shown to be factorially distinct or not. Lastly, 
the common resources of the English language, 
when ably used, can give an account of a com- 
plete human being which is hard to beat by any 
system of test scores or factors. Still, the in- 
ferences that can be drawn about the structure 
of human abilities from the factor analysis of 
test material is subject matter enough for a 
single book, and perhaps it is unkind to regret 
that other relevant methods and sources of evi- 
dence have not received consideration. My incli- 
nation to criticize Vernon's book on this account 
arises from an impression that he occasionally 
treats them as hardly deserving it. It is tempting 
to visualize a diagram like one of Vernon's which 
would exhibit the relationships between all these 
Systems, with factorists, psycho-analysts, field 
theorists, etc., at suitable distances from one an- 
other, and lines radiating between them. The 
name of Vernon may perhaps not yet appear 
exactly in the centre of this diagram, but it is 
certainly very far from being out on the periph- 
егу among the specifics. His conclusions are 
reached as the result of an extremely scholarly 


B407 


attempt to take a consensus of a prodigious mass 
of psychological research. There is no doubt that 
they will be widely acceptable, especially among 
psychologists in the educational field. They will 
prove helpful when applied to practical problems 
there and in the field of selection and vocational 
guidance, and possibly elsewhere, both in sug- 
gesting sound methods of approach and dis- 
couraging naïve over-optimism. They are, on 
the whole, moderate and reasonable and will be 
treated with respect even by those who disagree 
with them, or, like myself, feel that they may 
not survive the test of time entirely unimpaired. 
Students who want to become acquainted with 
the achievements of factor analysis will find this 
book, in spite of its comfortably small number 
of pages, extremely meaty, perhaps rather tough 
in parts, wholesome and substantial if they can 
digest it thoroughly. 


[8407 ] 
*Vernon, Punir E, anp Parry, Јону В. Person- 
nel Selection іп the British Forces. London: Uni- 
versity of London Press Ltd., 1949. Pp. 324. 205. B 
(New York: British Book Centre, 1950. $4.50.) (PA 
24: 776) 

B Menninger Clinic 15:35 Ja’51. Lester Lu- 
borsky. The authors have done a yeoman’s 
service in summarizing the personnel procedures 
and the results of hundreds of researches in the 
British armed services during World War п. 
It is written in nontechnical language but will 
probably be of interest mainly to professionals 
concerned with problems of selection. Deserving 
of special attention and of being read first, is the 
final chapter giving the main conclusions on such 
topics as the value of classification methods, and 
unreliability of unsystematic interviews in se- 
lection, Since an adequate followup was obtained 
for their test procedures, the British psycholo- 
gists were able to develop methods of known 
validity for selecting many occupational groups. 

Brit J Ed Psychol 19:225 М *49. C. W. V (al- 
entine). The British Psychological Society has 
good reason to be proud of the part taken by 
some of its leading members in the work they 
did for the Forces during the war. In particular, 
we may recall Sir Cyril Burt's chairmanship of 
the Committee of Psychologists acting as gen- 
eral consultants, and his own valuable memo- 
randa; Sir Frederick Bartlett's experimental 
work for the R.A.F. at Cambridge, Alec Rodg- 
er's pioneer work for the Navy, and finally 
Vernon’s own work, much of which is reported 
in this book. Other contributions on the side of 
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instruction methods are known to have had a 
marked effect on methods of training in the 
Army. Now in this valuable book packed with 
careful and critical reports of many tests of all 
kinds applied to thousands of men in the Forces, 
we have a comprehensive survey of those aspects 
of psychological work which had to do with per- 
sonnel selection. We could fill several pages of 
this Journal by merely listing the important find- 
ings after careful examination of the many re- 
searches and reports dealt with. But one striking 
characteristic of the book saves us from the ne- 
cessity of such analysis ; the methods and results 
of the various reports on tests, interviews, etc., 
are so critically considered by the authors, they 
are so cautious in their inferences, and so mod- 
erate in their statements, that when they do claim 
definite results of ‘value, these claims can be 
confidently relied upon. This gives great weight 
to their final summaries, of which we may men- 
tion just two: that the value of classification 
schemes with appropriate tests, etc., has been 
“proved without doubt” in reference to mechan- 
ics and clerks, and in officer selection ; and that 
the average interviewer’s judgments are worse 
than useless. New evidence is provided on the 
reliability and validity of many tests of abilities 
and attainment. The whole treatment is wider 
than the title might suggest. The results gained 
in the Forces are set out in a background of a 
thorough general discussion of tests of all kinds 
and of vocational selection. For example, there 
are chapters on “Principles of Psychological 
Testing,” on “The Biographical Questionnaire," 
on the “Effects of Practice and Age on Test 
Scores," and on “Special Aptitude and Temper- 
ament Tests." Indeed, the book should be at 
least as useful to the advanced student of tests 
of general and special abilities as it should be to 
the expert psychologists dealing with the Forces 
themselves. 

Brit J Psychol, Stat Sect 2:188 N ’49. Alec 
Rodger. Dr. Vernon occupies a unique position 
among Government psychologists in this coun- 
try. It is one that enables him to keep his eye on 
developments in four important Departments— 
the Admiralty, the War Office, the Air Minis- 
try, and the Civil Service Commission. He is 
thus familiar with a variety of problems con- 
fronting applied psychology both in war and in 
peace. Dr. Parry’s work has been mainly with 
the Air Ministry; and his penetrating, well- 
written contribution to this volume is manifestly 
somewhat shorter than Dr. Vernon’s. Although 


Dr. Vernon ranges widely, it may be taken that 
he concentrates on work with which he himself 
has been fairly directly concerned. Those who 
are already familiar with the famous “Р.Е,у” 
reports, prepared during the war for the Senior 
Psychologist to the Admiralty and the Director 
of the Selection of Personnel at the War Office, 
will see in his chapters a much-needed summary 
of the most important of those reports. Inevi- 
tably there are gaps in the narrative. Some are 
substantial enough to raise doubts about its ap- 
parently all-embracing title. The extremely im- 
portant spade-work done by Dr. С. R. Har- 
greaves, particularly in the days before the 
formation of the Army Directorate of the Se- 
lection of Personnel, surely deserves more than 
the mere mention it receives on p. 40; and it is 
surprising to find the Adjutant-General’s Ad- 
visory Committee dismissed with a passing com- 
ment on the same page. This group of eminent 
psychologists undoubtedly performed (and, in- 
deed, still performs) a valuable function, not 
least in providing high-level external support 
for the judgments of the Army’s own psycholo- 
gists. Moreover, one ог two of the Committee's 
members did in fact do very much more research 
on personnel selection, both for the Army and 
for the other Services, than might be suspected 
from the brief references here given. Very little 
is said about statistical techniques, for (accord- 
ing to the preface) it is intended that informa- 
tion about these should be published elsewhere. 
It is clear, however, from other evidence, that 
in his own work for the Fighting Services Dr. 
Vernon has given full rein to his capacity for 
statistical improvisation, and to his taste for ap- 
proximation where it seems apparent that ap- 
proximation will serve. But his enthusiasm for 
simplicity in calculation is scarcely matched bya 
similar degree of enthusiasm for simplicity pi 
the expression of results. Unlike Dr. Parry, E 
is not inclined to present data in diagrammatic 
form, despite the fact that, for at least oir 
the people for whom the book is intended, t б 
is probably very desirable. But the volume 
packed with meat enough to feed, for open 
psychologists and students of psychology ya 
are interested in personnel selection and б 
tional guidance. Whether it will, as the | vi 
hope, serve the purposes of the industria s i 
sonnel officer who wants to learn about p 

experience of the Fighting Services baii 
reviewer’s opinion, another matter. Togicil 
there are not many readers without psycholog 


rS, 


1079 ] BOOKS & REVIEWS: VERNON 


оа 


training who will be able satisfactorily to extract 
what they want from so concentrated a presenta- 
tion. They will be assisted by the admirable ab- 
stracts placed (very sensibly) at the beginnings 
of the chapters; and perhaps would do best to 
start with the excellent ten-page chapter labelled 
“Conclusions” and to work backwards. 

Eug R 42:43-5 Ap "50. Н. T. Himmelweit, 
* The authors....are particularly well qualified 
for this task, since both are distinguished psy- 
chologists who have been intimately connected 
with the psychological work in the three Serv- 
ices. * Selection of officers was carried out in all 
the three Services by more intensive pro- 
grammes of testing and interviewing. The most 
fully developed scheme, which has received 
much publicity, was that of the War Office Se- 
lection Boards (W.O.S.B.), where candidates 
were seen by military officers, psychologists and 
psychiatrists for a period of three days, during 
which time appraisal of their personality was 
carried out by means of interviews, projective 
tests, and observation of group activities. These 
last included leaderless group situations, a 
method of observation introduced by Bion and 
based on Lewin's field theory. Considerable 
follow-up data is available on the value of other- 
rank selection in the three Services. The figures 
indicate clearly the superiority of more scien- 
tific methods of selection over the old methods 
of selecting on the basis of past trade experience 
and a rough assessment of educational attain- 
ments. The superiority of the officer selection 
technique is much less firmly established. Ver- 
non and Parry state: “It should not be supposed 
that the introduction of a few situational tests, 
apparently analogous to situations involved in 
the job, provides the key to accurate assessment 
of personality and job suitability, nor that the 
fallibility of human judgment and the need for 
scientific validation are lessened thereby." These 
remarks must be viewed in the light of the sud- 
den popularity that this method of selection has 
received, and which has been extended to in- 
dustry, often without sufficient caution. Vernon 
and Parry, while clearly stating that the validity 
of the W.O.S.B. methods is not high, show that 
they served two very important functions: (1) 
They had a psychotherapeutic effect in stimu- 
lating the recruitment of candidates, which had 
fallen off because of the dislike of the old meth- 
оЧз of selection; (2) They were certainly su- 
Perior to the standard methods which they 
replaced. The account which Vernon and Parry 


poi Latio iiec of selection and the pro- 
e е three Services is very vivid, 
and gives the reader a clear picture of the prob- 
lems which were encountered. * In the second 
part of the book, Vernon and Parry take up the 
principles of selection in more detail and review 
the merits and defects of different methods. They 
outline the essential steps in vocational classifica- 
tion and show that these are applicable to both 
industry and the Services. Among the points of 
particular interest which they bring out are: (1) 
The need to guard against the “naming fallacy,” 
i.e. the assumption that a test is relevant if it has 
the same name as the quality involved in the 
job; (2) The need to develop some kind of 
central training organization for the thorough 
training of non-technical staff; (3) The need 
for adequate samples for validation and for du- 
plicate studies as confirmation of experimental 
results; (4) The great difficulties in obtaining 
reliable criteria for assessing the value of any 
selection procedure; (5) The necessity for sci- 
entific checks and for caution in generalization ; 
(6) The superiority of cognitive tests measuring 
the “у” and “ed” factors over those measuring 
the “g” factor in predicting occupational success 
in a wide variety of occupations. Many of the 
findings which the authors report throw light on 
important psychological problems such as the 
differential decline of various intellectual abili- 
ties with age, the effects of improved physical 
health upon intelligence test scores, and sex dif- 
ferences in various special abilities. A separate 
chapter is devoted to an interesting account of 
the main findings in the R.A.F. The final chap- 
ter contains fifteen conclusions which the writers 
have drawn from their survey of peace-time vo- 
cational investigations and from the work of the 
Services, This summary is outstanding for the 
manner in which the writers have been able to 
compress into a few pages an entire philosophy 
of the aims and methods of the vocational psy- 
chologist. The book is a mine of information and 
will certainly become one of the major text books 
and reference books for psychologists and others 
concerned with the problem of selection. It is 
very lucidly written, carefully explains whatever 
technical terms are used, and has a novel way 
of presenting information in graph form. The 
Appendices contain a very useful list of abbrevi- 
ations, and also present in tabular form informa- 
tion on all the tests discussed in the text. This 
list reflects the thoroughness with which the 
book has been written ; it gives author, name and 
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use of test, the number of items it contains, the 
time needed to administer it and its reliability 
coefficient. It is to be hoped that the authors will 
publish a series of volumes on this work, dealing 
with many of the problems in more detail than 
has been possible in this book. 

J Appl Psychol 34:288-9 Ag ’50. Dewey В. 
Stuit. * the authors have made three significant 
contributions in the writing of this book. First 
they have presented a brief, clear description of 
selection procedures as they existed in the Brit- 
ish Armed Forces during World War II. The 
student of military and personnel psychology will 
welcome having this material in a readily avail- 
able form. Second, the book provides a good 
summary of the applications of psychology to 
personnel selection before the war. The chapters 
on the biographical questionnaire, the interview 
and principles of psychological testing are par- 
ticularly good in this respect. Third, the results 
obtained during the war are critically evaluated 
and integrated with the knowledge which existed 
prior to the war. In a sense, the book almost im- 
presses one as a’ textbook in personnel psychol- 
ogy rather than being a description of war ex- 
periences with the applications of psychology to 
personnel selection. The book is by no means 
limited to a description of findings. The contri- 
butions of war experiences to the total field of 
personnel psychology and the peace-time appli- 
cations of these experiences are continually kept 
in mind by the authors. As one reads this book 
he is impressed with the similarity of findings 
in the British and American Armed Forces. The 
types of tests which “worked” were much the 
same, the interview was beset with identical dif- 
ficulties and the criterion was a problem for the 
British as well as American psychologists. 
Thorough follow-up studies were equally few in 
number in the armed forces of the two coun- 
tries. The serious student of personnel psychol- 
ogy may be disappointed to find that data are 
presented only in summarized form. It is stated 
by the authors that they hope more detailed re- 
ports can be published elsewhere. The book is 
well written for its intended audience and will 
be widely read by personnel psychologists, per- 
sonnel administrators and guidance workers. It 
will be useful as a reference work or supple- 
mentary text in courses concerned with person- 
nel selection. 

J Mental Sci 06:543 Ap "5o. M. B. Brody. 
From the preface: 
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‘The present book..is addressed both to the ind 
trialist or educationist....and to the student of GENERE 
ogy or to the personnel officer * If for this reason it 
falls between two stools, we must apologise; it is not 
easy to write a simple, readable and interesting descrip- 
tion for the former and at the same time to give the lat- 
ter all the technical details that he needs. Statistical 
terms, formulae and calculations have been kept to the 
tira minimum and will, it is hoped, be published else- 
where. 


I found the book interesting and informative, 
and so, I think, will most psychiatrists, I can- 
not decide whether, in fact, the authors’ fears of 
its falling between two stools are justified. The 
risk should not have been taken. Good as this 
book is, separate books for the different classes 
of readers would have been better. The material 
is rich enough and important enough to warrant 
them. 

Mental Health 9:78—9 sp ' 50. B(en) S. M(or- 
ris). * a welcome feature of the book [is] that 
it covers the work of all the Services. It is doubt- 
ful if anyone is as well qualified for this task as 
is Professor Vernon, both because of his very 
wide previous experience, and because he en- 
joyed the unique position of acting in consultant 
roles to two of the three services. * Part IT is 
devoted...to a thorough review of the main in- 
struments it uses—biographical questionnaires, 
interviews, and the various kinds of tests. Scope 
and limitations, our present knowledge of the 
reliability and validity of the main techniques 
and their more important variants, are clearly 
set out, and ably discussed. Other pleasing fea- 
tures are the abstracts, short, but eminently read- 
able, which precede each chapter ; the appendix 
of tests and their characteristics; the valuable 
bibliography and excellent subject index. * Spe- 
cialists in one or other of the fields describe 
may feel, as this reviewer does, with regard to 
Army Officer Selection, that even the astonis 
ingly efficient survey made, fails to do justice Ж 
the wealth of material with which they had first- 
hand contact, and perhaps more iimport t 
the underlying implications of their work. Bu 


WR iu rievance. The at- 
this is scarcely a legitimate g de-book 


thors have succeeded in producing a gt! i 
of truly encyclopaedic compass an of pen 
merit. It is both comprehensive and aede ii 
cid and terse, as well as stimulating without е М 
of objectivity. Its usefulness goes far RP 
mere description of what was done In the y 
and students of vocational psychology ™ ‘sd 
country aré likely to regard it as the stan 

work for many years to come. It is а truism 
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adequate vocational adjustment is a pre-requisite 
both of efficient social effort and of individual 
health. What contributions toward these aims 
can we expect from the powerful psychological 
technology growing in our midst? Can we pre- 
vent its misuse as an instrument of subtle co- 
ercion in the hands of powerful groups? Already 
our industrial society is becoming aware of a 
number of hitherto unsuspected and potentially 
dangerous concomitants of its growth toward 
the Welfare State. It could be, that the emer- 
gence of human engineering" is also a symptom 
of approaching doom—the unrecognized doom 
which closes round those who fail to perceive 
the shift in values implied in translating "the 
psychological guidance of individuals" into "the 
classification of personnel." As always, the only 
remedy is vigilance and understanding, on the 
part of both the public and psychologists. The 
primary understanding required is that of in- 
sight into human motives, and into the workings 
of a society based on the concept of ordered free- 
dom, with its implications of a balance between 
equality and variety of opportunity, and an ade- 
quate freedom of choice for the individual. Ver- 
non and Parry sense the dangers, and say a 
number of important and courageous things, but 
the issues are so grave and go so deep, that this 
aspect of applied psychology deserves a much 
wider and more prolonged discussion than it has 
yet received. 

Nature 164:978 D ro ’49. * an instructive 
chapter on the rise of vocational psychology in 
Great Britain * As the writers point out, these 
early educational influences should make it both 
easy and appropriate for the knowledge gained 
from the work in the Forces to be re-applied to 
current educational problems. The application of 
such methods to industry is bound to be slower, 
more tentative, and far less systematic. What is 
possible in the school or in the fighting Services 
under war-time conditions is scarcely feasible in 
the factory or the office. Nevertheless, the re- 
sults which the writers report in their own par- 
ticular spheres demonstrate beyond all question 
that, provided the limitations of the methods are 
realized and the actual work is supervised by 
trained psychologists and not by amateurs, then 
а vast improvement in general efficiency can be 
secured by a scientific scheme of personnel selec- 
tion. Perhaps the most valuable part of the book 
is the chapter of conclusions. In the form of 
forty concise generalizations, the authors have 


brought together the main theoretical and prac- 
tical inferences which they believe can be de- 
duced from their data and their general experi- 
ence, Their survey should be of the highest 
value, not only to those who are continuing the 
work in the Services, but also still more to those 
who are concerned with building up similar 
schemes in education, industry or government 
departments. ; 

Personnel Psychol 3:233-5 su’50. E. K..Tay- 
lor. * Officer candidate selection, as is now well 
known, was based primarily on the methods that 
the Germans had abandoned early in the war. 
In the British version the psychologist played 
an entirely ancillary role supplying information 
which the psychiatrist and/or military person- 
nel on the board could use (or misuse) as they 
saw fit. The system was too unquantified and 
too unstandardized to permit any real validation. 
In defending it, the best the authors can say is 
(p. 66) “It was extremely valuable in that the 
army was led to believe [italics added] that it 
was getting the best possible officers even though 
its methodological shortcomings were such that 
it probably made numerous incorrect choices." 
* well written and readable. It appears to be 
aimed more at those who may use the results of 
the psychologists’ efforts than at the technician 
himself. 

Psychol В 47:363-4 JU’ 50. Douglas H. Fryer. 
* The book is written primarily for the Britisher, 
to show the industrialist and educationalist the 
usefulness achieved by psychology during the 
war years. It is limited in scope pretty much to 
British sources * A useful reference will be 
found in Appendix II which lists by title and 
author, "The Main Psychological Tests Used 
in the Forces," giving the number of items, time 
of administration and reliability of tests. * Vari- 
ous "problems" indicated in the text will strike 
a chord of sympathy in American war-time re- 
searchers * The authors include an excellent 
summary of conclusions based on personnel se- 
lection in the Forces which they regard as having 
vocational and educational applications in peace- 
time, No new psychological methods or tech- 
niques'of selection are described, but added 
factual material is supplied for their understand- 
ing. The book is packed with facts from the work 
of selection in the Armed Forces of Great Brit- 
ain and it will probably be the only available 
general source for this material. An extensive 
index has been prepared for reference work, By 
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intent, the book does not deal with the work of 
psychologists on problems of training, the de- 
sign and layout of equipment and the study of 
morale. 
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*Watker, HELEN M. Elementary Statistical Meth- 
ods. New York: Henry Holt & Co., Inc., 1943. Рр. 


xxv, 368. $3.25. * (P.A 17:2565) For reviews, see 308 


in Statistical Methodology Reviews, 1941-1950. 
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*Warner, W. LLovp; MEEKER, MARCHIA ; AND EELLS, 
Kennetu. Social Class in America: A Manual of 
Procedure for the Measurement of Social Status. 
Chicago, Ill.: Science Research Associates, Inc., 1949. 
Pp. xili,-274. $4.25. * (PA 23: 4741) 

J Consult Psychol 13:451-2 D ' 49. E. Lowell 
Kelly. 'This book is much more accurately de- 
scribed by its subtitle * Thirteen of the fifteen 
chapters are devoted primarily to the descrip- 
tions of methods of determining the social class 
structure of a community and evaluating the 
social class status of individuals within that 
community. However, these methodological dis- 
cussions are sufficiently illustrated by data and 
observations from actual field studies to be both 
interesting and informative to the general reader. 
* This book will be of interest to all psycholo- 
gists, not so much for its methodology—but for 
its emphasis on the probable significance of so- 
cial class variables for many psychological prob- 
lems. 

Nerv Child 9:09 Ja ’51. * attempts to con- 
struct a measurement basis and scheme for so- 
cial classes in America. Those who have for a 
generation attempted to find evaluation or meas- 
urement possibilities for collective groups have 
found themselves faced with difficult social psy- 
chological and collective subconscious factors, 
which could never be measured as is done here. 
If a comparison may be made, we are presented 
here with attempts at measuring with a rough 
wooden yard stick where micro-measures are 
needed to give scientifically acceptable results. 

Occupations 28:204-5 D '49. Harold Gold- 
stein. * Two methods are discussed : “Evaluated 
Participation" and “Index of Status Character- 
istics.” The first of these, “E.P.” as it is known 
to the authors, involves asking people what the 
social classes in the community are, and who 
belongs to which class. It might be described— 
in no invidious sense—as the exploitation of 
gossip for the sake of science. This, or something 
similar, is one of the basic methods used in an- 
thropological and sociological research. It is not 
too well adapted, however, for the use of most 
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guidance workers : Long interviews with many 
people are required; moreover, because of the 
subject-matter, the interviewer has to be 
expert at establishing rapport. In an attempt to 
devise a simpler and more practical method, the 
authors worked out the Index of Status Char- 
acteristics, or “I.S.C.” Briefly, this is a method 
of scoring people on several objective and read- 
ily apparent characteristics which reflect or in- 
fluence their social status. The authors select as 
most useful these four: occupation, the kind of 
house and neighborhood one lives in, and the 
source of one's income (e.g. whether from 
wages, salaries, business profits, or inherited 
wealth). * There is one drawback to the wide- 
spread use of the “I.S.C.” by teachers, counsel- 
ors, and personnel workers: The "IS.C." as 
developed and validated for “Jonesville,” may 
not apply to other cities. The authors point out 
that the class structure varies among American 
cities—some have 5 classes, some 6, and there 
is the possibility of more or fewer as well. More- 
over, the factors affecting class status may dif- 
fer among communities—for example, the im- 
portance of family to one’s social status in the 
Old South would not be reflected in the “I.S.C.” 
Or the various factors might be given different 
weights in one city than in another. The authors 
warn the reader that this problem has not been 
sufficiently investigated. For analysis of the class 
structure of other cities, it may be necessary to 
fall back on the more complex and difficult 
“E.P.” method. 

Social Ed 14:90-2 F ’50. Manson Van В. 
Jennings. * Although the average reader might 
well be interested in a more complete presenta- 
tion of the findings, certain conclusions emerge 
that are significant to the student of the soci 
studies. * 

B410 
*Warson, Ковевт I. ч cud Method in T 
chology. 'New York: Harper & Brothers, 1951. {Р 
xii, 779. $5.00. * 

B411 
*Warts, А. F., AND — T The Alloc: 
tion of Primary School Leav oit 
Secondary Education: First Interim pru ig 
tional Foundation for Educational Research 
land and Wales, Publication No. 2. London МЦ 
Educational Publishing Co., Ltd., 1950. Pp. 6S. 7 А 

A.M.A. 46:124, 126 Mr-Ap '51. xn 
ment is very important and very ale P ini 
be closely studied by educationists an a pen 
trators. It offers the first-fruits of ab Sta- 
periment by able minds in search of truth. 
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tistical analyses fill much of the book. They 
demand some special knowledge, but those who 
lack it should not avoid the book therefore: they 
should carefully study Section I, skim through 
Sections II and III ignoring mathematical diffi- 
culties, and then concentrate on the Summary. 
The book has some faults, It is a pity, for ex- 
ample, that in the list of the Advisory Committee 
our honoured Chairman is reckoned a non- 
graduate! There are errors of expression ; “ndn- 
measurable qualities less susceptible to precise 
measurement" is surely a linguistic gem! But— 
thank goodness—the jargon of the psychologist 
is absent, and the book is readable. The investi- 
gators, seeking improved means of allocation of 
pupils to courses, ask first: By what criterion 
shall a method of allocation be judged? School- 
leaving examinations give only an incomplete 
picture for a few of the pupils; better, say the 
authors, to review allocations at 13, when train- 
ing differences will have had less time to distort 
results, and while it is still possible ta correct 
errors of allocation. (On the question of “re- 
view at 13” it is hard to reconcile the Foreword 
with the main text.) “What minimum levels of 
intelligence, aptitude, attainment, and capacity 
for sustained interest and effort” would justify 
the allocation of a pupil to a particular course? 
And, as one facet of the problem, is it possible to 
distinguish “practical” from “academic ability"? 
. The investigators base their present discussion 
on an experiment which used 1,128 children and 
a battery of seven or eight tests. Tentatively, and 
subject to evidence which will appear in an es- 
sential “follow-up,” they estimate that it may be 
possible to distinguish the “practical” from the 
“academic,” But they frankly state that few chil- 
dren fall definitely into either class; that factors 
unexplained appear in the statistical analysis ; 
and that, even if sharp division were possible, 
that might indicate a need, not of differing cur- 
ricula, but of varied teaching method and ap- 
proach. (Incidentally, they give a salutary warn- 
ing against over-facile statistical use of results 
obtained by the Alexander Performance Scale.) 
The candour of the investigators, and their re- 
luctance to anticipate or to overstate conclusions, 
are admirable. We should dogmatise no more 
than they. We may perhaps be permitted to say 
that, at the moment, we are not convinced that 
any “practical” ability, distinct from “g,” exists; 
we are not sure that the factorial analysis of hu- 
man abilities can elucidate any factors having 
teal existence; and we wonder whether the 


Bai 


uniqueness of the individual does not largely in- 
validate the vast statistical computations of the 
“modern examiner.” But wherever the truth 
may lie, investigations such as that we are consid- 
ering must in the end lead towards it. The sincer- 
ity of the investigators, and their willingness to 
admit that their hopes have not always been 
justified, give us confidence that their progress 
towards truth will not be slow. From this point 
of view, it is interesting to compare with the 
Interim Report its Foreword, which is written 
by other hands, We seem to feel in the Foreword 
a certain tendency to dogmatism, a certain re- 
luctance to accept unwelcome facts, At any rate, 
we prefer the Report itself. 

Brit J Ed Psychol 21:159 Je’51. P. E. V (er- 
non). Provided the reader does not expect a 
comprehensive treatment of the problems of al- 
location or of previous relevant research, he will 
find much of interest in this Report. The first 
part is an urbane discussion by Dr. Watts which 
points out the narrowness of grammar school 
examinations or school certificate results as а 
criterion of successful selection. The National 


' Foundation is genuinely concerned with alloca- 


tion, not selection, and the first stage should be 
to explore the range of individual differences in 


‚ abilities and interests among 11 year children. 


Allocation could then be based on the possession 
of minimum qualifications in 1.0)., attainment, 
etc, and the follow-up of "ability to profit" 
should be a continuous process. This section con- 
tains also many wise remarks on the relative 
value of intelligence tests and other more sub- 
jective methods of assessment, the effects of 
coaching, the bad influence of objective attain- 
ment tests, and the Dundee results. In the sec- 
ond section Mr. Slater explores the range of 
spatial and practical as distinct from academic 
abilities, and the possibilities of measuring them. 
The Alexander Scale and several spatial and 
other tests were given to about one thousand 
pupils around 11.0 years. Using Alexander's 
system, 37.2 per cent. would be deemed suit- 
able for technical, and only 2.4 per cent. for 
grammar school, education. This occurs because 
the published norms for the Scale are inaccurate, 
and because it correlates more highly with 
Moray House intelligence tests than Alexander 
believed. A factor analysis was carried out of 
three non-verbal intelligence, three practical, and 
four spatial tests, given to 585 children. This 
curious battery could hardly be expected to yield 
any result other than that spatial-practical abil- 
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ity is complex. The third section analyses some 
follow-up data on sixty boys, collected by Miss 
Lambert, comparing factorial and multiple re- 
gression techniques ; and shows that they do not 
always concur in their specifications of the tests 
that are most valuable for allocation to different 
types of courses. Probably this review, and the 
Report itself, suggest that the Foundation is 
tackling allocation and selection in a somewhat 
piecemeal fashion. Actually Dr. Jeffrey's Fore- 
word, and Dr. Watts's chapter show a well- 
conceived approach, which underlies the series 
of investigations into fundamental aspects of the 
topic. Further reports—on record cards, on 
coaching, on consistency of tests applied at dif- 
ferent ages, etc.—are likely to follow ; and these 
should provide welcome guidance to education- 
ists and psychologists in formulating really 
sound policies. 

New Era 32:159 JI-Ag '51. M. Brearley. * 
the authors....chose, as a beginning point, to seek 
to improve efficiency in the limited sphere of 
allocation to secondary education at 11 plus * 
the Foundation's work, however good of its 
kind and however welcome, may well turn out 
to have involved some waste of effort when we 
turn our attention to gearing what we provide in 


school more closely to children's needs. Further,’ 


there is the possibility, even a danger, that more 
effective selection, with too little consideration 
of the suitability of provision, will further harden 
the pattern and that the ultimate aim, “to define 
the extent to which children vary in their abili- 
ties and interests at 11.0 and hence to determine 
the range of courses required to provide second- 
ary training suited to various types of child,” 
will not be put forward by research on selection. 
In fact, however, struggles such as those de- 
scribed in the report to find ways of measuring 
practical ability lead inevitably to reflections on 
the curriculum: for instance, it was found that 
many tests of practical ability were vitiated be- 
cause of the differences of opportunity children 
have had in gaining practical experience, and 
this led to the recommendation that junior 
schools should give pupils “plenty of exercise in 
the greatest possible variety of useful everyday 
activities so that they may learn for themselves 
and reveal to their teachers something of the 
practical, artistic and social potentialities within” 
so that “we may be able to sample their abilities 
in this field as Binet did in the field of general 
intelligence” in the hope of establishing some 
general factor. It is also pointed out, in the dis- 
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cussion of the use of teachers’ ratings, that 

“where the course followed is of the traditional 

kind with a heavy emphasis on class instruction 

in the three R’s, we can hardly expect the teacher 

to be intimately acquainted with the potentiali- 

ties of their pupils in other directions.” The next 

step, one would hope, will be to investigate the 

uses to which these varying abilities can be put 

in secondary education. These struggles lead, 

too, to the definition of important concepts which 

have been half known or rashly assumed. The 

conclusion, for instance from an experiment 

which was in itself inconclusive, that “the non- 

verbal abilities of children at 11.0 may form a 

series that range from those at one end of the 

scale that are the expression of the general 

mental ability to those at the other end that re- 

quire for their expression the manipulation of 

concrete materials, and that, in between, lie those 

abilities that enable their possessors to solve 

problems involving spatial relationships without 

the need for the actual manipulation of ma- 

terials,” has cleared the way for further experi- 

menting. This experiment, described in Section 

II, to establish whether practical ability can be 

effectively tested at 11.0 was disappointing in 

that the results emphasized the fact that "far 

wider differences appear between children in re- 

spect of the characteristic "g" (general intelli- 

gence) than in respect of any other characteristic 

we have found including those we have denoted , 
“F” (practical ability) and “k” (spatial judg- 

ment). Variations in these two dimensions are 

hardly greater than in others to which we can 

attach no psychological significance. It s 

rightly pointed out, however, that even a slig 4 

bias, when supported by a child’s own gn 

and the parents' wishes, may be of great eco 

icance. Sociological, temperamental and T 

factors may, in fact, be of prime importance. 0 s 

teresting evidence on this point can be foun ye 

the article “Parents’ Wishes in Selection, Pi 

ing details of the careers of fifty-two Gram s. 
School rejects"—odious term—in the ШЕ 
25th May, 1951). The results, too, may ind 
been influenced by the comparatively e «ther 
oped state of tests of practical ability, the tu 


development of which this research prudent 


«Ње evi- 


dence which emerges indicates the possi 
revealing the presence of practica odidi 
age.” Teachers hope great things of e pe p 
tion: there are welcome signs in this T€P 
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the authors have the wood in mind though they 
are at the moment examining one particular tree. 
We must trust that they are aware as we are of 
the dangers traditionally involved in tree study. 


[ B412] 


х\үкзгоск„ Mary H.; Јонмѕ, JosEPH; AND RIGBY, 
Marityn К. Procedures for Evaluating Research 
Personnel With a Performance Record of Criti- 
cal Incidents. The third in a series of reports pre- 
pared under the sponsorship of the Manpower Branch, 
Human Resources Division, Office of Naval Research. 
Pittsburgh, Pa.: American Institute for Research, June 


1950, Pp. vi, 42. Paper, lithotyped. Gratis. * (PA 25: 
2706) 
[ B413] 

*WetstoceL, Mary H.; KoLovos, Ernest R.; AND 
Ricpy, Marityn K. The Development of a Test 
for Selecting Research Personnel. The second in 
a series of reports prepared under the sponsorship of 
the Manpower Branch, Human Resources Division, 
Office of Naval Research. Pittsburgh, Pa.: American 
Institute for Research, January 1950. Pp. vi, 33. Paper, 
lithotyped. Gratis. * (РА 24:4355) 


J Ат Stat Assn 46:277 Je '51. John G. Dar- 
ley. This brief monograph is entirely a descrip- 
tive report of one part of a test development 
project being carried out under contract with the 
Manpower Branch, Human Resources Division, 
Office of Naval Research. This phase of the 
project sought to develop a test that would: se- 
lect candidates for advanced scientific training 
or for placement as junior scientific workers; 
emphasize potentiality for, rather than profi- 
ciency in, research work ; standardize test items 
as predictors of specified "critical requirements" 
for research personnel. The report contains no 


basic statistical material by which the merit of 


the research can be judged. Technical appendices 
containing such material are listed as being sepa- 
rately published, but these were not made avail- 
able for review. Furthermore, since security of 
test items must be maintained, it is impossible to 
evaluate the final form of the test as standard- 
ized. In the light of these restrictions and limita- 
tions, it is not possible to prepare a critical or 
meaningful review ; nor will the potential readers 
of the report find in it material of more than de- 
scriptive import. 


[8414] 

XWrmzMAN, ELLIS, AND McNamara, WALTER, Te 
Constructing Classroom Examinations: A Guide 
for Teachers. Foreword by E. F. Lindquist. Chicago, 
Ill.: Science Research Associates, Inc., 1949. Pp. xvi, 
153. $3.25. * (РА 24: 760) 

High Points 32:78 F ’50. Short, clear, and 
specific, this is a valuable book for those who, 
like the reviewer, can use a good refresher in 


. 


tests and measurements. * unusually practical 
slant * 

J Appl Psychol 34:216—7 Je ’50. Waller W. 
Cook. Of all the guide books designed to assist 
teachers in building effective classroom exami- 
nations this is perhaps the most elementary. Be- 
ginning with validity and reliability and end- 
ing with the statistical analysis of test scores, it 
covers in simple, non-technical language the 
customary topics, briefly and superficially. Its 
greatest value will accrue to those teachers or 
prospective teachers who know nothing about ob- 
jective test construction and who desire to learn 
but little. Those who read this book for the pur- 
pose of becoming informed about important ad- 
vances in achievement test construction during 
the past twenty years will be disappointed. Va- 
lidity and reliability are defined and methods of 
measuring them given but the reasons why they 
are important and the factors which influence 
them are neglected. Instead of emphasizing the 
place of course objectives in the process of item 
construction and sampling, we find recom- 
mended a topical outline of subject matter with 
textbook page references. No instructions are 
given regarding ways of organizing content and 
objectives to facilitate item construction. Many 
of the test items used as examples are excellent 
but there is a preponderance of the more factual 
type. The teacher-made answer sheet and per- 
forated scoring key is not treated, but the con- 
sumable type of test with panel scoring key is. 
Eight pages are devoted to a cumbersome pro- 
cedure of item analysis designed to yield a meas- 
ure of the difficulty of the items. One page is 
devoted to a simple index of discriminating 
power without emphasizing its importance or 
the factors which influence it. Percentile ranks 
are not computed from the mid-frequencies of 
the scores. The student learns to compute means 
and standard deviations but standard scores are 
not mentioned. Certainly, better manuals on the 
construction and use of achievement tests have 
been available for many years. i 

[B415] , 


М Value of an Oral Read- 
*Wetts, Cartes A. The Value 2 апа Difücul- 


Performance, American Psychological Association, 
Psychological Monographs : General and Applied, Vol. 
64, No. 2, Whole No. 308. Washington, DC the 8- 
sociation, Inc., 1950. Pp. iv, 35. Paper. $1.00. (PA 
24: 6517) 
[ B416] ER 
, Joun, AND WELLS, Enm. Prepare for the 
Oficial Us. Army-Navy-Air Force Tests, Tenth 
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Edition. Irvington-on-Hudson, N.Y.: Capitol Pub- 
lishing Co., Inc., 1951. Pp. v, 164. Paper. $2.00. * 

[ B417] 
*Weppner, Mary Tueopore. The Standardization 
of Two Equivalent Forms of a Vocabulary Test 
Used in the Measurement of Various Age Levels 
in the Elementary Grades Three Through Eight. 
Washington, D.C.: Catholic University of America 
Press, Inc., 1941. Pp. 59. Paper. $1.00. * 

[ B418] 
*Wnmrrg Клін K. Value-Analysis: The Nature 
and Use of the Method. New York: Society for the 
Psychological Study of Social Issues, Department of 
Psychology, Columbia University, 1951. Pp. v, 87. Pa- 
per. $1.00. * 

[ B419] 
Wickznr, FnEpERIC, Eprror. Psychological Research 
on Problems of Redistribution. Army Air Forces 
Psychology Program Research Reports, Report No. 
14. Washington, D.C.: U.S. Government Printing Of- 
fice, 1947. Pp. vii, 298. Out of print. * (РА 22: 4652) 

J Consult Psychol 11:339 N-D ' 47. Laurance 
F. Shaffer. * reports work on the selection of in- 
structors, combat leadership, the validation of 
tests by combat criteria, the measurement of 
anxiety reaction to combat, the effects of combat 
on men, and the interests and preferences of re- 
turned personnel. The studies of instructor se- 
lection, of leadership, and of anxiety have impli- 
cations for further research and use in civilian 
life. 

Psychol B 46:521—3 N '49. Donald E. Baier. 
* a competent and interesting account of the 
activities of certain American aviation psycholo- 
gists during World War II. It should prove 
stimulating to academic psychologists and in- 
valuable to applied psychologists concerned with 
similar or related problems in the armed forces 
and elsewhere. * The discussions and conclu- 
sions are generally distinguished by careful at- 
tention to the statistical significance of data based 
on samples whose basic characteristics including 
size, variability, manner of selection, and other 
important relevant factors are specified. Ample 
tabular material presented and referred to in 
readily apprehended fashion contribute greatly 
to the content and readability of the volume. * 


For a review of the series, see B2, 


[ B420] 

*Wiwc, HrmsERT. Tests of Musical Ability and 
Appreciation: An Investigation Into the Meas- 
urement, Distribution, and Development of Mu- 
sical Capacity. British Journal of Psychology Mono- 
graph Supplements, Vol. 8, No. 27. London: Cambridge 
University Press, 1948. Pp. vii, 88. Paper. 8s. 6d. * 
(Chicago, Ill: University of Chicago Press, 1949. 
$1.40.) For reviews, see 231. 
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*Worrr, Снлк.отте. The Hand in Psychologi 
Diagnosis. London: Methuen & Co. Ld, 19st ч 
xv, 218. 325. 6d. * м 
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*Worrr, Werner. Diagrams of the Unconscious: 
Handwriting and Personality in Measurement, 
Experiment and Analysis. New York: Grune & 
DON Inc, 1948. Pp. xiv, 423. $8.00. * (PA 23: 

Am J Psychiatry 106:479-80 D ’49. Joseph 
Zubin. * Despite several ingeniously conceived 
experiments in so-called depth psychology, no 
real change is effected in the present status of 
graphology as a science, The book at best con- 
sists of special pleading for an acceptance of 
the basic principle that graphic movement is an 
expression of personality, but no further system- 
atic proof of this contention is made available 
in the text. Present-day antipathy toward the 
acceptance of graphology is not mitigated in the 
least. Take for example the rather simple ex- 
periment of asking the subject to write the fol- 
lowing 3 phrases: I myself; I father; and I 
mother. If the letter I is written closer to the 
word father than to the word mother a closer 
relationship between the subject and his father 
is inferred. Such simple, naive, analogical think- 
ing cannot help reminding one of Crepieux 
Jamin's initial efforts with the interpretation “ 
isolated graphological signs. The introduction of 
the experiments in depth psychology in the first 
part of the book are extremely interesting but 
give the rest of the book a semblance of (54 
tific validity which it does not possess. The book, 
as a whole seems to be composed of 2 distinct 
parts. The first part deals with a series of jid 
ments performed by various investigators, 1 
cluding the author, in which the nir 
uniqueness, and rhythmic qualities of han ij 4 
ing are analyzed. The second part deals ns 
description of handwriting elements and the ici 
graphical interpretation derived from Л ku 
tive, artistic, and sometimes brilliant analys à 
the characteristics of outstanding prae 
based merely on their signatures. It is to is 
gretted that, in giving the detailed ed fal 
which the interpretation is based, the mW? if 
to provide scales or measures for o jm 
characteristics of each element. Such ieee zi 
as “low pressure, lack of detail, origin? cific 
thin lines,” could be provided with jah d " 
definitions or examples if they are (0 2 vti 
in scientific work. There are no appare" 


1087 | BOOKS & REVIEWS: WEPPNER 


between these 2 parts of the book and it is sur- 
prising that the 2 should appear together under 
one cover. For psychiatrists and psychologists 
this book will make interesting reading since it 
does deal with the individual case in a very in- 
dividualized manner and its excursions into cur- 
rent affairs, as well as references to literature, 
history, and so forth, are always interesting and 
stimulating, though their contribution to graph- 
ology is sometimes not apparent. Furthernfore 
the list of handwriting characteristics and the 
alleged relationship to personality provide a 
challenge for further work in evaluation of the 
author’s methods. For those interested in pro- 
jective techniques and in methods of measuring 
personality this book is almost a must reading 
since it summarizes the most recent work in the 
field of handwriting. 

Am J Psychother 4:345 Ap '50. Herbert 
Boehm. * presents convincing evidence of the 
persistence of significant patterns, often in spite 
of the writer’s deliberate efforts to disguise his 
hand-writing. The first ten chapters represent a 
careful step-by-step presentation and imple- 
mentation of the author's intriguing hypothesis. 
He very properly warns the reader against any 
assumption that each isolated graphologic char- 
acteristic can be directly translated into a per- 
sonality trait. Unfortunately, the need for 
satisfying the popular hunger for a “how-to-do- 
it-in-six-easy-lessons” material drives the au- 
thor into a compromise with his own standards, 
so the book finishes with what the dust wrapper 
describes as “scoring charts and directions for 
analysis.” Character analysis cannot be learned 
by any simple scheme. If this chapter (fortu- 
nately, only 20 pages) can be ignored, the care- 
ful reader will find this book a challenging and 
absorbing treatise. 

Am Sociol R 14:331 Ap "49. * With few ex- 
ceptions, the approach throughout is illustrative, 
the author's statements being supported by ex- 
amples rather than by more conclusive evidence. 
Many signatures and other handwritten ma- 
terials, largely from well-known persons and in- 
triguing in their own right, are reproduced and 
discussed. But loose organization, lack of clearly 
formulated evidence, and a dearth of conclusions 
leaves graphology about where it was before the 
appearance of this book. 

B Menninger Clinic 15:28-9 Ja '51. Paul 
Bergman. * includes some careful experimental 
and statistical work, conclusions based on sam- 
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ples of handwriting that may be representative 
samples, and finally, intuitive interpretations of 
individual handwritings. The experimental part 
makes the reader feel he is on safe scientific 
ground. The transition to the bolder methods of 
the later parts is so gradual that many readers 
may follow there with the trust engendered by 
the careful beginning. The author, no doubt, is 
brilliant, industrious and a good story teller, The 
spirit of this book seems at times to be a mixture 
of real inspiration, selfconfidence and impatient 
ambition. 

J Abn & Social Psychol 44:428-9 Jl’ 40. Ger- 
ald R. Pascal. * the best book I've ever read on 
handwriting movements. He has marshaled a 
tremendous amount of factual data, a great deal 
of it the result of his own experimentation, to 
give his hypotheses a solid basis. He has pre- 
sented testable hypotheses about the diagnostic 
validity of handwriting which, insofar as I know, 
has never before been done in a book about hand- 
writing. * Particularly intriguing is his demon- 
stration of the configurational aspects of hand- 
writing. Thus, he shows in example after 
example that the various parts of a signature 
remain in fixed proportion to each other over a 
period of time and in different contexts. * It is 
impossible, of course, in this review to discuss 
in any systematic fashion the wealth of material 
in the book. In fact, one of my chief complaints 
of the book, if it be one, is that such a mass of 
material is presented that as a reviewer I haye 
had considerable difficulty in assimilating it in 
the time at my disposal. Wolff presents a num- 
ber of ingenious experiments and a number of 
rather startling discoveries. In addition to the 
configurational aspects of handwriting, hitherto 
not elsewhere published, he presents evidence to 
show that there are such things as empathy, pro- 
jection, and symbolization in graphic expression. 
Empathy, for instance, is demonstrated by 
matching given descriptive terms, eg., aggres- 
sion and love, with jagged and curved lines, 
which a great majority of subjects are able to 
do with ease. Projection of family relationships 
is demonstrated in an experiment in which a 
subject writes J myself, I father, and I mother, 
once with eyes open and once with eyes closed. 
There seems to be a remarkable relationship be- 
tween the form of graphic expression and rela- 
tionships within the family. Symbolism is strik- 
ingly illustrated by experiments with doodling. 
There is no question in my mind that Wolff 
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has definitely carried the case for handwriting 
as an important area of measurement in the 
study of what we call Personality. * my review 
has been rather enthusiastic. This enthusiasm is 
in some respects a reaction to expectancy. For 
those who, after reading this review, come away 
from the book somewhat disappointed, I recom- 
mend as a salubrious experience the reading of 
some earlier books on handwriting. Wolff's book 
15, to my way of thinking, the best yet, but I 
do not feel that it has made measurements of 
handwriting valid, objective diagnostic indica- 
tors of behavior. It has merely done the job, a 
remarkably good one, of setting the stage for 
the accomplishment of this task. There is, for 
instance, not one study in the book which shows 
the discriminating power of measurements of 
handwriting for behavior when gross opera- 
tional criteria of behavior are used, such as hos- 
pitalized psychotic patients vs. employed non- 
patients. At least two recent studies show that 
graphologists are not able on the basis of meas- 
urements of handwriting to discriminate between 
such gross deviations of behavior. I do not mean 
to imply that Wolff is a graphologist but atti- 
tudes toward measurement of handwriting are 
based on such evidence. In summary, I feel that 
Wolff has presented in his book powerful argu- 
ments for the expenditure of time and money in 
handwriting research. If his book has done this, 
and it seems to me it has, then certainly he has 
made an extremely valuable contribution to the 
complex and arduous task of somehow quanti- 
tatively defining human behavior. 

J Ат Med Assn 139:1236 Ap 23 ' 49. Graph- 
ology is rapidly becoming a significant field. The 
interrelationship between graphic and bio- 
graphic data enables the worker to approach the 
deeper layers of the personality. Intuitive graph- 
ology has been replaced by scientific approaches. 
The author is one of the leaders in this new field 
and has compiled a fascinating document por- 

. traying his work through the analysis of numer- 
ous examples of handwriting, diagrams and 
drawings. The book is to be highly recommended 
for all dynamic psychologists and psychiatrists. 
It is hoped that in the next edition the examples 
of handwriting will be taken from American sub- 
jects so that they may be more easily under- 
standable by American psychiatrists. 

J Consult Psychol 13:148 Ap 49. Laurance F. 
Shaffer. In this handsome and richly illustrated 
volume, Wolff marshals almost all of the avail- 
able evidence about handwriting and personal- 
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ity, with emphasis on his own studies, His criti. 
cal attitudes toward earlier speculative theories 
and inadequate experiments is commendable, 
but is not always extended to the planning and 
interpretation of his original work. Logical con- 
fusions are numerous. Thus, in a chapter en- 
titled “the consistency of personality,’ the author 
demonstrates, quite adequately, that persons 
have a considerable consistency of handwriting. 
Thé major issue of the relation of personality 
to handwriting is here begged rather than dem- 
onstrated. After similar chapters on symmetry, 
rhythm, configuration, movement, and other 
qualities, the author comes to the core of his 
findings in a series of tables. In these tables, 
personality is analyzed in terms of a series of 
“factors” or trait names which, most conven- 
iently, are exactly twenty-six in number and 
begin with the consecutive letters of the alpha- 
bet, from Aesthetic, Benevolent, Coordinated 
..to eXpansive, Yielding and Zealous. The 
handwriting indicators for each of these “fac- 
tors” are stated in terms of a continuum from 
the positive factor to its opposite. The author 
regards these tables modestly: “....our list of 
factors and indicators cannot be used like a 
cookbook...," “....future research will indicate 
the degree of validity of these observations.” Un- 
fortunately, the publisher does not share this 
modesty, for the jacket states, “Considering 
practical applications, Dr. Wolff includes a scor- 
ing chart, directions for analysis, new tables of 
trait clusters in relation to graphic patterns, and 
complete examples of personality diagnoses. 
The result, one may fear, is that there will be 
cookbook graphology, along with the prick! 
existing cookbook Rorschach and cookboo 
TAT, until greater efforts are expended upon 
the validation of the projective and expressive 
techniques. 

J Crim Law & Criminology 41:809 meet 
’51, Wladimir Eliasberg. A truly remanen 
book, this one, and one that will be noted in "i 
history of graphology. It contains a vast AE 
of original material together with a historic t н 
vey and a very comprehensive bibliography We 
it is remarkable not only for these traits; de is 
the most original achievements of the aut cable 
the demonstration, in the signature, xm 
configurations of the personality. As this е б 
stration is not amenable to review it 15 mi " 
that every interested reader should w gut 
in the text. It is quite possible that wai 
figurational method will play a big role in 
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in the identification of signatures and forgeries. 
It is therefore brought to the attention of practi- 
cal handwriting experts and should also be re- 
examined to establish its validity. This reviewer 
would like to mention one of the weaker points 
of this book which is the relationship of hand- 
writing and the lore thereof (graphology) to so- 
cial facts and social psychology. It may be sug- 
gested that Wolff himself should give some 
attention to this point in a desirable new edition. 
To sum up: One of the valuable publications 
which the practically and the theoretically in- 
terested criminologists should study. 

Psychol В 47:91—3 Ja '50. Fred Y. Billings- 
lea. * Part I gives a concise, though mostly non- 
critical, survey of the literature and introduces 
Wolff's ideas on how handwriting might be 
made to succumb eventually to experimental pro- 
cedures. * In my opinion: (1) his theoretical 
points are sufficiently sound to warrant investi- 
gation, (2) his individual signature analyses are 
instructive, but (3) his claims for experimental 
verification cannot be confirmed on the basis of 
the material included or referred to in this sec- 
tion. Wolff creates the unfortunate impression 
that he believes if he writes a statement with 
words of sufficient authoritative “ring,” tosses 
in a few figures that have the appearance of 
acceptable size, the reader will accept statistical 
significance and substantiation of claimed rela- 
tionships (pp. 58, 308). He uses eight pages 
(46-53), for example, in reporting one of his 
studies where, due to an inaccurate statisti- 
cal procedure (failing to combine plus-minus 
scores), a non-existent significant trend was 
found; further, the problem the study was con- 
cerned with is only slightly related to the point 
under discussion. Incidentally, the graphs shown 
there are displaced. Wolff refers his “lay reader” 
to Part II in preference to Part I, feeling it con- 
tains “the material demonstrating the princi- 
ples that determine everybody’s form and ex- 
pression in his graphic" productions. Here he 
all but ignores his efforts in Part I and falls back 
upon the classical approach to handwriting anal- 
ysis. * Chapter XIII tabularizes his various 
graphic elements, his grosser groupings of these, 
his list of personality traits with which he be- 
lieves they are correlated, and a recording blank 
for systematizing and simplifying the analysis. 
The procedure is demonstrated on several fam- 
ous signatures in the last chapter. Here, too, 
Wolff desires to appear experimentally sound. 
However, (a) he subjectively selects his trait 
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names and does not define them, (b) the great 
majority of these appear undefinable descrip- 
tively or operationally, (c) the experimental 
work reported lends little support to his claims 
of correlations between traits and graphic ele- 
ments, (d) there are no tables of norms, and 
(e) his only stated caution is for the analyzer to 
be certain that an interpretation is done on the 
basis of the configurational organization of the 
graphic elements. How much better it would 
have been for Wolff to (1) frankly state that 
the work is still based on subjective clinical 
judgment, (2) that he still does not have ex- 
perimental procedures which permit empirical 
evaluation of what he believes to be true from 
observations, and (3) that with these limitations 
he offers his findings to the Professional reader, 
inviting assistance in solving the multitudinal 
problems involved. In that vein, this reviewer 
feels that Wolff will have more success by drop- 
ping his personality traits, returning to his con- 
cept of themes, and employing characteristic, de- 
scribable response patterns with measurable 
properties. This ought to be a worthwhile book. 
In many ways it is. Yet its obvious faults per- 
vade so much of its material and influence, to 
the stage of inaccuracy, so many of its conclu- 
sions that one is hesitant to endorse it in spite 
of them. With adequate research many of its 
suggestions should lead to conclusive results. 
However, since many acceptable projective 
techniques are available, demonstration of 
considerable usefulness for this method is 
mandatory before others would be justified 
in spending the requisite research time and 
money. 

Q R Biol 26:106-7 Mr '51. Joseph С. Frank- 
lin. * by far the most ambitious, work yet pub- 
lished on handwriting as a diagnostic tool in the 
study of personality. * Despite his arduous ef- 
forts to put handwriting analysis ona psycho- 
logically sophisticated and objective basis, he is 
driven again and again back to clinical assump- 
tions he is loathe to admit into his system of 
analysis. Nevertheless, the richness and sugges- 
tiveness of his treatment of a long disreputable 
subject in American scientific circles will un- 
doubtedly do much to bring the projective use 
of handwriting out of the caves of graphologists 
and into the laboratories of psychologists where, 
if ever, Wolff’s promising leads rather than his 
performance may sometime establish the valid 
and reliable use of handwriting analysis in psy- 
chological science. 
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[ B423] 
*Worr, Werner, Ерітов. Projective and Expres- 
sive Methods of Personality Investigation (“Di- 
agnosis"). Personality Symposia on Topical Issues, 
Symposium No. 2. New York: Grune & Stratton, Inc., 
April 1950. Pp. ii, 76. Paper. $2.00. * 

[ B424] 
*Worrsox, Rose. A Study in Handwriting Analy- 
sis. New York: the Author, 1949. Pp. xii, 97. Paper, 
lithotyped. Out of print. * 

J Consult Psychol 13:223-4 Je '49. Laurance 
F. Shaffer. The Lewinson-Zubin scales for eval- 
uating handwriting were applied to matched 
samples of delinquent and nondelinquent boys. 
The groups were differentiated with statistical 
significance, delinquents making more extreme 
scores both in the direction of release and of con- 
traction. The measurements used are too labori- 
ous, and the differentiation insufficiently sharp, 
to warrant the use of handwriting as a practical 
method of diagnosis. 

[ B425] 
*Woop, Ben D. ann Haerner, ЁльРн. Measuring 
and Guiding Individual Growth. Illustrated by 
Joseph Paige. New York: Silver Burdett Co., 1948. 
Pp. viii, 535. $4.90. * (PA 22: 3207) 

Clearing House 23:54-5 S 748. Earl R. 
Gabler. * the authors have torn a page out of 
"Orbis Pictus." The illustrations and the con- 
versational method used are reminiscent of dia- 
logues and pictures in our early school texts. 
The plan of idea-graphing the principles of eval- 
uation and their use adds vitality to the book. 
The same can be said for the conversational 
method used, which allows for a functional ap- 
plication of right practices in the practical situa- 
tions where the need occurs. If you have found 
other treatments of evaluation uninteresting or 
unintelligible, read this book. It may be likened 
unto a fresh summer rain that turns green a 
heretofore rather dry but potentially productive 
pasture. However, one addition seems necessary 
in most books on evaluation. That addition 
would be an adequate discussion of pupil-con- 
structed schemes for self-evaluation. 

Ed Outlook 22:212 My’ 48. Burton P. Fowler. 
* The form of presentation used, that of breezy 
dialogue, is certainly not a usual one for a sub- 
ject as intricate and scientific as tests and meas- 
urements. This style frequently gets in the way 
of a simple, clear exposition which should not be 
beyond the grasp of the “administrators, inter- 
ested laymen, teachers, and students” for whom 
the book purports to be written. The philosophy 
of the book, however, is inspiring—that the guid- 
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ance of individuals is “а task for society, not just 
» 

one for schools." Too long have tests and guid- 
ance been regarded as techniques quite apart 
from their social implications. If the book is 
carefully studied, it will yield an amazing amount 
of detail about the nature of the individual, about 
the wide variety of measuring instruments avail- 
able to determine the nature of his aptitudes and 
achievements, hence his educability, and finally 
about the possibilities of intelligent guidance for 
the individual, once his characteristics and needs 
are ascertained, Personally, I think that the book 
has been written down too much, that the pro- 
spective teacher at Caribou State Teachers Col- 
lege, who is depicted as one continuously wise 
cracking in the vein of “Flanagan’s idea is the 
real McCoy," is a caricature of student teachers 
as I have known them. One can hardly imagine 
Gray's Anatomy being so diluted for a medical 
student. All the same, the book is a mine of 
valuable information and, until it has been con- 
densed by some humorless editor of The Readers 
Digest, will have to be quarried in its present 
form. 

El Sch J 49:304 Ja "49. Gertrude Boyd. * А 
refreshingly different book * This volume 1s 
planned especially for the classroom teacher, but 
it will be of interest to the administrator, layman, 
or student. The discussions are of value both on 
the elementary- and the secondary-school level 
and should be a factor in co-ordinating guidance 
between the two levels. * By far the major por- 
tion of the book is presented in a conversational 
method, with discussions among administrators, 
laymen, teachers, and students. This тео 
the advantage of introducing ideas as they mig | 
arise and places them significantly in a natura 
situation. Most of the discussion is centered in 
presenting evidence of individual different 
among learners. * The authors are well pet 
of the shortcomings of presenting the brin 
through discussions and admit the рози 5 
oversimplification and the omission of 9, d 
facts. Despite these shortcomings both the c e 
room teacher and the student teacher will s 
much of the material applicable to their part n 
lar situations. * an interesting and valuable v 4 
ume for the teacher, administrator, and vri 
The guidance specialist will find it a ш idol 
erence in emphasizing the need for 1n i 
guidance with teachers, parents, and ess | 

Occupations 27:142 N ?48. Adah uj ers 
presents in a unique manner the good 0 Га г 
ples of individual differences, measuring 
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ments, and guidance * The third part....will be 
of most interest to the trained counselor for it 
deals with the guidance of individual children. * 
This quotation appealed particularly to this 
reader : "There are no misfit children, There are 
only misfit courses of study, misfit textbooks, 
and misfit teachers. But in the very nature of the 
case, there can be no misfit children. The child 
is what education is for. One might as well say 
that a man does not fit his clothes as to say that 
the child does not fit the school." 

Sch & Soc 68:221-2 S 25 '48. William W, 
Brickman. * a sugar-coated approach * it appears 
too easy for college students and too difficult for 
high-school pupils. Perhaps it may be best uti- 
lized for reading by parents or school-board 
members who would like to know what all this 
“education business” is about, without having 
to wade through reams of professional double- 
talk. 

Sch R 56:492-3 О ’48. Eleanor Volberding. 
* differs, in some respects, from the usual guid- 
ance publication * there are several excellent 
chapters written especially for teachers in the 
elementary school * planned for the use of the 
classroom teacher * the major portion of the book 
is devoted to a discussion of individual differ- 
ences and the role of guidance in education. The 
discussion on individual differences in students 
and on the meaning which the concept of dif- 
ferences should have for education is excellent. 
Indeed, the philosophy that individual differ- 
ences are at the heart of the guidance program 
pervades the writing. * Although Part I stresses 
that not all information can be gathered by stand- 
ardized tests and that individual personality and 
Interest differences have to be considered in a 
guidance program, few suggestions are given in 
concrete form for implementing such ideas. In- 
stead, most of Part II deals with the usual type 
of academic tests of knowledge and mental abil- 
ity. Personal-adjustment problems seem of little 
interest to the authors, and the counselor is as- 
sumed to be concerned mostly with helping the 
Students choose suitable vocations, selecting 
high-school courses adapted to their needs, and 
assisting them in their academic successes. Other 
defects are the usual oversimplification and gen- 
€ralization which result when a publication of 
this kind compresses the entire discussion of 
tests into one hundred pages. While typical tests 
are listed and described and the reader is told 
where they may be obtained, they are not ade- 
quately evaluated as guidance instruments. Ex- 
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amples are the use of art tests to predict art 
success or group intelligence tests to predict 
academic success. In spite of these shortcomings, 
the average classroom teacher should find this 
book a useful reference. Its authors have written 
in a manner which will interest many teachers 
who would not be attracted to the usual volume 
on guidance. The language is simple and direct, 
and the style is informal. Presenting many cases 
of real children with real names serves both to 
increase the reader's interest in the book and to 
clarify the ideas and generalizations of the au- 
thors. Many teachers know little about the field 
of testing or of record-keeping. The chapter en- 
titled “What Do Test Results Mean?” contains 
a practical explanation of the treatment of test 
scores and includes many actual test scores as 
examples. Teachers who have never really un- 
derstood test terminology, simple statistical 
terms, or the treatment of test results should find 
this chapter of great assistance. Measuring and 
Guiding Growth is a useful publication for the 
average classroom teacher and student teacher, 
It is less useful for the specialist in the field of 
guidance. 


[ B426] 
WRINKLE, WILLIAM L. Improving Marking and 
Reporting Practices in Elementary and Second- 
ary Schools. New York: Rinehart & Co., Inc., 1947. 
Pp. v, 120. $2.00. * 

Cath Ed R 46:330-1 My 48. Walter L. Wil- 
bins. * an interesting and most complete survey 
of the situation in school marking practices. The 
new departures are challenging. 

Clearing House 22:376 F *48. Earl R. Gabler. 
The author has approached the problem of im- 
proving marking and reporting practices in a 
unique yet natural manner. He found that the 
task was first one of formulating the goals of 
education, second, determining the realization of 
the goals, and finally, making the proper reports. 
* The book is a definite contribution to the litera- 
ture in the field in that it gives a method for at- 
tacking our educational problems and presents 
the findings of the method. Most teachers, super- 
visors, and administrators will want to own this 
book. It is well documented, suggests how fac- 
ulties may plan co-operatively, and gives many 
sample record forms that have been constructed 
in the light of real educational outcomes. 

Ed Adm & Sup 34:254-5 Ap "46. A. S. Ed- 
wards. * One will not find in this volume the 
final answers to problems of marking and re- 
porting upon student’s work; there are perhaps 
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no final answers. But the reader will find a 
sensible discussion of many points relating to 
the difficult but apparently necessary problem. * 
With a cheerful sort of pessimism the author 
concludes his final chapter with a statement that 
anything one may do in connection with marking 
and reporting “has little likelihood of being more 
objectionable or less adequate than the practice 
which it replaces." The study is well worth 
thoughtful consideration. A useful index....con- 
cludes the volume. 

Ed Leadership 5:193-4 D "47. Sara M. 
Krentzman. * stimulating and convincing. The 
findings...are challenging and suggest many 
changes in the more conventional evaluation pro- 
grams of many schools. The selected bibliogra- 
phy might be used by a faculty study group as a 
basis for discussions. The author surprises the 
reader by presenting the bibliography in the 
regular text rather than as an appendix, at the 
psychological point when he feels that wide read- 
- ing is necessary. * deserves a good audience be- 
cause it treats a problem needing the attention of 
most educators and because it is one professional 
book that is written in an interesting manner. 

El Sch J 48:400-2 Mr '48. Warren E. 
Gauerke. For those persons who are familiar 
with the concept that teaching objectives should 
be stated in terms of desired behavior outcomes 
for the learner, Improving Marking and Report- 
ing Practices will hold nothing strangely new or 
different. * The main purpose of the 120 pages 
of text, it would appear, is assisting staffs to 
work out a statement of objectives; to tell what 
the learner is to do ; and to help teachers, pupils, 
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and administrators develop their own markin 
and reporting forms and practices. * The treat. 
ment....is highly informal * However, the reader 
gains the impression that a personal conference 
via the printed page is not an entirely adequate 
editorial device. * 

High Points 30:74—5 Ja’ 48. J. Wayne Wright. 
stone. * short, readable and instructive * Here 
in one concise book are the arguments for and 
against current marking and reporting practice 
in schools. Here, also, is the report of a construc- 
tive approach by one school to find its answers 
for the improvement of these practices. This 
book is a valuable guide not only to administra- 
tors and supervisors but also to teachers who are 
interested in the improvement of their marking 
and reporting practices. The new points of view 
and methods are a challenge to the more con- 
ventional practices in New York City schools. 


18427 ] 
XZunIN, JOSEPH; IN COLLABORATION WITH KATHLEEN 
M. Younc. Manual of Projective and Cognate 
Techniques. Madison, Wis.: College Typing Co, 
1948. Pp. 175. Paper, mimeographed. For latest edi- 
tion, see B428. 


[ B428] 
X*ZunmiN, }озЕрн. Quantitative Techniques and 
Methods in Abnormal Psychology. New York, 
N.Y.: Columbia University Bookstore, 1950. Pp. 220 
Paper, mimeographed. $3.75. * (PA 25:6933) 
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ж720віх, Јоѕерн, Eprror. Symposium on Statistics 
for the inician. Journal of Clinical Psychology 
Monograph Supplement No. 7. Brandon, e dy 
of Clinical Psychology, January 1950. Pp. 76. in 
$2.00. * (PA 25: 55, 58-9, 62-4, 67, 71-2, 74 np | 
368) [Reprinted from J Clin Psychol 6: 1-7 Ја "50. 
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15th St, N.W., Washington, D.C.: РЁ, Pt ay for 

Clearing House—The Clearing House: на АТ 
Modern Junior and Senior High Sc Ss per 
(omitting Je, Jl, Ag); vol. 27 started a 
year: 50¢ per issue; Forrest E. Long, edi bb 
С. Fisk and Earl R. Gabler, revien Вл, В 
Fourth Ave., New_York 3, NYS: Реа 
B217, B310, B342, B425-6 
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Col & Univ—College and University: The Journal of 
the American Association of Collegiate Registrars 
and Admissions Officers: 4 issues; vol. 27 started 
O ’51; $3 per year; $1 per issue; William Craig 
Smyser, editor, Miami University, Oxford, Ohio: 
bk, Вп, B362, B364, B368 


DIGEST NEUROL & PSYCHIATRY—Digest of 
Neurology and Psychiatry. Published by The Insti- 
tute of Living. 12 issues; vol. 20 started Ja '52; gra- 
tis; Francis ^E Braceland, editor, 200 Retreat Ave., 
Hartford 2, Conn.: bk, 141, B71, B323 


ED—Education: A Monthly Magazine Devoted to the 
Science, Art, Philosophy and Literature of Educa- 
tion. 10 issues (omitting Jl, Ag) ; vol. 73 started S 
52; $4 per year; 50¢ per issue; Raymond P. Palmer, 
managing editor, William P. Sears, Jr., review edi- 
tor, 370 Atlantic Ave., Boston то, Mass.: bk, B71, 
B231, B368 

Ed Adm & Sup—Educational Administration and 
Supervision. 8 issues (omitting Je, Jl, Ag, S) ; vol. 
38 started Ja '52; $5.50 per year; $1.10 per issue; 
Н. E. Buchholz, editor, ro East Centre St., Baltimore 
2, Md.: bk, B71, B121, B325, B426 

Ed & Psychol Meas—Educational and Psychological 
Measurement: A Quarterly Journal Devoted to the 
Development and Application of Measures of Indi- 
vidual Differences. 4 issues; vol. I2 started spring 
's2; $7 per year; $2 per issue; С. Frederic Kuder, 
editor, Box 6907 College Station, Durham, N.C 
bk, B22, B247, B390 У 

Ed Forum—The Educational Forum. Published by 
Kappa Delta Pi. 4 issues, 2 parts to each issue (N, Ja, 
Mr, My); vol. 16 started N '51; $3.50 per year; 
$1.00 per issue; E. I. F. Williams, editor, 277 East 
Perry St., Tiffin 4, Ohio: bk, B71, B283 

Ed Leadership—Educational Leadership. Journal_ of 
the Association for Supervision and Curriculum De- 
velopment, National Education Association. 8 issues 
(omitting Je, Jl, Ag, S) ; vol. 10 started O '52; $3.50 
per year; 50€ per issue; George W. Denemark, edi- 
tor, 1201 16th St, N.W., Washington 6, D.C.: bh, 


B426 

Ed Outlook—Educational Outlook. Published by the 
School of Education, University of Pennsylvania. 
4 issues; vol. 26 started N '51; $2.50 per year; 65€ 
per issue; John H. Minnick, editor-in-chief, Lee О. 
Garber, review editor, School of Education, Univer- 
sity of Pennsylvania, Philadelphia, Pa.: bk, B155, 
B282, B425 " { 

Ed R— Educational Review. Published by the Institute 
of Education, University of Birmingham. 3 issues 
(N, F, Je); vol. 3 started N ’50; 155. per year; 5$. 
per issue; E. A. Peel, "University of Birmingham, 
Edmund St., Birmingham 3, England: bk, B18, B74, 


Biz1 Д 
Ed es B—Educational Research Bulletin. Published 
by the Bureau of Educational Research, Ohio State 
University. 9 issues (omitting Je, Jl, Ag); vol. 31 
started Ja ’52; gratis; R. H. Eckelberry, editor, Ohio 
PAS niversity, Columbus 1o, Ohio: bk, B144, 
158, B272, B. 

El Soh Ihe Elementary School Journal: Empha- 
sizing Instruction, Administration, ocial Change. 
Published by the Department of Education, Univer- 
sity of Chicago. 9 issues (omitting Je, Л, Ав) ; vol. 
53 started S 's2; $4.50 per year; боё per issue; Mau- 
rice F. Seay, chairman of the editorial board, Uni- 
versity of Chicago, Chicago 37, Il: bk, 53, 352, 
Bror, Bro8, B183, B425-6 — . a : 

Erasmus—Erasmus: Speculum Scientarium. 12 «eth 
vol. 4 started Ja '52; 48 Swiss francs per year; 2. 


Swiss francs per issue; «Rudolf Jud, editor, Hirz- 
bodenweg 80, Basle, Switzerland: bk, Bg16 \ 
Eug R—The Eugenics Review. Published by the Euge- 
nics Society. 4 issues; vol. 43 started Ap '52; 12 s. 
r year; 3 s. per issue; К. Lane, editor, б ccleston 
quare, London S.W.1, England: bk, B18, B7o, B85, 
B143, B254, B256, B361, B406-7 
Excep Child—E xceptional Children, Title was Journal 
"n of Exceptional Children through vol. 17, no. 8, My 
5r. Journal of the International Council for Excep- 
tional Children, National Education Association. 
8 issues (omitting Je, Jl, Ag, S) ; vol. 19 started 
O "52; $3 per year; 40f per issue; Francis E. Lord, 
А m Michigan State Normal College, Ypsilanti, 
ich. 


FED PROBATION—Federal Probation: A Journal 
of Correctional Philosophy and Practice. Published 
by the Administrative Office of the United States 

‘ourt in cooperation with the Bureau of Prisons of 
the Department of Justice. 4 issues; vol. 16 started 
Mr 's2; gratis; Richard A. Chappell, editor, Su- 
preme Court Bldg., Washington 13, D.C.; Benjamin 
Frank, review editor, Federal Bureau of Prisons, 
Department of Justice, Washington 25, D.C.: bk, 121 


H SCH J—The High School Journal. Published by the 
School of Education, University of North Carolina. 
8 issues (omitting Je, Jl, Ag, S) ; vol. 35 started [9] 
э51; $2 per year; 40€ per issue; Samuel M. Holton, 
editor, Box 810, Chapel Hill, N.C. : bk, B71, Br59-60 

Harvard Ed R—The Harvard Educational Review. 
Published for the Graduate School of Education, 
Harvard University. 4 issues; vol. 22 started winter 
's2; $4 per year; $1.10 per issue; John J. O'Neill, 
chairman of the editorial board, Lawrence Hall, 
Kirkland St, Cambridge 38, Mass.: bk, B7o 5, 

High Points—High Points: In the Work of the High 
Schools of New York City. Published by the Board 
of Education of the City of New York. 10 issues 
(omitting Jl, Ag); vol. 34 started Ja '52; $1 per 
year; single issues not sold; Henry I. Christ, editor, 
'Andrew Jackson High School, St. Albans, N.Y.: 
bk, B71, B296, B414, B426 


INDIAN J PSYCHOL—Indian Journal of, Psychol- 
ogy. Official organ of the Indian Psychological Asso- 
ciation. 1 issue only in each of past 5 years; vol, 25 
consisted of parts 1-4 combined, '50; J. M. Sen, 
editor-in-charge, Calcutta University Press, 48 
Hazra Road, Calcutta, India: bk, B70 

Int J Opin & Attitude Res—International Journal of 
Opinion and Attitude Research, Discontinued with 
vol. 5, no. 4, W '51: bk, B20, B161, B361 

Int J Psycho-Analysis—The International Journal of 
Psycho-Analysis: And Bulletin of the International 
Psycho-Analytical Association. 4 шш vol. E: 

arted part I, 52; 50s. per year; single issues mo 
Sold: W Hofer, editor, 63 New Cavendish St., Lon- 
don W.r, England: bk, 140, B43, B85, B332 


ABN & SOCIAL PSYCHOL—The, Journal of 
ү Abnormal and Social Psychology. | shed by the 
American Psychological Associati. nc. 4 issues ; 
vol. 47 started Ja '52; $6.50 per year; 1.75 per issue ; 
J. McV. Hunt, editor, University of Illinois, Urbana, 
Til.; Stanley С. Estes, review editor, Harvard Uni- 
versity, Cambridge, Mass.: bk, 57, 112, 119, 135, 
140-1, B20, Ваз, B83, B86, Brot, Bras B171, B216, 
Bor! B282, В301, B315, B323, B332, B355, a ) 
у Ат Med Assn—The Journal of the American Medi- 
cal Association. 52 issues in 3 volumes ; vol, 148 
started Ja 5 ’52, vol. 149 started My 3 '52, vol. 150 
started S 6°52; $15 per year; 45% per issue; Austin 


A 
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Smith, editor, 535 North Dearborn St., Chicago 10, 
Ill.: bk, 120, 122, 141, B43, B153, B243, B268, B355, 
B359, B422 | ot 

J Am Stat Assn—Journal of the American Statistical 
Association. 4 issues; vol. 47 started Mr ’52 (no. 
257); $8 per year; $2 per issue; W. Allen Wallis, 


editor, University of Chicago, Chicago 37, Ill.: bk, 


B413 

J Appl Psychol—Journal of Applied Psychology. Pub- 
lished by the American Psychological Association, 
Inc. 6 issues; vol. 36 started F '52; $6 per year ; $1.25 
per issue; Donald G. Paterson, editor, University of 
Minnesota, Minneapolis 14, Minn.: bk, 57, IOI, 123, 
B22, B46, B71, Bo2, Воо, B121, B143-4, Brs4, 
B159, B171, B186, B236, B239, B272, B282, B301, 

B325, B359, B364, B368, B383, B407, B414 | 

J Bus Ed—The Journal of Business Education: for 
Collegiate, Private, and Public Business Schools: 
An Independent Magazine for Teachers of Business 
Subjects. 8 issues (omitting Je, Jl, Ag, S) ; vol. 28 
started O '52; $2.50 per year; 35€ per issue; Eliza- 
beth T. Van Derveer, editor, 512 Brooks Building, 
Wilkes-Barre, Pa.: bk, B46, B77, B141, B2or, B310 

J Clin & Exp Psychopathol—Journal of Clinical and 
Experimental Psychopathology. Published by the 
Washington Institute of Medicine. Title was Jour- 
nal of Criminal Psychopathology through vol. 5, no. 
4, Ap '44; title was then Journal of Clinical Psycho- 
pathology through vol. 11, no. 4, O '50; 4 issues; vol. 
13 started Ja-Mr ’52; $11 per year; Arthur M. 
Sackler, editor-in-chief, 19% East 62nd St, New 
York 21, N.Y. 

J Clin Psychol—Journal of Clinical Psychology. 4 is- 
sues; vol. 8 started Ja ’52; $6.50 per year; $2 per 
issue; Frederick C. Thorne, editor, 5 Pearl St., 
Brandon, Vt.: bk, 72, 109, 139, 145, B346, B406 

J Clin Psychopathol—Journal of Clinical Psychopath- 
ology. Title was Journal of Criminal Psychopathol- 
ogy through vol. 5, no. 4, Ap '44; title changed to 
Journal of Clinical and Experimental Psychopath- 
ology (q.v.) beginning with vol. 12, no. 1, Ja-Mr ’51: 


bk, B315 

J Consult Psychol—Journal of Consulting Psychol- 
ogy. Published by the American Psychological Asso- 
ciation, Inc. 6 issues; vol. 16 started F '52; $7 per 
year; $1.25 per issue; Laurance F. Shaffer, editor, 
525 West 120th St., New York 27, N.Y.: test, 49-50, 
65, 92-3, 100, 102-3, 131, 137-8, 220, 282, 347-50, 
354-5, 362-3, 465, 530, 710, 712; bk, 57, 72, 109, 112, 
119, 122, 126-7, 133, 135, 130-40, 357, 564, 651, 748, 
760, B20, B22, B40, B43, B50, B71, B85, Boo, B107, 
Broo, Brra, B121, B123, B147, B153, B155, B159, 
B168, В7о-І, B182-3, Broz, B197, B209, B217, 
B228, B247, B267, B274, B282-3, B323, B326, B332, 
B339, B355-6, B359, B361, B364, B368, B406, B409, 
B419, B422, 424 

Ј Crim Law & Criminology—The Journal of Criminal 
Law, Criminology, and Police Science. Official pub- 
lication of the International Association of Arson 
Investigators, the Illinois Academy of Criminology, 
and the Society for the Advancement of Criminol- 
ogy. 6 issues; vol. 43 started My-Je '52; $5 per year; 
$1.25 per issue; Robert H. Gault, editor-in-chief, 
357 East Chicago Ave, Chicago 11, Ill.: bk, 112, 
B323, B355, B422 

J Ed (London)—The Journal of Education: A Monthly 
Record and Review With Which Is Incorporated 
The School World. 12 issues; vol. 84 started Ja ’52 
(no. 990) ; 18s. per year; ту. 10d. per issue; E. Salter 
Davies, editor, Amen House, Warwick Square, Lon- 
don E.C.4, England: test, 792; bk, 345, B71, B73, 
B84, B160, B169, B171, B255, B335, B352, B360, 
B369, B378 : 

J Ed Psychol—The Journal of Educational Psychol- 


оду. 8 issues (omitting Je, Jl, Ag, S) ; vol, 
Ja '52; $7 per year; $1.20 per Wes Sel ede 
Corey, chairman of the board of editors, Tea niim 
College, Columbia University, New York 27, N.Y 
bk, 135, 145, 357, Brot, Bro6, Brat, B147, Bisy 
B171, B221, B282, B383, B388 ‚ Эу, 
J Ed Res—Journal of Educational Research. 9 issues 
{юне Je, Jl, Ав); vol. 46 started S "52; $4 per 
year; per issue; А. S. Barr, chairman of the edi- 
torial board, University of Wisconsin, Madison 6 
Wis.: bk, 352, 564, B71, B282, B325, B383, B386 ` 
J Excep Child—Journal of Exceptional Children. Title 
changed to Exceptional Children (q.v.) starting vol 
18, no. І, О °51: bk, Воб à 
J General Psychol—The Journal of General Psychol- 
ogy: Experimental, Theoretical, Clinical and His- 
torical Psychology. 4 issues in two volumes; vol. 46 
started Ja '52; vol. 47 started Jl '52; $14 per year; 
$7 per volume; $4 per issue; Carl Murchison, editor, 
2 Commercial St, Provincetown, Mass.: bk, Bi, 
B20, B143, B301 
J Genetic Psychol—The Pedagogical Seminary and 
Journal of Genetic Psychology: Child Behavior, 
Animal Behavior, and Comparative Psychology. 
4 issues in 2 volumes; vol. 80 started Mr '52, vol. 81 
started S ’52; $14 per year; $4 per issue; Carl 
Murchison, editor, 2 Commercial St., Provincetown, 
Mass.: bk, B171 j 
J Higher Ed—The Journal of Higher Education. Pub- 
lished by the Bureau of Educational Research, Ohio 
State University. 9 issues ene Л, Ag, S); vol. 
23 started Ja '52; $4 per year; 50$ per issue; R. H. 
Eckelberry, editor, Ohio State University, Columbus 
10, Ohio: bk, B117, B272 А 
Mental Sci—The Journal о} Mental Science: The 
British Journal of Psychiatry. Published by author- 
ity of the Royal Medico-Psychological Association. 
4 issues; vol. 98 started Ja ’52 (по. 410); 50s. per 
year; I2s. 6d. per issue; G. W. T. H. Fleming, 
editor-in-chief, Barnwood House, Gloucester, Eng- 
land: bk, 140, B18, B191, B332, B407 # 
1 Nerv & Mental Dis—The Journal of Nervous @ 
Mental Disease: An Educational Journal of Меш” 
psychiatry. 12 issues in 2 volumes; vol. 115 start x 
Ja 's2; vol. 116 started Jl '52; $14 per year; 3H 
per issue; Nolan D. C. Lewis, managing editus 
Pine St, New York 5, N.Y.: test, 42; bk, 119) T ] 
128, 140-1, B83, Bror, B153, B268, B279, В315 М 
J Personality—Journal of Personality. 4 issues; al 
20 started S ’51; $5 per year; $1.25 рег day i tam 
Zener, editor, Box 6877, College Station, D T =й 
N.C.; George F. J. Lehner, review editor, en. 
ment of Ра University of California, 
Angeles 24, Calif. : bk, 114 |, A 
J Proj Tech—Journal of Projective Teche Pub 
lished by the Society for Projective Te Seat Mr 
Rorschach Institute, Inc. 4 issues; vol, 10 sre qc 
752; $6 per year; $1.50 per issue; ik ^ “СШ. 
editor, 210 East Wilson Ave, Glenda e Bax 
test, 102, 146; bk, 124, 128, 133, 141, B3]; i Pub- 
J Sch Health—The Journal of School а sociation: 
lished by the American School Health Де] а 
10 issues (omitting Jl, Ag) ; vol. 22 S E eei 33% 
$3 per year; 356 per issue; Charles H. y 
Main St., Buffalo 14, N.Y.: bk, Bis’ 1 Psycholog); 
J Social Psychol—The Journal of Socia 1, 36 


S 


?e2: vol. 
4 issues in 2 volumes; vol. 35 starte m 1 
started Ag '52; $14 per year; $4 Pet был town, 


Murchison, XS 2 Commercial St, Pr í 
Mass.: bk, B83, I son, Pub- 
J Teach Ea Th Јога of Teacher Edicto Edu- 
lished by the National Commission ОП. ed vol. 
cation and Professional Standards. 4 SESS T. 
started Mr ’52; $3 per year; $1 per 597^ 


19) ] 


Stinnett, executive editor, Ray C. Mai i 

Dorothy McCuskey, review БОЕ амаш Каш, 
tion Association, 1201 Sixteenth St, N.W., Wash- 
ington 6, D.C.: bk, B183, B293, B386 

Jun Col J—The Junior College Journal. Official Organ 
of the American Association of Junior Colleges. 9 
issues (omitting Je, Jl, Ag) ; vol. 23 started S ’52; 
$3.50 per year; соб per issue; James W. Reynolds, 
a Crow P of a ea me University ot 

exas, P.O. Box 2118, Austi ex.: b 

B247, B272, B386 4 оъ, 


LANCET—The Lancet: A Journal of British and 
Foreign Medicine, Surgery, Obstetrics, Physiology, 
Pathology, Pharmacology, Public Health, and News. 
52 issues in 2 volumes; vol. 262 started Ja 5 '52, vol. 
263 started Jl 5 '52; 425. рег yeaf ; 15. per issue; Т. F. 
Fox, editor, 7 Adam St., Adelphi, London, W.C2, 
England: bk, 126, 344, B18, B105, B143, B191, B332, 


B359 

Librarian—The Librarian and Book World: The In- 
dependent Professional Journal. 12 issues; vol. 41 
started Ja '52; 125. per year; Is. per issue; A. b. 
Millard, editor, 5 Wardrobe Place, London E.C.4, 
England: bk, B220 


MARRIAGE & FAMILY LIVING—Marriage and 
Family Living. Journal of the National Council on 
Family Relations. 4 issues ; vol. 14 started Е '52; $5 
per year; $r per issue; Gladys Н. Groves, editor, 
Merton D. Oyler, book review editor, 5757 South 
Drexel Ave., Chicago 37, Ш.: test, 488 

Mental Health—Mental Health. Published by the Na- 

tional Association for Mental Health (England). 

3 issues; vol. 11 started winter '51; 5s. per year; 

ee 6d. per issue; К. F. Tredgold, editor, Maurice 

| raig House, 39 Queen Anne St., London W.1, Eng- 
wy bk, 126, B74, B85, B407 

ental Hyg—Mental Hygiene. Published by the Na- 
tional Association for Mental Health, Inc. 4 issues ; 

ү 36 started Ja 52; $6 per year; $1.50 рег issue; 

meee S. Stevenson, editor, 1790 Broadway, New 

М ork 19, N.Y.: bk, 141, B153 
ae R—The Management Review. Published by the 
merican Management Association, Inc. 12 issues; 

ol, 41 started Ja '52; $6 per year; 75¢ per issue; 
. J. Dooher, editor, 330 West 42nd St., New York 

od N.Y.: bk, B171, B187, B239, B301, B368 
у Муш, A Quarterly Review of Psychology and 
hilosophy. Published {ог the Mind Association. 

4 issues; vol. 61 started Ja 's2 (no. 241) ; 165. per 

усаг; 45. 6d. per issue; Gilbert Ryle, editor, Mag- 
alen College, University of Oxford, Oxford, Eng- 

land: bk, 126 


NATURE—Nature. 52 issues in 2 volumes; vol. 169 

started Ja 5 ’52, vol. 170 started Л 5.752; 120s. per 
ties 25. per issue; А. J. V. Gale and L. J. F. Brim- 
1 e, editors, St. Martins St., London W.C.2, Eng- 
and: bk, 126, 344, Bs2, B143, B183, Brot, B356, 


B407 

Nerv Child—The Nervous Child: Quarterly Journal 
of Psychopathology, Psychotherapy, Mental Hy- 
giene, and Guidance of the Child. 4 issues; vol 9 
started Ја '52; $7 per volume; $2.50 per issue; Er- 
nest Harms, editor-in-chief, 30 West 58th St., New 
York 10, N.Y.: test, 42; bk, 110, 121, 125, B50, B106, 
Broo, Brs3, B157, B268, B296, B316, B362, B409 

ew Era—The New Era in Home and School. 10 is- 
Sues (combined Jl-Ag, S-O issues) ; vol. 33 started 
Ja 52; 125. per year; 15. 6d. per issue; Peggy Vol- 
kov, editor, т Park Crescent, London W.1, England : 


bk, B169, B335, B411 
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New Outlook for Blind—The New Outlook fo 
Blind. Published by the American [geil vo M 
the pus Inc. Prior to May 1951, title was Outlook 
for the lind and The Teachers Forum (q.v.). 10 
issues (excepting Jl and Ag) ; vol. 46 started Ja '52; 
ink-print edition : $2 per year, 25¢ per current issue; 
praile wi : А рег уеаг, е per single issue; 

. Liechty, managin ito} 
St, New York i NY. a abiere 


OCCUPATIONAL PSYCH OL—Occupational Psy- 
chology. Published by the National Institute of In- 
dustrial Psychology. 4 issues; vol. 26 started Ja '52; 
30s. per year; 8s. per issue; Alec Rodger, editor, 14 
Welbeck St, London М.І, England: test, 289; bk, 
231, 352, B22, B71, B119, B121, B154, B191, 13256, 
B324, B356, B363, Baro, В : 

Occupations—Occupations : he Vocational Guidance 
Journal. Title changed to The Personnel and Guid- 
ance Journal (q.v.) beginning with vol. 31, no. I, 
О 's2: bk, 119, 748, B46, B71, Bror, Broo, Віш, 
Bizr, B147, Biss, B159, Bi7r, B183, B186, B301, 
B325, B339, B362-4, B368, um Ba25 . 

Outlook for Blind—1 he Outlook for the Blind and The 
Teachers Forum. Title changed to New Outlook for 
a Uu (q.v.) beginning with vol. 46, no. 1, Ja '52: 

, Вп 


PERSONNEL ADM —Personnel Administration. 
Official publication of the Society for Personnel Ad- 
ministration. 6 issues; vol. 15 started Ja '52; $4.50 
per year ; $1 per issue; O. Glenn Stahl, editor, Martin 
Beckford, review editor, Room 103, 1 K St, 
N.W., Washington 25, D.C.: bk, B23, B239, . 

Personnel & Guid J—The Personnel and Guidance 
Journal. Title was Occupations (q.v.) through vol. 
30, no. 9, My '52. Published by the American Person- 
2 mit eee Азда 9 issues (EE Je 

, Ар, H per year; 75 T issue; iliam 22. 
Wilkins, editor, New York University, New York, 
N.Y.; Nancy, Shivers, assistant editor, 1424 91x- 
teenth St, N.W. Washington 6, D.C. j 

Personnel. J—Personnel Journal: The Magazine of 
Labor Relations and Personnel Practices. 11 issues 
(Jl-Ag combined in one), vol. 31 started My '52; $5 
per year; 75% per issue; Edward N. Hay, editor, 
Swarthmore, Pa.: bk, B23, B45, B53, B121, B187, 
B239, B368 

Personnel Psychol—Personnel Psychology: A Journal 
of Applied Research. 4 issues; vol. 5 started spring 
'52; $7 per year; $2 per issue ; ‘Erwin К. Taylor, edi- 
tor, Personnel Research Institute, Western Reserve 
University, 314 Superior Ave. Cleveland 14, Ohio; 

ohn Н. Rapparlie, review editor, 9261 Grayton Rd., 

etroit 24, Mich.: bk, 786, B22 B46, B71, B109, 
B121, B147, Bist В159, B171, B175, B191, B216, 
B363-4, B368, B407 A 4 

Phils. Sci—Philosophy of Science. Organ of the Phi- 

losophy of rp dee uA dein vol. че 
а '52; г year; $1.25 per ! 70. ¥ 

ui Kita Dore ayne University, Detroit 1, 

Mich.: bk, B153 A. 1 

Psychiatric ОТ, Psychiatr Quarterly. Official 
Scientific Organ of the New York State De rtment 
of Mental Hygiene. 4 issues; vol ; phla se 

car; $1.50 per issue; t slow, 
akg ‘Duncan Whitehead, acting editor, Utica State 
Hospital, Utica, N.Y.: bk, 57, 120, 125-8, B86, B143, 
B170, B186, B268, B346, B355, B368 "T 

Psychiai Sup—The Psychiatric Quarterly әй 
ment. Published by authority of the New ork 


tarted part 1, 's2; $3 per year; $1,50 per issue; 
Marton Bigelow, editor, Duncan Whitehead, acting 


Psychiatry 
editor, Utica State Hospital, Utica, N.Y.: test, 107; 


‚145 Ар A 
Psychiatry—Psychiatry: Journal for the Study of In- 
terpersonal Processes. Published by the William 
Alanson White Psychiatric Foundation, Inc. 4 is- 
sues; vol. 15 started F '52; $8 per year; $2 per issue; 
Mabel Blake Cohen, editor, 1711 Rhode Island Ave., 
Washington 6, D.C.: bk, B83, pus К 
Psychoanalytic Q—The Psychoanalytic Quarterly. 4 
issues; vol. 21 started no. I, '52; $8 per year; $2.50 
per issue; Raymond Gosselin, editor, Room 912, 57 
West 57th St, New York ro, N.Y.: bk, 133, 135, 
141, B43, B153, B323, B332 ; | 
Psychoanalytic R—The Psychoanalytic Review: An 
Educational American Journal of Psychoanalysis 
Devoted to an Understanding and Education of Hu- 
man Behavior. 4 issues; vol. 39 started Ja '52; $8 per 
year; $2.25 per issue; Nolan D. C. Lewis, managing 
editor, 70 Pine St., New York 5, N.Y.: bk, B2! 
Psychol B—Psychological Bulletin. Published by the 
American Psychological Association, Inc. 6 issues; 
vol. 49 started Ja '52; $7 per year; $1.25 per issue; 
Wayne Dennis, editor, Edward Girden, review edi- 
tor, Department of Psychology, Brooklyn College, 
Brooklyn, і bk, 112, 135, 141, 352, 357, B20, 
B22, B43, B50, B71, B80, Boo, Втоў, B112-3, B117, 
B121, B123, B143, B147, B153, B155, B168, B182-3, 
B186, B192, B197, B226, B228, B243, B267-8, B274, 
B282-3, B301, B323, B361, B363, B368, B379, B407, 
B419, B422 — 
Psychol Service Center J—The Psychological Service 
Center Journal. Published by the International Psy- 
chological Service Center, Inc. 4 issues; vol. 3 
started with a combined issue nos. 1 and 2, Mr and 
Је °51; $5 per year; $2 per issue; Russell Graydon 
Leiter, editor, 1275 New Hampshire Ave. N.W., 
Washington 6, D.C.; W. Grant Dahlstrom, review 
„editor, Department of Psychology, State University 
_of Iowa, Iowa City, Iowa: bk, 109, 127 
Psychometrika—Psychometrika: A Journal Devoted 
to the Development of Psychology as a Quantitative 
Rational Science. Official journal of the Psycho- 
metric Society. 4 issues; vol. 17 started Mr °52; $10 
per year for current subscriptions (2 copies of each 
issue) ; $5 per back volume; $1.25 per back issue 
(current issues not sold) ; Dorothy C. Adkins, man- 
aging editor, University of North Carolina, Chapel 
Hill, N.C.; Clyde H. Coombs, review editor, Univer- 
sity of Michigan, Ann Arbor, Mich.: bk, B22, B83, 
Broo, Br11-2, B147, B157, B205-6, B325 
Psychosom Med—Psychosomatic Medicine: Experi- 
mental and Clinical Studies. Published under edi- 
torial supervision of the American Psychosomatic 
Society, Inc. 6 issues; vol. 14 started Ja-F '52; $7.50 
per year; $1.75 per issue; Carl Binger, editor-in- 
chief, 551 Madison Ave, New York 22, N.Y.: bk, 
112, 120, 126, 
B301, B332, B355 


Pub Opinion O—The Public Opinion Quarterly. Or- _ 
gan of the American Association for Public Opinion `` 


Research. 4 issues; vol. 16 started spring '52; $5 per 

a year; $1.50 per issue; Richard Snyder, editor, Box 
486, Princeton, N.J.: bk, 57. B282, B361 

Pub Personnel R—Public Personnel Review: The 
Quarterly Journal of the Civil Service Assembly of 
the United States and Canada. 4 issues; vol. 13 
started Ja '52; $6.50 per year; $2 per issue; Ken- 
neth O. Warner, editor, Room 456, 1313 East 6oth 
St., Chicago 37, Ill.: bk, 786, B22, B53, ees B183, 
B187, B239, B379, B383 


| -Q J CHILD BEHAVIOR—OQuarterly Journal of 
Child Behavior. 4 issues; vol. 4 started Ја ’52; $8.50 
per year; single issues not sold; Nolan D. C. Lewis, 


, B43, B71, B83-4, B143, B171, B282-3, 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK [ 1098 


managing editor, 70 Pine St, New Y. А 
o duri ees banh ork §, NY.: 
J Exp Psychol—The Quarterly Journal j 
mental Psychology. Published for the E oM 
Psychology Group. 4 issues ; vol. 4 started "52; 30s. 
р Lordi aii se É mais Russell Davis, editor. 
sychologica ratory, Downing St., idge, 
England: bk, B356 s Cambridge, 
Q R Biol—The Quarterly Review of Biology. 4 issues: 
vol. 27 started Mr '52; $6 per year; $1.75 per issue: 
B. H. Willier, editor, H. B. Glass and Carl P. Swan- 
son, review editors, Johns Hopkins University, Bal- 
timore 18, Md.: bk, 121, 133, 141, B83, Bior, Bro, 
Bros, Birr, B190, B282, B323, B332, B379, B422 
Q R Psychiatry & Neurol—Quarterly Review А. 
chiatry and Neurology: International Record of 
Psychiatry and Neurology. Published by the Wash- 
ington Institute of Medicine. 4 issues; vol. 7 started 
Ja '52; $11 per year; $2.75 per issue; Winifred Over- 
holser, editor-in-chief, Washington Institute of Med- 
icine, 19% East 62nd St, New York 21, N.Y.: bk, 
112, 120, B143, B153 


RORSCH RES EXCH & J PROJ TECH—Ror- 
schach Research Exchange and Journal of Projec- 
tive Techniques. Title changed to Journal of Projec- 
tive Techniques (q.v.) beginning with vol. 14, no. 1, 
'so. Title was Rorschach Research Exchange 
through vol. ro, no. 4, D '46: test, 132; bk, 112, 121, 
126, B43, B332 


SCH & SOC—School and Society. Published by the 
Society for the Advancement of Education, Ine, 52 
issues in 2 volumes; vol. 74 started Jl 7 '51; vol. 75 
started Ja 5 '52; vol. 76 started Jl 5 '52; $7 per year; 
20% per issue; I. L. Kandel, editor, 15 Amsterdam 
Ave., New York 23, N.Y.: bk, B425 dis 

Sch R—The School Review: A Journal of E 
Education. Published by the Department of ] "m 
tion, University of Chicago. 9 issues (omitting id 
Jl, Ag) ; vol. бо started Ja '52; $4.50 per ye al 
per issue; Maurice F. Seay, chairman of the е ib | 
committee, Патет. S er Chicago 37, ++ 
bk, B144, B159, B271, B326, B42 м): 

Sch Sci 4 Mask—School Science апа Ишан 
A Journal for АП Science and Mathematics Ea 
ers. Published by the Central Association © ЖИНА, 
and Mathematics Teachers, Inc. 9 issues fm on 
Л, Ag, S); vol. 52 started Ja '52; $3.50 Per evil 
50¢ per fer dus W. Warner, editor, La 
Ind.: bk, B183, B342 й sation 

Sci—Science. Published by the American дио, 
for the Advancement of Science. 52 ire "d 

umes; vol. 115 started Ја 4 E vs s M. 
259 ре Matsa chusetts Ave; 
B105 

t of 


Advancemen ] 


tart 
Science. 12 issues in 2 volumes; vol. 74 84 per 
's2; vol. 75 started Jl ’52; $7.50 per Yea; 7 1 is 
Massachusetts Ave., N.W., Washington 5: D.C.: % 


B 4 The 
Scottish Ed J—The Scottish Educational Journal bol: 

official organ of the 

land. 52 issues; vol. 

year; 3d. per issue; Janet 

Moray Place, Edinburgh, 3, 


B84, B360 Ж 
5004 Ed- Social Education. Official s 
National Council for the Social = nd 
by the Council in collaboration w! | 


Educational Institute 6 ч 
p B18, 


, 52 » 
35 eum JA or 
Scotland: bk, 34 
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Historical Association. 8 issues (omitting Je Л, Ай ТАТ NEWSLEPTER—The rat ўе. Vols. 


per is- | 


S); vol. 16 started Ja '52; $4 per year; 5 
sue; Lewis Paul Todd, editor, 1201 Sixteenth St., 
N.W., Washington 6, D.C. : bk, B70, B368, B409 
Social Forces—Social Forces: A Scientific Medium of 
Social Study and Interpretation. 4 issues; vol. 30 
started O '51; $5 per year; $1.50 per issue; Howard 
W. Odum and Katharine Jocher, editors, University 
of North Carolina, Chapel Hill, N.C.: bk, 57, B323, 


B361 

Sociol & Social Res—Sociology and Social Research: 
An International Journal. 6 issues; vol. 37 started 
S-O '52; $3.50 per year; 70 per issue; Emory S. 
Bogardus, editor, 3518 University Ave., Los Angeles 
7, Calif.: bk, 57, 101, 126, В217 

Southw Social Sci Q—The Southwestern Social Sci- 
ence Quarterly. Official organ of the Southwestern 
Social Science Association. 4 issues; vol. 33 started 
Je '52; $4 per year; $1 per issue; Oliver Benson, 
editor-in-chief, University of Oklahoma, Norman, 
Okla.; H. Malcolm Macdonald, review editor, The 
University of Texas, Austin, Tex.: bk, 57 


er 


PERIODICAL DIRECTORY & INDEX 


2 
mese Times Lit Sup 


r; 


1-2 were publ 


published b PR Joure 


separates on $ 
TE | as parts of the Jour- 
“Techniques inning. with "e 
1952, The TAT Newsletter is Avail- 
"able only as a part of the Ji of Pro, jective Tech- 
niques; Edwin S. Shneidman, editor, ACA? Neüto- 
psychiatric Hospital, Wilshire and Sawtelle Blvds., 
Los Angeles 25, Calif. : test, 103, 105; bk, 114, 1 40 
Teach Col J—The Teachers College Journal. p^ - 
lished by Indiana State Teachers College. 6 issues 
(О, N, D, SMS and Му-Је) ; vol. 23 started О 
"51; gratis; Charles Hardaway, editor, the College, 
Terre Haute, Ind.: test, 570 
Times Ed Sup—The Times Educational Supplement. 
52 issues; 43rd year started S 5 '52 ш 1940), 245. 
per year; 4d. E issue ; PUR ouse Square, 
London E.C.4, England: bk, B52, S Вт, B247 
Times Lit Sup—The Times Literary монен. 52 
issues; 5154 year started Ја 4 '52 (no. 2,604) ; 6d. per . 
issue; Printing House Square, London E.C.4, Eng- 
land: bk, B83 
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References are to test and book entries, not to pages. Stars indicate test publishers which issue 
catalogs devoted entirely or in large part to tests; asterisks indicate other publishers of one or 


more tests listed in this volume. 


Branch office addresses are given for selected test publishers. 


Numbers preceded by "test" refer to tests; numbers preceded by "bk" refer to books. 


VOCE MEN dowd e Inc., Rockville M rs 
-Y.: test, 20, 67, 152, 161-2, 191-2, 274-5, 406, 4 
485, 536, 549, 590, 603, 626, 664, 666-7, 676-7, 681, 695 
*Addison-Wesley Press, Inc., Kendall Square Bldg., 
Cambridge 42, Mass. : test, 56; bk, 57, 140, B38, B358 
p University, Maxwell Air Force Base, Ala.: bk, 


24 
Almqvist & Wiksell, Box 96, Stockholm 1, Sweden: 
‚ B204 

American Association for Health, Physical Education, 
and Recreation, National ucation Association, 
1201 16th St., N.W., Washington 6, D.C.: bk, B289 

*American Automobile Association, Pennsylvania Ave. 
at 17th St., Washington 6, D.C.: test, 521-2, 790 

American Chemical Society. See Examinations Com- 
mittee. 

*American Council on Education, 1785 Massachusetts 
Ave., Washington 6, D.C.: test, 26, 510; bk, B25, 
B109, Br11, B122, B205-6, B217, B247, B320, B329- 
31, B364, B372, B390 . е 

*American Dental Association, 222 East Superior St., 
Chicago 11, Ill.: test, 788 

American Educational Research Association, 1201 16th 
St., N.W., Washington 6, D.C.: bk, B290-2 

American Foundation for the Blind, Inc., 15 West 16th 
St., New York 11, N.Y.: bk, B118 

American Home Economics Association, 1600 20th St., 
N.W., Washington 1, D.C.: bk, B241 

American Institute for Research, 410 Amberson Ave., 
Pittsburgh 32, Pa.: bk, B149, B412-3 

American Management Association, 330 West 42nd 
St., New York 18, N.Y.: bk, B23, B26, B28, B119 

aoe Optical Co., Southbridge, Mass.: test, 

I 

American Optometric Association, Inc. See Depart- 
ment of Public Information. 

*American Orthopsychiatric Association, Inc., 1790 
Broadway, New York то, N.Y.: test, 144 

American Psychological Association, Inc., 1515 Massa- 
chusetts Ave., N.W., Washington 5, D.C.: bk, B39, 


Bo8, B140, B142, B146, B174, B194, B223, B229, 
B233, B240, B249, B251, B269-70, B302, B334, B350, 
B391, B415 Е 

Antioch College. See Fels Institute. — . ni 

* Aptitude Associates, Док 1128, Washington, D.C.: 
test, 170, 204, 329, 77 р 

* Aptitude Test Eric Box 239, Swarthmore, Pa.: 
test, 459, 725, 827 l 

Arco есы о Lexington Ave, New York 
17, N.Y.: bk, B310-1, B392 

E. 7 Arnold & Son ee 41-43 Maddox St, London 
W.1, England: bk, B343 е б 

Association for Supervision and Curriculum A 
ment, 1201 16th St, N.W., Washington 6, D.C.: 95 
B293 РЕР? C. Jones, 

Association of Technical Institutions, % J. C. , 
Honorary Minus The Polytechnic, 309 Regent 
St., London W.1, England: bk, B16 d National 

Australia. See Department of Labour an 
Service. 3 

Xdustralion Council for Educational Research, К 
Collins St., Melbourne, C.r, Australia 5 test, 149, 
267-73, 308-9, 528, 700, 719, 756; bk, B199 


*BANKS Upshaw & Co., 707 Browder St, Dallas ^ 
Tex.: test, , 
БӨУ ВооЁ Со. wd е Rutland Road, Harrow 
Middlesex, England: bk, B220  , Я i 
*Baster Foundation for Research in Education ШЕ 
315 Central Park West, New York, N. ub "AY: 
Beacon HE тот Park Ave, New York 17, 
k, B248, tate 
DUAE. Bashke Congdon Campus School, Stal 
Teachers College, Potsdam, N.Y.: eshi Ра. bh, 
Bell Publishing Co., Box 71, Drexe ] 


B250 tate 
*Arihur L. Benton, Department of Psychology 5 
University of Iowa, Iowa City, towa: ler, Ind: 
*Benton Review Publishing Co., be бот, 607,701 
test, 159-60, 392, 430, 509, 643, 671, 083, P" 
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1101 | 


Willis i Berg, 1512 Spruce St., Berkeley, Calif.: bk, 
139, 529 
J. J. Berliner & Staff, present address unknown: bk, 


Br 

Basil Blackwell & Mott, Ltd., 49 Broad St., Oxfor 
England: bk, B366-7 А s 

*Blackwell Scientific Publications, Ltd., Oxford, Eng- 
land: test, 111; bk, 112, 122, B44, B153, B192, B196, 
B253, B359 

DUM Blumenthal & Co., Inc., Shelton, Conn.: test, 


65 

Board for Vocational Education, State of Iowa, Des 
Moines r9, Iowa: bk, B89 

*Emory S. Bogardus, 3518 University Ave., Los An- 
geles 7, Calif.: test, 88 

*Book-A-Day Series, Box 41, Hill City, S.D.: test, 189 

British Book Centre, Inc., 122 East 55th St., New York 
22, N.Y.: bk, B407 

British Information Services, 30 Rockefeller Plaza, 
New York 20, N.Y.: bk, B256 

William C. Brown Co., 915 Main St, Dubuque, Iowa: 


bk, B76 

*Martin M. Bruce, Apt. 6A, 624 East 20th St, New 
York 9, N.Y.: test, 824 

Bruce Publishing Co., 400 Broadway, Milwaukee 1, 
Wis.: bk, B75 H 

*Charlotte Buhler, 1127 North Sweetzer Ave., Los 
Angeles 46, Calif.: test, 147; bk, 119, B63 

Bureau of Business Education, Commission for Voca- 
tional Education, State Department of Education, 
Sacramento 14, Calif.: bk, B141 

Bureau of Business Research, июнну of Missis- 
sippi, University, Miss.: bk, B1 

XBurcau of Educational Measurements, Kansas State 
Teachers College of Emporia, Emporia, „2 fest, 
38, 149, 199, 207, 254, 264, 281, 431, 439, 450-1, 472, 
504, 509, 520, 539, 578, 673, 708 К 

Bureau of Educational Research and Service, College 
of Education, University of Wyoming, Laramie, 
Wyo.: bk, B56 А 

Bureau of Educational Research and Service, Medical 
College of Virginia, Richmond 19, Уа. : bk, B203 

*Bureau of Educational Research and Service, State 
University of Iowa, Iowa City, Iowa: test, 16, 164, 
166-7, 197, 224, 393, 395, 426, 621-2, 638-0, 814 

Bureau of Guidance and Records, University of Wis- 
consin, Madison, Wis.: bk, Boo-1 E 

*Bureau of Publications, Teachers College, Columbia 
University, New York 27, N.Y.: test, 19, 132, 200, 
232, 421, 474, 477, 481, 558, 563, 566; bk, 123, Broz, 
B120, B157-8, B210, B222, B383 hehe 

Bureau of Reference, Research and Statistics, New 
York Public Schools, 110 Livingston St., Brooklyn 
2, N.Y.: bk, B30, B234 

Bureau of Research, Iowa State Teachers College, 
Cedar Falls, Iowa: bk, B347 i E 

Burgess Publishing Co., 426 South Sixth St, Min- 
neapolis 15, Minn.: bk, B273 § 

*Burton Bigelow Organization, 274 Madison Ave., 
New York 16, N.Y.: test, 826, 830 


*C.P.S. Co., P.O. Box 42, Gracie Station, New York 
28, N.Y.: test, 103 Ў 

California. See Bureau of Business Education. 

XCalifornia Test Bureau, 5916 Hollywood Blvd., Los 
Angeles 28, Calif.; rro South Dickinson St., adi- 
son 3, Wis.; 206 Bridge St., New Cumberland, Pa.: 
test, 2, 23, 62, 66, 151, 168, 175, 226, 228, 282, 366, 411, 
ae 417, 464, 530, 5 55 540-2, 569, 645, 710, 743. 
46, 750, 761, ; bk, B124 

Cambridge 7 ем Press [London], Bentley 
House, 200 Euston Road, London N.W.1, England: 
bk, 231, B200, B221, B370, B380, B420 
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Crowley 


Cambridge University Press [New York 

А Бет Мет York 28 СЕ bk, Вб пера 
apitol Publishin , Inc, Irvi - 
NY: bh a 0. Inc., Irvington-on-Hudson, 

*A. Carlisle & Co., 135 Post St, San Francisco 8, 
Calif.: test, 511 

* Russell N. Cassell, 344 В St., Hayward, Calif.: test, 


33 

Cassell & Co. Ltd., 37-38 St. Andrew's Hill, Queen 
Victoria St, London ECA England: bk, B37! 

Catholic University of America Prat 620 Michigan 
Ave., N.E., Washington 17, D.C.: bk, B49, B68, B88, 
B125, B162, B417 

*Center for Psychological Service, George Washing- 
ton University, Washington, D.C.: test, 89, 473, 806, 


16, 620 

*Center for M Education, New York University, 
New York, N.Y.: test, 523-4 

*Central Institute for the Deaf, 818 South Kings High- 
way, St. Louis 10, Mo.: test, 646 

Chapman & Hall Ltd., 37 Essex St, London W.C.2, 
England: bk, B183, B257, B379 

*Character Research Association, 946 Goodfellow 
Blvd., St. Louis 12, Mo. : fest, 39, 205 

Chartwell House, Inc., 280 Madison Ave., New York 
16, N.Y.: bk, B307 

Child Care Publications, 30 West 58th St., New York 
19, N.Y.: bk, B66 Е 

*Child Development Laboratories, University of Mich- 
igan, Ann Arbor, Mich.: test, 20; bk, B299-300, B354 

Child Development Publications, Society for Research 
in Child Development, Inc., Fayerweather Hall, 
East, Northwestern University, Evanston, Ill.: bk, 
B190, B232 ў 

*Citizenship Education Project, Teachers College, 
Columbia University, New York 27, N.Y.: test, 36 

Civil Service Assembly of the United States and Can- 
ada, 1313 East 6oth Street, Chicago 37, Ill.: bk, 
B148, B338 

*Walter V. Clarke Associates, 503 Sowans Road, Bar- 
rington, R.I.: test, 27 } 

Clarke, Irwin & Co., Ltd., 103 St. Clair Ave., West, 
Toronto s, Canada: bk, B84, B255 

*College Entrance Examination Board, 425 West 
117th St, New York 27, N.Y.: test, 178, 237, 244, 
248-50, 259, 285-6, 367-8, 526, боо, 617, 633, 662, 808- 

; bk, Bo4-7 : h н 
College Туба Co., 527 State St, Madison 3, Wis.: 


4 2 D п 
Columbia еге See Bureau of Publications ; 
Citizenship Бае мое: Columbia University 
Press; and King's Crown Press. 

C columbis ишү үз 2960 Broadway, New 
York 27, N.Y.: bk, B42 
Columbia. University Press, 2960 Broadway, New 

York 27, N.Y.: bk, 133, B160, B371 
* Committee on Diagnostic Reading Tests, Inc., 419 
West 119th St, New York 27, N.Y.: test, fh 
*Concordia fg emer Бона 3558 South Jefferson, 
St. Louis 18, Mo.: test, д А 
ЖС, aee Test Division, Educational Testing Serv- 
ice, Princeton, N.J. : test, 4-7, 26, 45, 58, 60, 155, 176, 
183-4, m 213, 238, 245, p 2 nies у оса 
о, 370, 387-8, 423, 438, 440, ^ , 557, 
7, 579, S812, 585, 504-5, 601, 606, 618, 623-4, 629, 034, 
663, 668, 684-7, 702 Vv 
Cornell University. See University Testing and Serv- 
ice Bureau. $ d 
* Il University Medical College (orders shoul 
iir fo. "Albert J. Erdmann, Jr., The New York 
Hospital, 525 East 68th St, New York 21, NY): 
test, 470 
Il Publications, Adams, Mass.: bk, B214-5 
Seventy Milner & Co, Detroit 26, Mich.: test, 418 


Crown 


Crown Publishers, Inc., 419 Fourth Ave., New York 
16, N.Y.: bk, B261 

J. Curwen & Sons Ltd., 24 Berners St., Marylebone, 
London W.1, England: bk, B373 


*DARTNELL Corporation, 4660 Ravenswood Ave., 
Chicago 40, Ill.: test, 773, 781, 789; bk, Вто, B304 
Stephen Daye Press, 105 East 24th St, New York 10, 

N.Y.: bk, B268, B284 

*Dembar Publications, Inc., 114 South Carroll St., 
Madison 3, Wis.: test, 793; bk, B78 

*] M. Dent & Sons (Canada) Ltd., 224 Bloor St, 
West, Toronto 5, Ontario, Canada: fest, 792 

Department of Classroom Teachers, National Educa- 
tion Association, 1201 16th St., N.W., Washington 6, 
D.C.: bk, B294 

Department of Educational Research, Ontario College 
of Education, University of Toronto, 371 Bloor St., 
West, Toronto 5, Ontario, Canada: bk, B211 

Department of Labour and National Service, Common- 
wealth of Australia, Melbourne, Australia: bk, B3 

*Department of Neuropsychiatry, Washington Univer- 
sity School of Medicine, St. Louis, Mo.: test, 60 

*Department of Physical Education for Women, State 
University of Iowa, Iowa City, Iowa: test, 

Department of Public Information, American Op- 
tometric Association, Inc., 707 Jenkins Bldg., Pitts- 
burgh 22, Pa.: bk, B8 

Department of Public Instruction, State of Illinois, 
Springfield, Ill.: bk, B219 

*Department of Public Service, General Mills, Inc., 
400 Second Ave., South, Minneapolis 1, Minn.: test, 
502 

Division of Chemical Education, American Chemical 
Society. See Examinations Committee. 

*Division of Educational Reference, Purdue Univer- 
sity, Lafayette, Ind. : test, 46, 83, 705, 798, 800, 803; 
bk, B47, B161, B319, B333, B344, B357 

Division of Industrial Relations, Graduate School of 
er Stanford University, Stanford, Calif.: bk, 

245 

*Division of Maternal апа Child Health, Massachu- 
setts Department of Public Health, 73 Tremont St., 
Boston 8, Mass.: test, 647 

Dennis Dobson Ltd., 12 Park Place, St. James's St., 
London S.W.r, England: bk, B52 

*H. & W. B. Drew Co., Jacksonville т, Fla.: test, 512 

Dryden Press, Inc., 31 West 54th St, New York 19, 

* N.Y.: bk, B212 

Duckworth, Gerald & Co. Ltd., 3 Henrietta St., 
Covent Garden, London W.C.2, England: bk, B348-9 

Dymock's Book Arcade Ltd., 424-426 George St., Syd- 
ney, Australia: bk, B260 


*ECONOMIC Institute, Box 1160, Chicago 90, Ш.: 


test, 807 
*R. W. Edmiston, Miami University, Oxford, Ohio: 
test, 580, 649, 738 
Educational Publishers, Inc. [Minneapolis]. See Edu- 
cational Test Bureau. 
Educational Publishers, Inc. [St. Louis], 122 North 
Seventh St., St. Louis 1, Mo.: bk, 128, B394 
*Educational Records Bureau, 21 Audubon Ave., New 
York 32, N.Y.: test, 202-3, 212, 400, 583, 602, 619, 
о 635, 718, 787; bk, B126-34, B342, B387, B380 
*Educational Test Bureau, Educational Publishers, 
Inc, 720 Washington Ave, S.E, Minneapolis, 
Minn.; 3433 Walnut St., Philadelphia 4, Pa.: 2106 
Pierce Ave., Nashville 5, Tenn.: test, 8, 94, 158, 230, 
246, 255, 265, 351, 372-3, 381, 385, 389, 410, 428, 434, 
436, 443, 447, 572, 506, 614, 627, 640, 650, 665, 674, 
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*Educational Testing Service, Princeton, N.J.: test, 
IO-I, 178, 186, 219, 227, 233, 237, 241, 244, 247-50, 
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259, 263, 285-6, 203, 367-8, 374, 442, 497, 505-6 
526-7, 600, 604, 617, 620, 630, 633, 637, boa je Peg 
703, 709, 797, Зо, о oe 815, 817; bk, Bg. 
207-9, B237, B321. See also Cooperati in 
^ sion, Educational Testing hes етеги 
ngineers Northwest, 100 Metropolitan Li 
Хемера B а test, 466 A SEE 
nglish Language Institute, University of Michi 
Ann Arbor, Mich.: fest, 235 y of Michigan, 
*Evaluation Division, Bureau of Educational Research, 
Ohio State University, Columbus, Ohio: test, 98. 
Evans Brothers Ltd., Montague House, Russell Square, 
London W.C.r, England: bk, B202, B297 
*E xaminations Committee, Division of Chemical Edu- 
cation, American Chemical Society, St. Louis Uni- 
versity, St. Louis 3, Mo.: test, 607-12 


FABER & Faber Ltd., 24 Russell Square, London 
W.C.1, England: bk, 126-7, B275-6 

*Family Life Publications, P.O. Box 337, Durham, 
N.C.: test, 488 

Federal Security Agency. See Office of Vocational Re- 
habilitation; and United States Office of Education. 

*Fels Institute, Antioch College, Yellow Springs, 
Ohio: test, 43 

Foundation Press, Inc., 268 Flatbush Ave. Ext., Brook- 
lyn 1, N.Y.: bk, B309 

Funk & Wagnalls Co., 153 East 24th St, New York 
17, N.Y.: bk, B45 


*GALTON Laboratory, University College, Gower 
St., London W.C.r, England: test, 312 . ў 
General Mills, Inc. See Department of Public Service. 
*Robert Gibson & Sons (Glasgow) Ltd., 45 Queen 

St., Glasgow, Scotland: test, 412 
Gordon Bookshop, Iowa City, Iowa: bk, B244 |. 
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113(18), 136(153, 290) 

Arny, Clara Brown: fest, 491-4, 502 

Aron, Betty: bk, 139, B29; ref, 113 
(18), 136(118, 154, 189, 290) ; f, 
46; other, 71(220 

Aronow, Miriam S.: bk, B30 

Aronson, Albert H.: other, B148 

Aronson, Marvin Lucius: ref, 71 
(241), 102(6), 117 (1075-6) 

An Seth: exc, B155; ref, 92 

2 

Arthur, Grace: test, 335, 359; ref, 
335 (42-3, 45), 349(18), 358 (461, 
480), 350 (1-2 

Arthur, Guy B., Jr.: bk, B28 

Arthur, Helen : exc, 121, 133, B190 

Artley, A. Sterl: ref, 5(1, 3), 155 
(33, 47), 547 (18, 23), 668(1, 3) 
shburn, Frank D.: ref, 526(4) 

Asher, E. J.: test, 301; ref, 660(25), 


751(6) 

Ashford, T. A. : ref, 607 (2), 608(2), 
610(9), 611(1), 612(1); cross 
ref, 621-2 ^ 

Association of Principals of Techni- 
cal Institutions: bk, B16 

Association of Technical Institu- 
tions : bk, B16 

Asthana, Н. S.: ref, 117(622, 656) 

Atkey, Richmond R.: ref, 361(105) 

Atkinson, John W.: ref, 136(119, 
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Axtell, S. B.: ref, 71(97) 
Ayer, Fred C.: fest, 198; bk, B36; 
ref, 198(1) ^ 
Ayer, ee L.: rev, 66, 702; 
cross ref, 15, 150, 408, & 
oe Howard dO ref, 
22 


Ayres, Leonard P.: test, 475 


B., E.: exc, 141 

B., E. K. C.: exc, B356, B406 

B., G. E. R.: exc, B74, B171 

B., L. B.: exc, B169 

Baas, Malcolm L.: ref, 742(116, 


159) 
Babb, Lawrence: bk, B272 
Babcock, Harriet: test, 31; bk, 760, 
B37; ref, 759(54), 358(404) 
Baccari, Francis: test, 249 
Bachrach, Arthur J.: ref, ан) 
Bacon, Arleighn Mae: ref, 358(368 
Baehr, Melany E.: ref, 47(17), 4 
(20), 59(21) 
Baer, L. Opal: ref, 77(126) 
Baggaley, Andrew R.: ref, 742(71) 
Bagley, Jessie W.: ref, 570(16) 
Baier, Donald Е. : exc, B419 а 
Bailar, John С.: test, 620 
Bailey, Albert Ernest: ref, 299(14) 
Bailey, Donald W.: ref, 277 (260) 
Bailey, prt Воо; ref, 277 
(170), 280(1 
Bailey, Helen K.: ref, 2(9), 28 
(26), 302(50), 358(442) 
Bair, John T.: ref, 730(3) 
Baker, Charles A.: ref, 35 (27) 
Baker, Florence: ref, 542(1), 555 
( 


2) 
Baker, С. Derwood: exc, B102 
Baker, Gertrude: ref, 117 (1142) 
Baker, Harry J.: test, 86, 288 n 
Baker, Lawrence M.: ref, HH 
Baker, dips : Ve 382, 391, 09% 
803; ref, 803(24. , 
РААТ L.: test, 43; fi 
Brot, B221 ; ref, 43(7-9 1 13-4), 
58 (425-6) 
Baldwin, Barbara Jean: réf, 650 


Baldwin, Marcella Vig.: ref, ™ 


Bale Robert F.: test, 565 bk, sh 


B38; ref, 56(1-3) ; 
Balinsky, ам! rev, 146, 39 


bk, B33 
Balint, Michael: ref, 134(4) 
Balken, Eva Rui Н pi 133 

Il, Eric G.: test, ; 
Ball Rachel Stutsman: bk, B365 

cross ref, 351 
Ballenger, H. L.: test, 165 tke 
Baller,” Warren R.: rev, 510, 5 

583; test, 794 ) 
Balloch, J. C.: ref, 117( З 
арР M.: test, 57 n 
Banks, Charlotte: rev 

314(50) 
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Bannon, W. J.: ref, 358(427) 
Banzet, Ernest M.: bk, B272 
Barghoorn, Elso S.: test, 600 
Barker, Clarence, Jr.: ref, 472(1) 
Barkley, Bill J.: ref, 144(19) 
Barnabas, Bentley: test, 38; ref, 71 
(98), 77 (191, 219), 747 (199) 
Barnes, Cunliffe: ref, 51(19), 
117(903), 144(28) 
Barnett, Albert: ref, 277(158), 763 


(80) 
Barnett, Gordon James: ref, 747 


(240 

Barnett, Irving: ref, 117(904), 361 
(259) i 

Barnette, W. Leslie, Jr.: bk, B39; 
ref, 621 (17, 19), 730(1, 4), 742 
(117, 135, 187-8), 751(8, 13), 763 
(81, 92), 766(27, 43), 810(3, 5), 
277(200, 261) 

Barnowe, Theodore Joseph: ref, 
282 (18), 302(47) 

Baron, Samuel: ref, 117(905, 1077) 

Barr, A. S.: test, 793; cross ref, 806 

Barrell, Robert Poindexter : ref, 117 


(1078) 
Barrett, Albert M.: ref, 71(192) 
Barrett, Dorothy M.: ref, 460(3) 
Barrett, E. R.: fest, 149 
Barrett, Harry O.: ref, 223(5-6), 
224(22-3, 25) 
Barrett, Linton L.: test, 259 
Barrett, Ruth Evelyn: ref, 129(22) 
Barrett, William G.: exc, B43 
Barron, Emerson M.: ref, 71(242) 
Barthelmess, Harriet M.: cross ref, 
15, 588 
Bartlett, E. M.: exc, 345 
Bartlett, Edward R. : ref, 28(18) 
Bartlett, Е, C.: exc, Bror 
Bartlett, Marian R.: exc, B379 
Bartlett, Marion W.: ref, 77 (89) 
Basiul, Walter Joseph: ref, 277 


(232 

Bass, Bernard M.: exc, B191; ref, 
751(14) 

Bateman, Richard M.: ref, 28(94), 


46 (44), 742(118) 
Batmale, Louis F.: ref, 26(20) 


Bauernfeind, Robert H.: test, 90; 
ref, 90(1) 

Baughman, Emmett E.: ref, 117 
(1079-80) 


Bauman, Mary K.: bk, B40 
Baxter, Brent: rev, 770, 783 
Baxter, Edna D.: test, 32; ref, 32 (1- 


2 
Bay, Margaret S.: ref, 361 (149) 
Bayles, Ernest E.: cross ref, 634 
Bayley, Nancy: exc, 382; ref, 77 
ey 324(60), 358(443), 361 
(196), 742(119, 189); cross ref, 


341, 351 

Beal, A. Elizabeth : ref, 28(31) 

Beamer, George C.: ref, 742(72, 
84), 747 (186), 759(50), 763(73) 

Bear, Robert Murray : rev, 575, 559; 
ref, 531(1) ; cross ref, 155, 547 

Beardsley, Seymour Willis: ref, 10 


2 
Bechtoldt, Harold: rev, 711, 715 
Beck, Betty S.: test, 175 
o Roland L.: fest, 175; cross ref, 
, 12 


Beck, Samuel J.: bk, B41; exc, 
B355; ref, 117(671-2, 780, 906, 
1010, 1081, 1203) ; cross ref, 28 

Becker, George J.: ref, 361(150), 


716(63) 
БЕШИ Albert Sidney: ref, 299 


(8) 

Bedell, Ralph C.: cross ref, 73 

n George Stewart: ref, 134 

I 

Beecher, Dwight E.: test, 796, 804; 
bk, B42; ref, 804(1) 

Beechley, Robert M.: ref, 358(481) 

Beeler, Nelson F.: test, 617 

Beers, Fred S.: test, 213; exc, B22, 
B383; cross ref, 6 

Beers, Lester D.: test, 635 

Beier, Ernst G.: ref, 136(249) 

Bell, Harold Vernon, Jr.: ref, 88 
(13, 18) 

Bell, Hugh M.: test, 28, 76, 84; ref, 


28(16, 65) 

Bell, John Elderkin: rev, 103, 105; 
bk, B43; exc, 112, 114, B355; ref, 
105 (1), 108 (3), 111(3), 113(10), 
115(5, 8), 117 (673, 781), 129 (16, 
23), 130(§), 134(5, 7), 136(120, 
155), 144(13, 20), 147(3), 361 
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B20; ref, 103(1), 113(18), 117 
(907), 136(156, 190-4, 200); f, 
117; other, 115(12), 117 (960, 981, 
994), 132(2), 134(22), 136(192), 


144(35 
Bellak, Sonya Sorel: test, 103; ref, 
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Bellows, Roger M. : exc, B121, B123 

Belman, Н. S.: ref, 48(14, 21), 173 
(7, 9), 277 (262), 742(160, 190), 
762 (4), 77410, 11), 799(3-4) 

Bemb, Goldie: bk, B21 

Benarick, Stanley J.: ref, 71 (243) 
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Berger, Dorothy: bk, B149 
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(645), 136 (113) 

Bergstresser, John L.: ref, 60(2) 

Berk, Norman: ref, 117(782), 120 
(24), 136 (157) 

Berks, Mary Dee Drummond: ref, 
354(3, 5), 361 (149) 

Berkshire, Roger : ref, 277 (234) 

Berman, Abraham B. : ж) 28(49) 
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(16), 356(45), 358(437), 301 


(181) 
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Bernreuter, Robert G.: rev, 320; 
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cross ref, M 283 

Berrien, F. K.: ref, 292(16), 660 


(19) 
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Berryman, George H. : ref, 71 б) 
Bersadsky, Leona: ref, 117(783) 
Bessell, Harold : ref, oo i 305 


(7) 
Bessent, Trent E.: ref, 340(22) 
Best, H. L.: ref, ON 

Betz, William: cross rej, 387 
Bezdek, V.: bk, B17 

Bhatia, C. M.: bk, B48 

Bialick, Irving: ref, 117 

Bickel, Charles L.: test, 619; cross 


ref, 618 
Biegel, Mark М.: ref, golo) 
Bier, William C.: bk, B49; ref, 71 
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Biggy, M. Virginia: ref, s (2) 
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other, 37 (23), Шу (6a), 3004), 
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B422; ref, 77 (189), 144 (10-1, 14) 
Bills, Marion А, : ref, 320(1) ; cross 
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1(340 
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634(1), 718(3) 
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Birch, Jack W.: ref, 292 $2), 349 
(21, 24), 358(47 
Birch, Jane R.: ref, (24) 
Bird, Robert G.: ref, 714 (20) 
Birge, William R.: ref, 65(2 
Birkner, Edward C.: ref, 71 (152) 
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20), 361(270-0, 
Bischof, Tedford eet ref, 714 
(21), 277 (235, 
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Bitterman, M. E.: ref, 281(84) 
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Blackburn, J. M.: cross ref, 334 
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Blacker, С. P.: exc, B143 
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Blair, W. R. N.: ref, 71 (193) 
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(194), 117(630, 677, 910, 1056), 
129 (17), 361 (151) 
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Blaker, C. P.: exc, B1 

Blanchard, Howard L.: ref, 277 
(201), 560(4), 763(82) 

Blanton, Richard: ref, 71(244), 117 
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(1083) 

Blessing, Harold D.: ref, 134(16) 
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Block, Stanley L.: ref, 117(1015 

Blommers, Paul J.: rev, 368, 378, 
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cross ref, 6 

Bluett, С. G.: ref, 315(9), 751(4) 

Blum, or S.: test, 102; ref, 102 
(1-3 d 

Blum, Lawrence Philip: ref, 71 
(74), 747 (187) 

Blum, Milton L.: rev, 714, 741; bk, 
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Blumberg, Eugene: bk, B229; ref, 
361 (348) 

Blumenfield, Walter: ref, 324(58) 

Blyth, M. Isobel: test, 388 

Bock, Carson: ref, 136(158) 

Bodily, Gerald P.: test, 311 

Bodine, J. H.: test, 604 
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Boehncke, Freida C.: ref, 292(26), 
349(2) 

Bogardus, Emory S.: test, 88; exc, 
126, B217; ref, 88(1-3, 7, 9, 14- 
6) ; f, 46, 88 

Boileau, Vance: ref, 117(1186), 120 
(72), 136(287) 

Bolin, Byron J.: ref, 117(911) 

Bolton, Ina A.: ref, 358(371) 

Bond, Elden А. : ref, 209(1) 

Bond, Guy L.: ref, 358(463) ; cross 
ref, 561 

ker, Ivan A.: rev, 544, 558; 
cross ref, 2, 18, 530, 543, 556 
Booth, Bradford A.: test, 186 
Gotthard: exc, 120, B43, 


rev, 4, 628; 


Bordin, Edward S.: rev, 742, 747; 
ref, 742(114), 747(209); cross 


ref, 743 

Borg, Walter R.: ref, 28(108), 47 
(21), 48(24), 59(24), 277 (202), 
250 90 -3), 747(242), 
70003 

Boring, Edwin G.: ref, 281 (81) 

Borko, Harold: ref, 715(5 

Borow, Henry: test, 34, 810; exc, 
B272; ref, 34 (1-2). o 

Borstelmann, Lloyd J.: ref, 134(17- 
8, 30, 39-40) 

Boswell, Charles Alan: ref, 63(4) 

Botel, Morton : ref, sar (17) 

Botsford, Frances R.: test, 457, 463 

dd PN Jack: ref, 31(45), 361 


(336) 
Bou, Ismael Rodriguez : ref, 176(1), 
557 (1), 576(1), #70) 
Boulding, Kenneth Е. : test, 672 
Boulger, Catherine: ref, 359(2) 
Bourke, William Theodore ; ref, 117 


(785) , 
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716 (64) 

Bovard, John F.: bk, B55 

Bovée, Arthur Gibbon: ref, 238(2), 
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Bowers, Henry: test, 792; ref, 702 
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Bowles, J. W., Jr.: bk, B79 
Bowman, : ref, 71(120), 
361( 163) 
Bowman, Mary Jean: test, 672 
Boyd, Gertrude: bk, B56; exc, B425 
Boyd, J. B.: exc, B3 
Boyer, Roscoe A.: bk, B57 
Boyle, Carrie E.: test, 418 
Boynton, Paul: ref, 77 (162) 
Boys, Floyd: bk, B104 
Brackbill, Betty J.: ref, 136(252) 
Brackbill, Glen A.: ref, 136(251-2) 
Braden, George D.: ref, 815 (2) 
Bradfield, Anne Frederiksen: ref, 
71(195), 117(912), 304(9), 742 
(164), 747 (243) 
Bradford, E. J. G.: rev, 105, 132, 201 
Bradford, Jeanne M. : test, 5, 6, 155, 
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Bradley, Gladyce H.: ref, 280(23) 
Bradshaw, G. D.: exc, B183 
Bradway, Katherine Preston: ref, 

94(59), 117 (1084), 358(401) 
Brady, Elizabeth Hall: bk, B372 
Bragdon, Henry W.: test, 662 
Brainard, Paul P.: test, 737 
Brainard, Ralph T.: test, 737 
Bene Lawrence M.: ref, 277 


( 
Brandenburg, G. C.: ref, 803(1-2) 
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Branham, V. C.: exc, B315 
Branom, M. E.: test, 8 

Bransford, Thomas L.: exc, B183 
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(72) 
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Braverman, Sydell: ref, 136(156) 
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Breed, Frederick S.: ref 
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regman, Elsie O.: test, зіл? үг 
281 (76); ў, ауу Ted 
Brener, Roy: ref, 117 (678) 


Brenke, W. C.: cross ref, 387 — 
Bresnahan, Т. J.: ref, ИШ, 
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e Ann: cross Ө 2s 
» 45 
Brickman, William W.: exc, B425 
Рибе, Claude F. : bk, B22; ref, 18 
12 

Bridgman, С. S.: exc, Boo 
Brierley, Marjorie: exc, B85 
Briggs Нер s. bk, B58 

rill, Moshe: ref, 292(17, 2 
Brinkley, Stuart R.: test, rth 
Bristow, William H.: test, 513 
par, Steuart Henderson: erc, 


361 
Broadbent, Todd H.: ref, 37(32, 


39) 
Broadhurst, John C.: ref, 763 (74) 
Broadley, Charles V.: bk, B59 
Broadley, Margaret E.: bk, B59 
Brodman, Keeve: test, 37,479; rel, 
37 (2, 4-5, 9, 15-6, 21-2, 32, 39) 
Brody, Abraham Barnet: ref, 17 
(913), 361(261), 716(87) 
Brody, David S.: ref, 71(196), 742 
6 
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Brody, M. B.: exc, B18, B332, B407 
Brogden, Hubert E.: rev, 47, 59; 


ref, 77 (163) 
Bromley, Ann: ref, 5(8), 277 (230), 


379(2) 
Brooks, Esther: ref, 96(3) 
Brooks, Fowler D.: ref, 358059) 
Brooks, Leah E.: ref, 51(12), 301 


(126) 
Brooks, Lee M.: ref, 88(8) 
Brooks, Nelson: rev, 242-3; ref, 237 


Bikers, Norris L.: bk, B104 
Broom, М. E.: ref, 535(4), 5420) 
2 
MA ET Elizabeth: ref, 17 
(612) 
Brosin, Henry: ref, 117 (679) 4 
Brotemarkle, R. А.: cross 7), 47 


Ва, Leo Joseph : ref, 6(23) ) 
Brothers, Wilbur Leo: ref, 71 s 
155(64), 277 (237), 547 (29^ 


= . 
Ее Paul ].: ref, 60(2) 5 other, 


Brower, Daniel: ach 2880), э, 
‚ 75, 100), Д 
А 136(121, 253), 361 (152) 
Brower, Edna: ref, 136 (105 2 
Brown, Alfred S.: cross 7¢ e) 
Brown, Andrew v € 203; 
: cross ref, \ 
ud acies Manley: reh 74 
(о) W.: bk, В? 
Brown, Clarence W.: Р» (1997) 
Brown, Clinton C.: ref, 117 
Brown, Douglass: test; 429 
Brown, E. K.: test, I 
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Brown, M. A.: ref, 71(198) 

Brown, Malcolm: ref, 117(914, 
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Brown, Martha Marlin: ref, 129 


(12) 
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Brown, Ralph R.: ref, 117(681) 
Brown, Roger C.: ref, 350(4) 
Brown, Sara Ann: bk, B89 
Brown, Warren T.: ref, 139(291) 
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743(8) o 
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Brozek, Josef: exc, B71, В83-4, 
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(102-3, 136, 245), 129 (18, 28) 
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B187 
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ref, 117 (631) 

Bryan, Alice I.: ref, 77 (98, 142) 

Bryan, J. R.: ref, 28(109) 

Bryan, Miriam M.: rev, I5, 4I 
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Buchanan, Paul Cramer: ref, 715 


(10) 
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Bucklow, M.: exc, Вт19 
Budd, Thomas A.: ref, 787 (3) 
Budrow, Gladys F.: ref, 550(1) 
Buegel, Hermann F.: other, 714 
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119-20, B63-6, B189-20; ref, 117 
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5-6) 
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Lierle, Dean M.: other, B106 
Lifton, Walter M.: ref, 75(12) 
Likert, Кепѕіѕ : test, 763; f, 46, 88 
Limbert, Paul M.: cross ref, 4 
Lincoln, A. L.: test, 202-3 
Lindahl, Lawrence G.: test, 775 
Linder, Thelma D.: ref, 117(829) 
Lindesmith, Alfred R.: exc, 125 
Lindgren, Henry Clay: exc, B71, 
B326; ref, 742(79), 743(3) 
Lindner, Robert M.: ref, 117(649, 


I 

Lindquist, E. F.: test, 4, 7, 15, 17, 
150, 408, 554, 588, 685; bk, B246- 
7; ref, 15(10), 559(3); f, 805; 
other, 157, 180, 214, 365, 573-4, 
587, 591, 669, nd 

Lindsay, Charles: ref, 77 (149) 

Lindz, Theodore: ref, 117(1162) 

Lindzey, Gardner : ref, 129(49) 

Lindzey, Gardner E.: test, 284, 729; 

ref, 361 (130) 

Line, William: exc, B83, B301, 
„B316; cross ref, 288 

Linn, Louis : ref, 117(729) 

Lins, Leo Joseph: ref, 6(19), 26 
(26), 155 (43), 277 (175, 189, 246), 
299(19), 547(22), 802(28) 

Lippitz, Ivan: test, 55 

Lippman, Hyman S.: exc, 133 

Lipsky, Esther: ref, 136(194) 

Lipton, H. : ref, 134 (61) 

Lipton, Mortimer B. : ref, 117 (1149- 
50), 361 (350) 

E Edith S.: ref, 117(730, 


830) E 
Little, James William: ref, 71 (177) 


Little, Kenneth: ref, 26(26), 277 


(189) 

Little, Kenneth B. : bk, B346; ref, 71 
(219, 281), 113(18), 136200) ; 
other, 71(260), 111(11), 117 
(141), 144(39), 361 (347), Вот 

Liveright, Alice К.: cross ref, 539, 


551 
Livesay, T. M.: ref, 277(164) 
Lloyd, Eunice Ann: test, 663 
Loevinger, Jane: rev, 333 
Logie, lona R.: exc, B362 
Loman, William: test, 829 
Lomax, Paul S.: cross ref, 445-6, 
452-3, 458, 462 
Long, John A.: test, 388, 423, 438 
E ref, 277 (181), 812(4), 
13(5 
Longacre, Andrew: cross ref, 634 
Longstaff, Howard P.: ref, 742 
(97), 747 (213) 
Loomis, Charles P.: bk, B248 
Loos, Frank M.: rev, 61 
Lord, Edith : bk, B249; ref, 117 (831, 


982-3) 

Lord, Frederic: ref, 812(10) 

Lord, Joseph P.: exc, 135 

Lorge, Irving: test, 479, 558; bk, 
B247; ref, 37(32, 39), 317(1), 
358 (468), 558 (1) ; cross ref, 309; 


f, 729 
Lorge, Sarah Wolfson: test, 238 
Lotesta, P.: ref, 117(1150), 361 


(350) 
Lott, Hiram V.: ref, 173(3), 544 


(10) 
[ош om М. : ref, 71 (221, 269), 
95(25 
Lourie, Doris K.: ref, 358(384) 
Louttit, C. M.: rev, 76, 356; exc, 
B121; ref, 356 (18, 23) ; cross ref, 
28, 94; other, B79 
Love, L. L.: ref, 26(15) 
Lovegren, Levi Alton: ref, 304(1) 
Lovell, Constance: bk, B178-80 
Lovell, George D. : ref, 28 (82) 
Lowell, E. L. A.: ref, 136(224) 
Lowenbach, V. J.: other, 117 (1065) 
Lowenfeld, Ln ai test, 115, 
147; ref, 147 (1 
Lowenfeld, Viktor: f, 111 
Lowrey, Lawson G.: other, 117 


(672) 

Lozoff, Milton: ref, 134(2c) 

Lubar, Gerald H.: ref, 117(731) 

Lubin, Ardie: rev, 61, 87, 134; ref, 
134(56, 60) 

Luborsky, Lester B.: test, 61; exc, 
B407 ; ref, 61 (2-3). 136(169) 

Lucas, DeWitt: bk, B250 

Lucas, Winafred: exc, 124 

Ludlow, Herbert Glenn: ref, 581 
(17), 711 (16) 

Ludwig, Ethel: test, 160 

Lueck, William R.: fest, 384 

Lufburrow, N. A.: test, 736, 744 

Vut, С. W.: ref, 358 (366) 

Luker, Albert-G.: ref, 155(68), 277 


(260 
Lum, Vernon K.: ref) 144(40) 
Lumry, Gayle Kelly : bk, B66; ref, 


1 ,6) 
v Robert W.: bk, Bast; ref, 


229(110) 


McElwain 


Lundin, William H.: bk, B252; ref, 
117 (1151) 

Lutz, Rose E.: test, 370 

Lyerly, Samuel B.: bk, Bar 

Lyman, Howard B.: test, 301 


М., А. M.: exc, B220 

M., H.: exc, B236 

M., J.: exc, 231 

McAdoo, Harold L.: bk, B22 
McAllister, Robert J.: ref, 28(102) 
McBrearty, J. F.: ref, 335(49), 303 


(13) 
McBride, John R.: ref, 117 (1046) 
McBroom, Maude: test, 15, 150, 408, 


554, 588 
McCall, Raymond Joseph: ref, 117 


(2152) 
McCall, William А. : test, 205, 421 
McCallister, James M.: cross ref, 


556 
Me Cambell, Robert H.: ref, 117 


(832) 
McCandless, Boyd R. : rev, 358, 363; 
exc, 123; ref, 117(650, 833) 
McCarthy, Dorothea: ref, 292(24, 


3 
McCarthy, Thomas J.: ref, 2851), 


77(150) 
McCarty, Billy S.: ref, 361 (151) 
McCary, roc L.: ref, 129(50, 70 
McCaul, Robert L.: cross ref, 536, 


558, 560 

мё ейапа, David С.: ref, 77 (175, 
212), 136(119, 170, 275) 

Месса bars ais ref, 71 

270), 798(3) ; other, B25, 

McClosky, Herbert: ref, 710289) 

McCloy, C. H.: exc, B 

McComb, H. G.: test, 816 

McConagha, Glenn L.: ref, 26 (30) 

MeConsughy, ммо Wentworth : 

› Wh 

eee R.: f, 6; other, B109 

McCormick, E. J.: ref, 7780) 

McCornack, R. L.: ref, 4 (1) 

syed Josephine; ar 325(2), 

2), 576(2), 577(2 
мі оту, Зана йог: ref, 71 


120) f 
McCrum, Muriel: test, a85 
McCulloch, Thomas L.: ref, 115(9) 
мес Шо Betsey R.: ref, 358 
6, 
м обрӯ, Constance М. ; ad 
ref, 15, 152, 545, 554, 559; /, 
май Milun We ref, 361 


stone, George H.: bk, B280 
McCurdy, Harold Grier: ref, 202 


(48) 
tchen, S. P.: cross ref, 685-6 
MeDerott, Dell Park: ref, 45 ais 


ther, 
uum MC ш Lundy: ref, 28 


‚ 308(43), 544 (22 
IE iun e Regina: ref, 356 


Oats Dorothea B.: ref, 144 
(15) 
Dougal, Russell: test, 592 
Ме Norman C.: ref, AT d 
McEtheny, W. T.: ref, 7 (28 
McElwain, D. W.: rev, 719, 756 


{ 


McElwee . 


McElwee, Edna Willis : ref, 292(10, 


15) 
McFadden, J. H.: ref, 358(367) 
McFarland, Muriel G.: test, 495-6, 


498-501 
McFarland, Robert Leo: ref, 117 


(732) 

Мйне: Ј. W.: ref, 358(433), 
MESI 

Men arguerite Q.: ref, 117 


5 
мВб J.: ref, 117 (1153) 
McGarrett, Vincent: test, 695 
McGary, J. Keith: test, 505 
McGann, WWE ref, 155(45) 
McGehee, William: ref, 155(35) 
McGillicuddy, Kathleen: test, 499 
McGillicuddy, Marjorie: exc, B364 
McGinnies, Elliott: ref, 92(71) 
Morem Joseph D.: ref, 117 


(83 

McGrath, Richard F.: 458 (390) 

McGuire, S. H.: ref, 188(2) 

Machen, Lorelle Horing: ref, 277 
(247) 

Machover, Karen: test, 111; bk, 
112, B253; ref, 111(1, 4-5, 11-2), 
117(733 

McHugh, Gelolo: test, 488; ref, 292 


(41-2) 
ме osh, Douglas M.: bk, B254-5 
McIntyre, G. A.: ref, 272(1) 


McK., R. S.: exc, B323 
McKay, B. Elizabeth : ref, 358 (402) 
Mackay, Donald: bk, B255 
McKee, J. H.: test, 169, 173 
Mackell, James F.: test, 642 
Mackenzie, Gordon N. : rev, 9; cross 
ref, 1,8 
меч, Richard E.: ref, 361 


351 
McKim, Margaret G.: rev, 165, 
543; cross ref, 18, 25, 529, 543, 


5 
мау, М.: other, B143 
MORE J. Charnley : test, 71; ref, 
71(130 
McKee, Florence: test, 492 
McKinney, Fred: exc, B183, B326 
Mackworth, N. H.: bk, B256 
MacLaren, Malcolm: test, 248 
McLauchlin, James Argyle: ref, 277 


149 

MacLean, Malcolm S.: bk, B186 
McLeish, John : rev, 229-30; ref, 229 

(114), 230(5) 
s En ul eng wt (084 

c ‚ Hugh: ref, 117 ) 
McMahan, Herbert С. : ref, 134(53) 
McMorries, James C.: ref, 28(29) 


77 (101) 
McMullen, Harold G.: ref, 606(3) 
McMullen, T. E.: test, 301 
McMurry, Robert N. : test, 318, 772- 
3, 781, 789; exc, B154 
McNamara, Walter у bk, B414; 
ref, 28(36), 70(7-8 
cNeal, Benjamin F.: ref, 361 


(305) 
McNeil, Elton B.: ref, 361(268) 
y Y Vincent Alfred: ref, 77 
253 
Macneil, F. E.: exc, B154 
McNemar, Olga W.: ref, 77 (259) 


McNemar, Quinn: test, 324; bk, 
B257; exc, 352, B183; ref, 361 
(306) ; f, 46, 88; other, 77 (154) 

MacPhail, w H.: ref, 742 


(200) 
MacPhee, Н. М. : ref, 346(33), 361 


(257) . 

Ma rrie, T. W.: test, 759 

MacQuarrie, W. C. H.: test, 633 

ry, John P.: bk, B78 

McQuitty, John V.: rev, 4, 6; exc, 
B272; cross ref, 5, 379, 595, 668 

McQuitty, Louis L.: rev, 63, 279; 
ref, 77 (133, 238) 

McReynolds, Paul: ref, 117(837-8, 
1091, 11 ) 

Macy, M. T.: test, 584 

Madden, Faith: rev, 493 

Maddock, Marie E.: ref, 144(25) 

Madeley, Charlotte Boehncke: ref, 
349 (16) 

a as P. о, Ві pred, 358 

419, 472), 361 (141, 17. 

Magdsick, Winifred E : ] 363 

Mager, B. B.: test, 469 

Magruder, F. A.: test, 698 

Maher, Howard: ref, 326(2) 

Mahin, Albert R.: test, 402 

Mahler, Walter R. : exc, B3or 

Mailloux, Noél: exc, 112 

Mais, Robert Dale: ref, 63(9, 11) 

Maiti, H. P.: exc, B70 

Major, C. L.: ref, 308(30) 

Malcolm, David Donald: ref, 742 
(98-9, 177), 743(5-6, 15), 747 
(214, 253) 


Malk Laura: exc, 120 
Maller, J. B.: cross ref, 84 
мн, Donald Roger: ref, 28(19- 


20 
Mallinger, Betty R.: ref, 117(985) 
AN Helga Tait: ref, 117(611, 


13 
Malloy, M. : ref, 134(56) 
Manahan, Neil: ref, 340(8) 
Mandell, Milton M. : rev, 774, 785 
Mandler, George: ref, 361(352) 
Mann, Arthur: ref, 358(486) 
Mann, Mary: bk, B78 
Mann, William A.: ref, 37 (36-7) 
Manne, Sigmund H. : ref, 117 (1073) 
Manor, Harold Carl: ref, 220(115) 
Manson, Morse P. : test, 30, 68, 467 ; 
ref, 30(1), 37(14, 34), 68(1-4), 
71(178), 282(32) 
Manuel, Herschel T. : rev, 350; test, 
176, 325, 333, 557, 576-7. 632; bk, 
205, В258; exc, B272, B386; ref. 
176(3), 202(11), io as 
(379), 475(6), 557(4), 576(4), 
577 (4) ; cross ref, 19, 737; f, 325; 
other, 7 (1) 
Marcuse, F. L.: ref, 77(176), 281 


Margolet, B. L.: exc, 135 
Margolis, Muriel Franklin: ref, 111 


2 
Maricle, LeClaire Robert: ref, 77 
(108) 


Marie, Sister Rita: test, 506 
Marine, Edith Lucile: ref, 358(361) 
Markin, Karl E.: ref, 361(362) 
Marks, Eli S.: ref, 358(421) 
Marnell, William H.: test, 248 
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Marquart, Dorothy Irene: 
(225) 

Marquis, Donald G.: f, 28 

Marquis, Vivienne: bk, Bir 

Marsh, Charles J.: ref, 28 60) 

Marsh, S.: ref, ои 

Marshall, Helen: ref, 43(3), 77 


ref, 77 


277(249), 547 (43) 
.: lest, 47-8, 726, 
749; ref, 7717 


) 
artin, Jeremy: [i B261 
Martin, Margaret P.: test, 387-8, 


423 
Martin, Rose E.: test, 259 
Martin, W. Edgar : bk, B262 
Martinez-Arango, Carlos: ref, 136 

(114, 139) 

Marum, O.: other, B143 

Marvin, Doris: ref, 28 (82) 

Marx, Barbara: ref, 358(360) 
Maslow, Баш: bk, 125, B263-4; ref, 

117(11 
Massey, Benjamin H.: bk, B104 
Mathematical Association of Amer- 

ica: test, 369 А 
Mathieu-Fortin, Claire: ref, 117 

(839) t 
Matteson, Ross W.: rev, 84, 7: 
Matthews, Jack: ref, 349(21) 
Maucker, J. William: test, 17, 669. 
Maurer, Katharine M.: fest, 351; 

bk, 352, B265; ref, 358 (416) 
Mavis, Frederic T.: test, 811 
Maxfield, Francis N.: cross ref, 299, 


8 
Maxfield, Kathryn E.: exc, Бш 
Maxwell, James: rev, 85, 291; 676, 
B356; cross ref, 380, 713 din 
May, Miriam. See Bryan, Miri 
M 


h , Lewis B.: bk, B272 
меке Martin: ref, 117 (652), 
361(353).. 
M › F. Blair: bk, B266 
Mayo, Thomas B., Jr.: ref, 55(1), 
MD Mary Elizabeth. See Ful- 
ler, Elizabeth Mechem. e 
Meehl, Paul E.: bk, 72; {к E 
B43, Bros; ref, 71 (130, 222 
263-4) ; cross, ref, P^» 
Meeker, Marchia: 5 d (586 ) 
rd: ref, 
Meet, presi Charles: test, 224 
‘oss ref, 222 
Mies, lem D у s ) 
i es: ref, Я 
Meister, Ralph К.: ла, 358(42 
) 


8 be 
м з, Margaret: ref, 70) 
Melberg, Merritt E.: re Ў ise S 
Meldrum, ems B.: oc 

-9, 618, 623 р 
Mele, bouis Kamel: ref, ui (17 
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Mellenbruch, P. L.: test, 301 

Mellone, Margaret A.: fest, 306; 
exc, 169; ref, 306 (1-3) 

Melton, Arthur W.: bk, B2, B267 

Meltzer, H.: cross ref, 62 

Meltzoff, Julian: ref, 117(1143), 
136(221), 138(3) 

Melville, S. D.: test, 579; ref, 747 


(260) 
Mendel, Alfred О. : bk, B268 
Mendola, Vincent S.: ref, 361 (307) 
Mensh, Ivan Norman : rev, 342, 346, 
360, 364; ref, 117 (922, 987-8), 346 
(30), 358(434), 361 (167-8) 
Menzel, Emil W.: ref, 292(20) 
Mercer, Margaret: ref, 117(840, 
989, 1157), 134(31, 57), 136 (225), 
361 (234, 308), 750(46), 763 (60 
Meredith, Laura Marie: ref, 25 (38) 
Meredith, P.: bk, B149 
Merlino, Camillo: test, 249 
Merriam, S. L.: test, 432 
Merrill, Edward H.: test, 695 
Merrill, Maude A.: test, 358; other, 


77 (154) 
Merrill, Ruth Atherton: ref, 623 (1) 
Merritt, Curtis B.: ref, 747 (254) 
Meshke, Edna: test, 494 
Metcalf, Lawrence: ref, 98(1) 
Mettler, Fred A.: ref, 117(1215) 
Meyer, Bill T.: ref, 117(1158) 
Meyer, Charles A.: exc, 786 
Meyer, Herbert H.: bk, B269; ref, 
716 (99) 
Meyer, John H.: cross ref, 238 
Meyer, John K.: ref, 91 (6) 
Meyer, Mortimer M. : exc, 119; ref, 
117(787, 841), 136(171), 361 


235 
Miale, Florence R.: ref, 117 (653) 
Michael, Joseph C.: ref, 147 (2) 
Michael, William B. : rev, 220, 232; 
bk, B181, B270; exc, B247, B390; 
ref, 715(3, 6, 11, 15), 716(76, 
100), 761 (1) 
Michaelis, John U.: ref, 50(2), 62 
(6), 71 (271) 
Michal-Smith, H.: ref, 102(7) 
Michaux, William: ref, 742(100) 
Michea, Claude Angus: ref, 349 


(13) \ 
Micheels, William J.: rev, 503-4; 


, B271 

Michel, Eleanor L. : test, 237 

Michigan State College, Board of 
Examiners : bk, B272 

Middleton, Jean Ellis: fest, 504 

Middleton, Warren C.: ref, 54(1), 


77 (134-5). 
Middleton, William: test, 820 
Miles, Lester F.: test, 780; ref, 780 


(1) 

Miles, Murphy: exc, Br55 

Miles, Ray W.: ref, 742 (101). 
Miles, Walter R.: test, 654 
n e Y bk, B273; ref, 25 

43), 555(3 

Miller, Allen Duane: ref, 742(192) 
Miller, Carroll H.: ref, 77 (102), 


743 (16) 
Miller, Charles: ref, 763 (71) 
Miller, Daniel R. : ref, 117 (990) 
Miller, Ezra A.: test, 616 
Miller, Floyd D.: ref, 46 (19-20) 
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Miller, Harold E.: test, 46m; ref, 46 


10, 2I) 
Miller, James G.: bk, B192; other, 
117 (780), 136(147), 140, B41, 


В35 

Miller, Joseph M.: ref, 285(11) 
ket uera 3 еи 

10), 117 (1117), 361(341), B1 
Miller, Kenneth D.: ref, 489(9) 
Miller, L. W.: test, 167, 621 
Miller, Minnie M.: test, 264 
Miller, Neal E.: bk, B2, B274 
Miller, Richard B.: ref, 725(6) 
Miller, W. S.: test, 304 
Milligan, E. E.: ref, 26(26), 277 


Mils L. Fes ref, 207(2), 307(1), 


314(25 
Mills, Theodore M.: ref, 56(3) 
Milton, E. Ohmer, Jr.: ref, 117 (815, 
842-3, 968, 991 
Mindess, Harvey: ref, 117 (1159) 
Ministry of Education, Great Brit- 
ain: bk, B173 
Minnick, J. H.: cross ref, 422 
Mintzer, A.: ref, 77 (109) 
Mirmow, Esther Lee: ref, 129(57) 
Mitchell, Howard E.: ref, 136(221, 
226, 276), 138(3) 
х Philip Harold: ref, 92 


(82 
Mitteímann, Bela: test, 37; ref, 37 
(1-5, 9, 15-6, 21-2) ; other, B355 
Mochel, Marguerite: ref, 71(179) 
Mock, Sanford J.: ref, 213(4) 
Modell, Arnold Н.: ref, 111(6, 13) 
Modlin, Herbert C.: ref, 37 (26) 
Molino, Henry Samuel : ref, 349 (9) 
Molish, Herman B.: exc, 119; ref, 
117 (906, 1160, 1203) 
Molitch, Matthew: ref, 25 (40) 
Mollenkopf, William G.: rev, 380, 
383, 385; ref, 285(18) ; cross ref, 


713 
Monachesi, Elio D.: ref, 71(223-4, 


265) 
Monroe, acre ref, 361(354) 
Monroe, Marion. See Cox, Marion 
Monroe. 3 
Monroe, Marjorie Bates: ref, 742 


8o 
ТА Ruth L.: ref, 136(172) 
Monroe, Walter S.: ref, 179 (1) 
Mons, NA 3 bk, 126-7, В275-6; ref, 
I 
Nu, M. F. Ashley: ref, 281 


83) 
Ме ошер, Ruth P.: ref, 356(13) 
Montoye, Henry J.: bk, B104 
Moodie, Mary: ref, 759 (47) 
Moodie, William : other, B84 
Mooney, Ross L.: test, 73, 81-2; ref, 


1 
NA James H.: ref, 361(265) 
Moore, Joseph E.: rev, 731, 813; 
test, 750; ref, 750 (1-6) ; cross ref, 
12, 223, 752 
Moore, mond С.: test, 630 
Moore, Roland C.: ref, 37 (27) 


Morgan, 


Murphy 


Moore, Wilbert E.: test, 709 
Moran, Maurice J.: ref, 115(10) 
Moreno, J. L.: bk, B277-9 

Morey, Elwyn A.: ref, 77 (239), 742 


142 
Morgan, Antonia : test, 170, 204, 329, 


776 
Morgan, J. J. B.: ref, 354(1) 
уйан J.: test, 170, 204, 
329, 377, 776 
Morgenthau, Dorothy Ruth: ref, 


350(7) 
Morison, Luella J.: test, 82; ref, 82 


I 

Morris, Ben S.: exc, B407 

Morris, Charles M.: ref, 299(4), 
356(25), 763(59)..... 

Morris, Woodrow Wilbert: ref, 71 
(86), 117(844, 992), 817(0) 

Morrison, J. Cayce: test, 205 

Morrison, Lester M.: ref, 
(1186), 129(72), 136 (287) 

Morrison, Thomas F.: test, 

Morrow, Mary: exc, 12 

Morrow, Robert 5. : br 224(21) 

piod Wr f, 46; other, 71 

220), 136(1 

Morse, seen bk, B280; exc, 
Вўо; cross ref, 4, 6 

Mort, Paul R.: test, 19 

Mortensen, Donald Grant: ref, 747 
(255), 828(6) 

Morton, Mary A.: ref, 71(131) 

Morton, №. W.: rev, 106, 766; cross 
ref, 747 

Morton, В. B.: ref, 130(1) 

Mosak, Harold H.: ref, 117 (1161) 

Moser, W. E.: ref, 26(31-2), 684(4- 


) 
MS Charles I.: bk, B247; exc, 
B22, B383; cross ref, 77, 774: 1, 


117 


Moss, F. A.: test, 89, 806, 829 
Moss, Mary prio ref, 77 (214) 


Mueller, Kate : 

yv eo Frederick Charles: 
ref, 117(993 

Mueller, он J.: test, 38 

Muench, George А.: bk, B192; ref, 


28 (8, 
27 Joseph V.: ref, 26(31-2), 


684 (4-5) 
Muirhead, Peter P.: bk, B384 
Mukerji, D. N.: bk, B281 
Mull, Helen К.: ref, 28(64),77 (145) 


149, 184) 
Mullen, Frances A.: ref, 71(132) 
Muller-Thym, Bernard exe, 


B368 
ев Richard A.: ref, 26(36) 
Muncie, Wendell: exc, B323 
Mundy, John Price: ref, 361 (176) 
Munn, Norman L.: test, 5 
Mos Ruth е ү, 770150), 117 
0, =$, 27 
a о: ref, 278(10) 
Murphy, Charles T.: fest, 250; ref, 
250(2 
Murphy, Gardner: f, 88; other, 
B216, B277 
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Murphy, Harold D.: ef, » UNS mb, Theodore: cross ref, 46, — Ols , Marjorie: 
ME ыл ннн, Ns Olson, Wild cr i) 


7 77 
Murphy, Helen A.: test, 571; ref, Newman, Joseph: rev, 360, 364 other, B3s4 » 299-300; 
571(1) Newmark, Maxim: test, 244 Olstead, Margery: ref, 129(51) 
Murphy, Laura White: ref, 302 Newton, Abba V.: test, 367-8 Omwake, K. T.: test 


(42), 324(55), 750(45), 763(58) ^ Newton, Bernauer W.: ref, 134(10) Onarheim, James. ref, 
Murphy, Maxine M.: ref, 134(10) Newton, Richard L.: ref, 31(44), (2. J Ў 77 (15), 766 
Murphy, Miles : exc, B155, B282 117 (845), 361 (309) O'Neil, W. M.: exc, B283 
Murphy, William F.: exc, 135, B323 — Neyhart, Amos E.: test, 789-91 O'Neill, Desmond: ref, 117 (969) 
Murra, Wilbur F.: cross ref, 685 - Neyhart, Helen L.: test, 789, 791 Ontario Commercial Teachers’ АУ 
Murray, Elsie: rev, 656-7, 659 Nichols, Albert 5. : ref, 802(24) sociation: bk, B13; ref, 4бо($); 
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Spoerl, Dorothy en: exc, 
ref, 28(61), 88(12), 92(64), 136 
(103), 292(30) 


Stacey, Chalmers L.: ref, 136(249), 
216(15), VALEN 491), 361(324, 


(7, 19), 751 (15) 
Stafford, John W.: ref, 10(5), 293 
2 


` Stagner, Ross: ref, 77(80) ~ 


Stalnaker, John. M.: rev, 187; bk, 
B247 ; exc, B362; ref, 285(0, 1), 
621 (16), 638(5), 662(5), 803(5 
817(6, 11) ; cross ref, 216 

Stalnaker, Ruth C. : ref, 285(9), 662 


5 
Stambaugh, Charles J., Jr.: ref, 77 


9; 
сеоба, Margaret J.: ref, 144(26) 
Stick Julian C. : rev, 746, 755; ref, 


92 
Stannard, Cedric: ref, 759(44) 
Stanton, John M.: ref, 361(249) 
Stapleton, Mary R.: ref, 155(58), 
277(219) 


` Star, Shirley A.: bk, B361 ; f, 46, 88 


зы we те 117 (1036), 

1 » 3 325 

Starke, t P.: rev, 369, 388 

Starr, Anna S.: cross ref, 

Staton, Thomas F.: ref, RN à) 

Staudt, Virginia M.: ref, 28(115), 
71(142, 185) 

Stedman, Louise A.: bk, B357 

Stefflre, Buford: ref, 743(4 

Stefic, Edward C.: ref, 361 (326) 

Stein, Harry: ref, 117(1197 

Stein, Harry L.: ref, 282(25) 

Stein, Morris I.: bk, 140, B358; ref, + 
117(878-9, 1198), 131(3), 136 
(147, 179) 

Stein, Seymour P.: test, 283 

Steinberg, Arthur : ref, 117(880) 

Steinberg, David Louis : ref, 28(62) 

Steiner, Matilda E.: test, 117; bk, 
o" B193, B359; ref, 117(757, 
112 

Steinmetz, Harry C.: test, 66 

Steinzor, Bernard: exc, B216, B359 

Steisel, І. : ref, 117(881) 

Steisel, Ira M.: ref, 361(133) 
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23) 
Stephens, Dean H.: ref, 747(206) 
Stephens, J. M.: exc, Вїїї : 
Stephenson, William: rev, 49, 92; 
test; 323; bk, B360; ref, 117 


( 
Sterling, Mac E.: ref, 117(1037, 


1199 
Stern, Karl: ref, 117(613) 
terry, Eliza : test, 632 
Stevason, C. C.: test, 816 
Stevenson, Charles L.: test, 505 


Steward, Verne: test, 828; ref, 828 


(7 
Stewart, Barbara MacMichael : ref, 
117(1038), 742(82, 186) 
Stewart, Frank Walter : ref, 28(92) 
Stewart, Lawrence H.: ref, 747 


Stewart, Naomi: rev, 111, 292; ref, 
387(2), 618(4) 

Stice, G.: test, 87 

Stickland, Charles A.: rev, 334 

ret: ref, 117(1073) 

Stinson, Mildred C.: ref, 742(182) 

Stockford, Lee: f, 785 

Stoddard, A. J.: ref, 802(5) 

Stoddard, George D.: 
621-2, 638-9; те, 


оюк 
I 
зо C. Harold: ref, 466(5), 747 


(267) 

Stone, Calvin P.: test, 364; ref, 281 
(85), 364(4) 

Stone, Clarence R.: ref, 1(1), 543 


12 
Stone, D. R.: ref, 71(251) 
Stone, G. Raymond: exc, B107 
Stone, Gideon Barto, II: ref, 117 
(1039), 136(237) 
o L. Gordon; ref, 73(21), 660 
21 
Stone, L. Joseph: rev, 103; bk, 
Stones EA f, 202(55) 
tonesifer, Fri .: ref, 292(5 
Эюшишу Mary McCauley: "n 77 
III 
Stothers, Carman Е. : test, 336 
Stotsky, Bernard Alexandrevich: 
ref, 117(1200), 361 (364) 
Stott, Leland H.: test, 41 
wu Samuel A.: bk, B361; f, 


40, 
Stout, Marjory: ref, 71(186) 
Stovall, Evelyn L.: test, 190 
Stovall, Franklin L. : ref, 325(1, 7) 
Strang, Ruth: test, 474, 481; bk, 
B362; ref, 474(1), 4811) 
Strange, J. R.: ref, 751(5) 
Stratton, Dorothy C.: test, 474 
Straus, Murray A.: ref, 282(39) 
trawn, Bernice: test, 493 
trayer, Floyd J.: ref, 77(112) 
Strayer, Joseph R.: test, 690 
Strodtbeck, Fred L. : ref, 56(3) 
Stromberg, Eleroy L.: test, 755; ref, 
755(1 
Stromson, Karl E.: ref, 77(153) 
Stromwell, С. Eva Linnia : ref, 117 


(599 
Strong, Edward K., Jr.: test, 747, 
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787; ref, 466(1), 742(1 
(176, 196, 236-7, 268.70). Ab 0 
Strother, Charles R.: rev, 113, 131; 
ref, 358(412) ; cross ref, 42 > 
Strother, George B.: re » 742(84) 
759(50), 763(73) | 
Strou B.: test, 568; exc, Bisr; 
ref, 15(10) ; cross ref, 155, 47° 
Strydom, Nicolaas : bk, B104 
Stucki, Ralph E.: ref, 751 (14) 
Stucky, Philip : test, 397 
Studebaker, M. E.: test, 457 
Student Counseling Bureau, Uni- 
versity of Minnesota: bk, Baor 
Stuit, Dewey B.: rev, 812-3; test 
16, 167, 621, 638, 814; bk, 8363-4: 
exc, B22, B147, B407; ref, 28 
(27), 527 (24), 814(4) ; cross ref, 


702-3 
Stump, N. Franklin: ref, 96(6) 
Stutsman, Rachel. See Ball, Rachel 
Stutsman. 
Stutz, Frederick H. : test, 686-7; bk, 


B272 
Suchman, Edward A.: bk, B361; f, 
6, 88 


40, 
Suci, George: ref, 801(9) 
Sueltz, Ben A.: test, 372 
Suesholtz, Z.: ref, 136(148) 
Suessmilch, Frederick L.: ref, 117 
(1201), 136(293) 
Sukov, May: ref, 28(30) 
Sullivan, Ben A. : ref, 277(221), 810 


(4) 
Sullivan, Elizabeth T. : test, 282, 645 
Sullivan, Ellen B.: ref, 77 (259) 
Sullivan, Helen Blair : test, 562; ref, 
562(1, 3) 
Sullivan, James J.: ref, 783(2) 
Sullivan, John P.: ref, 25(40) 
Summerfield, Arthur: exc, Br 
Sumner, F. C.: ref, 77 (202, 232) 
Sumner, W. L.: rev, 412, 424; 0% 
B 


Sutherland, 


(1040) 
Suttell, Barbara J.: 
145, B303; ref, 1 


I 

gue, Naser n ref, 277 (233) 

618(3). 763 (689) 
Sutton, Dorothy M.: ref, 117 
Sutton, Richard M.: test, 637 
Swaim, Evelyn: fest, 4 ) 
Sward, Keith: ref, 77 (86-7 ) 
Sweeney, J. W.: ref, 812(7. 
Sweeney, Leo: test, 4 @ 
Sweetland. Anders: ref, 71 : 
Swenson, Wendell M. : гё, i 
Swickard, Mary T.: test, 49. 


test, 144; Db 
17 (1002), 144 


m 
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Swineford, B. M.: test, 457 

Swineford, Frances: bk, B370; ref, 
526(7) 

Swisher, Carl B.: test, 703 

Symonds, Percival M.: rev, 116, 
129; test, 79, 132, 232; bk, 133, 
B371; ref, 79(1-2), 113(18), 117 
(758), 132(1-2), 136(149, 181, 
290), 258(1), 535(3) ; cross ref, 


28, 75 

Szollosi, E.: ref, 134(62) 

Szondi, Lipot: test, 134; other, 135, 
Birs5 


T., К. F.: exc, B361 

Taba, Hilda: rev, 474, 663; bk, 
B217, B372; cross ref, 6 

Taber, Howard A.: test, 633 

Tagiuri, Renato: exc, B216; ref, 361 


131) 
Takala, Reino: test, 429 
Talley, Howard V.: exc, B144 


Tamarin, S.: ref, 117(1150), 361 
(350) ? 
Tamminen, A. W.: ref, 280(29), 

361(365) 


Tanzer, Charles: test, 600 

Tanzer, Miriam: ref, 117(1053) 
Tarlau, Milton: ref, 117(1202) 
Tarpley, Elizabeth: test, 494 

Tate, Miriam E.: ref, 349 (23) 
Tatom, Mary H.: ref, 117(884), 361 


(251) |. , 

Taylor, Calvin W. : ref, 714(24, 33); 
817(5, 7). 

Taylor, Colin: bk, B373 

Taylor, Doris: test, 156 

Taylor, Earl A.: ref, 660(18) 

Taylor, Erwin K.: rev, 276, 331; 
exc, B407 

Taylor, С. J.: ref, 361 (157) 

Taylor, Garland Miller: test, 193 

Taylor, Glenn J.: ref, 648 (т) 

Taylor, Harold: test, 658 

Taylor, Howard R.: rev, 714, 741; 
cross ref, 89, 784 

ЖЫШ, мыо; V., Jr.: ref, 28 

114), 129(75 

Taylor, О. M.: ref, 129(75) 

Taylor, Vivian: ref, 117 (1033) 

Taylor, Wallace: test, 685, 687 
cross ref, 681 

Tead, Ordway: exc, B364 

T Florence M.: 
cross ref, 341, 351 

Telford, C. : ref, 2902 (12), 356 


(14, 20) 
Templin, Mildred C.: rev, 353-4 
Terman, Lewis M.: test, 25, 174 
195, 324, 358, 419, 555, 593; ref, 77 
(88, 259) ; other, B83 
Terry, Dorothy: ref, 136 (206, 238) 
Teuber, Hans-Lukas : bk, B192 
Thacker, Ernest W.: bk, B76 
Thaler, M.: ref, 361 (156) 

Tharp, James B. : rev, 260, 266; ec, 
B158; ref, 237 (1) 
Thayer, J. ‘A. = test, 250 
Theaman, Milton: ref, 


361 (327) 
Thelen, Herbert A.: cross ref, 624 
Theron, Р. A.: ref, 28(85) 
$ Ted William N.: ref, 117 (906, 
1203 


rev, 94; 


117 (1041), 
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Thiel, George A.: test, 630 

Thiele, C. L. : cross ref, 2, 405, 411 

Thiele, W. B.: ref, 71 (111) 

Thiesen, J. Warren: ref, 117(1 

Thiesen, Warren G.: ref, 117( j 

ES Don F.: ref, 277 (196), 
278(11 

Thomas, Dorothy M. See Thomas- 
Baines, Dorothy M. 

Thomas, Garth J.: ref, 657 (2) 

Thomas, Richard W.: other, 136 


(196) 

Thomas, William F.: ref, 77(163) 

Thomas-Baines, Dorothy M.: test, 
46g; ref, 46(26, 35) 

Thompson, Albert $.: rev, 825, 828; 
exc, B379 

Thompson, Anton : rev, 15, 199, 205; 
ref, 2(12), 15(12), 18(20), 25 
(53) ; cross ref, 150, 408, 554, 5 

Thompson, Charles E.: test, 138; 
ref, 77 (140, 257) 136(150, 182-3, 
216, 239-40, 204), 138(1-2) 

Thompson, Clare Wright: 
Brot, B282; ref, 358(472) 

pr Claude Edward: ref, 742 

I 
коре» б G.: ref, 28 
116), 117(1094 

Thompson, Grace M.: ref, 71(144- 
5), 117 (660, 759-61, 775, 885, 1042- 
3, 1205) p 

Thompson, Helen: exc, 140 

Thompson, John W., Jr.: ref, 714 


exc, 


(27) 
Thompson, Lola Rivers: fest, 586 
Thompson, Louise M.: exc, B143 
Thompson, Paul O.: ref, 715(13) 
"Thompson, R. B.: ref, 277 (168), 280 
(15), 308(33) 
Thompson, W. R.: test, 177 
Thompson, William R.: test, 800 
Thomson, Edith I. M. : cross ref, 552 
Thomson, Godfrey H.: bk, В374-8, 
B352; ref, 358 (454-5) ; other, 306, 
В18; cross ref, 71 
Thomson, Kenneth F. : ref, 575 (4) 
Thomson, William A. : ref, 277 (1 57) 
Thomure, Eugene: ref, 353(4 
Thorn, Katherine Ё.: ref, 71(236) 
reru e L.: test, 477, 5585 ref, 
477 (1, 4) } f, 477, 729 
Thorndike, Robert L. : rev, 662, 808 ; 
bk, B2, B209, B247, B379; ref, 277 
, 285(10), 358(3 3); 
24; ot 
.: ref, 
117 (1195), 136 (292) 
B323, 


er, 

Thorne, 1( 

107(19), 
Thornton, 


B355 , 

Thorpe, Louis P. : test, 228, 14, 746 

"Thrower, Florence L. : ref, 136 (291) 

Thurman, C. G.: ref, 71417) 

Thurstone, L. L.: test, 93, 319, 326, 
116, 4 ДК 5 ds б 
25), 48(27), 59(27), 93117. 
(702 ,716(70, 83) ; f, 46, 88, B84; 
other, 277-8, B22 f 

Thurstone, Thelma Gwinn: test, 
277-8, 319, 326, 716; bk, B381; ref, 
716(62, 71, 84, 95, 101); other, 


277-8 
Tibbitts, Clark: other, 
Tiedeman, David V.: rev, 


B117 
16, 301 


1 Trigg 


Tiedeman, Herman R.: bk, B382 
Tiegs, Ernest W.: test, 2, 151, 282, 
à , 411, 530, 645; cross ref, 7 
Tiffin, Joseph: test, 731, 762, ` 

816; exc, B121; ref, 751 (6), 762 
тр К ‘gd ( à 

ilton, J. W.: ref, 302 (48), 
Timmons, William M. : Н e 


Tinkelman, Sherman: bk, 4 


Tinker, Miles А, : ео, bahy $54 y 
exc, фб, Bro6, B221, B383, B388; 
ref, a ; cross ref, 500-1 

Tinsley, Willa Vaughn: test, 491, 


502, ` 

Tireman, L. S.: ref, 562(2) 

Tizard, jJ. ref, 31 (as, 5 , , 
НИЕ 


I 

Todd, J. Edward : ref, 658(1) ~ 
Toliver, Hazel M.: cross ref, 251 
Toll, Charles H.: ref, 277 (145), 281 


(77), 324(52) 
Tomkins, Elizabeth J. : bk, 141 
Tomkins, Silvan S.: bk, 141, 
ref, 136(184) 
Tomlin, Frank E.: test, 471 
‘Tomlinson, Helen; test, 586. 
Tomlinson, J. P.: test, 332 
Toops, Herbert A.: test, 308, 763; 
cross ref, duo * 
Tooth, Geoffrey: ref, 361(144) 
Topal, John R.: ref, 55(1), 783(1) 
Topetzes, Nick J.: ref, 


(14) Y 
Tordella, C. L.: ref, 117(1040) 
Torgerson, T. L.: test, 226; cross 
rej, 563 я 
Torrance, Paul: ref, 361(187) 
Torrie, А.: other, Brot 
Tougas, R. V.: ref, 117(914) 
Towne, Laura: test, 238 
Townsend, Agatha: bk, B342; ref, 
18(15), 25(50, 54), 20261), 203 
ane БАТУ 387 (3), 388(3), 423 
1), 531 ( sd» 
Townsend, M. E.: ref, Boat) oh 
Trabue, M. R.: cross ref, 216, N 
Tracht, Vernon S.: ref, 314 (33 
Trafton, Helen: ref, 285(1 
Traster, Stella: ref, 324 (51) 
Traut, Gladys M.: f. (25097) 
Travers, Robert M. W.: rev, 702; 
bk, B117, B386; exc, B22; ref, 155 


re], 


D „в (9) ТЫЙ @ A 


38, 810; other, 79(2 
Travis, Lee E.: test, 142 
Traxler, Arthur E.: rev, 


ФМ 


18), 

бо(5), 583(1), 6o2(1- 

3)» das Q2), eri 271), 787 

q 4-5, 10, 14-5 

21 

Зу, John P.: ref, 797 (2) 

TE Mil ins: ref, 
‚ 810(4 

Д ie test, 600 

Trigg, Charles Wilderman : ref, 324 


(53) 


B385 . 
В385; - 


155(71), 166 


Triggs 


Triggs, Frances Oralind; ref, 277 
Ктр элем 531 Cer 13-5), 
'42(113) ; cross ref, 

Trimble, Harold C.: ref, 606(3) 

Triplett, Bernice: ref, 714(8) 

Trites, David Keightley: ref, 277 


(222) 
Troup, Evelyn: ref, 94(66) 
^ ; Loretta: ref, 34(3), 
742 (154 
Troyer, Maurice E.: test, 468, 534; 
ref, 6(24), 480); cross ref, 10; 
other, 804(1), B42 
Trubin, Beatrice: ref, 94(64) 
Tryon, R. M.: cross ref, 68 
Tryon, Robert C.: cross ref, 716 
Tschechtelin, "3 Ant БУ test, 
795; ref, 795(1, 2 
übbs, William R.: ref, 710158) 
Tucher, John E.: ref, 117 (886, 1044) 
Tucker, Ledyard К.: ref, 237(5), 
(3), 250(1), 2; (2) 
Tuckman, Jacob: ref, 28(77), 460 


Тойда, Rea D.: ref, 280(20), 
15(10) ; f, 
T win N: ref, 559(4), 660 


24 
Tunis, Martin M.: ref, 117(797, 
1206 
Tu о Ernest C.: bk, B391; ref, 47 
I 
Turnbull, Mary E.: rev, 236; ref, 
26(8) 


5 

Turnbull, William W.: rev, 531, 
550; ref, 526(2-3), 633(3) ; cross 
rej 


› 309 
Turner, Clarence Е.: rev, 234, 240; 
cross ref, 238 
Turner, David: bk, B392 
Turney, Austin H.: cross ref, 28, 


278, 302, 481 
Тиге, Paul L.: test, 460; ref, 460 
I 
Tussing, Lyle: rej, 28(56), 46(41), 
77 (159), ges 
Tuthill, C. E.: ref, 747(203) 
Tuttle, Florence Piper: bk, B393 


Tuttle, Stephen D.: test, 227 
Twichell, Constance M.: ref, 825 


(18 

Twitchell-Allen, Doris: test, 143; 
ref, 143(1 

Tyler, Alice Felt: test, боо 

Tyler, Fred T.: ref, 17(3), 50(2), 
62(3, 6), 71(271, 277), 277 (251), 
300(1) ; cross ref, 204 

Tyler, Leona E.: rev, 45, 77; exc, 
Br43: ref, 716(102) 

Tyler, Priscilla: test, 178 

Tyler, Ralph W.: test, 45, 58, 60, 
183, 187-8, 484. 581-2, 585, 504, 
606, ; bk, B247; ref, asta), 
бо(т), 183(1), 188(1), 484(1), 
581 (14), 582(5), 585(1), 504(3), 

Gor; ether, 45 (5), ЭЁ СЕ), 188025 
1; other, 45(5), 58(5), 188(2), 

606 (3) 


UHRBROCK, Richard Stephen: 
ref, 16(11). 77(244). 02(65), 166 
(13), 167(20). 308(45). 422(1), 
427 (5). 621 (18), 622 (15), 638 (6). 
630 (2), 718(7), 747 (182), 813(7) 


Ulett, George A.: bk, 128, B394; 
ize m zeh, 317 (1045-6) 
. L.: test, 197 
pee John P.: ref, 346(34), 
I 
UA m B.: exc, B 
United Business Education Associa- 
tion: test, 445-6, 452-3, 458, 462, 


722 
United States Armed Forces In- 
stitute: test, 26 - 

United States Civil Service Com- 
mission: bk, B395-9 к 
United States Employment Service: 
test, 714, 717, (a ref, 714(5) 
United States Military Academy, 

Department of Tactics: bk, B400 
University of Minnesota, Student 
Counseling Bureau: bk, B401 
University of Toronto, Ontario Col- 
lege of Education, Department of 
Educational Research: test, 294, 
405; bk, B13; other, 294(2) 
University Testing and Service 
Bureau, Cornell University: bk, 


Broo 
Upshall, C. C.: cross ref, 446, 453 


V., M. D.: exc, B186 

Vail, Curtis C. D.: cross ref, 245 

Vail, James P.: ref, (115) 

Valentine, C. W. : test, 343; bk, 344- 
5, B402-3; exc, B378, B407; ref, 


Valentine, W. 


(780 
Van Alen, Rita: bk, B404 
Van Allyn, Keith: test, 771 
Van den Akker, J. A.: ref, 658(1) 
Van der Kolk, M. J.: ref, 356(29) 
Van Der Lugt, M. J. A.: test, 653; 
ref, 653(1) 
van der Merwe, A. B.: ref, 28(85) 
укку, Marguerite G.: ref, 136 
255 
Van Dyke, L. A.: test, 437 
Van Engen, Harry: rev, 409 
van Lennep, D. J.: test, 105; ref, 105 


3 

Van Nostrand, Hannah: test, 193 
Van Nostrant, Virginia: test, 493 
Van Ormer, Edward B.: bk, B405 
Van Orsdall, Otie P. : test, 397 
Van Peursem, L. : test, 608-9 
Van змена: rev, 48-9 
LN хора, мт R.: bk, B312 

an Wagenen, M. J.: test, 351 
ш унн», Ralph O.: ref, 117 


7 
Van Жез, Raymond: ref, 783(3) 
Varnado, Gladys R.: ve]; 17304). 


544(12) 
Varva, Frank: ref, 71(188) 
Vaughn, Kenneth W.: test, 17, 214, 
E34 587; bk, B1oo, B247; exc, 
183; ref, 527(9, 12, 14-8), 718 


7(1), 527(18), 747(197), 787(1), 

van oe Wait A.: ref ) 
erburg, allace A.: ref, 

Vernallis, Francis Fee Ц 


(36) 
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Vidor, Martha: ref, 2 (2/8) 
Vieg, pono A.: test, 703 
Vinacke, W. Edgar: ref, 61 (5) 
Vinette, Roland: cross ref, 238 
Vinton, Kenneth W.: ref, 618(1) 
Virene, Edgar Paul: ref, 285(5 
Virtue, Charles F. Sawhill: ref, 45 
(9) ; other, 45(5) 
Vistica, Nicholas J.: ref, 361(252) 
Volberding, Eleanor: exc, B425 
Vonachen, Harold A.: ref, 37(5) 
Von Lackum, William J.: ref, 117 


(932) 

Vordenberg, Wesley : ref, 155(74), 
738(1) |. 

Votaw, David F.: test, 12, 632; ref, 
586 (1) 


ref, 816 
Wed, John Vergil : ref, 277(148) 
Wakelam, B. B.: ref, 343(1 
Walch, A. E.: ref, 71(90) 
Walcott, William O.: ref, 136(185) 
Waldfogel, ee : exc, 112, B283; 
, 361 (366 
ма ref, 117 (1209) 
Walker, David ve bk, e 
Walker, George A.: ref, 
Walker, Helen M.: bk, B408; cross 
ref, 387 
wake K. F.: ref, 763(72) ) 
Walker, Robert es аа 19), 
17 (1048, 1210), 12 
ор, O.: ref, 742(184) 
Walker, Virgil R.: test, 43 
Wall, W. D.: rev, 314 397 
Wallace, F. C.: fest, ) 
Wallace, J. G.: ref, 314(45 76, 189; 
Wallace, S. Rains, Jr. : rev, 7 iem E ^ 
ref, 48 (13), 742 (155), 825 (10 
8 (72 
Wallace, Wimburn L.: ref, 155 
83), 277 (223, 257, 95 6 E 
(m ИБИ Oa 
sites, Richard: ref, 117 (764) 194 
WO, J. E. Wallace: ref, 3580355) 


Walsh, Donald D.: test, 2595 rej, 
wees, Barrier M.: fth 7 


wales, Annette: ref, 45(2, 10); 
waiter, £5. test, 424 
Walton, Willi 

dt, Edwin: rev, 792, 79, 
entm, Morey J.: rev 817 


am E: ref, 7000 ) 
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Williams 


War Manpower Commission, Di- 
vision of Occupational Analysis : 
ref, 714(1) i fito 741 

Warburton, Е. W.: rev, 290, 779; 
exc, B406 

Ward, Leo C.: other, 136(198) 

Ward, Leo R.: test, 506 

Ward, Lewis B.: bk, B2; ref, 77 
(160), 787 (12) 

Ward, William D.: ref, 803(13) 

Ware, Caroline F.: ref, 662(1-3) 

Warfel, Harry R.: test, 186 

Warnecke, Warren T.: test, 617 

Warner, Kenneth O.: other, B148, 
B338 

Warner, Nathaniel : ref, 37 (6-7) 

Warner, Samuel J.: ref, 117 (1211), 
361 (328) 

Warner, W. Lloyd: bk, B409 

Warren, Elmer Chapman: ref, 277 


(150) 
Washburne, Annette C.: ref, 117 


(609) 
Washburne, John N.: test, 95 
Wasson, Margaret M. : ref, 96(7) 
Waterhouse, I. K.: exc, I 
Waters, Aaron C.: test, 630 
Watkins, John G.: exc, 112 
Watkins, Ralph K.: cross ref, 634 
Watkins, Richard Walker : ref, 742 


(185) 
Watrous, Blanche G.: ref, Meri 
Watson, Donald T. : ref, 117 (1049 
Watson, G. E.: ref, 714(8) 
Watson, Goodwin : cross ref, 38-9 
Watson, Neal: ref, 359 (4) 
Watson Robek b bt, Buong 
28(57), 37 (18), 69(5), 776101); 
( ) 6 50296) 


ref, 361 

Watson, Walter: exc, B364 n 

Watts, A. F.: test, 215, 537, 7545 
bk, B411; ref, 215(1) 

Watzinger, Arthur J.: test, 244 

Wauck, LeRoy A.: ref, 71 (237) 

Weathers, Edra: test, 38 

Weaver, Herbert B.: ref, 350 (4) 

Weaver, J. F.: ref, 634(2) 

Кее, Lillas Jean: ref, 178(2), 
526(1 

Webb, Sam C.: ref, 6(26), 155(84) 

Webb, Wilse B.: exc, B301; ref, 361 
(253, 313, 368) 

Weber, Christian O.: ref, 237(2), 
277 (169), 285(7), 617(4), 633 
(2), 662(6) 

Weber, Ernst: test, 811 

Webster, Edward C.: ref, 304 (16), 
742 (206), 747 (273) 

Webster, Harold D., Jr.: ref, 136 


(297) 
Мет Florence O.: ref, 37 
(31), 120(77) f 
Wechsler, David: test, 37; 361, 363- 
41 ref, 37 (1-4, 9, 16-7, 21-2), 361 
(369), 363 (8 20) ; cross ref, 287, 
303, 314; f, 363 
ede 8 ) UE exc, B282, B383 
Weeks, Edward J.: bk, B1 
Wegman, Margaret: ref, ye (458) 
Weick, D. V.: ref, 71 (157 


Weidemann, Charles C.: cross ref, 
e 427 itg 
eider, ur: test, 37; exc, B. 
Bag, Brsg; ref, 37 (19,89, 16:7, 
ros 358(493), 361(123), 303 


Weider, Norma: ref, 37 (5) 

Weinstein, Benjamin: ref, 358( 3) 

Weinstock, Harry I.: ref, 37(1 ) 

Weintraub, Ruth G.: test, 703; ref, 
277(217) +; 

Weir, A. J.: ref, 117(1155) 

Weisgerber, Charles A.: ref, 71 


(279) 

Wer Mary H.: bk, B149, 
412-3 

Weisman, Leah L.: ref, 581(15), 


3 

Weisskopf, Edith A.: ref, 136(243- 
4, 298 

Weitz, Henry : rev, 531 

Weitzman, Ellis: bk, B414 

Welborn, Ernest L.: ref, 277(176) 

Welch, Livingston: ref, 117(696) 

Welford, A. T.: rev, 757, 769 

Wellisch, Е.: ref, 117 (888-91, 1050) 

Wellman, Beth L.: cross ref, 351 

Wells, Charles A.: bk, B415 

Wells, Enid: b&, B416 

Wells, F. L.: test, 305, 315-6; exc, 
B301 ; ref, 316(1) ; cross ref, 716, 
358, 3615 f, 31 

Wells, John: bk, B416 

Wells, R. G.: ref, 762(6) 

Wells, Stephen: ref, 134(36) 

Welsh, George S.: ref, 71 (146, 189, 
280), 361(254) ; f, 28 

Wentworth, Helen C.: ref, 224 Go) 

Wentworth, Mary M. See Mc- 
Conaughy, Mary Wentworth. 

Wentworth-Rohr, Ivan: ref, 117 
(1051), 136(245), 361 (329) 

Weppner, Mary Theodore : bk, B417 


"Werner, Francis J.: test, 276 


Werner, Simon: ref, 71(119) 
Wertham, Frederic: ref, 


12) 
Weschler, Irving R.: ref, 48( 19) 
Wesley, Edgar B.: rev, 600, 8; 
test, 8; cross ref, 7, 063, 684; f, 


688 
Wesley, S. M.: ref, 31(37), 117 
(787), 742 (82, 186) 
Wesman, Alexander G.: rev, 814- 
5; test, 331, 7115 ref, 18( y, 358 
(457), 361 (209, 255), ae ,71 
(4, 10-1, 20, 22), 710 85) 
West, Edith: f, oe 
West, Elmer D.: bk, B149 
West, Robert: other, B106 
Westby, George: rev, 207, 313-4 
Westover, Frederick L.: rev, 557 
Westover, Henry Tudor: ref, 26 


(27) 
Westwood, Dale: ref, 714(18) 
Weite um C.: test, 489; ref, 
489(1-2, 4, 7 
Wexler, Murray : d 117(966) 
мета ду le M.: ref, 111(1), 
11 
Wheat, Parr Grove: cross ref, 2, 
18, 411, 410, 543 
Wheatley, Luis Andrés; ref, 77 
(202) 


115(2, 


Wheeler, Erma Т, : ref, 71(91), 11 
(661), 136(117), 361 hos 4 
Wheeler, oun L, Jr.: ref, 301(370) 
Wheeler, Lester R.: ref, 277 (224-5, 


275) 
Wheeler, Norman C.: ref, 71(80 
Wheeler, Viola D.: Mi nu. 


275) 
Wheeler, William Marshall : ref, 71 
(219, 238, 281), 117 (787, 892-3, 
ics 61 o) M S doa) 
erry, Robert J.: ref, 71(204 
Whisler, Laurence D. : test, 46d; ref, 


6 (40, 42 
Whistler ЕЙ ы S.: test, 228 
White, Dorrance Stinchfield : test, 


258 
White Howard R.: rev, 144 
White, M. J.: exc Br 
White, M. S.: re}, 661(10) 
White, Mabel Frances Rodger: ref, 


202 (44) х 
White, Mary Alice: ref, 117 (1053), 


136 (246) 
White, Ralph K. : bk, B418; ref, 113 
(18), 136(290) 
Whiteman, Mordecai: ref, 361 (331) 
Whiteman, Paul H.: ref, 134(37, 


wWidlock John В., Jr. : ref, 77 (245), 


Woe 
hitmer, Carroll A.: cross ref, 335 
Whitmyre, John Warren: ref, 134 


(38 

Mere Delos D.: exc, B226; test, 
5 

Wickersham, Francis Myron: ref, 
37(38) К 

Wickert, Frederic R.: bk, B2, B175, 


B419 
Wiegel, Donald F.: ref 26(17) 
Wiener, Daniel N.: ref, 71 (147-50, 
239, 282), 742(207) ; other, 361 
(180), B305 
Wiggins, Emma: test, 401 
Wightwick, Beatrice : ref, 763(87) 
Wikler, Abraham: ref, £17 (768) 
Wilcox, George M.: ref, 26 ) 
Wilcox, Katherine W.: cross ref, 


107 
Wilcox, Paul H.: test, 44; ref, 44 


282(34), 361 


: 0.010909), n7 
r (271) 


pga: WA "are ; cross ref, 


Willhauk, Ralph C.: ref (4) 
Williams, Clarence i d ү 
Williams, Edna К. : test, 

Williams, H. M. : ref, 358 (458) 
Williams, Harold M.: ref, 28017, 


witless Herbert Howard: ref, 129 


УШ J. Harold: ref, 292 (21) 


Williams 
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Williams, Marion L.: ref, 292(5) 
Williams, Meyer: ref, 117(1087), 
346(30), 358(434), 361 (167-8) 
Williams, Roger K.: ref, 77(203) 
Williams, Ruth Margaret: ref, 349 


(12) 
Williams, sed B.: exc, 141, B282 
Williamson, E. G.: test, 70; ref, 28 
(30), 70(4-5) ; other, B109, B364, 


Williamson, W. G.: test, 401 
Willing, M. H.: ref, 179(1) 
Willis, Margaret: cross ref, 684 
Willis, Mary : test, 155, 184, 213, 547 
Willoughby, Raymond R.: test, z 3 
ref, 96(1, 2) 
Wilmore, Waldo W.: ref, 569(2) 
Wilson, Claude E. : ref, 28 (106), 136 


(186) 
Wilson, Donald Murray: ref, 117 
(1054), 136(247), 142(2) 
Wilson, Earl: ref, 742(114) 
Wilson, Glen P., b ref, 71(194), 
117(910, 1055-6 
Wilson, Guy 


Wilton, Howard: test, 807 

Wilson, James T.: ref, 322(2), 380 
(3), 713(3) 

Wilson, Logan: test, 709 

Wilson, Margaret m ref, 94(65), 


358(392) 
Wilson, Maye Alexander: test, 193 
Wilson, N. A. B.: exc, B191, B363 
Wilson, Robert G. : ref, 114(34) 
Winans, J. Merritt: ref, 282(35) 
Winberg, Wilma C.: ref, 71 (92) 
Winchester, T. H. : ref, 136(151) 
lys cad Clarence L.: ref, 71(82, 


126) 
Winetrout, Kenneth: ref, 802(20) 
Wing, Cecilia: test, 230 
Wing, Herbert D.: rev, 226, 229; 
test, 230; bk, 231, B420; ref, 230 


(1-4, 

Wingfield, Robert C.: ref, 77(233) 

Wink, Willmer A.: ref, 658(1) 

Winn, Alexander : ref, 304(16), 742 
(206), 747 (273) 

Winn, James Charles : ref, 742(156) 

Winne, John F.: ref, 71 (240, 283), 
134 (61) 

Winters, С. L. Jr.: exc, B108 

Wipf, Adeline: test, 539 

Wischner, George J.: ref, 117(662), 
130(4), 144(12) 

Wisham, Wayne: ref, 117(1137, 


1142) 

Wishner, Julius : ref, 117 (766, 1057, 
1212), 361(188 

Wisler, Maxine L.: ref, 335(36), 
358(374) 

Withall, John: exc, 53 

Wittenborn, J. R.: rev, 61, 87; exc, 
B368; ref, 16(8), 115 (8947, 1058- 
60, 1213-5), 136(187, 248, 299), 
166 (11), 167 (17-8),361(256,371), 
763(68-9) ; cross ref, 117, 136 

Wittman, Mary Phyllis: ref, 28(62, 


71) 
Wittson, C. L.: ref, 346 (a7) 
Witty, Paul A. : cross ref, 2, 530, 566 
Wold, Jane A.: ref, 51 (16) 
Wolf, Elizabeth B.: ref, 117(625, 
1096) 


.: cross ref, 18, 416, 


Wolf, Herbert S. : bk, B104 

Wolf, Jacob G.: bk, B104 

Wolff, Charlotte: bk, B421 

үөн, Чеш G.: test, 37, 479 У ref, 
37 (1-5, 9, 10, 19, 21-2, 32, 39 

Woltt, W. M.: ref, 742(208), 766 


(47) 

Wolff, Werner : test, 104; bk, B422- 
3; exc, ; ref, 104(1-2) ; 
other, 92(76), 136(230) 

Wolfle, Dael: rev, 305, 315-6 

Wolfson, Rose: bk, В 

Wollach, Lorraine: "d. 06 (15) 

Woltmann, Adolf G.: exc, 103, 130, 
d B49 aya, B171, B323, B332; 
ref, 144(3 

Wonderlic, E F.: cross ref, 730 

Wood, Arthur L.: ref, 28(89) 

Wood, Ben D.: test, 386, 422, 787; 
bk, B425; ref, 787 (5), 802(6, 21, 
26) ; other, 2 n И), 91(5), 
155(82), 203(1), 547(36), 711 
(19), 742(197), 743(20), B126-34 

Wood, E. К. : test, 149, 281 

Wood, Hugh B.: test, 584, 631; 
cross ref, 2, 18, 20, 151, 411, 416, 


530, 543 м 

Wood, La Duskie: test, 193 

Wood, Ray G.: cross ref, 25, 555, 667 

Wood, Wendell F.: ref, 229 (108) 

cse Marjorie J.: ref, 714 
19 

Woodrow, Herbert: ref, 18(11) 

Woodruff, Asahel D.: ref, 308 


(42) 
Woodruff, Louise: ref, 28(64) 
Woods, Roy C. : ref, 277 (276) 
Woods, Velma E.: ref, 562(2) 
Woods, Walter A.: ref, 742(115) 
Woodward, L. С. : test, 89 
Woodworth, G. W.: test, 227 
Woody, Clifford: test, 421, 551 
Woodyard, Ella: f, 477 
Mirari School, Austin, Texas: 


Woolf, Maurice D.: ref, 28(63) 

Worcester, D. А. : rev, 799-800; exc, 
357, B71; ref, 317(2) ; cross ref, 
1б, 543 


Worthington, Myrtle Raymaker : 
ref, 356(11) 
Wosika, Paul H. : ref, 117(791), 361 


(204) 
Wren, Harold A.: ref, 812(8) 
Wrenn, C. Gilbert: exc, B363; ref, 
277 (227), иу, 308(46), 358 


(459), 747(202), 763(76) ; cross 
ref, 55 


9 
Wright, Annette М. : test, 631 
Wright, Beatrice А. : bk, 122, B193; 
ref, 117 (1125) 
Wright, Clare: ref, 356(37), 358 


419) А 

Wright, David G.: other, 144(41), 
145, B303 

be H. F.: ref, 346(33), 361 


257 

Wright, M. Erik: bk, 122, B193; 
rcf, 117(1125) 

Wright, S. C.: ref, 134(31), 136 
(225), 361(308) 

Wrightstone, J. Wayne: test, 67, 
274-5, 761; bk, B291; exc, B296, 
B426; ref, 358(485), 363(12); 


cross ref, 581, 588 


Wrinkle, William L.: bk, B, 
Wrinn, Mary J. J.: test, od 
Wu, T. M.: rej, 358(366) 

Wyatt, Frederick : exc, 141; ref, 136 


W 452) С. S.: 1 
ykott, С. S.: test, 1 ; 
f j 69, 173; ref, 


3(7 
Wyle, Herbert: test, 829 
Wyndham, A. J.: exc, B239 


ur M.: re B143 
a, Mariano: ref, 324(61), 
Yeomans, Alice V.: PA aM 
Yepsen, Lloyd N.: ref, 292(3-4) 
Yerbury, Edgar C.: ref, 117(1216) 
Yocom, Rachael Dundven : bk, B238 
Yoder, Dale: rev, 785 
York, Richard H. : ref, M fetta 
Youmans, John B.: ré, 71(80) 
Young, Florence M.: ref, 363(22) 
Young, Kathleen M.: bk, B427; ref, 
117(768) ; other, 136(127) 
Young, Nellie M.: ref, 117(1186), 
129(72), 136(287) 
ieee Reginald J.: ref, 117(767, 


1061 

Young, Richard H.: ref, 817(3) 
Young, Robert V.: test, 1 

Yu, Poe Eeng: ref, 77(82) 
Yule, D. L. G.: ref, 294(3) 


ZABIN, Dorothy H.: ref, 358(370) 

Zaccaria, Michael A.: ref, 325(5), 
659(2) 

зро, Henry Joseph: ref, 304 


гы, F. C.: ref, 28(107), 38(4), 


246), 95(24), 759(53) 
ud iliam R.: ref, 77 (170, 


193) 
Zander, Alvin: exc, 57 7 
Zangwill, О. L.: ref, 117 (1153); 
cross ref, 35, 51 


‘Zeaman, Jean B.: ref, 361 (237) 


Zeckel, A.: ref, 356(29) 
Zehrer, Deu А.: 
ref, 117(121 
A riae L.: ref, 117 (1062) 
Zeligs, Rose: ref, 88(4-6, 10) 
Zieber, Naomi W. : test, 259 ; 
Ziegfeld, Edwin: rev, 220, 222-3; 
, 221, 224 
pascal Frederick T.: ref, 117 
121 
7 САИ John G.: test, 5, 595 
Zimmerman, Wayne S.: Rs 
715, 739; ref, 715 (2, 4 7, п, 15), 
716(100), 739(2 yer 4) 
Zintz, Е. Raymond: MX 3(4 
Zirkle, Conway: test, 
Zook, George F.: f, б; other, 26 
8), Br22 i 
zin. Joseph: bk, B192, B198 


3 + ref, 77 
B427-0; exc, B315, B422; ref, 
(197), 1 17 (768, 1063) ; other, 136 
127, 20 | 
e Tue: ref, 117 663, 760 
nee Зийа B.: ref, 17 
; етс, B323 
24079), Hans: test, 1175 feb 17 
1065- 
тйл, Virginia T.: ref, i (60), 
277 (228), 742 (157). 745 
Zyve, D. L.: test, 813 


exc, B359; 
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This index serves as an expanded table of contents for the section “Tests and Reviews” by pre- 

senting a complete list of the tests and reviewers, Numbers preceding titles indicate the entry 

numbers to be consulted for full bibliographic entries, test references, and reviews, Stars indicate 

new tests not previously listed in this series; asterisks indicate tests which have been revised in 
some way or have been issued in new form since they were last listed. 


ACHIEVEMENT BATTERIES 


1. *American School Achievement Test. Review by 
Ralph C. Preston. 

2. *California Achievement Tests. Reviews by War- 
ren G. Findley, Alvin W. Schindler, and J. Harlan 
Shores. 

3. XContemporary A ffairs: Every Pupil Test. 

4. *Cooperative Contemporary Affairs Test for Col- 
lege Students. Reviews by John V. McQuitty and Ben- 
jamin S. Bloom. А 

5. *Cooperative General Achievement Tests, Revised 
Series. Review by Paul L. Dressel. 

6. *Cooperative General Culture 
John V. McQuitty. ў 
х fi *Cooperative Test on Recent Social and Scien- 
tific Developments. Review by Roger Т. Lennon. 

8. *Coordinated Scales of Attainment, Review by 
Alvin W. Schindler. 

о. XEssential High School Content Battery. Reviews 
by Herbert S. Conrad, J. Thomas Hastings, and A. 
Harry Passow and Gordon N. MacKenzie. 

10. *Graduate Record Examinations Profile Tests. 
Review by Max D. Engelhart. 

тї. *Graduate Record Examinations Tests of Gen- 
eral Education. 

12. *The Gray-Votaw-Rogers General. Achievement 
Tests. Review by Oliver F. Anderhalter. 

13. kThe Harlow Achievement Tests for Texas. 

14. XThe Harlow Battery Achievement Test. 

15. *Iowa Every-Pupil Tests of Basic Skills. Re- 
views by Miriam M. Bryan and Anton Thompson. 

16. Iowa High School Content Examination, 1943 
Edition. Review by, David V. Tiedeman. 

17. *The Iowa Tests of Educational Development. 
Review by Eric F. Gardner. { 

18. *Metropolitan Achievement Tests. Review by 


Warren С. Findley. 


Test. Review by 


‚19. *Modern School Achievement Tests: Skills Edi- 
tion. Review by Charles R. Langmuir. 

20. *Municipal Botey: ational Achievement 
Tests. Review by Ral . Preston. 

21. *Ohio Eighth Grade Test. 

22. *Ohio General Scholarship Test for High School 


Seniors. 

23. Progressive Tests in Social and Related Sci- 
ences. Reviews by Harry D. Berg and J, Raymond 
Gerberich. 

24. KSeven Plus Assessment: Northumberland Se- 
ries. Review by Stanley D. Nisbet. , 

25. Stanford A chievement Test. Reviews by Paul R. 
Hanna and Claude E. Norcross, and Virgil E. Herrick. 

26. Tests of General Educational Development. Re- 


view by Gustav J. Froehlich. 


CHARACTER AND PERSONALITY 
NONPROJECTIVE 


27. ж Activity Vector Analysis. ^ 
28. The Adjustment Inventory. Reviews by Nelson 
G. Hanawalt and Theodore R. Sarbin. | 

29. kAffectivity Interview Blank. Reviews by Mor- 
ris Krugman and Verner M. Sims. 

зо. XThe Alcadd Test. Reviews by Charles H. Hon- 
zik and Albert L. Hunsicker. муку n 

31. Babcock Test of Mental Deterioration. Reviews 
by D. Russell Davis and Seymour G.K ebanoff, 

32. *[The Baxter Group Test of Child Feeling.] 

33. ЖСііепі Centered Counseling Progress Record- 
ing Chart. 1 А 

*The College Inventory of Academic Adjust- 

un Reviews by Lysle W. Croft and Harrison G. 


Gough. Ў 
as Concept Formation Test. Review by Kate Levine 

illiam S. 

36. *Contemporary Problems. 
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37. *Cornell Index. Reviews by Hans J. Eysenck, 73. *Mooney Problem Check List: s Revision, 


Nelson G. Hanawalt, and Laurance F. Shaffer. 

38. *The Cowan Adolescent Adjustment Analyzer: 
An Instrument of Clinical Psychology. 

39. C-R Opinionaire. Review by George W. Hart- 
mann. 

40. *The Empathy Test. 

4I. Every-Day Life: A Scale for the Measure of 
Тин Varieties of Self-Reliance. Review by Harold E. 

ones, 

42. Examining for Aphasia and Related Disturb- 
ances. Review by D. Russell Davis; and excerpts from 
two reviews, 

43. *Fels Parent Behavior Rating Scales. Review by 

ale B. Harris. 

44. *Gardner Behavior Chart. 

45. *General Goals of Life Inventory: General Edu- 
О, Series. Reviews by C. Robert Pace and Leona 

. Tyler. 

46. [Generalized Attitude Scales.] Reviews by Ken- 
neth E. Clark and Donald T. Campbell. 

47. *The Guilford-Martin Inventor ys Factors 
GAMIN, Abridged Edition, Review by Hubert E. 


Brogden. 

4 The Guilford-Martin Personnel Inventory, Re- 
view by Neil Van Steenberg. 

49. *The Guilford-Zimmerman Temperament Sur- 
vey. Reviews by William Stephenson and Neil Van 
Steenberg ; and an excerpt from а review. 

50. *Heston Personal Adjustment Inventory. Re- 
views by Albert Ellis, Hans J. Eysenck, and E. Lowell 
Kelly; and an excerpt from a review. 

51. Hunt-Minnesota Test for Organic Brain Dam- 
age. Review by Seymour G. Klebanoft. 

52-3. *[/llinois Opinion Inventories.] Review by 
Kenneth E. Clark. 

54. *Industrial Subjective Fatigue and Euphoria 
Scales: Present Feelings, 

55. *Industrial Subjective Fatigue Scale: Retro- 
spective Work Curve Feelings: National Research 
Program on Employee Feelings at Work. 

257. XInteraction Process Analysis. Review by 
Launor F. Carter. 

58. *Interest Index: General Education Series. 

59. An Inventory of Factors STDCR. Review by 
Hubert E. Brogden. 

60. *Inventory of Personal-Social Relationships: 
General Education Series. Reviews by N. L. Gage and 
Theodore А. Sarbin. 

бт. ЖТле IPAT Humor Test of Personality. Re- 
views by W. Grant Dahlstrom, Ardie Lubin and Frank 
M. Loos, J. R. Wittenborn. 

62. Johnson Temperament Analysis. Review by Al- 
bert Ellis. 

63. *Jurgensen Classification Inventory. Reviews by 
Robert G. Demaree and Louis L. McQuitty, and Wil- 
liam J. E. Crissy. 

oe *KD Proneness Scale and Check List. Reviews 
by Douglas Courtney and Dale B. Harris. 

*Kuder Preference Record —Personal. Excerpt 
from a review, T 

66. Lewerenz-Steinmets Orientation Test: Concern- 
ing Fundamental Aims of Education. Reviews by Fred- 
eric L, Ayer and Roger T. Lennon. 

67. *The Life Adjustment Inventory. Reviews by 
Helen Shacter and John W. M. Rothney. 

68. *The Manson Evaluation. Reviews by Charles 
H. Honzik and Albert L. Hunsicker. y 

69. *Memory-for-Designs Test. 

70. The Minnesota Inventory of Social Attitudes, 
Review от Тетик M. Sims. 

71-2. * finnesota _ Multiphasic Personality Inven- 
tory, Revised Edition. Review by Arthur L. Ben- 
ton. i 


Reviews by Harold E. Jones and Morris 
74. kNorthampton Activity Rating Scale, 
75. Personal Audit. Review by William Seem 
76. *ТАе Personal Preference Inventory: Student 
Form. Reviews by E. Lowell Kelly and С. M. Louttit, 
77. The Personality Inventory. Review by Leona E, 


yler. 
E *Personality Record. Review by Verner М 
ims E ; 


79. kThe Personality Survey, Reviews 
Courtney and John W. M. R ney. by Douglas 

80. *Pre-Counseling Inventory. Review by Charles 
H. Honzik, К 

81. *Problem Check List: Form for Rural Youth, 

82. *Problem Check List: Form for Schools of 
Nursing. 

83. *The Purdue Rating Scale for "Administrators 
and Executives. Review by Kenneth L. Heaton. 

The School Inventory, Review by Ross W, Mat- 

teson, 

85. Selective Vocabulary Test. Review by James 
Maxwell. ) i 

86. *Self Analysis Inventory. 


87. *The Sixteen Personality Factor Questionnaire, 
Reviews by Ardie Lubin, Charles М. arsh, and J. 
Richard Wittenborn. 


88. *Social Distance Scale, Seventh. Experimental 
Edition. Review by Donald T. Campbell. i 
*Social Intelligence Test: George Washington 
University Series, Revised Form. Р 
AS. А Junior Inventory. Review by Dwight L. 
rnold, 

91. XSRA Youth Inventory. Reviews by Kenneth 
E. Clark and Frank S. Freeman. ( 

92. *Study of Values: А Scale for Measuring the 
Dominant Interests in Personality, Revised Edition. 
Reviews by Harrison G. Gough and William Steplien- 
son; and an excerpt from a review. А 

93. ЖТлиғѕіопе Temperament Schedule. Кее 
Hans J. Eysenck, бала M. Harsh, and David С. 
Ryans; and an excerpt from a review. ч 

94. Vineland ‘Social Maturity Scale, Reviews by 
William M. Cruickshank and Florence M. Teagarden. 

95. Washburne Social-Adjustment Inventory, Thas- 
bic Edition, Review by William Seeman. i 

96. Willoughby Emotional Maturity Scale. Review 
by Lysle W. Croft. ч 

97. *The Wishes and Fears Inventory. 

98. Hi rad имао к. ; 

. ork Preference Inventory. 

HN *Your Activities and Atiitudes. Excerpt from 

а review. 


PROJECTIVE 


7 i the 
102. kThe Blacky Pictures: A Technique for the 
Exploration of Personality Dynamics. Review by Al 
bert Wee рер оше тее а Edi- 
103. ildren's erception , 1 
tion. Reviews by John. E. Bell and L. Joseph Stone; 
and excerpts from five reviews. 
104. *Expressive Movement Chart. , by John 
105. *Four-Picture Test (1930). Reviews by J 
Е. Bell, E J. С. Bradford, and Ephraim Rosen; ; 
excerpt from a review. | he Study 0 
106. *Group Projection Sketches for the IY. 
» тай Groups. Reviews by Robert R. Holt and N 
со 5 t 
107. *H-T-P; А Projective Device and о МИН 
of Adult Intelligence. Reviews by Alberi (өшү; 
Ephraim Rosen; and an excerpt from a revi W philip 
108-9. *Horn-Hellersberg Test. Reviews ^ 
L. Harriman and T. W. Richards. 
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110. ЖТле Kahn Test of Symbol Arrangement, Re- 
view by Edward Joseph Shoben, Jr. ^ E 

11-2, *Machover Draw-A-Person Test. Reviews 
by des dr Mini. ricerca Stewart. 

113-4. ake a Picture Story. Reviews by Albert 1. 
Rabin and Charles R. Strother.” аА 

її $ XI Mos Test. 

110. *Plot-Completion Test. Reviews by Robert C. 
Challman and Percival M. Symonds. il 
117-28. Rorschach, Review by Helen Sargent. 
‚129. osensweig Picture-Frustration Study. Re- 
pesa by Robert C. Challman and Percival M. Sy- 
monds. f 

„130. The Rotter Incomplete Sentences Blank. Re- 
views by Charles N. Cofer and William Schofield ; and 
an excerpt from a review. 

131. *Sentence Completions Test. Reviews һу 
Charles N. Cofer and Charles Ё. Strother; and an 
excerpt from a review. 

ae Symonds Picture Story Test. Review by 
E. J. С. Bradford; and an excerpt from a review. 

134-5. "Szondi Test. Reviews by Ardie Lubin and 
Albert I. Rabin. 

136-41. Thematic Apperception Test. Review by Ar- 
thur L. Benton, 

142. *The Travis-Johnston Projective Test: For 
the Exploration of Parent-Child Relationships. Review 
by Robert R, Holt. 

143. XTwitchell-Allen Three-Dimensional Apbercep- 
tion Test. Review by Edward [н Shoben, Jr. 

144-5. *Visual Motor Gestalt Test. Reviews by Ar- 
thur L. Benton and Howard R. White, 

146. XThe Vocational Apperception Test. Reviews 
by Benjamin Balinsky and William E. Henry; and an 
excerpt from a review, 

147. kThe World Test. 


ENGLISH 


148. XA.C.E.R. English Usage Tests. 
149. *Barrett-Ryan English Test. | 
150. *Basic Language Skills: Iowa Every-Pupil 
Tests of Basic Skills, Test C, New Edition. 
151. *California Language Test. Reviews by Gerald 
V. Lannholm and Robert C. Pooley. i 
ТЇЙЇ *College English Test: National Achievement 
ests. 
153. College Placement Test in English. Review by 
Charlotte Croon Davis. Ё 
154. College Preparatory Test in English. Review 
by Charlotte Croon Davis, f 
155. *Cooperative English Test: Lower and Higher 
Levels. Review by Chester W. Harris. 
156. *Correct English Usage Test. 
„157. “Correctness and Appropriateness of Expres- 
225 lowa Tests of Educational Development, 
est 3. 
158. *English IX-XII: Achievement Examinations 
for Secondary Schools. 
150. *English—oth Grade: 20th Century Test. 
160, kEnglish—roth Grade: 20th Century Test. 
161. XEnglish Test: Municipal Tests: National 
Achievement Tests. * 
162. English Test: National Achievement Tests. 
Review by Winifred L. Post. / 
163. *English Usage: Every Pupil Testi ў 
164. Iowa Grammar Information Test. Review Ьу 
Robert C. Pooley. A À 
165. *Jowa Language Abilities Test, Review by 
Margaret G. McKim. д : ; 
166. Iowa Placement Examinations: English Apti- 
tude: Series EAr, Revised. Reviews by Clarence Der- 
rick and W. C. Kvaraceus. ‹ А 
167. Iowa Placement Examinations: English Train- 


English 
ing. Reviews by Clarence Derrick and W, C, Kvara- 


ceus, 

168, Los Angeles Diagnostic Tests: Language (A 
Test in Capitalisation, Punctuation and Language Us- 
age). Review by Gerald V. Lannholm, 

169. *Methanics of Written English: State High 
School Tests for Indiana, 

„170. *Morgan Test of Modern English Usage. Re- 
view by Walter N. Durost. 

171. XObjective Test in Grammar. 

172. *Objective Tests in English [Grammar]. 

173. *The Purdue Placement Test in English: The 
Clapp-Young Self-Marking Tests. 

174. *Stanford Achievement Test [Language Arts]. 

175. XTest of English Usage. Review by Charlotte 
Croon Davis. 

үө XTests of Language Usage: Active Vocabulary 
and Expression: Cooperative Inter-American Tests. 
Review by Walter V. Kaulfers. 

177. *Tools of Written English: State High School 
Tests for Indiana, 


COMPOSITION 


178. *College Entrance Examination Board Achieve- 
ment Test in English Composition. Review by Char- 
lotte Croon Davis and Frederick B. Davis. 

179. Hudelson’s Тунг Composition Ability Scale, 
Review by Worth J. Osburn. 


LITERATURE 


180. * Ability to Interpret E Materials: Iowa 
Tests of Educational Development, Test 7. 
181. *American Literature: Every Pupil Test. 4 
182. *An Awareness Test in 20th Century Litera- 


ture. h 
183. *Check List of Novels. Review by John S. 
Diekhoff, v 
184. *Cooperative Literary Comprehension and Ap- 
preciation Test. j i 
185. *English Literature: Every Pupil Test. 4 
186, *Graduate Record Examinations Advanced Lit- 
erature Test. М 
187. kInterpretation of Literature Test: General 
Education Series. Reviews by John S. Diekhoff and 


hn M. Stalnaker. ү h 
Joba XInventory of Satisfactions Found in Reading 
Fiction: General Education Series. Review by Holland 


Roberts. / 
row XThe Jones Book-A-Day Tests: For Checking 


Outside Reading of High School Pupils, 
190. * Literature Appreciation Tests, б , 
191. *Literature Test: Municipal Tests: National 


Achievement Tests. y j 
192, *Literature Test: National Achievement Tests. 


193. Objective Tests in English [Perfection Form 
De "Objective Tests in English [Turner E. Smith 
д y 


195. *kStanford Achievement Test. [Literature]. Re- 


i inifred L, Post. А 
kr зи Test in English Literature. Review by 


. Diekhoff. ] 
Јо 5, ior Clark Test on Classical References 


and Allusions. 
- SPELLING 
198. *kAlyer. Standardised Spelling Test. Review by 


. Bixler. ^ ү 
КЫЫ ДМ | Spelling Test. Review by 
h r * * D 
en ipsi ce Spelling- Diagnosis Test, Review 
Spache. MNA 
cn Жайга Word Spelling Testi. - ——— 


Fine Arts 


202. *Lincoln Diagnostic Spelling Test. Reviews by 
Walter Scribner Guiler and rge Spache. 

203. *Lincoln Intermediate Spelling Test. Reviews 
by Walter Scribner Guiler and George Spache. 

204. kMorgan Spelling Test for Business and Indus- 
try. Review by Harold H. Bixler. 

205. Morrison-McCall Spelling Scale. Review by An- 
ton Thompson. " | 
$ уң *The New Standard High School Spelling 

cale. 

207. kRich-Engelson Spelling Test. Review by 
Henry D. Rinsland. i 

208. * Spelling and Vocabulary: Every Pupil Test. 

209. * Spelling Errors Test. 

210. *Spelling: Seven Plus Assessment: Northum- 
berland Series. к 

211. *Spelling Test for Clerical Workers. Review 
by Harold Н. Bixler. $ 

212. Traxler High School Spelling Test. Review by 
Henry D. Rinsland. 


VOCABULARY 


213. *Cooperative Vocabulary Test. 

214. *General Vocabulary: lowa Tests of Educa- 
tional Development, Test 8. f 

215. kHolborn Vocabulary Test for Young Chil- 
dren. Review by C. M. Fleming. 

216. *Michigan Vocabulary Profile Test. Review by 
David Segel. 

217. *Test of Literary Essentials. 

218. *Word Dexterity Test. 


FINE ARTS 


ART 


219. kGraduate Record Examinations Advanced 
Fine Arts Test. 

220. kGraves Design Judgment Test. Reviews by 
William B. Michael and Edwin Ziegfeld; and an ex- 
cerpt from a review. 

221. *Horn Art Aptitude Inventory. 

222. Knauber Art Ability Test. Review by Edwin 
Ziegfeld. 

223. Knauber Art Vocabulary Test. Review by Ed- 
win Ziegfeld. 

224. The Meier Art Tests: I, Art Judgment. Review 
by Harold A. Schultz. 


MUSIC 


225. *Conrad Instrument-Talent Test. 

226. kDiagnostic Tests of Achievement in Music. 
Reviews by William S. Larson and Herbert D. Wing. 

227. kGraduate Record Examinations Advanced 
Music Test. 

228. *Musical Aptitude Test: Series A. Review by 
William S. Larson. 

220. Seashore Measures of Musical Talents, Revised 
repr Reviews by John McLeish and Herbert D. 


ing. , 

230-1. kWing Standardized Tests of Musical Intel- 
ligence: A Test of Musical Ability on 10 Records. Re- 
view by John McLeish. 


FOREIGN LANGUAGES 


232. Foreign Language Prognosis Test. Review by 
William, B. Michael. 


ENGLISH 


233. *English Examination for Foreign Students: 
Including a Test of Non-Verbal Reasoning. 

234. XEnglish Language Test for Foreign Students. 
Review by Clarence E. Turner. 
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235. kTest of Aural Comprehension ij 5 
а Foreign Language. " on in English as 


FRENCH 
$ m *Cohen French Test. Review by Mary E, Turn- 
u 


237. *College Entrance Examination Boar, ieve- 
ment Test in French Reading. Review by ee 
E French Ti 

238. *Cooperative French Test: Lower a i 
Levels. Review by Elton Hocking. fug 

239. *French I and II: Achievement Examinations 
for Secondary Schools. Review by Elton Hocking, 
240. *French Recognition Vocabulary Test: State 
High School Tests for Indiana, Review by Clarence E, 
Turner. 

241. *Graduate Record Examinations Advanced 
French Test. 

242. kA Standardised French Grammar Test. Re- 
views Nelson Brooks and Donald G. Burns, 

243. KA Standardized French Vocabulary Test, Re- 
views by Nelson Brooks and Donald G. Burns. 


GERMAN 


244. *College Entrance Examination Board Achieve- 
ment Test in German Reading. Review by Herbert 
Schueler, 

245. *Cooperative German Test: Advanced Form. 
Review by Herbert Schueler. E 

246. *German I and 11: Achievement Examinations 
for Secondary Schools. 

247. *Graduate Record Examinations Advanced 
German Test. 


GREEK 


248. College Entrance Examination Board Achieve- 
ment Test in Greek Reading. 


ITALIAN 


249. *College Entrance Examination Board Achieve- 
ment Test in Italian Reading. 


LATIN 


250. *kCollege Entrance Examination Board Achieve- 
rigs Test in Latin Reading. Review by Harold B. Dun 

el. y 

251. oeste 8 щш С; Lower and Higher 
Levels. Review by Konrad Gries. 4 

252. *First Year Latin Test: State High School 
Tests for Indiana. р 

ар Latin Composition Test. Review by 
Konrad Gries. 7 

254. Бойна; Second Year Latin Test. Review by 
У. C.K i КЫ 

255. lano T and II: Achievement Examinations 
for Secondary Schools. : 

256. *Latin I and II: Every Pupil Test. , ig Scholl 

257. *Second Year Latin Test: State Hig 
Tests for Indiana. 

258. White Latin Test. Review by Konrad Gries. 


SPANISH 


250. *College Entrance omg Board Achieve- 
ment Test in Spanish Reading. ‘her 

260. SCooperative 5 ому у Ден Lower and H igh 
Levels. Review by James B. Tharp. І 0 

261. Mn уй АЫ Spanish Test: State High Scho 
Tests for Indiana. 

262. *Furness 
Spanish. : 

263. *Graduate Record Examina 
Spanish Test. А 


: A 
Test of Aural Comprehension * 


tions Advanced 
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264. *Kansas First Year Spanish Test. 

265. *Spanish I and II: Achievement Examinations 
for Secondary Schools. 

266. The Stanford Spanish Tests. Review by James 
B. Tharp. 


INTELLIGENCE 
GROUP 
267. *A:C.E.R. Adult Test (B40). 
268. *A.C.E.R. General Ability Test: Advanced M. 
269. *A.C.E.R. General Test A. 
270. kA.C.E.R. General Test C. 
271. kA.C.E.R. Junior A. 


272. A.C.E.R. Non-Verbal Test. Review by F. J. 
Schonell, 

273. kA.C.E.R. Test 1. 

274. Academic Aptitude Test: Non-Verbal Intelli- 
gence: Acorn National. Aptitude Tests. Review by 
William B. Schrader. 

275. *Academic Aptitude Test: Verbal Intelligence: 
Acorn National Aptitude Tests. Review by William B. 
Schrader. 

276. kAkron Classification Test. Review by Erwin 
K. Taylor. 

277. *American Council on Education Psychological 
Examination for College Freshmen. 

278. *American Council on Education Psychological 
Examination for High School Students. 

279. The Army Alpha Examination: First Nebraska 
Revision. Review by Robert G. Demaree and Louis L. 
McQuitty. 

280. *Army General Classification Test, First Civil- 
ian Edition. Review by John T. Dailey. 

281. Army Group Examination Alpha. Reviews by 
John T. Dailey and Willis C. Schaefer. 

282. *California Test of Mental Maturity and adap- 
tations. Excerpt from a review. 

283. *Chicago Non-Verbal Examination. 

284. *Classification Test for Industrial and Office 
Personnel. 

285. ЖСоПеде Entrance Examination Board Scho- 
lastic Aptitude Test. Review by Frederick B. Davis. 

286. *College Transfer Test. x 

ay. Culture-Free Test. Review by Raleigh М. 

rake, 

288. *[Detroit Intelligence Tests.] y 

280. *Duplex Series of Ability Tests. Reviews by 
W. G. Emmett and Stanley D. Nisbet; and an excerpt 
from a review. Ў 

290. *The Essential Intelligence Test. Review by 
F. W. Warburton. 

201. XFigure Reasoning Test: A Non-Verbal Intel- 
ligence Test. Reviews by E. J. G. Bradford and James 
Maxwell; and an excerpt from a review. : 

292. Goodenough Intelligence Test. Review by 
Naomi Stewart. EAS 4 

293. kGraduate Record Examinations Aptitude 
Test. Reviews by J. Р. Guilford and Carl I. Hoyland. 

204. *Group Test of Learning Capacity: Dominion 
Tests. Review by W. G. Emmett. ] J 

295. *Group Test 33 (Intelligence) : National Insti- 
tute of Industrial Psychology. Д А 

296. kGroup Test 36 [Intelligence]: National Insti- 
tute of Industrial Psychology. : 

297. kGroup Test 70 [Nonverbal Intelligence] : Na- 
dd Institute of Industrial Psychology. Review by 

eorge Westby. Н i 

peel Group. Test 90A [Intelligence]: National In- 
stitute of Industrial Psychology. Pe a 

. *The Henmon-Nelson Tests of Mental Ability: 

The Clapp-Young Self-Marking Tests. Review 
H. M. Fowler. J Test 
300. kThe IPAT Culture Free Intelligence Тез. 


Intelligence 


Бейеү by Gladys C. Schwesinger and Raleigh M. 
301. kKentucky Classification Battery. i 

David V. Tiedeman. je =: кке, pe tad 
302. *Kuhlmann-Anderson Intelligence Tests, Sixth 

E iir Reviews by Henry E. Garrett and David Se- 


gel. 
303. OMM, ЫШ te: e Scale. 
304. er Analogies Test, Reviews Carl I. 
Hovland irs ied Alpha Ex by 
305. odifie a Examination Form 9. Review 
by Dael Wolfle. : f 
‚306, *Moray House Picture Intelligence Test. Re- 
views by Gertrude Keir and M. L. Kellmer Pringle. 
307. *Non-Verbal Test. Review by E. A. Peel. 
.308. *Ohio State University Psychological Test. Re- 
view by George A. Ferguson. 
309. *Oral Directions Test: An Intelligence Test on 
Phonograph Records. 
зто. kOtis Employment Tests. 
311. XP.C. Verbal Capacity Sampler. 
WE *Pattern Perception Test, Review by Alice W. 
eim. 
313. *The Peel Group Test of Practical Ability. Re- 
view by George Westby. 
314. *Progressive Matrices. Reviews by Charlotte 
Banks, W. D. Wall, and George уен, 
315. *Revised Alpha Examination, Forms 5 and 7. 
Review by Dael Wolfle. 
316. Revised Alpha Examination Form 6, Short 
Form. Reviews by Edwin R. Henry and Dael Wolfle. 
317. *Revision of Army Alpha Examination. Re- 
views by Edward E. Cureton and Edwin R. Henry. 
318. *SRA Non-Verbal Form. Review by W. D. 
Commins. л 
319. *SRA Verbal Form. Reviews by W. D. Com- 
mins and Willis C. Schaefer. й 
320. *Scovill Classification Test. Reviews by Robert 
С. Bernreuter and Edward E. Cureton. 
321. kThe Simplex Group Intelligence Scale. 
322. *[The Simplex Junior Intelligence Tests.] 
323. Southend Group Test of Intelligence. Review by 
Gertrude Keir. p 
324. *Terman-McNemar Test of Mental Ability. 
325. Tests of General Ability: Cooperative Inter- 
American. Tests. Reviews by Walter N. Durost and 
Raleigh M. Drake. 
326. *Thurstone Test of Mental Alertness. 
327. V.G.C. Intelligence Indicator. Review by 
George A. Ferguson. 
328. XVerbal and Non-Verbal Test., ? 
329. *Verbal Intelligence Test. Review by William 
B. Pag icit AO 
o. *Verbal Test I. Р 
pes *Wesman Personnel. Classification Test. Re- 
views b; John C. Flanagan and Erwin К. Taylor. 
332. * West Yorkshire" Group Test of Intelligence. 
333. Word-Number Test of Scholastic Aptitude, Re- 
view by Jane Loevinger. 
INDIVIDUAL 
Alexander Performance Scale: A Performance 
sale for the Measurement of Practical Ability, Re- 


i les A. Stickland. 
view ictor Pob Scale of Performance Tests. Re- 


i illiam К. Grove. T d 
ees oom In ANM Examination, 1947 Re- 
Gwen F. Arnol 


а а old. 
un йу Crichton Vocabulary Scale. Reviews by 


Banks and W. D. Wall. à 
сее Ситі Classification Form. Review by Harold 


= The FR-CR Test. Review by William S. Ko- 


gan and Kate Levine Kogan. 


Mathematics 


340. *Full-Range Picture Vocabulary Test. Reviews 
by William D. Altus and William M. Cruickshank. 

341. *Gesell Developmental Schedules. 

342. *The Immediate Test: A Quick Verbal Intelli- 
gence Test. Reviews by Jerome Doppelt and Ivan 
Norman Mensh. 

343-5. *Intelligence Tests for Children. 

346. Kent Series of Emergency Scales. Review by 
Ivan Norman Mensh. 

347. *kThe Leiter Adaptation of Arthur's Stencil De- 
sign Test. An excerpt from a review. 

348. kThe Leiter Adaptation of the Painted Cube 
Test. An excerpt from a review. 

349. *Leiter International Performance Scale. Re- 
view by Gwen F. Arnold; and an excerpt from a re- 
view. 

350. *Leiter-Partington Adult Performance Scale. 
Reviews by Harold A. Delp and Herschel Manuel; 
and an excerpt from a review. 

351-2. Minnesota Preschool Scale. 

353. Nebraska Test of Learning Aptitude for Young 
Deaf Children: A Non-Verbal Individual Scale Espe- 
cially Devised for Deaf and Hard of Hearing Children 
From Four to Ten Years of Age. Review by Mildred 
C. Templin. 

354. *Northwestern Intelligence Tests. Review by 
Mildred C. Templin; and an excerpt from a review. 

355. *Partington’s Pathways Test. Review by 
George K. Bennett; and an excerpt from a review. 

350-7. *Porteus Maze Test, Vineland Revision. Re- 
views by C. M. Louttit and Gladys C. Schwesinger. 

358. Revised Stanford-Binet Scale. Review by Boyd 
R. McCandless. | 
i 359. *Stencil Design Test. Review by Benjamin Ba- 
insky. 


360. *Visual Retention Test for Clinical Use. Re- 
views па Schofield, Ivan Norman Mensh, and 
oseph Newma: 


n. 

361-2. *Wechsler-Bellevue Intelligence Scale. Re- 
views by Murray Aborn and William D. Altus. 

363. Жж Wechsler Intelligence Scale for Children. Re- 
views by James M. Anderson, Harold A. Delp, and 
Boyd R. McCandless ; and an excerpt from a review. 

364, W echsler Memory Scale. Reviews by Ivan Nor- 
man Mensh and Joseph Newman. 


MATHEMATICS 


365. *Ability to Do Quantitative Thinking: Iowa 
Tests of Educational Development, Test 4. 

366. *California Mathematics Test. 

367. College Entrance Examination Board Achieve- 
ment Test in Advanced Mathematics. Review by Paul 
L. Dressel. =. 

368. *College Entrance Examination Board Achieve- 
ment Test їп Intermediate Mathematics. Review by 
Paul J. Blommers. A 

369. *Cooperative Mathematics Pre-Test for College 
Students. Review by E. P. Starke. 

, 370; *Cooperative Mathematics Tests for Grades 7, 
, and 9. 

371. *Davis Test of Functional Competence in Math- 

ematics: Evaluation and Adjustment Series. 

372. kFunctional Evaluation in Mathematics. Re- 
view by Charles S. Ross. 

373. *General Mathematics III: Achievement Ex- 
aminations for Secondary Schools. 

374. kGraduate Record Examinations Advanced 
Жиен Тен, " A E 

375. athematics: Every Pupil Test. 

376. Morti Test 1. ў С 

377. kRogers Achievement Test in Mathematics for 
Technical and Industrial Schools. 4 

378. *Snader General Mathematics Test: Evalua- 
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tion and Adjustment Series. Reviews b P. 

mers п Нотага " nh y Раш J. Blom- 
37 est of General Proficiency in the Field of 
Mat. tics: Cooperative General Achi 

Revised Series, Test Ш. руа 


ALGEBRA 


380. Ability for Algebra: Fife Tests of Ability, Test 
3. Review by William G. Mollenkopf. 

381. *Advanced Algebra: Achievement Examina- 
tions for Secondary Schools. 

382. *Advanced Algebra Test: State High School 
Tests for Indiana. 

383. * Algebra Prognosis Test. Reviews by William 
G. Mollenkopf and Paul J. Blommers. 

384. Algebra Readiness Test. Review by Harold 
Gulliksen. 

385. "California Algebra Aptitude Test. Review by 
William G. Mollenkopf. 

386. Columbia Research Bureau Algebra Test. Re- 
view by Stanley Clark. 

387. vata ah wee Algebra Test: Elementary Alge- 
bra Through Quadratics. Review by Stanley Clark. 

388. *Cooperative Intermediate Algebra Test: Quad- 
ratics and Beyond. Reviews by Lucian B. Kinney and 
E. P. Starke. 


389. *Elementary Algebra: Achievement Examina- ` 


tions iP aspect yeu т Рин Test 
90. ementary Algebra: Every Pupu Lest. 
». *First Year Algebra Test: State High School 
Tests for Indiana. * NN 
392. *First Year Algebra: 20th Century Test. 
393. Iowa Algebra Aptitude Test, Revised Edition. 
Reviews by Harold Gulliksen and Emma Spaney. 
304. *Lankton First-Year Algebra Test: Evalua- 
tion and Adjustment Series. Review by Stanley Clark. 
s. kLarson-Greene Unit Tests in First-Year Al- 


gebra. : р 
‚ *Orleans Algebra Prognosis Test, Revised Edi- 
урык by Emma Spaney and Harold Gulliksen. 
307. *Seattle Algebra Test: Evaluation and Adjust- 
ment Series. 


ARITHMETIC 


„ *A.C.E.R. Arithmetic Tests. ‘ : 
2 XA.C.E.R. Number Test. Review by Leslie M. 
Haynes. - 
400. *Arithmetic Essentials Test. Reviews by Foster 
E. A yore and Chida IN : 
I. *Arithmetic: Every Pu est. К 
от *Arithmetic Fundamentals Test: State High 
SENON ане for gadana. жиы 
ү rithmetic Reasoning 1 est. ү 
y *Arithmetic: Seven Plus Assessment: North- 
umberland Series. Joss 
405. *Arithmetic Test, Fundamental Operations: 
Dominion Tests. ^ 
406. Arithmetic Test (Fundamentals and Ка 
ing): Municipal Tests: National Achievement By 
Reviews by Foster E. Grossnickle and 


055. Р " 
407. *Arithmetical Reasoning Test. Review by Wi 
iam L. f. ds 
any "Past Arithmetic Skills: Towa Every Pupil 
Tests of Basic Skills, Test D, New Edition, Reviews 

400. *Д Brief Survey of Ашим Е Ша : 
by William A. B and Henry V. r is. 
7410. Көн, “Diagnostic “Arithmetic Tests 
i Herbert F, Spitzer, з rt 
Ww "California “Arithmetic Test. Review by А 
Arith- 


. Burch. ТЫ V 
412. *Cotswold Measurement of Ability 1: 
metic. Review by W. L. Sumner. 
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413. Diagnostic Chart for Fundamental Pr i 
Arithmetic. Review by Leo J. Brueckner. mara 

414. kGraded Arithmetic-Mathematics Test, 

415. *Los Angeles Diagnostic Tests: Fundamentals 
of Arithmetic. 

416. *Metropolitan Achievement Tests [Arithme- 
tic]. Review by Robert L. Burch. 

417. Number Fact Check Sheet. Review by Miriam 
MB kRetail Arithmetic W 
418. etail Arithmetic Worksample. Revi 
William J. E. Crissy. кй 

419. *Stanford Achievement Test [Arithmetic]. Re- 
view by Robert L. Burch. 

420. Test of Mechanical Arithmetic. 

421. XWoody-McCall Mixed Fundamentals in Arith- 
metic. Review by William A. Brownell. 


GEOMETRY 


422. Columbia Research Bureau Plane Geometry 
esi. Review by Cyril J. Hoyt and Theodore E. Kel- 
ogg. 

423. *С ooperative Plane Geometry Test. Review by 
Cyril J. Hoyt and Theodore E. Kellogg. 

424. kGeometry Attainment Test. Reviews by I. 
Macfarlane Smith and W. L. Sumner. 

425. *Geometry: Every Pupil Test. 

426. *Lane-Greene Unit Tests in Plane Geometry. 

427. *Orleans Geometry Prognosis Test, Revised 
Edition. 

428. ЖРІапе Geometry: Achievement Examinations 
for Secondary Schools. 

429. *Plane Geometry Test: State High School 
Tests for Indiana. 

430. *Plane Geometry: 20th Century Test. 

431. *Schrammel-Reed Solid Geometry Test. 

432. *Seattle Plane Geometry Test: Evaluation and 
Adjustment Series. į 

433. XShaycoft Plane Geometry Test: Evaluation 
and Adjustment Series. Reviews by Harold P. Fawcett 
and Cyril J. Hoyt and Theodore E. Kellogg. 

434. kSolid Geometry: Achievement Examinations 
for Secondary Schools. 

43s. *Solid Geometry Test: State High School 
Tests for Indiana. 

436. Survey Test in Plane Geometry. Review by 
Harold P. Fawcett. 

437. *ТАе Van Dyke Solid Geometry Test. 


TRIGONOMETRY 


438. *Cooperative Plane Trigonometry Test. 

439. *Rasmussen Trigonometry Test. 

440. kTrigonometry Test: State High School Tests 
for Indiana, 


MISCELLANEOUS 


AGRICULTURE 
441. Farm Shop Tools: Recognition and Use: State 
High School Tests for Indiana. Review by M. Ray 


rnes. 
442. *Graduate Record Examinations Advanced Ag- 


riculture Test. 1 
BUSINESS EDUCATION 


443. XBookkeeping: Achievement Examinations for 
Secondory енна I: Every Pupil Test 

444. XBookkeeping I: Every ГИ, m 

445. И д Тезї: National Business En- ^-^ 
trance Tests. 

446. *Business Fundamentals and General Informa- 
tion Test: National Business Entrance Tests, , . 

447. XBusiness Relations and Occupations: Achieve- 
ment Examinations for Secondary Schools. 


Miscellaneous 


448. *Commercial Arithmetic Test: St i 
School Tests for Indiana. ripe Sieh 

„449. *Cooperative Commercial Arithmetic Test, Re- 
view by Bertram Epstein. 

‚ 450. XGilbert Business Arithmetic. Review by Wil- 
liam ws f. 

451. iett Simplified Shorthand Test (Gregg). 

452. *Machine Calculation Test; Nasional г АА 
Entrance Tests, 

453. *National Business Entrance Tests. 

t SRA Dictation Skills. Review by Harold F. 
othe. 

455. Seashore-Bennett — Stenographic Proficiency 
Tests: A Standard Recorded Stenographic Work- 
sample. Review by Harold F. Rothe. 

456. *Shorthand I and II: Every Pupil Test. 

457. *Simplified Shorthand Test: State High School 
Tests for Indiana. 

458. *Stenographic Test: National Business En- 
trance Tests. 

459. ЖТезі for Bie or Skill. Reviews by 
Reign. H. Bittner and Clifford E. Jurgensen. 

460. Turse Shorthand Aptitude Test. Review by 
Leslie M. Haynes. 

461. *Typewriting I: Every Pupil Test. y 

462. *Typewriting Test: National Business En- 
trance Tests. 

463. *Typewriting Test: State High School Tests 
for Indiana. i 


COMPUTATIONAL AND SCORING DEVICES 
464. жле and І.О. Calculator. 

465. XThe Delp Г.О. Computer. An excerpt from a 
review. 
466. XkHanke's Answer Sheets. j 3 
467. KThe Multiscore Profile Form and Scoring 


Codes. 

468. *SRA Self-Scorer. Reviews by James M. An- 
derson and Arthur E. Traxler. А 

469. ЖТле Tweeddale 1.0. Conversion Tables. 

470. XV.G.C. Answer Strips. 


ETIQUETTE 


471. The Best Thing to Do: A Test of Knowledge of 
Social Standards. Review by Helen Shacter. 
472. Furbay-Schrammel ‘ocial Comprehension Test. 


Review by James H. Ricks, Jr. У | 
yo t of Etiquette: ' George Tu Uni- 
versity Series, Review by James H. Ricks, Jr. . 
474. Test of Knowledge of Social Usage. Review by 


Hilda Taba. 
HANDWRITING 
475. Ayres Measuring Scale for Handwriting. Re- 
view by Worth sburn, 


; The Thorndike Scale Jor Handwriting of Chil- 


3 HEALTH 
478. *College Belt Knowledge Test. Review by 
ison Clarke. ыш, 

PM Cornell Medical Index—Health vertionpoirt. 

480. *[French Tests for Pres а Courses, n 
Knowledge of Sports] : hysical ducation Major Ex- 
aminations, Review by H. Harrison Clarke, Y 

481. Gates-Stran Health Knowledge Tests. Re- 


i 1 E. Kambly. н ^ 
view DY Health and Safety Education Test: National 


T t Tests. 1 > 
ДОМЕ s Health Education and Hygiene: Every Pupil 
Test. j 


Reading 


484. *Health Inventories. Review by Benjamin 


imberg. 
48s. *Health Test: National Achievement Tests. 
Review by Jacob S. Orleans. 
486. *Patient's Self History Form. 
487. *Personal Health Inventory. Review by Wil- 
lard W. Patty. * 
488. *&Sex Knowledge Inventory: Experimental Edi- 
tion. Review by Albert Ellis and an excerpt from a re- 


view. 

489. *[Wetzel Grid Charts.) 

.490. *Width-Weight Tables, Second Revised Edi- 
tion. 


HOME ECONOMICS 


491. * Cooperative Test in Foods and Nutrition, Re- 
view by Robert L. Ebel. 

492. * Cooperative Test in Home Management. 

493. *Cooperative Test in Household САГ оды 
Reviews by Faith Madden and Victor Н. Noll. 

494. *Cooperative Test in Textiles and Clothing. 

495. *Foods I, Food Selection and Preparation: 
State High School Tests for Indiana. 

496. *Foods II, Planning for Family Food Needs: 
State High School Tests for Indiana. 

497. kGraduate Record Examinations Advanced 
Home Economics Test. 

498. *Helping With Food in the Home: State High 
School Tests for Indiana. Review by Jean D. Amber- 


son. 
499. Helping With the Housekeeping: State High 
ЕБ Tests for Indiana. Review by Hester Chadder- 
оп. 

00. *Home Care of the Sick Test: State High 
School Tests for Indiana. 

sor. *Housing the Family: State High School Tests 
for Indiana. Review by Jean D. Amberson. 

502. *Nutrition Information. 


INDUSTRIAL ARTS 


503. Mechanical Drawing Test: State High School 
Tests for Indiana. Review by William J. Micheels. 

504. *Middleton Industrial Arts Test. Review by 
William J. Micheels. 


PHILOSOPHY 
sos. *Graduate Record Examinations Advanced 


оору Test. 
506. *Graduate Record Examinations Scholastic 
Philosophy Test. 


PSYCHOLOGY 


507. *Comprehensive Examination in Psychology. 
Review by Edith M. Huddleston. 

508. *Graduate Record Examinations Advanced 
Psychology Test. 

509. *Hogan Psychology Test. 


RECORD AND REPORT FORMS 


510. *American Council on Education Cumulative 
Recosrd Folders. Reviews by Warren R. Baller and 
Arthur H. Brayfield. 

511. XCalifornia Cumulative Guidance Record for 
Elementary Schools. Review by Warren R. Baller. 

512. *Cumulative Guidance Record. 

513. *Cumulative Personnel Record. 

514. *Ontario School Record System. 

515. *Permanent Record Card. 

516. *Secondary-School Record. 

517. *V.G.C. Cumulative Record Folder. 
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RELIGIOUS EDUCATION 


518. *Jewish Information Test. 
519. *Opinion Survey. 
520. *Peters Biblical Knowledge Test. 


SAFETY EDUCATION 


521. X[American Automobile Associati i 
Testing Apparatus.] pod ш 

522. *Examination for Drivin 

523. *General Achievement 
алови T ius Pr dos 

524. *Rating Scale for Automobile-Driver Skills: 
The Abercrombie Driver Test. d 

525. *Revere Safety Test, Second Edition. Reviews 
by Willard A. Kerr and Harold G. Seashore. 


Instructors, 
est in Fire Safety: 


TESTING PROGRAMS 


526. *College Entrance Examination Board Admis- 
sions Testing Program. 
527. kThe Graduate Record Examinations. 


READING 


‚528. * A.C.E.R. Silent Reading Test, Form C. 

529. * Achievement Tests in Silent Reading: Domin- 
ion Tests. Review by Henry P. Smith. 

530. *California Keading Test. Reviews by John C. 
Flanagan and James R. Hobson; and an excerpt from 
а review. : 

531. *Diagnostic Reading Tests. Reviews by Fred- 
е B. Davis, William W. Turnbull, and Henry 

eitz. 

532. *Elementary Reading: Every Pupil Test. 

533. *Garvey Primary Reading Test. — 

TAA *General Reading Test: Ohio Senior Survey 
ests. > 

5s Haggerty Reading Examination. Review by 
William S. Gray. Р $ 

536. *High School Reading Test: National Achieve- 
ment Tests. Review by Holland Roberts. М 

1537; *Holborn Reading Scale. Review by C. M. 


eming. , 
538. 1 raham-Clark Diagnostic Reading Tests. Re- 
therine G. Keneally. ; Nila 
B 539. Kansas Primary Reading Test. Review by 
anton Smith. 4 
о. YLee-Clark Reading Test—First Reader, 1943 
Edition. р : 
sn. Los Angeles Elementary Reading Test. Review 
b P. Smith. rj 
ща. Los Angeles Primary Reading Test. Review by 
Nila Banton Smith. nent Tests [Reading]: 
Я etropolitan Achievem i 
ra ait by Сн R. Hobson and Margaret G. Mc 
yy ; : abulary 
The Nelson-Denny Reading Test: V ocabuta 
and "Paragraph: i The Clap -Young ‘Self-Marking 
sts. R van A. Booker. 
Sas. The Nelson Silent Reading Test: A 
and Paragraph: The Clapp-Young Self- 
Tests. Review by William D. Sheldon. Tesi 
546. *Primary Reading: Every Pupi dioe. English 
547. *Reading Comprehension: Coopera " "a 
Test: Lower and Higher Levels, Test Ст a Worthum- 
548. X*Reading: Seven Plus Assessment: 
berland Series. - and speed): 
549. kReading Test (Comprehension Tests. 
Municipal Tests: National Achievement William W. 
sso. SRA Reading Record. Review by 
Turnbull. 


| 
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sr. Sangren-Woody Reading Test. Revi à 
vid H. Russell, Mee. gium 

552. "The Schonell Reading Tests. Review by M. L. 
кше, Pe , 

553. Shank Tests of Reading Comprehension. Re- 
view by William D. Sheldon. s denen 

554. "Silent Reading Comprehension: Iowa Every- 
Pupil Teste саза Skills, Test A. 

555. Stanford Achievement Test [Reading]. Revi 
by James R. Hobson. | ed: Revie 

556. *Techniques in Reading Comprehension for 
Junior-Senior High School: Every Pupil Test. 

557. XTests of Reading: Cooperative Inter-American 
Tests. Reviews by Jacob S. Orleans and Frederick L. 
Westover. 

558. *Thorndike-Lorge Reading Test. Review by 
Ivan A. Booker. 

559. *Traxler High School Reading Test. Review by 
Harold D. Carter. 

560. *Traxler Silent Reading Test. Review by J. 
Thomas Hastings. 


MISCELLANEOUS 


sór. Durrell Analysis of Reading Difficulty. Review 
by Helen M. Robinson. 

562. *Durrell-Sullivan Reading Capacity and 
Achievement Tests. Review by Helen M. Robinson. 
_ 563-4. Gates Reading Diagnostic Tests, Revised Edi- 
tion. Review by Worth J. Osburn. 


ORAL 


565. *Oral Diagnostic Test ој W ord-Analysis 
Skills, Primary: Dominion Tests. Review by Nila 
Banton Smith. E 


READINESS 


566. Gates Reading Readiness Tests. Review by F. J. 
Schonell. 
7207 Group Test of Reading Readiness: Dominion 

ests. 

568. *The  Harrison-Stroud Reading Readiness 
Tests. Review by William S. Gray. 

‚560. *Lee-Clark Reading Readiness Test, 1951 Re- 
vision. 

570. *Metropolitan Readiness Tests. Review by Eric 
F. Gardner ; excerpt from one review. А б 

571. kMurphy-Durrell Diagnostic Reading Readi- 
ness Test. \ 

‚572. *School Readiness Inventory. Review by Da- 
vid H. Russell. 


SPECIAL FIELDS 


573. * Ability to Interpret Reading Materials in the 
Natural Sciences: Iowa Tests of Educational Develop- 
ment, Test 6. {= 

574. *Ability to rea hae Reading Materials in the 
Social Studies: Iowa Tests of Educational Develop- 
ment, Test 5. i 

575. *[Robinson-Hall Reading Tests] Review by 
Robert Murray Bear. 

576. kTests of Natural Sciences: Vocabulary and 
Interpretation of Reading Materials: Cooperative In- 
ter-American Tests. Review by Clarence H. Nelson. 

577. kT ests of Social Studies: Vocabulary and In- 
terpretation of Reading Materials: Cooperative Inter- 
American Tests. Reviews by Gustav J. Froehlich and 
Martha E. Layman. 


STUDY SKILLS 


578. *Bennett Use of Library Test. Review by Louis 
Shores. 


Science 


579. *Cooperative Dictionary Test. 

580. *Edmiston How to Stud Test. 
pip dei назъи Щщ Data Test: General. Educa- 

eries. Reviews y 
Vietor fi ix y J. Raymond Gerberich and 

582. *Logical Reasoning Test: General Educati 
Series. Review by Robert 1. Ebel. rene 

583. Survey of Study Habits, Experimental Edition. 
Review by Warren К. Baller. 

584. *T'est of Critical Thinking. 

585. *Test on the Use of Books and Libraries: Gen- 
eral Education Series. Reviews by Henry D. Rinsland 
DUE J Lib 

586. e Use of Library and Study Materials. Re- 
view by Robert Murray Ban Y $247 

587. *Use of Sources of Information: Iowa Tests of 
Educational Development, Test 9. 

588. *Work-Study Skills: Iowa Every-Pupil Tests 
of Basic Skills, Test B, New Edition. 


SCIENCE 


» 589. *Elementary Science and Health:, Every Pupil 
est. 

590. *Elementary Science Test: National Achieve- 
ment Tests. 

sor. *General Background in the Natural Sciences: 
Iowa Tests of Educational Development, Test 2. 

592. *General Science Test: State High School 
Tests for Indiana. > 

593. XStanford Achievement Test [Elementary Sci- 
ДК Reviews by Bertram Epstein and Paul E. Kam- 

y. 

504. *Test of Application of Principles in. Physical 
Science: General Education Series. Review by Palmer 
O. Johnson. { Ў 

595. *A Test of General Proficiency in the Field of 
Natural Sciences: Cooperative General Achievement 
Tests, Revised Series, Test 11. 


BIOLOGY 


506. Noon Achievement Examinations for Sec- 
ondary Schools. 

507; *Biology: Every Pupil Test. 

598. *Biology Test: State High School Tests for In- 
diana, 

599. *Biology: 20th Century Test. 4 

боо. *College Entrance Examination Board Achieve- 
ment Test in Biology. Review by Clark W. Horton. 

601. *Cooperative Biology Test. Review by Leland 
P. Johnson. 1 

602. *Cooperative Biology Test: Educational Rec- 
ords Bureau Edition. s 4 

боз. *General Biology Test: National Achievement 


Test. 
p^ Graduate Record Examinations Advanced 


iology Test. ; 
PN AN Biology Test: Evaluation and Adjust- 
ment Series. Reviews by Clark W. Horton and Leland 

. Johnson. b e 
US Test of Application of Principles in Biology: 
General Education ‘cries, Reviews by Clark W. Hor- 
ton and Clarence Н. Nelson. 


CHEMISTRY 


‚ *A.C.S. Cooperative Biochemistry Test. ý 
608. Acs. Cooperative ЖАНЫ Rg in Quali- 
i is. Review m. Rieman III, í 
uS mes C Do тоно Chemistry Test in Quanti- 
tative Analysis. А 

610. *A.C.S. Cooperative General. Chemistry Test 
for College Students. Review by Kenneth E. Anderson. 


Sensory-M otor 


бїт. *A.C.S. Cooperative Organic Chemistry Test. 
612. *A.C.S. Cooperative Physical Chemistry Test. 
613. kAnderson Chemistry Test: Evaluation and 
Adjustment Series. Review by Wm. Rieman III. 
614. *Chemistry: Achievement Examinations for 
Secondary Schools. 
615. *Chemistry: Every Pupil Test. 
616. *Chemistry Test: State High School Tests for 


iana. 

617. *College Entrance Examination Board Achieve- 
ment Test in Chemistry. Review by Evelyn Raskin. 

618. *Cooperative Chemistry Test. 

619. XCooperative Chemistry Test: 
Records Bureau Edition. 

620. *Graduate Record Examinations Advanced 
Chemistry Test. 

621. Iowa Placement Examinations: Chemistry Ap- 
titude. Review by: Kenneth E. Anderson. 

622. Iowa Placement Examinations: eviter std 
Training: Series CTI, Revised. Review by Kenneth E. 
Anderson. 


Educational 


GENERAL SCIENCE 


623. *Cooperative General Science Test. Review by 
John S. Richardson. 

624. *Cooperative Science Test for Grades 7, 8, and 
9. Review by R. Will Burnett. 

625. *General Science: Every Pupil Test. 
fae. *General Science Test: National Achievement 

ests. 

627. kGeneral Science III: Achievement Examina- 
tions for Secondary Schools. 

628. *Read General Science Test: Evaluation and 
Adjustment Series. Reviews by Benjamin S. Bloom 
and John S. Richardson. 


629. *Test of Кріт of Principles іп General 
Science: General Education Series. Review by R. Will 


Burnett. 


GEOLOGY 


630. *Graduate Record Examinations Advanced Ge- 
ology Test. 


MISCELLANEOUS 


631. *Measurement of Observation and Understand- 
ing of Physical Phenomena and Life Processes. 
632. *The New Air World. 


PHYSICS 


633. *College Entrance Examination Board Achieve- 
ment Test in Physics. Review by Palmer O. Johnson. 

634. *Cooperative Physics Test. 

635. *Cooperative Physics Test: Educational Rec- 
ords Bureau Edition. . 

636. *Dunning Physics Test: Evaluation and Ad- 
justment Series. Review by С. P. Cahoon. . 

637. kGraduate Record Examinations Advanced 
Physics Test. у 

38. Iowa Placement Examinations: Physics Apti- 

tude. Review by John W. French. 

639. Iowa Placement Examinations: Physics Train- 
ing: Series PTI, Revised. Review by С. P. Cahoon. 

640. kPhysics: Achievement Examinations for Sec- 
eer. РМ Every Pupil T. 

I. ysics: Every Pupil Test. 
642. *Physics Test: State High School Tests for In- 


diana. t 
643. *Physics: 20th Century Test. 


SENSORY-MOTOR 


644. Harris Tests of Lateral Dominance. Revi 
Witham G, Рей god Miles A. Tinker, ere wa һу 


THE FOURTH MENTAL MEASUREMENTS YEARBOOK [ 1160 


645. XPre-Tests of Vision, Hearing, 
ordination. 9 ра Motor Co. 


„* 
wh HEARING 


646. kAuditory Tests (Spondaic Word Li 
647. *The Massachusetts Hearing Тай T 
648. *Sonotone Screening Audiometer Model 30. 


MOTOR 


649. X*Edmiston Motor Capacity Test. 

650-1. Oseretsky Tests of Motor Proficiency: А 
Translation From the Portuguese Adaptation. Review 
by Anna Espenschade. 

652. *The Rail-Walking Test. Review by William 


Sloan. 

653. *V.D.L. Psychomotor Scale for the Measure- 
ment of Manual Ability. Reviews by Anna Espenschade 
and William Sloan. 


VISION 


654. A-B-C Vision Test for Ocular Dominance. Re- 
view by Miles A. Tinker. 

655. *Eames Eye Test. i 

650. Farnsworth Dichotomous Test for Color Blind- 

ness: Panel D-15. Review by Elsie Murray. 

657. *The Farnsworth-Munsell тоо-Ние Test for 
the Examination of Color Discrimination. Review by 
Elsie Murray. 

658. *Glenn Colorule. M 

659. *The Illuminant-Stable Color Vision Test. Re- 
view by Elsie Murray. А 

ббо. *The Master 
Miles A. Tinker. : $ 

+ 661. *Pseudo-Isochromatic Plates for Testing Color 
Perception. 


Ophthalmograph. Review by” 


SOCIAL STUDIES 


662. *College Entrance Examination Board Achieve- 

ment Test in Social Studies. Review by Robert L. 
Thorndik 

663. *Cooperative оо des à Test for Grades 7, 

1 iew by Hilda Ta E, Я 

Р aS For. [^ ра Test: Municipal Ten 

National Achievement Tests. Review by Harry ^ 


к жеди to Soc Баин: Айин 
moe eet бө Eagar eleg ЖҮ. 
TAE Social Studies Test: Natio | — 
Socal Ко: аячы oficiency in 1 ieveme 
E SÜnderstonding paren Social Concepts: 1000 
Tests of Educational Development, Test I. 


ECONOMICS 


- З or 
670. *Economics Test: State High School Tests Í 
iana. | 
Ж ics: 20th Century Test. ‘a 
on т Record Examinations Aavane 
Economics Test. J 
673. kHills Economics Test. 


GEOGRAPHY 


674. kEconomic Geography: Achievement 
nations for Secondary Schools. . A { 

675. *Geography: Every Pupil Tes Tests: National 

676. *kGeography Test: Municipal Н, Reeder. 
Achievement Tests. Review by Edwin £^ 


Exami- 
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Vocations 


677. Geography Test: National Achievement Tests. 
678. *Modern Geography and Allied Social Studies. 
Reviews by Edith М. Huddleston and Edwin Н. 
eeder, ‘ > 


HISTORY 


679. American History: Achievement Examina- 
tions for Secondary Schools. 

680. *American History: Every Pupil Test. 

681. *American History Test: National Achieve- 
ment Tests. 

682. *American History Test: State High School 
Tests for Indiana. 

683. *American History: 20th Century Test. 

684. *Cooperative American History Test. Reviews 
by Dorothy C. Adkins and Martha E. To 

685. Cooperative Ancient History Test. 

686. *Cooperative Modern European History Test. 

687. *Cooperative World History Test. 

688. kCrary American History Test: Evaluation 
and Adjustment Series. Review by Edgar B. Wesley. 

689. ЖСиттіпдѕ World History Test: Evaluation 
and Adjustment Series. Reviews by Dorothy C. Adkins 
and Howard R. Anderson. 

690. *Graduate. Record Examinations Advanced 
History Test. 

691. *Indiana History. 

боз. XModern World History: Achievement Exam- 
inations for Secondary Schools. ( 

693. *Understanding of American History, 
Revision. Review by Elizabeth C. Adams. 

604. *World History: Every Pupil Test. ! 

695. KWorld History Test: Acorn National Achieve- 
ment Tests. { 

696. *World History Test: State High School Tests 
for Indiana. 

697. kWorld History: 20th Century Test. 


POLITICAL SCIENCE 


698. *k American Civics and Government Tests for 

High Schools and Colleges. kt ; 
00. k American Government and Citizenship: Every 

Pupil Test. Review by Elizabeth C. Adams. 

700. *Civic Vocabulary Test. 

701. *Civics: 20th Century Test. 

702. *Cooperative American Government Test. Re- 
view by Frederic L. Ayer. 

703. kGraduate Record 
Government Test. Я ү. 

704. *Junior High School Civics Test: State High 
School Tests for Indiana. MM 

705. *Patterson's Tests on the Federal Constitution. 

a Senior High School Civics Test: For a One- 
phy. d Course: State High School Tests for In- 
iana. 

707. *Senior High School Civics Test: State High 
School Tests for Indiana. 


SOCIOLOGY 


708. *Black-Schrammel Sociology Test, 
709. Graduate Record Examinations Advanced So- 
ciology Test. j 


"1940 


Examinations Advanced 


f 
VOCATIONS 


710. Aptitude Tests for Occupations. Review by 
Clifford P. Froehlich; excerpt from one review. 

11. XDifferential Aptitude Tests. Reviews by Har- 
old Bechtoldt, Ralph F. Berdie, and Lloyd G. Hum- 


hreys. d 
д 712, &Factored Aptitude Series. Review by D. 


712. t 
Welty Lefever; excerpt from one review. 


E 713. Fife Tests of Ability, Review by 1, Macfarlane 
mith, 
714. *General Aptitude Test Battery. Reviews by 


~ Milton L, Blum, Edward B. Greene, and Howard R. 


Taylor. 

715. *The Guilford-Zimmerman Aptitude Survey. 
Reviews by Anne Anastasi, Harold Bechtoldt, John B 
Carroll, and P. E. Vernon. 

716. Meria Mental Abilities]. Reviews by Anne 
Anastasi, Ralph F. Berdie, John B. Carroll, Stuart A. 
Courtis, and P. E, Vernon. 


717. X[United States Employment Service Special 


Aptitude Tests.) 
718. Yale Educational Aptitude Test Battery. 


CLERICAL 


719. XA.C.E.R. Speed and Accuracy Tests. Review 
by D. W. McElwain. 

720. *Alphabetizing Test. 

721. kFiling Test. 
. 722. kGeneral Office Clerical Test (Including Fil- 
ing): National Business Entrance Tests, 

723. kGroup Test 20 [Checking] : National Institute 
of Industrial Psychology. Review by E. G. Chambers. 

724. ЖСтоир Test 25 (Clerical) : National Institute 
of Industrial vj ona y. Review by E. G. Chambers. 

725. k[Hay Tests ДУ Clerical Aptitude.] Reviews 
by Reign H. Bittner and Edward E. Cureton. 

726. *Martin Office Aptitude Tests. Reviews by 
D. Welty Lefever and Ross W. Matteson. 

727. XName Comparison Test for Clerical and In- 
dustrial Inspection Operations. 

728. *Number Comparison Test for Clerical and In- 
dustrial Inspection Operations, 

729. *[Personnel Research Institute Clerical Bat- 
tery.) Reviews by Louise Witmer Cureton and Albert 

^ rtz. 


urtz, 
730. *Psychological коеш General Clerical 
Test. Reviews by Edward E. Cureton and С. A. Satter. 

731. Purdue Clerical Adaptability Test: Purdue 
Personnel Tests. Reviews by Edward N. Hay, Joseph 
E. Moore, and Alec Rodger. f 

732. *SRA Clerical Aptitudes. Reviews by Edward 
N. Hay and G. A. Satter. 

733. kThe Short Employment Tests. 

734. XTabulation Test. ) 

735. XV.G.C. Clerical Indicator. Review by George 
А. Ferguson, 


INTERESTS 


736. *ABC Occupational Inventory. 

737. *Brainard Occupational Preference 
Review by Elmer D. Hinckley. х 

738. xEdmiston Inventory of Interest. Review by 
Arthur E. Traxler. ‚ ^ 

739. *The Guilford SM ИН ИН Interest 
Survey. Reviews by George K. Bennett and Wilbur L. 
Layton. » 

o. Henderson Analysis of Interest [Second Edi- 

do). Reviews by Wilbur L. Layton and Donald E. 

pro эген Check List. Reviews by Milton ts 
Blum and Howard R. Taylor. 

pies *Kuder Preference Record—Vocational. Re- 
Де by Edward S. Bordin, Harold D. Carter, and* 
H. M. Fo 


wler. 
743. Occupational Interest Inventory. Review by 


Arthur H. Brayfield. 
. *Self-Administering Vocational Interest Loca- 
tor With ork Interest Picture. Review by Donald E. 
ge i Thurstone Interest Schedule: Reviews by Don- 
ald E. Super and Norman Frederi Й 


Inventory. 


Vocations 


746. Vocational Interest Analyses: А Six-Fold 
Analytical Extension of the Occupational Interest In- 
ventory. Review by Julian C. Stanley. 

747-8. *Vocational Interest Blank for Men, Revised. 
une by Edward S. Bordin and Elmer D. Hinck- 
ey. 


` MANUAL DEXTERITY 


749. kMartin Peg Board (Finger Dexterity Test). 

750. *Moore Eye-Hand Coordination and Color- 
кш Test. Reviews by Norman Frederiksen and 
Jay L. Otis. 

751. *Purdue Pegboard. 

752. *Small Parts Dexterity Test, Review by Ray- 
mond A. Katzell. 

753. *Spatial Test I. Review by E. A. Peel. 

754. *Spatial Test II (Three Dimensional). 


с? *Stromberg Dexterity Test. Review by Julian’ 


tanley. 


MECHANICAL ABILITY 


756. kA.C.E.R. Mechanical Comprehension Test. 
Review by D. W. McElwain. 

Ӯ. ^ orm Relations Group Test. Review by A. T. 
elford. 

758: *Group Test 81 [Space Perception Test]: Na- 
tional Institute of Industrial Psychology. Review by 
E. С. Chambers. 

759-60. MacQuarrie Test for Mechanical Ability. 

761. Prognostic Test of Mechanical Abilities. Re- 
views by Willard A. Kerr and Ate G. Schultz, 

762. *Purdue Mechanical Adaptability Test: Purdue 
Personnel Tests. 

763. *Revised Minnesota Paper Form Board Test. 
Reviews by Clifford E. Jurgensen and Raymond A. 


Katzell. 

764. *SRA Mechanical Aptitudes. Reviews by Alec 
Rodger and Douglas G. Schultz. 

765. *Survey of Space Relations Ability. 

766. *Test of Mechanical Comprehension. Review by 
N. W. Morton. * 

767. XV.G.C. Object Visualization Indicator. 

768. *V.G.C. Space Relations Ability Indicator. 

769. kThe Vincent Mechanical Models Test A (In- 
dustrial). Review by A. Т. Welford. 


MISCELLANEOUS 


E 770. *Employee Evaluation Series. Review by Brent 
axter. 
771. kEmployee Merit Report. 
772. k[Employee Selection Forms.] 
773. *k[Executive, Industrial, and Sales Personnel 
Forms.) Review by Floyd L. Ruch. 
á q^ How Supervise? Review by Milton M. Man- 
e 


775. kInterview Aids: Purdue Vocational Series. 
Review by William W. Waite. 

776. *Ínterview Rating Scale for Prospective Em- 
ployees. Reviews by Jay L. Otis and S. Rains Wallace, 


К. 
777. kKahn Career Orientation Questionnaire: А 
Preliminary to Vocational or Educational Counseling, 
* Student Form. Review by Arthur E. Traxler. 
778. *Lawshe-Kephart Personnel Comparison Sys- 
stem, Review by Reign H. Bittner. 
779. Mathematical and Technical Test. Reviews 
by Charles R. Langmuir and F. W. Warburton. 
780. *Miles Career Evaluation Inventory. 
781. *Patterned Merit Review Form. Reviews by 
Harry W. Karn and Floyd L. Ruch. 
782. kPurdue Blueprint Reading Test: Purdue Vo- 
cational Test, No. 4. 
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783. *The Tear Ballot for Industry: EE 
ions. Review by Brent Baxter. ustry: General Opin- 


784. *Test of Practical Judgment. 


SPECIFIC VOCATIONS 


785-6. *Probst Rating System. Revi i 
iW. Mandell thd Dale Yoder. ae ee 


ACCOUNTING 


787. kAmerican Institute of Accountants Testing 
Program. 


DENTISTRY 
788. *Dental Aptitude Testing Program. 
DRIVERS 


789. *[Driver Selection and Training Forms.] Re- 
view by S. Rains Wallace, Jr. 

790. kRoad Test—In Traffic. 

791. *Traffic and Driving Knowledge. 


EDUCATION 


792. x Aptitude Test for Elementary School Teach- 
ers-in-Training. Reviews by Edwin Wandt and Rob- 
ert M. W. Travers; excerpt from one review. 

793. *Barr-Harris Teacher's. Performance Record. 

794. *The Case of МЧ Murphy: A Case-Study 
Instrument for Evaluating Teachers’ Understanding of 
Child Growth and Development (Second Revised Edi- 
tion). Review by Frank S. Freeman. 

795. *Diagnostic Teacher-Rating Scale. Ў 

796. kExceptional Teacher Service Record. Reviews 
by Edwin Wandt and Leo J. Brueckner. 

797. kGradwate Record Examinations Advanced 
Education Test. + 

98. ЖН от I Counsel. Reviews by Clifford P. Froeh- 
lich and Milton E. Hahn. 1 E 

799. How I Teach: Analysis of Teaching Practices. 
Roe d May V. Seagoe and D. A. Worcester. 

800. kH ow Feach and Learn in College? Experi 
mental Edition. Review by Dean А. Worcester. 


801. XMinnesota Teacher Attitude Inventory. Ке, 


iews by Dwight L. Arnold and Lee J. Cronbach. 
"бз. X National Teacher Examinations. Review by 


Harry N. Rivlin. 


803. *The Purdue Rating Scale for Instruction. Re- а 


view by Kenneth L. Heaton. и 
804. жерше for Rating Effettive Teachers Behanor: 
Reviews by Leo J. Brueckner and Edwin Wir e 
80s. *Teacher Opinionaire on Democracy. 5° 
by VAS e W. Hartmann and C. Robert Pace. mo 
Teaching Aptitude Test: George Washing 
University Series. Review by May V. Seagoe. 
807. x Wilson Teacher-Appraisal Scale. 
ENGINEERING 


808. College Entrance Examination Во p 
Aptitude Test in Spatial Relations. Review by Robe 
L. Thorndi Тув in 

809. ЖС, ollege Entrance Examination Borri Test 

-Engi i omprehensior NT 
pru ped Physical Science Aptitud 

t. iew b hn W. French. , = " 
i I. er rec Examinations Advanced E 

ineering Test. К... 
12 *Pre-Engineering Ability Test. Reviews 
Jerome E. Doppelt and Dewey B. Stuit., Reviews by 

813. Stanford Scientific Aptitude Test. 

Joseph E. Moore and Dewey B. Stuit. 


LAW scion, Ве 
814. XIowa Legal Aptitude Test: 1946 Rensi 


view by Alexander G. Wesman. 


ard Special 
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815. *Law School Admission Test. Review by Alex- 
ander G. Wesman. 
MACHINISTS 
816. *Purdue Test for Machinists and Machine Oper- 
ators: Purdue Vocational Test, No. 2. 
MEDICINE 
817. *Medical College Admission Test, Review by 
Morey J. Wantman. е 
NURSING 5 


818. *George Washington University Series of Nurs- 
ing Tests. 

819. XNLNE Achievement Tests for Basic Profes- 
sional Nursing Program. 

820. XkNLNE Graduate Nurse Qualifying Examina- 


tion. 
821. *NLNE Practical Nurse Achievement Exami- 
nations. 


Vocations 


822. XNLNE [Practical Nurse] Pre-Admission and 
Classification Examination, 

823. XNLNE Pre-Nursing and Guidance Test Bat- 
tery. 


SALESMEN 


824, Aptitude Associates Test of Sales Aptitude: 
A Test for Measuring Knowledge of Basie Principles 
of Selling. Reviews by Milton E. Hahn and Donald С. 
Paterson. 

825. * Aptitude Index. Reviews by Donald С, Pater- 
son and Albert S. Thompson. 

826. *[Bigelow] Hiring Kit. 

827. *Sales Situation Test. 

828, *Steward Selection System, 1951 Edition. Re- 
views by Donald G. Paterson and Albert S. Thomp- 


son. 
829. *Test for Ability to Sell: George Washington 
University Series. ‘ 
830. *Test of Sales Judgment. 


~ 
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